utf-16 encoded tags in id3v2 are not extracted

Started by kiefermat, March 17, 2016, 08:36:22 AM

Previous topic - Next topic

kiefermat

The attached mp3 file with id3v2.4 tags contains utf-16 encoded text in some fields. Exiftool does not show these tags:
exiftool umlaute_utf16.mp3

ExifTool Version Number         : 10.13
File Name                       : umlaute_utf16.mp3
Directory                       : .
File Size                       : 3.3 kB
Resource Fork Size              : 1338 bytes
File Modification Date/Time     : 2016:02:22 14:53:22+01:00
File Access Date/Time           : 2016:03:17 11:48:30+01:00
File Inode Change Date/Time     : 2016:02:22 14:54:57+01:00
File Permissions                : rw-r--r--
File Type                       : MP3
File Type Extension             : mp3
MIME Type                       : audio/mpeg
MPEG Audio Version              : 1
Audio Layer                     : 3
Sample Rate                     : 44100
Channel Mode                    : Joint Stereo
MS Stereo                       : On
Intensity Stereo                : Off
Copyright Flag                  : False
Original Media                  : True
Emphasis                        : None
VBR Frames                      : 11
VBR Bytes                       : 3284
VBR Scale                       : 60
Encoder                         : LAME3.99r
Lame VBR Quality                : 4
Lame Quality                    : 0
Lame Method                     : VBR (new/mtrh)
Lame Low Pass Filter            : 17.5 kHz
Lame Bitrate                    : 32 kbps
Lame Stereo Mode                : Joint Stereo
ID3 Size                        : 93
Audio Bitrate                   : 91.4 kbps
Duration                        : 0.29 s (approx)


Using mutagen-inspect (https://mutagen.readthedocs.org), I get the following output:
mutagen-inspect umlaute_utf16.mp3

-- umlaute_utf16.mp3
- MPEG 1 layer 3, 108077 bps (VBR, LAME 3.99.1+), 44100 Hz, 2 chn, 0.24 seconds (audio/mp3)
TALB=älbüm
TCON=gènré
TIT2=title
TPE1=ärtist


The exiftool version I tested with is 10.13 and the result is the same on Mac and on Debian Wheezy.

Phil Harvey

Thanks for the sample.

This is going to require some work.  What wrote this file?  It looks to me as if this ID3v2.4 format is incorrect.  The flag for the extended header is set, but there appears to be no extended header, which causes problems for ExifTool.

- Phil
...where DIR is the name of a directory/folder containing the images.  On Mac/Linux/PowerShell, use single quotes (') instead of double quotes (") around arguments containing a dollar sign ($).

kiefermat

Sorry, I don't know with which software this file is written. It is part of an test set (that's why it is so short) and it is at least 2 years old.