Problem Extracting Metadata on PDF Files on RHEL

Started by dmphan2, July 21, 2017, 08:45:26 AM

Previous topic - Next topic

dmphan2

Hi,

With a same command and parameters, I can extract metadata on Windows, but not on Red Hat Enterprise Linux.  Would anyone able to help me figuring why? Output from the two platforms as well as the PDF file are attached

Windows: exiftool --ZIP:* --Track1:* --PDF:* --XMP-xmpMM:Subject --XMP-photoshop:History --XMP-crss:SavedSettings* --IFD0:ProfileHueSatMapData1 --IFD0:ProfileHueSatMapData2 --XMP-crs:all --ExifIFD:ImageWidth --ExifIFD:ImageHeight -m -X -c "%.6f Degrees" x:\temp\AHB2014r54037.pdf > x:\temp\AHB2014r54037.pdf.windows.txt

RHEL: exiftool --ZIP:* --Track1:* --PDF:* --XMP-xmpMM:Subject --XMP-photoshop:History --XMP-crss:SavedSettings* --IFD0:ProfileHueSatMapData1 --IFD0:ProfileHueSatMapData2 --XMP-crs:all --ExifIFD:ImageWidth --ExifIFD:ImageHeight -m -X -c "%.6f Degrees" /dev/temp/AHB2014r54037.pdf > /dev/temp/nmah_pdf/AHB2014r54037.pdf.linux.txt

dmphan2


dmphan2

My pdf file is over 14 MB, I could not upload it.  :(

Phil Harvey

You should quote the arguments containing wildcards if you don't want the shell to expand them on Linux.

Also, if the PDF is encrypted, you may need to install Digest::SHA.  Use "exiftool -ver -v" to see the available optional libraries.

- Phil
...where DIR is the name of a directory/folder containing the images.  On Mac/Linux/PowerShell, use single quotes (') instead of double quotes (") around arguments containing a dollar sign ($).

dmphan2

Hi Phil,

Thank you for looking at this issue.  Here is the output of that command.  Do you know which library I have to install?

[tuser@si-dams401 mm16]$ exiftool -ver -v
ExifTool version 10.48
Perl version 5.016003 (-C0)
Platform: linux
Optional libraries:
  Archive::Zip                 (not installed)
  Compress::Zlib               (not installed)
  Digest::MD5                  (not installed)
  Digest::SHA                  (not installed)
  IO::Compress::Bzip2          (not installed)
  POSIX::strptime              (not installed)
  Unicode::LineBreak           (not installed)
  IO::Compress::RawDeflate     (not installed)
  IO::Uncompress::RawInflate   (not installed)
[tuser@si-dams401 mm16]$

Phil Harvey

Check the ExifTool output for warnings.  It should tell you which one(s) it wants.

- Phil
...where DIR is the name of a directory/folder containing the images.  On Mac/Linux/PowerShell, use single quotes (') instead of double quotes (") around arguments containing a dollar sign ($).

dmphan2

Thank you Phil!  After my system admin added those missing Perl libraries, exiftool was able to extract PDF metadata.

-Dewey