ExifTool Forum

General => Metadata => Topic started by: obetz on May 15, 2025, 01:06:24 AM

Title: Reveal old PDF metadata
Post by: obetz on May 15, 2025, 01:06:24 AM
Hi all,

AFAIK, ExifTool only appends PDF metadata, the old metadata stays in the document but is inactive.

How can I read the previous metadata after such a change (assuming that I didn't re-linearize the PDF)?

Or completely revert the change done by ExifTool?
Title: Re: Reveal old PDF metadata
Post by: StarGeek on May 15, 2025, 01:27:59 AM
From the PDF Tags page (https://exiftool.org/TagNames/PDF.html)
QuoteIf ExifTool was used to modify a PDF file, the original may be recovered by deleting the PDF-update pseudo-group (with -PDF-update:all= on the command line).

This is the link I keep handy to describe the incremental update. "Incremental Updates in PDF files", Debenu Foxit (https://www.debenu.com/kb/incremental-updates/)

Phil has said in the past that exiftool can only remove updates it has made, not those made by other programs if they also use incremental updates.
Title: Re: Reveal old PDF metadata
Post by: obetz on May 15, 2025, 02:27:11 AM
Thanks, "incremental update" is the description I was missing.


Title: Re: Reveal old PDF metadata
Post by: obetz on May 15, 2025, 09:07:07 AM
the search for suitable tools resulted in a few hits

https://blog.didierstevens.com/programs/pdf-tools/ -> pdftool.py "can check if a PDF contains incremental updates, and select the versions you want"

pdftool.py iu source.pdf lists all incremental updates

pdftool.py -d -s 2 -o out.pdf iu source.pdf extracts version 2

There is also a Java tool "Multivalent" on Sourceforge, but the author removed the "undo" part a few years ago.
Title: Re: Reveal old PDF metadata
Post by: StarGeek on May 15, 2025, 10:30:36 AM
Oh, nice. A new tool for the toolkit.