Exiftool based duplicate finder - open source script

Started by krzysiu, March 02, 2025, 01:45:16 PM

Previous topic - Next topic

krzysiu

Hey! I made it some time ago for myself, but ultimately I found some people need such tool, so I'm sharing it:
https://github.com/Krzysiu/ImageDataDupes
One day I will ask GPT to rewrite it in Python, so I'll be able to compile it. There's more things to do, all described in the link under "todo".

Main points:
* this tool finds duplicates of images using Exiftool $imagedatamd5, which means that it looks for identical image data, metadata may differ. So file with and without GPS will be marked as duplicate.
* it just lists files, no changes to files are made, nor deletions - i.e. it's safe!
* in short, if you already have PHP and Exiftool, you run it by php digest.php dirwhere dir is optional parameter for path (or starting path, in recursive mode)

Preview (these things on gray bg are "flags" - metadata blocks that are (or aren't) present, we can see base file and ~copy.cr2 lacks "I-2", which means "two fields in ITPC".



"We would use teleporters and live on another planets, if only ExifTool would be present when I was researching cosmos and physics"
Albert Einstein