repair meta data error and bad chars

Started by Lanthony, March 29, 2021, 07:12:33 AM

Previous topic - Next topic

Lanthony

PRELUDE:

Over the last twenty plus years have been digitising photos and slides using various scanners and any application software that is provided. This was done mainly with windows XP and later Win 7.

Eventually digital cameras and now smartphones have relegated all that to the scrap heap.

Currently the main computer (new) has win10 with all the digitised media.(with multi backups)

Note also have Linux mint on the old computer to use for experimenting with exiftool.

While using ffmpeg with some of the photos (and videos) found the results OK but not exactly as expected. (still learning to use that as well)

PROBLEM:

Long story short it seems the scanner SW has been very recalcitrant in what it did especially when it allowed adding extra details like comments etc.

All the supplied SW and its behaviours has been left behind when migrated to win7. Until now with exiftool exposing them all.

Exiftool -r -s -G -csv -sort DIR > all.csv

or due to majority being jpg

Exiftool -r -s -G -csv -sort -ext jpg DIR > alljpg.csv

The csv file opened in libre office calc and immediately could see odd characters in many places.

1-Some bad chars where in the rows associated with an image and in many places that can be linked to the SW used in scanning and subsequent editing.  But not all can be though. Some even using modern SW on both win10 and Linux. Then again it might be that it was there in the first place (maybe but still looking).

2-Some bad characters where found scattered in the first column (source file). Maybe leaked from the previous image but easy enough to delete the rows..

Opened the csv file in linux mint standard text editor and it made sure it was know that it had bad chars and not to be used saved etc.

Tracked bad chars to different columns that could then (in calc) have the content cleared leaving the header which then satisfied the text editor.

My thought experiment was to use the said csv file to update images after deleting everything.
Exiftool -r -ext jpg -all= DIR
then
Exiftool -r -ext jpg -csv=alljpg.csv DIR

That way it would repair any images --- but no.

exiftool -validate -error -warning -a DIR

did not provide any errors only warnings with minor number in brackets.
Same before and after above.


QUESTION:

Is it possible for exiftool to repair or even delete the errors  (bad chars).
Anything that is not done right etc..

exiftool  -r -all= -tagsfromfile @ -all:all -unsafe -icc_profile DIR

Still left bad chars in the csv file.


END:

most important fields are

dateTimeOriginal (if any)
comments within old photos (if any)

make and model would be advantageous to help identify who (camera used) and event location etc.

of course all location data in GPS enabled devices

If all goes well then will add (change) other fields in csv file to update each image.
EG: dates to reflect when photo was taken rather than when scanned.


StarGeek

What are these "bad characters"?  This is very vague and gives us no information to work with.  Can you provide an example image with this problem?

Also, have you made adjustments for the command line code page?  See FAQ #18.  Windows has extremely poor support for non-ascii characters and UTF8 characters can show up as weird even though that actual data in the file is correct.  This is because Windows would be displaying 2 byte-1 character symbols as 2 characters, something like é instead of just © for the copyright symbol.

Try checking one of these files on the linux machine to see if they're actually bad.  Also check with a program with good metadata support such as Adobe Bridge or DigiKam, both of which are free. 
* Did you read FAQ #3 and use the command listed there?
* Please use the Code button for exiftool code/output.
 
* Please include your OS, Exiftool version, and type of file you're processing (MP4, JPG, etc).

Lanthony

As mentioned I  use Linux.

I use Linux with exif tools installed so as to learn about exiftools and what it can do and cannot do.

I use Linux with a copy of photos so any mistakes (by me) do not cause any long term grief.

Previously all my work flow has been with a progression of windows versions and currently all photos are stored on the latest windows machine.

I am aware of windows behaviours when using the cli but if you mean using and storing on windows machine is the problem, well anything is possible in the digital age.

Digicam
Have use it in the past and it was one of those apps that tries to be everything for everybody and very annoying at that. Had to eventually remove it due to its behaviour that could not be tamed.

Will have to resurrect it now to see if it can recognise my problems and help with answers.

BTW: did I mention that I use Linux.