How to count number of files with two specific keywords?

Started by Lagrandeimage, December 12, 2020, 06:53:29 PM

Previous topic - Next topic

Lagrandeimage

Hello,

I would like to count the number of files who contain two specific keywords.

Added difficulty: one of the keywords is identical to the other part from accented characters  => Bérénice and B?r?nice.

The next step will probably be to delete B?r?nice (If as I suspect the two keywords are present in the same files), I would use :

exiftool -subject-=B?r?nice -r DIR

But that would delete the Bérénice as well no? How to make the "?" be treated as normal character?

Thanks in advance,

Cheers.

Phil Harvey

Question marks are not special in the -= argument.  They are treated as any other character.

- Phil
...where DIR is the name of a directory/folder containing the images.  On Mac/Linux/PowerShell, use single quotes (') instead of double quotes (") around arguments containing a dollar sign ($).

Lagrandeimage

Great thanks Phil, that settles a part of the problem.

Is there a solution for counting the  number of files where both Bérénice and B?r?nice are set?

Luuk2005

Greetings Lagrandeimage, if on Windows it can be like ...
exiftool -q -if "$keywords =~ /\bWord1\b/i and $keywords = ~/\bWord2\b/i" -p "$filepath" "YourFolderPath" |findstr /N "^"
This to present all the filepaths, with a number in the front.

If on Linux, it can be like ...
exiftool -q -if '$keywords =~ /(?=.*\bWord1\b)(?=.*\bWord2\b)/i' -p '$filename' 'YourFolderPath' |sed -n '$='
This to only count the files, not showing any names or paths.

If Word1 and Word2 must be case-sensitive matching, please to destroy i after the /regexes/.
Windows8.1-64bit,  exiftool-v12.92(standalone),  sed-v4.0.7

StarGeek

Note that this will also match keywords that contain the indicated word.  And since an apostrophe is considered a word boundry, using the above to search for Bérénice would also match Bérénice's daughter, as well as any keyword that contains Bérénice as a separate word.

Also, sed isn't available on Windows unless you install a ported version.

Try this if you want to match the keyword exactly (change double quotes to single if on Mac/Linux)
exiftool -sep "##" -if "$Subject=~m/(?:##|^)Bérénice(?:$|##)/ and $Subject=~m/(?:##|^)B?r?nice(?:$|##)/" -subject /path/to/files/

I use the -sep option with an unlikely character combination because the default comma separator would split a keyword of "Smith, John"
"It didn't work" isn't helpful. What was the exact command used and the output.
Read FAQ #3 and use that cmd
Please use the Code button for exiftool output

Please include your OS/Exiftool version/filetype

Luuk2005

#5
Yes, Im liking the ##-separator much better, because it lets you also search for \b chars in the keywords!
You can even use something crazy like "###||###" if its really to be needed. Many thanks, StarGeek!

So with Windows, it can be like ...
exiftool -q -sep "##" -if "$subject=~/(^|##)Word1(##|$)/i and $subject=~/(^|##)Word2(##|$)/i" -p "$filepath" "YourFolderPath" |findstr /N "^"
This to present all the filepaths, with a number in the front.

If on Linux, it can be like ...
exiftool -q -sep '##' -if '$subject=~/(?=(^|.*##)Word1(##|$))(?=(^|.*##)Word2(##|$))/i' -p '$filename' 'YourFolderPath' |sed -n '$='
This to only count the files, not show any names or paths.

If Word1 and Word2 must be case sensitive match, please to destroy i after the /regexes/.
There is also ways to only show total count# in Windows, but its much longer commands.
Windows8.1-64bit,  exiftool-v12.92(standalone),  sed-v4.0.7

Lagrandeimage