Scan Type: This option determines what aspect of the files will be compared in the duplicate scan. If you select Filename, dupeGuru will compare every filenames word-by-word and, depending on the other settings below, it will determine if enough words are matching to consider 2 files duplicates. If you select Content, only files with the exact same content will match.
The Folders scan type is a bit special. When you choose it, dupeGuru will scan for duplicate folders instead of duplicate files. To determine whether two folders are duplicates, all files contained in the folders will be scanned, and if the contents of all files in the folders match, the folders will be considered duplicates.
Filter Hardness: If you chose the Filename scan type, this option determines how similar two filenames must be for dupeGuru to consider them duplicates. If the filter hardness is, for example 80, it means that 80% of the words of two filenames must match. To determine the matching percentage, dupeGuru first counts the total number of words in both filenames, then count the number of words matching (every word matching count as 2), and then divide the number of words matching by the total number of words. If the result is higher or equal to the filter hardness, we have a duplicate match. For example, “a b c d” and “c d e” have a matching percentage of 57 (4 words matching, 7 total words).
Word weighting: If you chose the Filename scan type, this option slightly changes how matching percentage is calculated. With word weighting, instead of having a value of 1 in the duplicate count and total word count, every word have a value equal to the number of characters they have. With word weighting, “ab cde fghi” and “ab cde fghij” would have a matching percentage of 53% (19 total characters, 10 characters matching (4 for “ab” and 6 for “cde”)).
Match similar words: If you turn this option on, similar words will be counted as matches. For example “The White Stripes” and “The White Stripe” would have a match % of 100 instead of 66 with that option turned on. Warning: Use this option with caution. It is likely that you will get a lot of false positives in your results when turning it on. However, it will help you to find duplicates that you wouldn’t have found otherwise. The scan process also is significantly slower with this option turned on.
Can mix file kind: If you check this box, duplicate groups are allowed to have files with different extensions. If you don’t check it, well, they aren’t!
Ignore duplicates hardlinking to the same file: If this option is enabled, dupeGuru will verify duplicates to see if they refer to the same inode. If they do, they will not be considered duplicates. (Only for OS X and Linux)
Use regular expressions when filtering: If you check this box, the filtering feature will treat your filter query as a regular expression. Explaining them is beyond the scope of this document. A good place to start learning it is regular-expressions.info.
Remove empty folders after delete or move: When this option is enabled, folders are deleted after a file is deleted or moved and the folder is empty.
Copy and Move: Determines how the Copy and Move operations (in the Action menu) will behave.
In all cases, dupeGuru nicely handles naming conflicts by prepending a number to the destination filename if the filename already exists in the destination.
Custom Command: This preference determines the command that will be invoked by the “Invoke Custom Command” action. You can invoke any external application through this action. This can be useful if, for example, you have a nice diffing application installed.
The format of the command is the same as what you would write in the command line, except that there are 2 placeholders: %d and %r. These placeholders will be replaced by the path of the selected dupe (%d) and the path of the selected dupe’s reference file (%r).
If the path to your executable contains space characters, you should enclose it in “” quotes. You should also enclose placeholders in quotes because it’s very possible that paths to dupes and refs will contain spaces. Here’s an example custom command:
"C:\Program Files\SuperDiffProg\SuperDiffProg.exe" "%d" "%r"