Scan for duplicate files.
Double File Scanner
1. Introduction

The purpose of this tool is scanning the selected directory or directories for
duplicate files, i.e. files with identical content. Duplicate files are
identified by first calculating the SHA-1 digest of each file and then looking
for values that appear more than once. In particular, files with identical
content are guaranteed to have the same SHA-1 digest, while files with
differing content will have different SHA-1 values with very high certainty.

All computed SHA-1 values are stored in a hash table, so collisions are found
quickly and we do NOT need to compare every digest to every other one. Also,
the files are processed concurrently in multiple "worker" threads in order to
parallelize and speed-up the SHA-1 computations on multi-core processors. On
our test machine it took ~15 minutes to analyse all the ~260,000 files on the
system drive (~63.5 GB). During this operation ~44,000 duplicates were found.

Once the scan is completed, the program provides commands to review, rename or
delete the duplicate files manually. Alternatively, the "automatic clean-up"
wizard may be invoked in order to remove ALL redundant files at a single blow.
Optionally, the list of duplicates can be exported to the XML or INI format.

2. Platform Support

The following platforms are supported:
- Windows 8 and 8.1, 32-Bit and 64-Bit versions
- Windows 7, 32-Bit and 64-Bit versions
- Windows XP, Service Pack 3 is required!

Platforms that should work too, but are not tested extensively:
- Windows Vista, 32-Bit and 64-Bit versions
- Windows XP Professional x64 Edition, Service Pack 2 is required!

Download Service Pack 3 for Windows XP here:

3. Anti-Virus Warning

Anti-Virus programs can interfere with the Double File Scanner software and
significantly slow down the process! Therefore it is highly recommend to turn
off the "real time scanner" or "guard" feature of your Anti-Virus program
while the Double File Scanner is running. But don't forget to re-enable it!

4. Command-Line Options

The following command-line options are available:
  --console           Enable the debug console
  --scan <directory>  Scan the specified directory, can be used multiple times

List of influential environment variables:
  DBLSCAN_THREADS     Set the number of worker threads (default: auto detect)

5. Frequently Asked Question

Q: Can the scanning process be aborted?
A: Yes, simply press the 'ESC' key!
Q: Can the scanning process be suspended/resumed?
A: Yes, simply press the 'Pause' key!