Find duplicate photos (or other file types) on your computer.
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
web clean up. Mar 31, 2018
.gitignore update project structure. Mar 21, 2018
LICENSE
README.md update readme Jun 30, 2018
auto.py update auto script. Oct 21, 2018
config.py clean up. Mar 31, 2018
db.py update read me and clean up. Mar 27, 2018
requirements.txt add web project. Mar 25, 2018
scan.py update read me and clean up. Mar 27, 2018
utils.py add delete file handler. Mar 26, 2018
web.py

README.md

photodup

Find duplicate photos (or any other file types) on your computer. Python 3.x required.

Step 0

Setup workspace and install requirements.

pip install -r requirements.txt

Step 1

Create a database for later scanning.

python db.py

This action will create a photo.db in workspace.

Step 2

Run command to scan all photos from somewhere, multiple dirs accepted.

python scan.py dir1 dir2

This action will scan all photo files into database (*.jpg by default, check config.py).

Step3

Run command to start a web page to help you identify duplicate photos / files.

python web.py

Launch browser to http://127.0.0.1:5001 to cleanup the duplicates. You will be able to clean up files by hash or file name.

dup_by_hash

dup_by_name