.. toctree:: :maxdepth: 2 :hidden: :caption: Getting started /getting_started/installation /getting_started/basic_usage /getting_started/cli_usage /getting_started/output
.. toctree:: :maxdepth: 2 :hidden: :caption: Methods and parameters /methods/build /methods/search /methods/search_moveto /methods/search_delete
.. toctree:: :maxdepth: 2 :hidden: :caption: Contributing /contributing/contributing /contributing/support
.. toctree:: :maxdepth: 2 :hidden: :caption: Further Resources /resources/desktop /resources/large_datasets /resources/supported_filetypes /resources/report_bug
difPy is a Python package that automates the search for duplicate/similar images.
difPy searches for images in one or more directories, compares the images it found and checks whether these are duplicates. It then outputs the image files classified as duplicates, as well as the images having the lowest resolutions, so that you know which of the duplicate images are safe to be moved/deleted. You can then either move/delete them manually, or let difPy do this for you.
difPy does not compare images based on their hashes. It compares them based on their tensors i. e. the image content. This allows you to let difPy not only search for duplicate images, but also for similar images.
difPy leverages Python's multiprocessing capabilities and is therefore able to perform at high performance even on large datasets.
Note
✨ difPy will soon be available as an app for your desktop! Learn more.
View difPy on GitHub and PyPi.