Identifies and processes duplicate files between a scan directory and a reference directory.
-
Updated
Jun 11, 2024 - Python
Identifies and processes duplicate files between a scan directory and a reference directory.
Interact, analyze and structure massive text, image, embedding, audio and video datasets
CLI utility to find near duplicate images and remove all but the best copy.
Duplicates Detector is a cross-platform GUI utility for finding duplicate files, allowing you to delete or link them to save space. Duplicate files are displayed and processed on two synchronized panels for efficient and convenient operation.
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Search for duplicate files based on extension.
Language of Vectors (LangVec) is a simple Python library designed for transforming numerical vector data into a language-like structure using a predefined set of words (lexicon).
A Python library to scan a file system, find duplicated file etc.
Find and delete duplicate files in a folder using regex
files duplicate viewer
This repository contains scripts for detecting duplicate files in a specified directory. The scripts use hash functions to identify duplicates by comparing file contents, ensuring accuracy regardless of file names. Available for Bash, PowerShell, and Python environments.
Detect duplicate images locally
Simple python script to get rid of duplicated files
RDMP3 a simple tool for Remove Duplicate MP3
GUI application for finding image duplicates
Find, remove and avoid duplicates with dugu: The Duplicates Guru
Add a description, image, and links to the duplicate-detection topic page so that developers can more easily learn about it.
To associate your repository with the duplicate-detection topic, visit your repo's landing page and select "manage topics."