Bachelor's Thesis on Near-Duplicate Image Detection. This repo contains all resources, code, and documentation developed during the process.
-
Updated
May 22, 2024 - Python
Bachelor's Thesis on Near-Duplicate Image Detection. This repo contains all resources, code, and documentation developed during the process.
Language of Vectors (LangVec) is a simple Python library designed for transforming numerical vector data into a language-like structure using a predefined set of words (lexicon).
Python library for detecting near duplicate texts in a corpus at scale using Locality Sensitive Hashing, as described in chapter three of Mining Massive Datasets.
A Simple Image Clustering Script using CLIP and Hierarchial Clustering
ISCC: International Standard Content Code
Add a description, image, and links to the near-duplicate-detection topic page so that developers can more easily learn about it.
To associate your repository with the near-duplicate-detection topic, visit your repo's landing page and select "manage topics."