Holds code for near-duplicate image parser using optimized image classifiers.
-
Updated
Sep 16, 2021 - Jupyter Notebook
Holds code for near-duplicate image parser using optimized image classifiers.
Simple library for finding duplicate and near-duplicate text documents in massive sets/libraries/databases
Exploiting the PyTerrier library to build a Search Engine and resolve the Near Duplicate Detection tasks.
A Simple Image Clustering Script using CLIP and Hierarchial Clustering
First homework for the Advance Data Mining course
Fast image similarity search with hash tables (Golang). Version 1
Image similarity in Golang. Version 4 (LATEST)
ISCC: International Standard Content Code
Bachelor's Thesis on Near-Duplicate Image Detection. This repo contains all resources, code, and documentation developed during the process.
Fast image similarity search with hash tables (Golang). Version 2 (LATEST)
Multi module project focused on near-duplicate search for images.
Language of Vectors (LangVec) is a simple Python library designed for transforming numerical vector data into a language-like structure using a predefined set of words (lexicon).
an application for comparing images using various image hashing algorithms
Python library for detecting near duplicate texts in a corpus at scale using Locality Sensitive Hashing, as described in chapter three of Mining Massive Datasets.
Add a description, image, and links to the near-duplicate-detection topic page so that developers can more easily learn about it.
To associate your repository with the near-duplicate-detection topic, visit your repo's landing page and select "manage topics."