Deduplicate Google Calendar events that were created by Fastmail import
-
Updated
Feb 1, 2022 - Python
Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.
Deduplicate Google Calendar events that were created by Fastmail import
ATBU Cloud/Local Backup & File Integrity/Duplication Management Utility
a collection of image deduplication repositories
Big Data Analysis
A tool to enrich any OCDM compliant Knowledge Graph, finding new identifiers and deduplicating entities.
Implementation of text classification, duplicate question recognition and text deduplication in Python.
Research Project of Image de duplication
Model for data deduplication assignment.
python script to analyze dedup usage in btrfs
Deduplication/backup tool with extremely high 'compression' rate
Removes repeating pages with same page number in PDFs prepared for presentation purposes.
An extension for ASReview Lab to preprocess the dataset before importing in ASReview
A dictionary that de-duplicates values.
RDF Graph Compression Tool. Hash RDF subjects based on a checksum of their triples, effectively consolidating together subjects that contain identical definitions. Reduce time taken to mint URIs. Use Blank Nodes to your Advantage
🧱 blocking methods for entity resolution
Configurable and lightweight backup utility with deduplication and encryption
Deduplicate photos in macOS library (or standalone)
Scan one or more directories for duplicated audio files.
Created by Halbert L. Dunn
Released 1946