deduplication
Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.
Here are 30 public repositories matching this topic...
Let the goddess keep your data on cloud
-
Updated
Jan 22, 2018 - C
zfs data obfuscator/compressor
-
Updated
Mar 6, 2018 - C
Rust local/cloud backup program, supporting compression, encryption, and deduplication.
-
Updated
Mar 8, 2018 - C
A local/cloud backup program in C, supporting compression, encryption, and deduplication.
-
Updated
Mar 8, 2018 - C
a data deduplication tool for container images
-
Updated
Oct 29, 2018 - C
Continuous data protection for GNU/Linux (cdpfgl).
-
Updated
Mar 15, 2019 - C
A simple tool for cataloging/deduplication/other backup preparation tasks.
-
Updated
Aug 21, 2019 - C
This is a mirror of https://gitlab.com/gob-backup/gob
-
Updated
Mar 1, 2020 - C
A command-line tool for deduplicating entries in a file or stream with constant memory usage
-
Updated
Apr 11, 2022 - C
-
Updated
Aug 2, 2022 - C
Alignment-free FASTQ deduplication.
-
Updated
Oct 18, 2022 - C
SuperREP: huge-dictionary LZ77 preprocessor
-
Updated
Mar 10, 2023 - C
CLI utility to find duplicate files
-
Updated
Jun 7, 2023 - C
Fast block device sync with digest, designed to improve block-based backups.
-
Updated
Jun 15, 2023 - C
Files associated with my blog post on memory deduplication attacks.
-
Updated
Jun 18, 2023 - C
Deduplicating filesystem via Python3, FUSE and SQLite
-
Updated
Jun 27, 2023 - C
Created by Halbert L. Dunn
Released 1946
- Followers
- 37 followers
- Organization
- entity-resolution
- Wikipedia
- Wikipedia