A simple tool for cataloging/deduplication/other backup preparation tasks.
-
Updated
Aug 21, 2019 - C
Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.
A simple tool for cataloging/deduplication/other backup preparation tasks.
Fast block device sync with digest, designed to improve block-based backups.
Let the goddess keep your data on cloud
User-guided Page Merging: Memory Deduplication for Serverless
Variable-sized block deduplication archival backed by Plan9's venti
A local/cloud backup program in C, supporting compression, encryption, and deduplication.
In-band deduplication via LD_PRELOAD for any filesystem that supports reflinks!
a data deduplication tool for container images
Rust local/cloud backup program, supporting compression, encryption, and deduplication.
zfs data obfuscator/compressor
Files associated with my blog post on memory deduplication attacks.
This is a mirror of https://gitlab.com/gob-backup/gob
A command-line tool for deduplicating entries in a file or stream with constant memory usage
A backup suite. Supports FLZMA2, bzip3, LZ4, Zstandard, LSH i-node ordering deduplicating archiver, long range deduplication, encryption and recovery records
Alignment-free FASTQ deduplication.
Quick and Dirty Deduplication Analyzer
Created by Halbert L. Dunn
Released 1946