A kernel module which provide a pool of deduplicated and/or compressed block storage.
-
Updated
Nov 6, 2024 - C
Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.
A kernel module which provide a pool of deduplicated and/or compressed block storage.
Ultra fast file archiver that supports data deduplication and differential backups
Userspace tools for managing VDO volumes.
Fast block device sync with digest, designed to improve block-based backups.
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
In-band deduplication via LD_PRELOAD for any filesystem that supports reflinks!
Variable-sized block deduplication archival backed by Plan9's venti
Extremely fast tool to remove duplicates and other lint from your filesystem
Quick and Dirty Deduplication Analyzer
User-guided Page Merging: Memory Deduplication for Serverless
A backup suite. Supports FLZMA2, bzip3, LZ4, Zstandard, LSH i-node ordering deduplicating archiver, long range deduplication, encryption and recovery records
Deduplicating filesystem via Python3, FUSE and SQLite
Files associated with my blog post on memory deduplication attacks.
CLI utility to find duplicate files
SuperREP: huge-dictionary LZ77 preprocessor
Alignment-free FASTQ deduplication.
A command-line tool for deduplicating entries in a file or stream with constant memory usage
This is a mirror of https://gitlab.com/gob-backup/gob
Created by Halbert L. Dunn
Released 1946