deduplication
Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.
Here are 12 public repositories matching this topic...
A systemd service for backing up your root drive to an external disk using borg with support for LVM Snapshots.
-
Updated
Dec 30, 2018 - Shell
Incremental backup automation of LVM based virtual machines using chunk based deduplication
-
Updated
Mar 8, 2019 - Shell
just a simple incremental backup script for linux using tar pigz, etc - useful for bacula
-
Updated
Jul 4, 2019 - Shell
Incremental Backup via rsync with hard links for instant deduplication. Works both for linux (tested in RedHat, OpenSuse and Ubuntu) and FreeBSD (tested in FreeNAS 9.10)
-
Updated
Dec 1, 2022 - Shell
Wrapper for a deduplicating archiver BorgBackup. It simplifies performing everyday tasks on multiply repositories.
-
Updated
Mar 14, 2023 - Shell
Collection of scripts for various backup scenarios.
-
Updated
Jun 12, 2023 - Shell
Quick and dirty backup tool benchmark with reproducible results
-
Updated
Dec 19, 2023 - Shell
(Mirror repository) -- Poda is a tool to find similar and duplicate content between disconnected storage units. It can be used for connected ones too.
-
Updated
May 27, 2024 - Shell
Enable deduplication with non-Synology SSDs and unsupported NAS models
-
Updated
Oct 21, 2024 - Shell
Created by Halbert L. Dunn
Released 1946
- Followers
- 38 followers
- Organization
- entity-resolution
- Wikipedia
- Wikipedia