A Python script for generating duplicate data to test the performance of record linkage and master data management systems.
-
Updated
Jun 12, 2024 - Python
A Python script for generating duplicate data to test the performance of record linkage and master data management systems.
Library to provide functions for Securities Master data (aka instrument reference data).
A Python package designed to allow health, biomedical and other researchers to clean (standardise) and deduplicate or link data sets of all sizes faster, with less effort and with improved quality.
Add a description, image, and links to the master-data-management topic page so that developers can more easily learn about it.
To associate your repository with the master-data-management topic, visit your repo's landing page and select "manage topics."