Skip to content

Commit

Permalink
Merge 626a2d4 into 3e94405
Browse files Browse the repository at this point in the history
  • Loading branch information
RobinL committed Mar 12, 2022
2 parents 3e94405 + 626a2d4 commit 2fc8c4b
Showing 1 changed file with 6 additions and 1 deletion.
7 changes: 6 additions & 1 deletion README.md
Expand Up @@ -4,7 +4,12 @@
![issues-status](https://img.shields.io/github/issues-raw/moj-analytical-services/splink)
![python-version-dependency](https://img.shields.io/badge/python-%3E%3D3.6-blue)

# splink: Probabilistic record linkage and deduplication at scale
✨✨ **Note to new users:** ✨✨

Version 3 of Splink is in development that will make it simpler and more intuitive to use. It also removes the need for PySpark for smaller data linkages of up to around 1 million records. You can try it by installing a [pre-release](https://pypi.org/project/splink/#history), or in the new demos [here](https://github.com/moj-analytical-services/splink_demos/tree/splink3_demos). For new users, it may make sense to work with the new version, because it is quicker to learn. However, note that the new code is not yet fully tested.


# Splink: Probabilistic record linkage and deduplication at scale

`splink` implements Fellegi-Sunter's canonical model of record linkage in Apache Spark, including the EM algorithm to estimate parameters of the model.

Expand Down

0 comments on commit 2fc8c4b

Please sign in to comment.