multilink
is an R
package which performs Bayesian probabilistic
record linkage. It handles the general problem of multifile record
linkage and duplicate detection, where any number of files are to be
linked, and any of the files may have duplicates. multilink
implements
methodology presented in the manuscript “Multifile Partitioning for
Record Linkage and Duplicate Detection” by Serge Aleshin-Guendel and
Mauricio Sadinle, published in the Journal of the American Statistical
Association and
available on arXiv. For more
information on using multilink
, please see the multilink_tutorial
github
repository.
The package is now available on CRAN:
install.packages("multilink")
You can install the development version of multilink from GitHub with:
install.packages("devtools")
devtools::install_github("aleshing/multilink")