Senior Project

Fuzzy Record Linkage

Gregory Smith | Axel Solano

Summary

Our goal is to devise a probabilistic (fuzzy) data-matching algorithm to be used in a client data platform. This platform acts as a central hub for data from multiple sources. This algorithm will generate functionality to evaluate various field values between a master record, and a record to be matched. A metric of confidence of a match will be returned to the master record indicating potential matches.

Note

This program is not suitable for general data matching purposes. This was designed for a use-case specific to our client. However, the underlying methods of matching (similarity calculation & indexing method) are useful for general purposes.

To use in a project

Download the JAR file above (DataMatching_v1.jar) and add it as a dependency in your project. Then simply call the classes and methods as needed.

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
docs		docs
project		project
.gitignore		.gitignore
DataMatching_v1.jar		DataMatching_v1.jar
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs

docs

project

project

.gitignore

.gitignore

DataMatching_v1.jar

DataMatching_v1.jar

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Senior Project

Fuzzy Record Linkage

Gregory Smith | Axel Solano

Summary

Note

To use in a project

About

Releases

Packages

Contributors 2

Languages

License

OryGregS/SeniorProject

Folders and files

Latest commit

History

Repository files navigation

Senior Project

Fuzzy Record Linkage

Gregory Smith | Axel Solano

Summary

Note

To use in a project

About

Resources

License

Stars

Watchers

Forks

Languages