Skip to content

bmahaj2/Data-cleaning-and-integration

Repository files navigation

Data-cleaning-and-integration

• Project application implemented in Python. Analyzed the database that contained uncertain and imprecise references (dirty data). • Cleaned the dataset using proper transformation rules and spelling checks in Python. • Implemented Edit Distance and Jaccard Similarity to query the dataset.

About

Data cleaning and integration

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages