Replies: 1 comment
-
Your problem seems very similar to the one here: here is my suggested answer |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am trying to cluster potential match on address but have not been successful, does Splink work well with Fuzzy matchicing?
Here is my use case:
Company Id, Name, Address, State, Postal Code
10000000, John Medical Services, 2639 Main St, Texas, 65983
10000048, John Medical Services, 2639 Main Street, Texas, 65983
10000056, John Medical Services, 2693 Main Str, Texas, 65983
Is there a way to say that: I set in my config that:
-State, Postal Code should be exact match.
-Address should be: Fuzzy (Levenstein).
And I can get the three records within a same cluster? They're same entity just variation in the address (should be considered potential duplicate, due to variation in the string)
Specify a data linkage model
Beta Was this translation helpful? Give feedback.
All reactions