You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First I apologize for having to ask a question here.
The question is, is there an example of record linkage MySQL (large dataset)? I have tried about six times to convert the record linkage example to MySQL without success.
Our specific usage scenario is:
We have a large dataset which we have deduped successfully using dedupe (about 500k records).
However, every other week, new messy data is acquired, which we dedupe separately using our trained settings. Now we need to link these two canonical dataset, the first being about 500k records and the second being maybe 1k records. Now as you can see, we cannot use CSV to link such large data (or can we?) So we're desperate for a sample MySQL record linkage.
I am fairly new to Python, started learning it only so I can use this library (Dedupe), but I am not yet that good to understand the nuances of this awesome lib and be able to create the needed MySQL record linkage.
PLEASE HELP!
The text was updated successfully, but these errors were encountered:
First I apologize for having to ask a question here.
The question is, is there an example of record linkage MySQL (large dataset)? I have tried about six times to convert the record linkage example to MySQL without success.
Our specific usage scenario is:
I am fairly new to Python, started learning it only so I can use this library (Dedupe), but I am not yet that good to understand the nuances of this awesome lib and be able to create the needed MySQL record linkage.
PLEASE HELP!
The text was updated successfully, but these errors were encountered: