Skip to content

3 Way Match - Find Best Match - OCR invoice line level detail & Purchase Order Receipt Data #1206

Answered by NickCrews
DeanBek asked this question in Q&A
Discussion options

You must be logged in to vote

If I understand you correctly (link a table of 300 records with another table of 300 records, where each record in A will match exactly one record in B), then this exactly what splink is designed for. You will have to write a bit of custom SQL to implement the comparisons that you are talking about, but it would work.

For the scale that you are talking about, there are only 300x300=90_000 possible comparisons, so honestly I think it might be easier to just write this logic in pure python and skip splink entirely.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@RobinL
Comment options

Answer selected by RossKen
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants