Replies: 1 comment 1 reply
-
|
Yeah - I don't think there's an easy/clean way to do this in Splink 4. In Splink 5 (forthcoming), we deliberately make each part of the workflow (computing tf tables, computing df_concat_with_tf, blocking, predict) more like a lego block, so this kind of thing is explicitly supported - see here. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I'm wondering if there is existing functionality within Splink to score a provided table of edges?
e.g. if I provide a table like
__splink__blocked_id_pairs:a linker could then expand that to
__splink__df_comparison_vectors, and score them to produce adf_predicttable only for those ID pairs?This is useful to me where the records exist within the linker object's input_tables already, and I have a target list of edges I'd like to generate a df_predict for. I'd like to avoid pulling the data out to records and then using
compare_two_records(), I'd just like to score this table of candidate edges.I've bastardised the
inference.predict()function to kind of do what I want by not computing the blocked_id_pairs and instead taking it in as an argument, but:The most important parts are:
Any insights would be welcome :)
Related: #2822
Beta Was this translation helpful? Give feedback.
All reactions