introduce node_input param #1854

tstadel · 2021-12-07T14:28:10Z

Proposed changes:

introduce node_input parameter to EvaluationResult.calculate_metrics() and EvaluationResult.wrong_examples()
introduce node_input as eval dataframe column with default value 'prediction'
(node_input value for 'perfect retriever' would be 'label')

Status (please check what you already did):

First draft (up for discussions & feedback)
Final code
Added tests
Updated documentation

tstadel · 2021-12-08T09:28:48Z

@julian-risch @tholor
What do you think about the 'node_input' parameter? For me it sounds more intuitive than anything containing 'open / closed domain' for choosing different inputs we evaluated on. However, it might be harder to refer to it if we're talking about 'open / closed domain' in the docs.

julian-risch · 2021-12-08T13:35:40Z

I like the idea of that parameter. Maybe we can find names for the parameter values that explain both what happens (as of now with pipeline and labels) and when to use it (open / closed domain eval).
However, let's still consider alternatives. For example, we could store separate EvalResults for open and closed domain eval. In that case no change would be necessary in the calculate_metrics() and wrong_examples() method, right? Only in the pipeline.eval() method and in the generation of the eval report.

tholor · 2021-12-13T11:39:30Z

How would the signature of pipeline.eval() look like this with this parameter design?
Would we, by default, always create both predictions in the dataframe (from prediction and from label)?
I am just thinking where else this node_input param might appear and if it sounds intuitive in these places 🤔

For example, we could store separate EvalResults for open and closed domain eval. In that case no change would be necessary in the calculate_metrics() and wrong_examples() method, right? Only in the pipeline.eval() method and in the generation of the eval report.

I think there should be at least the option to run pipeline.eval() only once and have all results you need available. Returning two dataframes in this case sounds a bit unintuitive to me.

tstadel · 2021-12-13T12:00:16Z

How would the signature of pipeline.eval() look like this with this parameter design?
Would we, by default, always create both predictions in the dataframe (from prediction and from label)?
I am just thinking where else this node_input param might appear and if it sounds intuitive in these places

Yes, node_input might not be what we want to pass to pipeline.eval(). Here we might rather pass a flag, that enables label node_input evaluation. prediction only would be the default. So node_input is designed to be an advanced param, that you don't need in the most common way (call pipeline.eval() and pass the result to pipeline.print_eval_report(), if there is 'label' node_input it'll be used, otherwise not)

julian-risch

Looks good to me (as already discussed via call). 👍

tstadel and others added 2 commits December 7, 2021 15:26

introduce node_input param

fed70e9

Add latest docstring and tutorial changes

04f5aec

tstadel requested review from julian-risch and tholor December 8, 2021 09:22

tstadel marked this pull request as ready for review December 8, 2021 09:23

tstadel and others added 2 commits December 13, 2021 12:06

prediction and label as node_input values

1230334

Add latest docstring and tutorial changes

f8217e4

julian-risch approved these changes Dec 13, 2021

View reviewed changes

tstadel merged commit 57a0463 into master Dec 14, 2021

tstadel deleted the eval_multi_node_output branch December 14, 2021 09:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

introduce node_input param #1854

introduce node_input param #1854

tstadel commented Dec 7, 2021 •

edited

tstadel commented Dec 8, 2021

julian-risch commented Dec 8, 2021

tholor commented Dec 13, 2021

tstadel commented Dec 13, 2021

julian-risch left a comment

introduce node_input param #1854

introduce node_input param #1854

Conversation

tstadel commented Dec 7, 2021 • edited

tstadel commented Dec 8, 2021

julian-risch commented Dec 8, 2021

tholor commented Dec 13, 2021

tstadel commented Dec 13, 2021

julian-risch left a comment

Choose a reason for hiding this comment

tstadel commented Dec 7, 2021 •

edited