refactor: Generate eval result in separate method #5001

bogdankostic · 2023-05-23T21:15:11Z

Related Issues

related to Adapt benchmarking script #4902

Proposed Changes:

This PR creates a separate method for generating eval results from batch predictions.

How did you test it?

Manual tests + CI

Notes for the reviewer

This is needed for benchmarking, as we want to get performance metrics for Pipelines but want to measure pure inference time without the overhead of calculating the metrics.

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added tests that demonstrate the correct behavior of the change
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

coveralls · 2023-05-23T21:35:57Z

Pull Request Test Coverage Report for Build 5067539591

0 of 0 changed or added relevant lines in 0 files are covered.
158 unchanged lines in 1 file lost coverage.
Overall coverage increased (+0.1%) to 39.594%

Files with Coverage Reduction	New Missed Lines	%
pipelines/base.py	158	33.38%

Totals
Change from base Build 5067531118:	0.1%
Covered Lines:	8864
Relevant Lines:	22387

💛 - Coveralls

vblagoje · 2023-05-24T12:38:10Z

haystack/pipelines/base.py

+        custom_document_id_field: Optional[str] = None,
+        context_matching_min_length: int = 100,
+        context_matching_boost_split_overlaps: bool = True,
+        context_matching_threshold: float = 65.0,


Out of curiosity - where did these defaults like context_matching_threshold and their values come from?

They come from eval_batch.

vblagoje

LGTM 🚀

wochinge · 2023-06-28T13:06:52Z

haystack/pipelines/base.py

+        context_matching_boost_split_overlaps: bool = True,
+        context_matching_threshold: float = 65.0,
+        use_auth_token: Optional[Union[str, bool]] = None,
+    ) -> EvaluationResult:
        eval_result = EvaluationResult()
        if add_isolated_node_eval:


this would need to happen before we do the batch evaluation

@bogdankostic This causes isolated evaluation to be omitted

Generate eval result in separate method

74270a3

github-actions bot added topic:pipeline type:documentation Improvements on the docs labels May 23, 2023

Merge branch 'main' into refactor_benchmarking

8ca7559

This was referenced May 23, 2023

Adapt benchmarking script #4902

Closed

refactor: Adapt retriever benchmarks script #5004

Merged

refactor: Adapt reader benchmarks #5005

Merged

refactor: Add reader-retriever benchmark script #5006

Merged

refactor: Adapt running benchmarks #5007

Merged

bogdankostic marked this pull request as ready for review May 24, 2023 06:23

bogdankostic requested a review from a team as a code owner May 24, 2023 06:23

bogdankostic requested review from silvanocerza and removed request for a team May 24, 2023 06:23

silvanocerza and others added 2 commits May 24, 2023 09:45

Merge branch 'main' into refactor_benchmarking

2f34c23

Merge branch 'main' into refactor_benchmarking

ef9414c

vblagoje reviewed May 24, 2023

View reviewed changes

vblagoje self-requested a review May 25, 2023 08:26

vblagoje approved these changes May 25, 2023

View reviewed changes

bogdankostic merged commit 19829da into main May 25, 2023
56 checks passed

bogdankostic deleted the refactor_benchmarking branch May 25, 2023 08:30

wochinge reviewed Jun 28, 2023

View reviewed changes

bogdankostic mentioned this pull request Jun 28, 2023

fix: Use add_isolated_node_eval of eval_batch in run_batch #5223

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: Generate eval result in separate method #5001

refactor: Generate eval result in separate method #5001

bogdankostic commented May 23, 2023 •

edited

Loading

coveralls commented May 23, 2023 •

edited

Loading

vblagoje May 24, 2023 •

edited

Loading

bogdankostic May 24, 2023

vblagoje left a comment

wochinge Jun 28, 2023

wochinge Jun 28, 2023

refactor: Generate eval result in separate method #5001

refactor: Generate eval result in separate method #5001

Conversation

bogdankostic commented May 23, 2023 • edited Loading

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

coveralls commented May 23, 2023 • edited Loading

Pull Request Test Coverage Report for Build 5067539591

💛 - Coveralls

vblagoje May 24, 2023 • edited Loading

Choose a reason for hiding this comment

bogdankostic May 24, 2023

Choose a reason for hiding this comment

vblagoje left a comment

Choose a reason for hiding this comment

wochinge Jun 28, 2023

Choose a reason for hiding this comment

wochinge Jun 28, 2023

Choose a reason for hiding this comment

bogdankostic commented May 23, 2023 •

edited

Loading

coveralls commented May 23, 2023 •

edited

Loading

vblagoje May 24, 2023 •

edited

Loading