refactor: Adapt running benchmarks #5007

bogdankostic · 2023-05-23T22:26:18Z

Related Issues

fixes Adapt benchmarking script #4902

Related PRs

depends on refactor: Generate eval result in separate method #5001
depends on refactor: Adapt benchmarking utils #5003
depends on refactor: Adapt retriever benchmarks script #5004
depends on refactor: Adapt reader benchmarks #5005
depends on refactor: Add reader-retriever benchmark script #5006

Proposed Changes:

This PR adapts the way benchmarks can be run.
run.py takes as input a config file and optionally an output path.
The config file is a Pipeline YAML file containing a querying pipeline and, in case the querying pipeline contains a retriever, an indexing pipeline. Additionally, beside the pipeline configuration, the config file contains a section benchmark_config where the following information should be specified:

labels_file: A SQuAD-formatted json or csv containing the labels to benchmark on.
documents_directory: A Path to a directory containing the files that should be indexed into the document store (Only needed for retriever and retriever-reader pipelines.)
data_url: Optionally. Allows to download data from this URL and save it in the directory data/.

How did you test it?

Manual tests.

Notes for the reviewer

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added tests that demonstrate the correct behavior of the change
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

coveralls · 2023-05-23T23:12:20Z

Pull Request Test Coverage Report for Build 5092460742

0 of 0 changed or added relevant lines in 0 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage remained the same at 39.621%

Totals
Change from base Build 5090502610:	0.0%
Covered Lines:	8876
Relevant Lines:	22402

💛 - Coveralls

… adapt_retriever_benchmarks

…nchmarks

…_benchmarks

…_run

…etriever

test/benchmarks/run.py

vblagoje

🚀

bogdankostic added 10 commits May 23, 2023 23:10

Generate eval result in separate method

74270a3

Adapt benchmarking utils

eeccb9e

Merge branch 'adapt_benchmarking_utils' into adapt_retriever_benchmarks

791f7fa

Adapt running retriever benchmarks

bc05e47

Adapt error message

766435d

Merge branch 'adapt_benchmarking_utils' into adapt_retriever_benchmarks

62dd309

Adapt running reader benchmarks

17c2ba9

Adapt retriever reader benchmark script

e409360

Merge branch 'adapt_reader_benchmarks' into adapt_benchmark_run

f13fd2f

Adapt running benchmarks script

9ff1ba6

github-actions bot added topic:pipeline topic:tests type:documentation Improvements on the docs labels May 23, 2023

bogdankostic added 4 commits May 24, 2023 00:37

Merge branch 'main' into adapt_retriever_benchmarks

83796ba

Merge branch 'main' into adapt_reader_benchmarks

7dad35f

Merge branch 'main' into adapt_reader-retriever_benchmarks

4766da4

Merge branch 'main' into adapt_benchmark_run

fd14ea2

bogdankostic mentioned this pull request May 23, 2023

Adapt benchmarking script #4902

Closed

bogdankostic and others added 11 commits May 24, 2023 23:46

Adapt README.md

a9bf07b

Merge branch 'main' into adapt_retriever_benchmarks

272f84c

Merge branch 'main' into adapt_reader_benchmarks

ef4b287

Raise error if file doesn't exist

a6c1996

Merge remote-tracking branch 'origin/adapt_retriever_benchmarks' into…

e6fd0db

… adapt_retriever_benchmarks

Merge branch 'main' into adapt_retriever_benchmarks

b14edfd

Raise error if path doesn't exist or is a directory

e8f73de

Merge remote-tracking branch 'origin/adapt_retriever_benchmarks' into…

5c28c02

… adapt_retriever_benchmarks

minor readme update

4e6a91b

Merge branch 'adapt_reader_benchmarks' into adapt_reader-retriever_be…

5e9b3ae

…nchmarks

Merge branch 'adapt_retriever_benchmarks' into adapt_reader-retriever…

fe5d710

…_benchmarks

Merge branch 'main' into adapt_benchmark_run

6adc955

github-actions bot removed the topic:pipeline label May 25, 2023

bogdankostic added 3 commits May 25, 2023 15:58

Merge branch 'adapt_retriever_benchmarks' into adapt_benchmark_run

b5b3ea1

Merge branch 'adapt_reader-retriever_benchmarks' into adapt_benchmark…

411b6d3

…_run

Merge branch 'main' into adapt_benchmark_run

512b414

bogdankostic marked this pull request as ready for review May 26, 2023 11:57

bogdankostic requested review from a team as code owners May 26, 2023 11:57

bogdankostic requested review from silvanocerza and vblagoje and removed request for a team and silvanocerza May 26, 2023 11:57

Create separate methods for checking if pipeline contains reader or r…

1f9a6ec

…etriever

vblagoje reviewed May 26, 2023

View reviewed changes

test/benchmarks/run.py Outdated Show resolved Hide resolved

Fix reader pipeline case

20173fb

vblagoje approved these changes May 26, 2023

View reviewed changes

bogdankostic merged commit b8ff105 into main May 26, 2023
47 checks passed

bogdankostic deleted the adapt_benchmark_run branch May 26, 2023 16:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: Adapt running benchmarks #5007

refactor: Adapt running benchmarks #5007

bogdankostic commented May 23, 2023 •

edited

Loading

coveralls commented May 23, 2023 •

edited

Loading

vblagoje left a comment

refactor: Adapt running benchmarks #5007

refactor: Adapt running benchmarks #5007

Conversation

bogdankostic commented May 23, 2023 • edited Loading

Related Issues

Related PRs

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

coveralls commented May 23, 2023 • edited Loading

Pull Request Test Coverage Report for Build 5092460742

💛 - Coveralls

vblagoje left a comment

Choose a reason for hiding this comment

bogdankostic commented May 23, 2023 •

edited

Loading

coveralls commented May 23, 2023 •

edited

Loading