feat: Add notebook for RAG eval harness #11

shadeMe · 2024-06-06T13:20:32Z

Proposed Changes:

Add a notebook to showcase the RAG evaluation harness.

Notes for the reviewer

Depends on the following: deepset-ai/haystack#7818

Without the above PR, the execution of the harness will break since it attempts to serialize the evaluation pipeline. To test it locally, either pull the above PR and build haystack-ai locally or remove the answer faithfulness metric from the harness.

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

coveralls · 2024-06-06T13:24:16Z

Pull Request Test Coverage Report for Build 9401655773

Details

20 of 22 (90.91%) changed or added relevant lines in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage remained the same at 97.764%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
haystack_experimental/evaluation/harness/rag/harness.py	20	22	90.91%

Totals
Change from base Build 9352626696:	0.0%
Covered Lines:	612
Relevant Lines:	626

💛 - Coveralls

TuanaCelik · 2024-06-07T12:30:48Z

PSA, you can view the notebook here: https://colab.research.google.com/github/shadeMe/haystack-experimental/blob/feat/rag-eval-harness-notebook/examples/rag_eval_harness.ipynb

davidsbatista · 2024-06-10T09:04:22Z

I think the code is already there, and it shows how to use it. Nevertheless, this notebook needs some improvement regarding "documentation" and the code's organisation. Here are my comments:

Create an intro section shortly describing what the notebook is about
Do not assume anything is installed; explicitly state what dependencies are needed and the commands to install them. (I think you have most of it already there.)
Add textual descriptions/instructions to each section/snippet (i.e.: dataset prep., indexing, retrieval, eval., results analysis), instead of just the title
Make sure to have any relevant links to Documentation and/or blog posts related to the snippets/sections if needed
Move the imports to their related sections/snippets of code instead of having everything at the beginning
See some of the tutorials here for guidance: https://github.com/deepset-ai/haystack-tutorials?tab=readme-ov-file
One example: https://github.com/deepset-ai/haystack-tutorials/blob/main/tutorials/39_Embedding_Metadata_for_Improved_Retrieval.ipynb

davidsbatista · 2024-06-10T10:52:00Z

just noticed that the keyword_eval_harness is not used anywhere

davidsbatista · 2024-06-11T09:54:05Z

PSA, you can view the notebook here: https://colab.research.google.com/github/shadeMe/haystack-experimental/blob/feat/rag-eval-harness-notebook/examples/rag_eval_harness.ipynb

@TuanaCelik, did you manage to run this collab/notebook?

davidsbatista · 2024-06-11T09:54:50Z

@shadeMe can you open another PR only with the pytoml ?

coveralls · 2024-06-13T11:02:01Z

Pull Request Test Coverage Report for Build 9498393193

Details

20 of 22 (90.91%) changed or added relevant lines in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage remained the same at 97.764%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
haystack_experimental/evaluation/harness/rag/harness.py	20	22	90.91%

Totals
Change from base Build 9465387731:	0.0%
Covered Lines:	612
Relevant Lines:	626

💛 - Coveralls

shadeMe requested a review from a team as a code owner June 6, 2024 13:20

shadeMe requested review from julian-risch, davidsbatista, bilgeyucel and a team and removed request for a team and julian-risch June 6, 2024 13:20

shadeMe force-pushed the feat/rag-eval-harness-notebook branch from 29d2bfc to 7b48f64 Compare June 13, 2024 10:57

shadeMe force-pushed the feat/rag-eval-harness-notebook branch from 7b48f64 to abd14f5 Compare July 5, 2024 11:21

shadeMe marked this pull request as draft July 5, 2024 11:21

shadeMe force-pushed the feat/rag-eval-harness-notebook branch from abd14f5 to 81f1bba Compare July 5, 2024 13:27

feat: Add notebook for RAG eval harness

8a2a3eb

shadeMe force-pushed the feat/rag-eval-harness-notebook branch from 81f1bba to 8a2a3eb Compare July 8, 2024 11:17

shadeMe marked this pull request as ready for review July 8, 2024 11:17

bilgeyucel approved these changes Jul 8, 2024

View reviewed changes

shadeMe merged commit 7592cc5 into deepset-ai:main Jul 8, 2024

shadeMe deleted the feat/rag-eval-harness-notebook branch July 8, 2024 11:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add notebook for RAG eval harness #11

feat: Add notebook for RAG eval harness #11

shadeMe commented Jun 6, 2024 •

edited

Loading

coveralls commented Jun 6, 2024 •

edited

Loading

TuanaCelik commented Jun 7, 2024

davidsbatista commented Jun 10, 2024

davidsbatista commented Jun 10, 2024

davidsbatista commented Jun 11, 2024

davidsbatista commented Jun 11, 2024

coveralls commented Jun 13, 2024 •

edited

Loading

feat: Add notebook for RAG eval harness #11

feat: Add notebook for RAG eval harness #11

Conversation

shadeMe commented Jun 6, 2024 • edited Loading

Proposed Changes:

Notes for the reviewer

Checklist

coveralls commented Jun 6, 2024 • edited Loading

Pull Request Test Coverage Report for Build 9401655773

Details

💛 - Coveralls

TuanaCelik commented Jun 7, 2024

davidsbatista commented Jun 10, 2024

davidsbatista commented Jun 10, 2024

davidsbatista commented Jun 11, 2024

davidsbatista commented Jun 11, 2024

coveralls commented Jun 13, 2024 • edited Loading

Pull Request Test Coverage Report for Build 9498393193

Details

💛 - Coveralls

shadeMe commented Jun 6, 2024 •

edited

Loading

coveralls commented Jun 6, 2024 •

edited

Loading

coveralls commented Jun 13, 2024 •

edited

Loading