Rag Evaluation #24

andreacerasani · 2024-11-13T16:00:09Z

No description provided.

…add rag evaluation to nav

…eflect chosen palette

…d table, add SDK example

alelavml3 · 2024-11-14T14:10:37Z

md-docs/user_guide/modules/rag_evaluation.md

-# RAG Evaluation
+# Rag Evaluation
+
+## Introduction


Puoi anche togliere il titoletto

alelavml3 · 2024-11-14T14:11:56Z

md-docs/user_guide/modules/rag_evaluation.md

+
+RAG (Retrieval-Augmented Generation) is a way of building AI models that enhances their ability to generate accurate and contextually relevant responses by combining two main steps: **retrieval** and **generation**.
+
+1. **Retrieval**: The model first searches through a large set of documents or pieces of information to "retrieve" the most relevant ones based on the user query.


aggiungerei: from a secific knowledge base defined by the system designer

alelavml3 · 2024-11-14T14:13:02Z

md-docs/user_guide/modules/rag_evaluation.md

+
+Evaluating RAG involves assessing how well the model does in both retrieval and generation.
+
+The RAG evaluation module analyzes the three main components of a RAG framework:


sarebbe carino mettere un blocco example oppure una colonna aggiuntiva. Scegli te

alelavml3 · 2024-11-14T14:14:28Z

md-docs/user_guide/modules/rag_evaluation.md

+1. **Retrieval**: The model first searches through a large set of documents or pieces of information to "retrieve" the most relevant ones based on the user query.
+2. **Generation**: It then uses these retrieved documents as context to generate a response, which is typically more accurate and aligned with the question than if it had generated text from scratch without specific guidance.
+
+Evaluating RAG involves assessing how well the model does in both retrieval and generation.


Aggiungerei una piccola motivazione del perché è importante fare la evaluation e dopo la tabella chiuderei l'introduzione.

alelavml3 · 2024-11-14T14:17:40Z

md-docs/user_guide/modules/rag_evaluation.md

+| Context    | The retrieved documents or information that the model uses to generate a response. |
+| Response   | The generated answer or output provided by the model.                              |
+
+In particular, the analysis is performed on the relationships between these components:


Da qua farei partire una nuova sezione che parli esplicitamente del nostro modulo.

Partirei quindi con il dire che è disponibile solo per i nostri Task (con link) di tipo RAG, e che è strutturata come report di evaluation di un set di dati.

Aggiungerei un blocco info che spieghi che è possibile farlo sia web app che da sdk.

Infine, ripartirei con le la frase di questo commento (in particular, the analysis....).

Qualcosa sul fatto che il report è visualizzabile in web app ma che si può anche esportare in formato excel.

alelavml3 · 2024-11-14T14:18:12Z

md-docs/user_guide/modules/rag_evaluation.md

+  <figcaption>ML cube Platform RAG Evaluation</figcaption>
+</figure>
+
+The evaluation is performed through an LLM-as-a-Judge approach, where a Large Language Model (LLM) acts as a judge to evaluate the quality of a RAG model.


Questo potremmo ometterlo così da non svelare troppo che approccio utilizziamo

alelavml3 · 2024-11-14T14:18:55Z

md-docs/user_guide/modules/rag_evaluation.md

+| Utilization | The percentage of the retrieved context that contains information for the response. A higher utilization score indicates that more of the retrieved context is useful for  generating the response. | 0-100                                                       |
+| Attribution | Which of the chunks of the retrieved context can be used to generate the response.                                                                                                                  | List of indices of the used chunks, first chunk has index 1 |
+
+!!! example


più che example è un blocco di note

alelavml3 · 2024-11-14T14:19:18Z

md-docs/user_guide/modules/rag_evaluation.md

+| Metric       | Description                                                                                                                                                                                                           | Score Range (Lowest-Highest) |
+| ------------ | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ---------------------------- |
+| Satisfaction | How satisfied the user would be with the generated response. A low score indicates a response that does not address the user query, a high score indicates a response that fully addresses and answers the user query. | 1-5                          |
+


al termine della sezione farei un esempio di una query

andreacerasani added 7 commits October 31, 2024 18:16

Add initial paragraphs of rag evaluation doc page

f17e036

Add required data chapter in rag_evaluation.md, fix typo in task.md, …

bf70aa8

…add rag evaluation to nav

Merge remote-tracking branch 'origin/dev' into dev-rag-evaluation

fcb21b6

Add timestamp mention

ddfcc12

Merge remote-tracking branch 'origin/dev' into dev-rag-evaluation

3cb7c89

Modify rag evaluation paragraph titles, add tables, change image to r…

e38e911

…eflect chosen palette

In rag evaluation use png instead of svg, remove we/our reference, ad…

791f945

…d table, add SDK example

andreacerasani requested a review from alelavml3 November 13, 2024 16:00

alelavml3 requested changes Nov 14, 2024

View reviewed changes

Improve RAG evaluation doc

8daef3a

alelavml3 merged commit 22deb5c into dev Nov 20, 2024

alelavml3 deleted the dev-rag-evaluation branch November 20, 2024 16:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rag Evaluation #24

Rag Evaluation #24

Uh oh!

andreacerasani commented Nov 13, 2024

Uh oh!

alelavml3 Nov 14, 2024

Uh oh!

alelavml3 Nov 14, 2024

Uh oh!

alelavml3 Nov 14, 2024

Uh oh!

alelavml3 Nov 14, 2024

Uh oh!

alelavml3 Nov 14, 2024

Uh oh!

alelavml3 Nov 14, 2024

Uh oh!

alelavml3 Nov 14, 2024

Uh oh!

alelavml3 Nov 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		RAG (Retrieval-Augmented Generation) is a way of building AI models that enhances their ability to generate accurate and contextually relevant responses by combining two main steps: retrieval and generation.

		1. Retrieval: The model first searches through a large set of documents or pieces of information to "retrieve" the most relevant ones based on the user query.


		Evaluating RAG involves assessing how well the model does in both retrieval and generation.

		The RAG evaluation module analyzes the three main components of a RAG framework:

Rag Evaluation #24

Rag Evaluation #24

Uh oh!

Conversation

andreacerasani commented Nov 13, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants