[patch] Reduce NaN Occurrences by Simple Prompt Modification for JSON Output for context_precision #581

i-w-a · 2024-02-09T06:18:13Z

Overview

During the calculation of context_precision, an issue was observed where increasing the context amount led to a surge in NaN occurrences. Comparatively, context_recall does not exhibit this problem. An investigation into the causes of the difference uncovered that the issue stems from whether the prompts specify outputting in JSON format.

Discovery

It was found that simply specifying JSON output for context_precision, similar to what is done for context_recall, significantly reduces the incidence of NaN. Utilizing JSON mode appears to be crucial, as noted in the OpenAI reference for text generation in JSON mode:

"If you don't include an explicit instruction to generate JSON, the model may generate an unending stream of whitespace and the request may run continually until it reaches the token limit." OpenAI Text Generation JSON Mode Documentation

Solution

To align with best practices and address the NaN generation issue, I propose updating the prompt for context_precision to explicitly instruct the generation of output in JSON format. This small but impactful change will bring context_precision in line with how context_recall operates and ensure more stable and predictable outcomes when handling larger context volumes.

Impact

By making this explicit switch to JSON output, we not only follow the guideline provided by OpenAI but also prevent the potential uncontrolled behavior that can result in a heavy onslaught of NaN values. This improvement should increase the reliability of calculations within our system and significantly decrease the time spent debugging NaN-related issues.

I look forward to your review and approval of this change, which will help us maintain robustness in our context precision calculations.

Best,
i-w-a

shahules786 · 2024-02-09T19:32:01Z

Hey @i-w-a , this is nice hack man. I tried it , it reduces the issue even with bedrock by 20% avg. I have raised a PR to your fork i-w-a#1, can you merge it?
I added it to every prompt that uses JSON

add json formatting

i-w-a · 2024-02-10T02:22:05Z

Thank you. I was surprised that bedrock even worked!

shahules786 · 2024-02-10T03:59:35Z

@i-w-a haha :)

shahules786 and others added 2 commits February 8, 2024 16:47

ensure dict type

55064c2

symple change for context precision

19cab75

i-w-a changed the title ~~[patch] Reduce NaN Occurrences by Simple Prompt Modification for JSON Output for context_presicion~~ [patch] Reduce NaN Occurrences by Simple Prompt Modification for JSON Output for context_precision Feb 9, 2024

shahules786 added 2 commits February 9, 2024 11:22

Merge branch 'main' of github.com:explodinggradients/ragas

1ae2319

add JSON instruction

da6d9ca

shahules786 mentioned this pull request Feb 9, 2024

Why faithfullness is NaN most of the times even when context and answer both makes sense. #580

Open

Merge pull request #1 from shahules786/dev#json-imp

8ef1bd9

add json formatting

shahules786 merged commit 41c0c28 into explodinggradients:main Feb 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[patch] Reduce NaN Occurrences by Simple Prompt Modification for JSON Output for context_precision #581

[patch] Reduce NaN Occurrences by Simple Prompt Modification for JSON Output for context_precision #581

Uh oh!

i-w-a commented Feb 9, 2024

Uh oh!

shahules786 commented Feb 9, 2024

Uh oh!

i-w-a commented Feb 10, 2024

Uh oh!

shahules786 commented Feb 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[patch] Reduce NaN Occurrences by Simple Prompt Modification for JSON Output for context_precision #581

[patch] Reduce NaN Occurrences by Simple Prompt Modification for JSON Output for context_precision #581

Uh oh!

Conversation

i-w-a commented Feb 9, 2024

Overview

Discovery

Solution

Impact

Uh oh!

shahules786 commented Feb 9, 2024

Uh oh!

i-w-a commented Feb 10, 2024

Uh oh!

shahules786 commented Feb 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants