fix groundedness with no supporting evidence #1193

nicoloboschi · 2024-06-11T08:01:10Z

Items to add to release announcement:

Heading: fix groundedness feedback with no supporting evidence

Currently if you use feedback groundedness_measure_with_cot_reasons and one of the hypothesis is returning no supporting evidence, the feedback is completely broken and not even the score is visible. This is due to the fact this feedback implies to always have a "reason" behind the score.

🚀	This description was created by Ellipsis for commit `1c9a729`

Summary:

This PR fixes an issue in the groundedness_measure_with_cot_reasons feedback function to handle cases with no supporting evidence without breaking.

Key points:

Modified evaluate_hypothesis function in Provider class to handle missing 'reason' key gracefully.
Ensures feedback system remains operational even when no supporting evidence is provided.

Generated with ❤️ by ellipsis.dev

ellipsis-dev

👍 Looks good to me! Reviewed everything up to 1c9a729 in 33 seconds

More details

Looked at 15 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 1 drafted comments based on config settings.

1. trulens_eval/trulens_eval/feedback/provider/base.py:1225

Draft comment:
The change effectively handles cases where no 'reason' is provided, preventing potential KeyErrors and ensuring the feedback system remains robust. This is a necessary fix for the groundedness feedback functionality.
Reason this comment was not posted:
Confidence changes required: 0%
The PR aims to address a bug where the feedback system breaks if no supporting evidence is provided for a statement's groundedness. The change checks if the 'reason' key exists in the 'reason' dictionary before attempting to access it, which prevents a KeyError and allows the feedback system to handle cases where no reason is provided. This is a crucial fix for robustness in feedback generation.

Workflow ID: wflow_x9g9iRRlgxs4Fudq

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

trulens_eval/trulens_eval/feedback/provider/base.py

epinzur · 2024-06-11T12:00:04Z

i wonder if when running in deferred mode, this change would prevent trulens from seeing this as a failed evaluation and then would not re-run it.

nicoloboschi · 2024-06-12T07:24:23Z

i wonder if when running in deferred mode, this change would prevent trulens from seeing this as a failed evaluation and then would not re-run it.

It might be as side effect. Anyway all other feedback providers don't fail if the reason can't be found so I think it's better to align to the others' behaviour

* fix groundedness with no supporting evidence * use reason not generated --------- Co-authored-by: Josh Reini <josh.reini@snowflake.com>

fix groundedness with no supporting evidence

1c9a729

dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Jun 11, 2024

ellipsis-dev bot reviewed Jun 11, 2024

View reviewed changes

epinzur reviewed Jun 11, 2024

View reviewed changes

trulens_eval/trulens_eval/feedback/provider/base.py Outdated Show resolved Hide resolved

use reason not generated

5643374

Merge branch 'main' into fix-no-supporting-evidence

b1458ff

sfc-gh-jreini merged commit f24aeb2 into truera:main Jun 13, 2024
9 checks passed

sfc-gh-jreini mentioned this pull request Jun 21, 2024

0.32.0 release #1240

Merged

sfc-gh-dhuang pushed a commit that referenced this pull request Jun 28, 2024

fix groundedness with no supporting evidence (#1193)

d9a58a6

* fix groundedness with no supporting evidence * use reason not generated --------- Co-authored-by: Josh Reini <josh.reini@snowflake.com>

sfc-gh-dhuang pushed a commit that referenced this pull request Jul 1, 2024

fix groundedness with no supporting evidence (#1193)

7ef4818

* fix groundedness with no supporting evidence * use reason not generated --------- Co-authored-by: Josh Reini <josh.reini@snowflake.com>

sfc-gh-chu pushed a commit that referenced this pull request Sep 25, 2024

fix groundedness with no supporting evidence (#1193)

62508c1

* fix groundedness with no supporting evidence * use reason not generated --------- Co-authored-by: Josh Reini <josh.reini@snowflake.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix groundedness with no supporting evidence #1193

fix groundedness with no supporting evidence #1193

nicoloboschi commented Jun 11, 2024 •

edited by ellipsis-dev bot

Loading

ellipsis-dev bot left a comment

epinzur commented Jun 11, 2024

nicoloboschi commented Jun 12, 2024

fix groundedness with no supporting evidence #1193

fix groundedness with no supporting evidence #1193

Conversation

nicoloboschi commented Jun 11, 2024 • edited by ellipsis-dev bot Loading

Summary:

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

epinzur commented Jun 11, 2024

nicoloboschi commented Jun 12, 2024

nicoloboschi commented Jun 11, 2024 •

edited by ellipsis-dev bot

Loading