Add Vectara Hallucination Detection Model #950

Josephrp · 2024-03-02T11:53:20Z

Added vectara hallucination detection model to the huggingface class

adding an exposition/model example using huggingface models end to end for demo

this is a draft PR , just need to add some text to explain the examples we have chosen

	🚀 This PR description was created by Ellipsis for commit `4f3353e`.

Summary:

The PR adds a new method hallucination_evaluator to the Huggingface class in hugs.py for evaluating the hallucination score of a combined input of two statements using the Huggingface hallucination evaluation model.

Key points:

Added hallucination_evaluator method to Huggingface class in hugs.py.
The method uses the HUGS_HALLUCINATION_API_URL endpoint for the hallucination evaluation model from Huggingface.
The method takes two arguments: model_output and retrieved_text_chunks, combines them, and sends a POST request to the API.
The response is parsed to extract the hallucination score.

Generated with ❤️ by ellipsis.dev

review-notebook-app · 2024-03-02T11:53:25Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

with all my thanks !

ellipsis-dev

❌ Changes requested.

Reviewed the entire pull request up to 4f3353e
Looked at 61 lines of code in 1 files
Took 1 minute and 3 seconds to review

More info

Skipped 4 files when reviewing.
Skipped posting 1 additional comments because they didn't meet confidence threshold of 50%.

1. trulens_eval/trulens_eval/feedback/provider/hugs.py:485:

Assessed confidence : 100%
Grade: 40%
Comment:
The method hallucination_evaluator does not handle the case when the response from the API is not a list and not a proper HTTP response. This could lead to unexpected behavior. Consider adding an else clause to handle this case.
Reasoning:
The new method hallucination_evaluator is not handling the case when the response from the API is not a list and not a proper HTTP response. This could lead to unexpected behavior.

Workflow ID: wflow_I5ako30SCD4tU8DH

Want Ellipsis to fix these issues? Tag @ellipsis-dev in a comment. We'll respond in a few minutes. Learn more here.

trulens_eval/trulens_eval/feedback/provider/hugs.py

joshreini1 · 2024-03-04T20:21:58Z

trulens_eval/examples/expositional/models/hugging__Face_.ipynb

@@ -0,0 +1,300 @@
+{


Vectra -> Vectara?

Please remove the stray "or."

Reply via ReviewNB

joshreini1 · 2024-03-04T20:23:36Z

Can you remove the .ipynb checkpoints from this change?

Also can you elaborate on why there are two separate notebooks to show this capability?

joshreini1 · 2024-03-04T20:27:21Z

trulens_eval/examples/expositional/models/hugging__Face_.ipynb

@@ -0,0 +1,300 @@
+{


since this notebook is focused on the Vectara HHEM evaluator, can you rename the notebook to reflect that?

Suggestion: vectara_hallucination_evaluator.ipynb

Reply via ReviewNB

joshreini1 · 2024-03-04T20:27:21Z

trulens_eval/examples/expositional/models/hugging__Face_.ipynb

@@ -0,0 +1,300 @@
+{


It'd be useful here to show usage of this evaluator as part of a recorded app, e.g. as shown in https://www.trulens.org/trulens_eval/langchain_quickstart/

Reply via ReviewNB

joshreini1 · 2024-03-04T20:27:21Z

trulens_eval/examples/expositional/use_cases/vectra_hallucination_evaluation_model.ipynb

@@ -0,0 +1,260 @@
+{


This notebook seems similar to the one under /models - what is the distinction?

Reply via ReviewNB

joshreini1 · 2024-03-04T20:27:55Z

Thanks for all the work on this @MN-Noor @Josephrp - this is close! Just added a few small comments before it can go in

Josephrp · 2024-03-09T12:53:27Z

hi @MN-Noor would you like to take the last reviews above ?

@joshreini1 , thanks for the comments, should we keep both notebooks renamed or remove one or the other?

sorry to bother you but i'll be able to wrap this up :-)

joshreini1 · 2024-03-09T13:48:49Z

@Josephrp keep the one in /models and remove the other. Thanks!

Josephrp · 2024-03-13T17:26:03Z

@joshreini1 thanks ! @MN-Noor wrapped it up nicely , hope that's us done - until next time !

joshreini1 · 2024-03-23T16:30:11Z

@Josephrp @MN-Noor do you have a Twitter/x account? I’ll shoutout from the TruLens handle if so!

Josephrp · 2024-03-23T16:49:00Z

ha! i sure do , here's me .
Noor's doesnt work well where she is , so that's why she prefers linked-in

hope that's okay :-)

MN-Noor added 2 commits February 29, 2024 18:27

added vectra hallucination feedback function in hugs.py

0281a22

rag notebook to evaluate responces using vectra HHEM

c62a20f

tonic and others added 5 commits March 2, 2024 13:09

Create hugging__Face_.ipynb

0649db9

with all my thanks !

formated hhem notebook

405875f

formatting

4011ac3

moved hhem notebook

81e71ef

Merge branch 'main' into main

4f3353e

Josephrp marked this pull request as ready for review March 2, 2024 15:21

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. documentation Improvements or additions to documentation labels Mar 2, 2024

ellipsis-dev bot reviewed Mar 2, 2024

View reviewed changes

trulens_eval/trulens_eval/feedback/provider/hugs.py Show resolved Hide resolved

MN-Noor added 2 commits March 2, 2024 23:57

handling incorrect responce formats

50dce01

Merge branch 'main' of https://github.com/MN-Noor/trulens

61756b4

joshreini1 reviewed Mar 4, 2024

View reviewed changes

joshreini1 and others added 4 commits March 9, 2024 08:48

Merge branch 'main' into main

56e2382

added vectara hhem evaluate

aefc5b3

documentataion added

2f2d4cd

Merge branch 'main' of https://github.com/MN-Noor/trulens

061f879

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:XXL This PR changes 1000+ lines, ignoring generated files. labels Mar 13, 2024

Merge branch 'truera:main' into main

10eda07

remove hugging face.file

573499d

joshreini1 self-requested a review March 15, 2024 17:28

joshreini1 approved these changes Mar 15, 2024

View reviewed changes

joshreini1 added 7 commits March 15, 2024 12:28

Merge branch 'main' into main

0f300e6

Merge branch 'main' into main

bb07b7f

Merge branch 'main' into main

053a2fb

Merge branch 'main' into main

bbed0e4

Merge branch 'main' into main

88a7b5f

Merge branch 'main' into main

b5d0a0e

Merge branch 'main' into main

755b627

joshreini1 merged commit bff1cdc into truera:main Mar 23, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Vectara Hallucination Detection Model #950

Add Vectara Hallucination Detection Model #950

Josephrp commented Mar 2, 2024 •

edited by ellipsis-dev bot

review-notebook-app bot commented Mar 2, 2024

ellipsis-dev bot left a comment

joshreini1 Mar 4, 2024 •

edited

joshreini1 commented Mar 4, 2024

joshreini1 Mar 4, 2024 •

edited

joshreini1 Mar 4, 2024 •

edited

joshreini1 Mar 4, 2024 •

edited

joshreini1 commented Mar 4, 2024

Josephrp commented Mar 9, 2024

joshreini1 commented Mar 9, 2024

Josephrp commented Mar 13, 2024

joshreini1 commented Mar 23, 2024

Josephrp commented Mar 23, 2024

Add Vectara Hallucination Detection Model #950

Add Vectara Hallucination Detection Model #950

Conversation

Josephrp commented Mar 2, 2024 • edited by ellipsis-dev bot

Added vectara hallucination detection model to the huggingface class

adding an exposition/model example using huggingface models end to end for demo

Summary:

review-notebook-app bot commented Mar 2, 2024

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

joshreini1 Mar 4, 2024 • edited

Choose a reason for hiding this comment

joshreini1 commented Mar 4, 2024

joshreini1 Mar 4, 2024 • edited

Choose a reason for hiding this comment

joshreini1 Mar 4, 2024 • edited

Choose a reason for hiding this comment

joshreini1 Mar 4, 2024 • edited

Choose a reason for hiding this comment

joshreini1 commented Mar 4, 2024

Josephrp commented Mar 9, 2024

joshreini1 commented Mar 9, 2024

Josephrp commented Mar 13, 2024

joshreini1 commented Mar 23, 2024

Josephrp commented Mar 23, 2024

Josephrp commented Mar 2, 2024 •

edited by ellipsis-dev bot

joshreini1 Mar 4, 2024 •

edited

joshreini1 Mar 4, 2024 •

edited

joshreini1 Mar 4, 2024 •

edited

joshreini1 Mar 4, 2024 •

edited