Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

instance level metric outputs #6

Open
cabreraalex opened this issue Nov 15, 2023 · 4 comments
Open

instance level metric outputs #6

cabreraalex opened this issue Nov 15, 2023 · 4 comments

Comments

@cabreraalex
Copy link

This is fantastic work!

I was wondering if you all could release the instance-level outputs from the analysis. We'd love to visualize the results using Zeno

@simonhughes22
Copy link
Contributor

Can you expand on what you mean by this? You can interact with the model right now on Huggingface if that helps https://huggingface.co/vectara/hallucination_evaluation_model

@cabreraalex
Copy link
Author

Ah sorry I meant the outputs of the hallucination evaluation model for each instance, e.g. a new column with the model's output in this file: https://github.com/vectara/hallucination-leaderboard/blob/main/leaderboard_summaries.csv

Would love to dive in and compare the summarization models more in depth, similar to reports we've published recently on the HF leaderboard and Whisper transcription models:

https://twitter.com/gneubig/status/1724872160144171104
https://twitter.com/a_a_cabrera/status/1722698009094529118

@amin2718
Copy link
Contributor

Hello, I'm sorry I didn't see this earlier. Tragically, Simon passed away over Thanksgiving, and other members of the team are picking this up. We'll try to get the new column added soon.

@cabreraalex
Copy link
Author

Oh no, I'm so sorry! Best wishes to the family and team. Of course, no rush at all!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants