Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Priority Task : start using trulens to evaluate Gemini #1

Closed
Josephrp opened this issue Dec 15, 2023 · 6 comments 路 Fixed by #19
Closed

Priority Task : start using trulens to evaluate Gemini #1

Josephrp opened this issue Dec 15, 2023 · 6 comments 路 Fixed by #19
Assignees
Labels
good first issue Good for newcomers help wanted Extra attention is needed

Comments

@Josephrp
Copy link
Member

Josephrp commented Dec 15, 2023

馃How To

Check our References

trulens github + notebooks : https://github.com/truera/trulens/tree/main/trulens_eval/examples

Ideas for Evaluation

  • RAG
  • System Prompt
  • Data Processing Pipeline
  • Image Inputs

Work

What it takes : literally just running a notebook.

  • chroma , or embeddings to test
  • list of prompts to test
  • test combinations of prompts
  • multimodal evaluations

we will include the notebooks in the submission and write up

@Josephrp
Copy link
Member Author

hey there @mie-h and @Zochory : https://github.com/Tonic-AI/DataTonic/tree/main/evaluation this is a folder where we will first start working on the trulens evaluations which are a hackathon requirement + good practice while building an app 馃

@Josephrp
Copy link
Member Author

hey there @mie-h & @Zochory : i added default prompts to the baseline prompts folder we can use those in a trulens evaluation.

@Josephrp Josephrp pinned this issue Dec 16, 2023
@Josephrp
Copy link
Member Author

consider using this to generate "system prompts" for gemini

@Josephrp
Copy link
Member Author

@Josephrp Josephrp added the help wanted Extra attention is needed label Dec 17, 2023
@Josephrp Josephrp assigned jsaluja and unassigned mie-h Dec 17, 2023
@Josephrp
Copy link
Member Author

big thank you to 馃弳馃槑 @MN-Noor for producing the first TruLens with gemini on RAG using open ai!

Open tasks :

  • Make a notebook to test Gemini MultiModal (image inputs)
  • Make a notebook to test more models against Gemini
  • Make a notebook to test the "new features of Gemini" like the censorship level.

we'll all work on this together, normally if everyone does one, or at least contributes to a good one we will have secured this task.

@Josephrp Josephrp added the good first issue Good for newcomers label Dec 18, 2023
@MN-Noor MN-Noor linked a pull request Dec 18, 2023 that will close this issue
@Josephrp Josephrp reopened this Dec 18, 2023
@Zochory
Copy link
Contributor

Zochory commented Dec 18, 2023

Est-ce que l'on ajouterait pas d'autres multimodal LLM ?
comme celui ci dans les evals ? https://huggingface.co/sshh12/Mistral-7B-LoRA-ImageBind-LLAVA

@Josephrp Josephrp changed the title start using trulens to evaluate Gemini Priority Task : start using trulens to evaluate Gemini Dec 19, 2023
@Tonic-AI Tonic-AI deleted a comment from twilwa Dec 21, 2023
@Josephrp Josephrp unpinned this issue Dec 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers help wanted Extra attention is needed
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

7 participants