Skip to content

valeman/LLM-Evaluation

 
 

Repository files navigation

Resources for Evaluation of LLMs / Generative AI

This repository includes the slides and some of the notebooks that are used in my Evaluation workshops.

Some of the notebooks do require an OpenAI API key.

These notebooks are intended for explaining key points of the talk, please don't try to bring them to production use. If you want to dig deeper or have issues, go to the source for each of these projects.

About the workshop

image

Notebook links

Prompting a Chatbot: Colab notebook

Testing Properties of a System: Guidance AI

Langtest tutorials from John Snow Labs: Colab Notebooks

LLM Evaluation Harness from EleutherAI: Github or Colab notebook

Ragas showing Model as an evaluator: Github or Colab notebook

About

Sample notebooks and prompts for LLM evaluation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 100.0%