A test suite to see if LLMs can correctly guess the correct answer in quizzes from Open Trivia database.
Install dependencies:
pip install -r requirements.txt
Minimal run:
python main.py your-project-id model-id
Run with all options:
python main.py your-project-id model-id --num_iterations=4 --no_questions=25 --google_search_grounding
Example runs:
python main.py genai-atamel gemini-1.0-pro
python main.py genai-atamel gemini-1.5-pro
python main.py genai-atamel gemini-1.5-flash
You can find some run outputs in runs folder.