This is the exact code used for our submission that scores #2 on the test set, #1 on the validation set.
Check out the current leaderboard here.
First, install requirements:
pip install -r requirements.txtSetup your secrets in a .envfile:
HUGGINGFACEHUB_API_TOKEN
SERPAPI_API_KEY
OPENAI_API_KEY
ANTHROPIC_API_KEYAnd optionally if you want to use Anthropic models via AWS bedrock:
AWS_BEDROCK_ID
AWS_BEDROCK_KEYThen run gaia.py to launch tests!
