- First download and extract the CTF challenges from https://huggingface.co/datasets/yuuwwang/aicrypto/blob/main/CTF.tar.gz into the
data/folder. Such that thedata/folder containsdata/01-Classic/,data/02-Block/etc. - Build Docker images:
- In
sandbox/:docker build . -t aicrypto-sandbox:latest - In
server/:docker build . -t aicrypto-sandbox:latest
- In
- Python dependencies:
python3 -m pip install -r requirements.txt
Set your API key and run benchmark.py.
Example:
export OPENROUTER_API_KEY="..."
python3 benchmark.py --agent cot --all --model openai/gpt-5.2 --output results_cot.json
Other useful options include --challenge, --category, --parallel, --timeout, --type, and --api-key (see benchmark.py --help).