A compiler optimization benchmark for evaluating LLM performance on code optimization tasks.
bench/- Benchmark runner that tests AI models on C code optimizationvisualizer/- Next.js app for visualizing benchmark results
Install dependencies:
bun installRun a benchmark:
cd bench
bun run optimUpdate visualizer data:
cd bench
bun run update-vizRun visualizer:
cd visualizer
bun run dev