Richter is a LLM benchmarking tool
Test multiple LLM models simultaneously
Measure response times, token usage, and success rates
Support for concurrent and sequential execution
Export results to JSON or CSV format
Real-time progress tracking with ETA
Graceful error handling and recovery
- Go 1.25
- OpenRouter API key (get one at https://openrouter.ai)
- Set your API key:
export OPENROUTER_API_KEY="your-api-key-here"- Test connectivity:
go run . --test- Run a basic benchmark:
go run . # Simple benchmark with default model
go run .
# Multiple models
go run . --models "openai/gpt-oss-20b:free,mistralai/mistral-small-3.2-24b-instruct:free"
# Custom prompts
go run . --prompts "Hello,How are you?,Tell me a joke"
# Multiple iterations for reliability
go run . --iterations 3
# Concurrent testing
go run . --concurrent 3 --iterations 2
# Export to CSV
go run . --export csv --output my_results.csv