Stars
Evaluate_LLM
2 repositories
Supercharge Your LLM Application Evaluations 🚀
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line …