v0.5.11
Breaking Changes
- Remove deprecated Claude 3.5 model from LLM juries (#3926)
- Switch Gemini 2.5 to use
GoogleGenAIClient(#3944)
Models
- Add Claude Haiku 4.5 (#3922)
- Add GPT-5.1 model (#3927, 3937)
- Add
GoogleGenAIClient(#3925) - Add Gemini 3 Pro (#3936, 3937)
- Add Gemini Robotics-ER 1.5 (#3939)
- Add Kimi K2 model on Together (#3943)
- Switch Gemini 2.5 to use
GoogleGenAIClient(#3944) - Add more Qwen3 and Qwen3-Next models on Together AI (#3947)
Scenarios
- Convert AraTrust to use multiple_choice_joint adapter (#3920)
- Remove deprecated Claude 3.5 model from LLM juries (#3926)
- Remove stop sequences in ALRAGE scenario (#3948)
Framework
Contributors
Thank you to the following contributors for your work on this HELM release!