v0.5.9
Models
- Add Arabic language models
- Add Qwen3-Next 80B A3B Thinking model (#3875)
- Add DeepSeek-R1-Distill-Llama-70B and DeepSeek-R1-Distill-Qwen-14B (#3873)
- Add Qwen3-Next 80B A3B Thinking model (#3891)
- Add Falcon3 models (#3894)
- Add Claude 4.5 Sonnet (#3897)
Scenarios
- Fix breakages in
shc_privacy_medandshc_proxy_med(#3876) - Allow applying regular expression pattern before mapping output (#3882)
- Add output mapping patterns for Arabic MCQA scenarios (#3885)
- Update speech language pathology scenarios to use Hugging Face Datasets (#3835)
Framework
- Make priority optional for run entries (#3865)
- Add
herror()andhexception()to logger (#3789) - Log error stack traces from within various clients (#3880)
Contributors
Thank you to the following contributors for your work on this HELM release!