It would be nice to test the model on more benchmarks #44
Labels
doc-required
Your PR changes impact docs and you will update later.
enhancement
New feature or request
regression
MT-Bench | AGIEval | BBH MC | TruthfulQA | MMLU | HumanEval | BBH CoT | GSM8K
The text was updated successfully, but these errors were encountered: