-
Notifications
You must be signed in to change notification settings - Fork 1
Feat: Add accuracy evaluation for LLMs (GPQA, AIME, HLE etc.) #4
Copy link
Copy link
Open
Feature
0 / 40 of 4 issues completed
Copy link
Labels
ShowStopperPriority !!!: Something that is critically important to implement or fixPriority !!!: Something that is critically important to implement or fixaccuracyAccuracy evaluation and scoringAccuracy evaluation and scoringfeatureA new featureA new feature
Metadata
Metadata
Assignees
Labels
ShowStopperPriority !!!: Something that is critically important to implement or fixPriority !!!: Something that is critically important to implement or fixaccuracyAccuracy evaluation and scoringAccuracy evaluation and scoringfeatureA new featureA new feature