Skip to content

Pull requests: microsoft/eureka-ml-insights

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Read me updates + docstr pipeline
#170 by safooray was merged Jul 10, 2025 Loading…
Userconf
#167 by safooray was merged Jul 8, 2025 Loading…
Add V*Bench dataset
#165 by tmlabonte was merged Jun 28, 2025 Loading…
mmlu benchmark
#164 by jyotianeja was merged Jun 28, 2025 Loading…
small fix to llm extraction file
#162 by vibhav-vineet was merged Jun 23, 2025 Loading…
updated sat and tsp HF paths
#161 by vibhav-vineet was merged Jun 17, 2025 Loading…
adding vision caption evals
#160 by michaelharrisonmai was merged Jun 25, 2025 Loading…
Update scoring_prompt.jinja
#159 by tmlabonte was merged Jun 11, 2025 Loading…
Benushi/offline model
#157 by nushib was merged Jun 20, 2025 Loading…
Readme
#156 by safooray was merged May 23, 2025 Loading…
Readme
#155 by safooray was merged May 23, 2025 Loading…
aime readme
#154 by lchen001 was closed May 22, 2025 Loading…
Updated hf file locations.
#153 by vibhav-vineet was merged May 22, 2025 Loading…
Adding BoN+WoN reporting to SAT config file.
#152 by vibhav-vineet was merged May 22, 2025 Loading…
Minorfixes
#151 by safooray was merged May 22, 2025 Loading…
TSP and SAT specs
#150 by vibhav-vineet was merged May 22, 2025 Loading…
update aime spec
#149 by lchen001 was merged May 22, 2025 Loading…
LLM extractor for SAT problem
#148 by vibhav-vineet was merged May 22, 2025 Loading…
adding specs for omni math and ba_calendar
#147 by vidhishanair was merged May 22, 2025 Loading…
llm extraction
#146 by lchen001 was merged May 21, 2025 Loading…
Small PR to add Sat metrics
#145 by vibhav-vineet was merged May 20, 2025 Loading…
Benushi/kitab updates
#144 by nushib was merged May 15, 2025 Loading…
Llm extract
#143 by safooray was merged May 16, 2025 Loading…
ProTip! Filter pull requests by the default branch with base:main.