repositories Search Results · topic:benchmark org:microsoft
Filter by
0 results
(181 ms)0 results
inmicrosoft (press backspace or delete to remove)A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]
- 117
- Updated on May 17
[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?
- Python
- 109
- Updated 11 days ago
The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mob…
- Python
- 106
- Updated 10 days ago
PEACE: Empowering Geologic Map Holistic Understanding with MLLMs [Official, CVPR 2025]
- Python
- 47
- Updated on May 7

Sponsor open source projects you depend on
Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projectsProTip!
Press the /
key to activate the search input again and adjust your query.
Sponsor open source projects you depend on
Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projectsProTip!
Press the /
key to activate the search input again and adjust your query.