Skip to content

repositories Search Results · topic:benchmark org:microsoft

Filter by

0 results
 (181 ms)

0 results

inmicrosoft (press backspace or delete to remove)

A unified evaluation framework for large language models
  • Python
  • 2.7k
  • Updated
    29 days ago

A validation and profiling tool for AI infrastructure
  • Python
  • 318
  • Updated
    yesterday

A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]
  • 117
  • Updated
    on May 17

[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?
  • Python
  • 109
  • Updated
    11 days ago

The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mob…
  • Python
  • 106
  • Updated
    10 days ago

Latency Benchmarking tool
  • Go
  • 93
  • Updated
    on May 11, 2024

🚀 SWE-bench Goes Live!
  • Python
  • 83
  • Updated
    6 days ago

PEACE: Empowering Geologic Map Holistic Understanding with MLLMs [Official, CVPR 2025]
  • Python
  • 47
  • Updated
    on May 7

SQL-ProcBench is an open benchmark for procedural workloads in RDBMSs.
  • TSQL
  • 47
  • Updated
    on Sep 25, 2021

A Benchmark Tool for Natural Language-based Network Management
  • Python
  • 26
  • Updated
    on Jun 18, 2024
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! 
Press the
/
key to activate the search input again and adjust your query.