Pinned Loading
Repositories
Showing 10 of 190 repositories
- SWELancer-Benchmark Public
This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"
openai/SWELancer-Benchmark’s past year of commit activity - simple-evals Public
openai/simple-evals’s past year of commit activity