SWE-Bench Dataset - 8,712 files

The dataset comprises 8,712 files across 6 programming languages, featuring verified tasks and benchmarks for evaluating coding agents and language models. It supports coding agents, language models, and developer tools with verified benchmark scores and multi-language test sets. - Get the data

Dataset characteristics:

Characteristic	Data
Description	An extended benchmark of real-world software engineering tasks with enhanced artifacts and broader language coverage
Data types	Text
Tasks	Bug fixing, code completion, pull request generation, automated code review
Total number of files	8,712
Total number of people	30
Labeling	Annotated with golden patches, test patches, post-patch reference states, and metadata stored in parquet files (e.g., repository name, issue/PR identifier, diffs, test results)
Programming languages	C#, Go, PHP, Rust, Kotlin, Ruby

📊 Sample dataset available! For full access, contact us to discuss purchase terms.

Dataset structure

Go - Files in Go
Scala - Files in Scala

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Go		Go
Scala		Scala
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SWE-Bench Dataset - 8,712 files

Dataset characteristics:

📊 Sample dataset available! For full access, contact us to discuss purchase terms.

Dataset structure

🧩 Like the dataset but need different data? We can collect a custom dataset just for you - learn more about our data collection services here

Similar Datasets:

🌐 UniData - your trusted data partner. Unique, accurate, thoroughly collected and annotated data designed to fuel your AI/ML success.

About

Uh oh!

Releases

Packages

Languages

UniData-NLP/swe-bench-coding-tasks

Folders and files

Latest commit

History

Repository files navigation

SWE-Bench Dataset - 8,712 files

Dataset characteristics:

📊 Sample dataset available! For full access, contact us to discuss purchase terms.

Dataset structure

🧩 Like the dataset but need different data? We can collect a custom dataset just for you - learn more about our data collection services here

Similar Datasets:

🌐 UniData - your trusted data partner. Unique, accurate, thoroughly collected and annotated data designed to fuel your AI/ML success.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages