GitHub - princeton-nlp/SWE-agent: SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.

Website & Demo | Documentation | Discord | Preprint

👋 Overview

SWE-agent turns LMs (e.g. GPT-4) into software engineering agents that can fix bugs and issues in real GitHub repositories.

On SWE-bench, SWE-agent resolves 12.29% of issues, achieving the state-of-the-art performance on the full test set.

We accomplish our results by designing simple LM-centric commands and feedback formats to make it easier for the LM to browse the repository, view, edit and execute code files. We call this an 🤖 Agent-Computer Interface (ACI). Read more about it in our paper!

SWE-agent is built and maintained by researchers from Princeton University.

If you found this work helpful, please consider using the following citation:

@misc{yang2024sweagent,
      title={SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering}, 
      author={John Yang and Carlos E. Jimenez and Alexander Wettig and Kilian Lieret and Shunyu Yao and Karthik Narasimhan and Ofir Press},
      year={2024},
}

✨ Use SWE-agent as a dev tool

We provide a command line tool and a graphical web interface:

🚀 Get started!

All information is provided in our documentation:

and many more topics.

💫 Contributions

If you'd like to ask questions, learn about upcoming features, and participate in future development, join our Discord community!
If you'd like to contribute to the codebase, we welcome issues and pull requests!
If you'd like to see a post or tutorial about some topic, please let us know via an issue.

Contact person: John Yang and Carlos E. Jimenez (Email: {jy1682, carlosej}@princeton.edu).

🪪 License

MIT. Check LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 438 Commits
.devcontainer		.devcontainer
.github		.github
assets		assets
config		config
docker		docker
docs		docs
evaluation		evaluation
inspector		inspector
make_demos		make_demos
scripts		scripts
sweagent		sweagent
tests		tests
trajectories		trajectories
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build_deploy.sh		build_deploy.sh
codecov.yml		codecov.yml
environment.yml		environment.yml
mkdocs.yml		mkdocs.yml
mlc_config.json		mlc_config.json
pyproject.toml		pyproject.toml
release_dockerhub.sh		release_dockerhub.sh
requirements.txt		requirements.txt
run.py		run.py
run_replay.py		run_replay.py
setup.sh		setup.sh
start_web_ui.sh		start_web_ui.sh

License

princeton-nlp/SWE-agent

Folders and files

Latest commit

History

Repository files navigation

👋 Overview

✨ Use SWE-agent as a dev tool

🚀 Get started!

💫 Contributions

🪪 License

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages