Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to run OpenDevin with the SWEBench full dataset? #1858

Closed
ramsey-coding opened this issue May 17, 2024 · 2 comments
Closed

How to run OpenDevin with the SWEBench full dataset? #1858

ramsey-coding opened this issue May 17, 2024 · 2 comments
Labels
question Further information is requested

Comments

@ramsey-coding
Copy link

I am trying to run OpenDevin using the full SWEBench dataset and need some guidance on the process.

Specifically, I would like to know:

  • What are the steps to set up the environment for OpenDevin to work with the SWEBench dataset?
  • Are there any specific configuration files or settings required for this integration?
  • Are there any sample scripts or examples available that demonstrate how to run OpenDevin with a subset of the SWEBench dataset?
@ramsey-coding ramsey-coding added the question Further information is requested label May 17, 2024
@xingyaoww
Copy link
Collaborator

xingyaoww commented May 17, 2024

We don't yet support running on the full SWEBench dataset -- but. you can check this document for running it on a subset (SWEBench-lite).

@xingyaoww
Copy link
Collaborator

We haven't tried running the full set and haven't validated it will work (too expensive) -- but the docker image we pre-build should contain all the test beds: https://github.com/OpenDevin/OpenDevin/blob/main/evaluation/swe_bench/run_infer.py#L283-L285 If you change this dataset to SWE-Bench full, it might just work out of box! Feel free to re-open this issue if it does not work for you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants