Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add detailed tutorial for adding new evaluation benchmarks #1827

Merged
merged 34 commits into from
May 18, 2024

Conversation

xingyaoww
Copy link
Collaborator

Co-authored by OpenDevin :)

@xingyaoww xingyaoww marked this pull request as ready for review May 17, 2024 11:04
@neubig neubig self-requested a review May 17, 2024 11:12
@neubig neubig assigned neubig and unassigned xingyaoww May 17, 2024
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
Copy link
Contributor

@neubig neubig left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is exciting!

One suggestions is that we can probably make the README a bit less verbose without hurting understandability at all. Personally when I see a long README I hesitate to read it a bit, so making it more concise might make it seem less intimidating.

evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/swe_bench/run_infer.py Outdated Show resolved Hide resolved
@neubig neubig assigned xingyaoww and unassigned neubig May 17, 2024
xingyaoww and others added 9 commits May 17, 2024 22:54
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
xingyaoww and others added 6 commits May 17, 2024 22:59
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
@xingyaoww
Copy link
Collaborator Author

Thanks a ton, @neubig, for the feedback! I've cleaned up the README and move those comments to the actual code file :)

@xingyaoww xingyaoww enabled auto-merge (squash) May 17, 2024 15:59
Copy link
Collaborator

@li-boxuan li-boxuan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is very helpful! Appreciate your attention to details. I left some suggestions. I already applied all typo fixes, but the rest are up to you.

evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/TUTORIAL.md Outdated Show resolved Hide resolved
evaluation/swe_bench/run_infer.py Outdated Show resolved Hide resolved
evaluation/swe_bench/run_infer.py Outdated Show resolved Hide resolved
@li-boxuan li-boxuan disabled auto-merge May 18, 2024 07:23
@neubig
Copy link
Contributor

neubig commented May 18, 2024

@xingyaoww ping me when this is ready to review (maybe also considering @li-boxuan 's comments), and I'll make a final pass through!

xingyaoww and others added 5 commits May 19, 2024 00:44
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
@xingyaoww
Copy link
Collaborator Author

@neubig Should be ready for another round of review now :)

Copy link
Collaborator

@li-boxuan li-boxuan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thank you!

@neubig
Copy link
Contributor

neubig commented May 18, 2024

Looks great!

@neubig neubig merged commit b2fdb96 into OpenDevin:main May 18, 2024
25 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants