Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature: -- plan flag #12

Open
swyxio opened this issue May 17, 2023 · 2 comments
Open

Feature: -- plan flag #12

swyxio opened this issue May 17, 2023 · 2 comments

Comments

@swyxio
Copy link
Contributor

swyxio commented May 17, 2023

this is an easy one

@swyxio
Copy link
Contributor Author

swyxio commented May 18, 2023

image
image

@swyxio swyxio changed the title Feature: smol plan Feature: -- plan flag May 30, 2023
@swyxio
Copy link
Contributor Author

swyxio commented Jun 19, 2023

The discovery that only GPT-4 can self-improve, while weaker models cannot, is very intriguing, indicating a new type of emergent ability (i.e. to improve upon natural language feedback) may only exist when the model is "mature" (large and well-aligned) enough
https://twitter.com/Francis_YAO_/status/1670618013089820674

Large Language Models (LLMs) have shown remarkable aptitude in code generation but still struggle on challenging programming tasks. Self-repair -- in which the model debugs and fixes mistakes in its own code -- has recently become a popular way to boost performance in these settings. However, only very limited studies on how and when self-repair works effectively exist in the literature, and one might wonder to what extent a model is really capable of providing accurate feedback on why the code is wrong when that code was generated by the same model. In this paper, we analyze GPT-3.5 and GPT-4's ability to perform self-repair on APPS, a challenging dataset consisting of diverse coding challenges.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant