API responded with status code: 429. Rate limit reached for 10KTPM-200RPM in organization org-WyXXXXXXXXX #17

akamalov · 2023-08-28T19:14:59Z

Getting immediate:

API responded with status code: 429. Response text: {
    "error": {
        "message": "Rate limit reached for 10KTPM-200RPM in organization org-WyXXXXXXXXX on tokens per min. Limit: 10000 / min. Please try again in 6ms. Contact us through our help center at help.openai.com if you continue to have issues.",
        "type": "tokens",
        "param": null,
        "code": "rate_limit_exceeded"
    }
}

This is my first attempt to access OpenAPI for today and I am already getting this error. I am running other applications to generate python code and I am not getting this error.

The text was updated successfully, but these errors were encountered:

zenchantlive · 2023-08-28T21:26:17Z

i am having the same issue

hafizSiddiq7675 · 2023-08-29T11:44:38Z

Same issue

zvone187 · 2023-08-29T19:37:23Z

This happens when you have a small limit on the number of tokens per minute. OpenAI puts 10k tokens per minute by default which is too little for GPT Pilot, but you can request a limit increase from OpenAI.

CyKiller · 2023-08-30T05:20:58Z

We should add a step - to improve the pilot and allow for user feedback during confirmations and error handling, we can modify the create_gpt_chat_completion function.

Firstly, instead of just asking the user to press ENTER to confirm, we can use the questionary library to create a more interactive prompt. Secondly, in the case of an error, we can ask the user for advice or feedback before deciding whether to retry the request or not. When an exception occurs, we can have the code now asks the user for advice or feedback using questionary.text. Our input is then printed out. We can replace this print statement with any action we want to perform with the user's feedback. I think that should halt the process which could be in a loop or over token usage where we can contain the control point for now with this more terminal approach for a succeeding request for an answer after an error.

We can then use the user's feedback as we see fit at that task need. For example, we could log it, use it to alter the program's behavior, or even send it back to the server for further analysis. These changes should help make the program more interactive and responsive to our input, and could potentially help avoid issues like infinite loops or excessive token usage with a limit or not.

Zate · 2023-09-01T05:57:35Z

This happens when you have a small limit on the number of tokens per minute. OpenAI puts 10k tokens per minute by default which is too little for GPT Pilot, but you can request a limit increase from OpenAI.

No they do not up the limit on GPT-4 according to their own docs and forms.

I'd love to see a combination of using GPT 3.5 Turbo for places where it doesnt matter, with GPT4 just used for the important pieces.

Be nice to have it implement some kind of automated handling of the rate limit such as like https://help.openai.com/en/articles/5955604-how-can-i-solve-429-too-many-requests-errors or similar.

zenchantlive · 2023-09-01T06:01:01Z

I completely agree with you! There s no ability to up our limits unfortunately.

…

On Thu, Aug 31, 2023, 10:57 PM Zate ***@***.***> wrote: This happens when you have a small limit on the number of tokens per minute. OpenAI puts 10k tokens per minute by default which is too little for GPT Pilot, but you can request a limit increase from OpenAI. No they do not up the limit on GPT-4 according to their own docs and forms. I'd love to see a combination of using GPT 3.5 Turbo for places where it doesnt matter, with GPT4 just used for the important pieces. Be nice to have it implement some kind of automated handling of the rate limit such as like https://help.openai.com/en/articles/5955604-how-can-i-solve-429-too-many-requests-errors or similar. — Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AYYOAZO2MCNNZRLG2FGFKITXYF2NVANCNFSM6AAAAAA4B3LMPQ> . You are receiving this because you commented.Message ID: ***@***.***>

CyKiller · 2023-09-01T07:37:25Z

Be nice to have it implement some kind of automated handling of the rate limit such as like https://help.openai.com/en/articles/5955604-how-can-i-solve-429-too-many-requests-errors or similar.

We can test this I guess - we likely need to update the llm_connection.py file to include an exponential backoff mechanism similar to the one described in the OpenAI article. We can try adding a while loop in def stream_gpt_completion to wrap the existing API request code. If a "429: Too Many Requests" error is encountered, we can have the code wait for a sleep time we set and then retry the request. Any advice here would be helpful as I haven't thought it out thoroughly but we can have the sleep time double with each retry, up to a maximum of retries to keep it going in what feels like uninterrupted on our end for now.

This article explains - Note: we will not increase limits on gpt-4, text-davinci-003, gpt-3.5-turbo-16k, or fine-tuned models at this time. .

nalbion · 2023-09-09T14:10:26Z

This is fixed now at https://github.com/Pythagora-io/gpt-pilot/blob/main/pilot/utils/llm_connection.py#L152

I do like @CyKiller's suggestion of exponential back-off. Currently it follows the instructions in the response which is always "Please try again in 6ms"

@Zate also suggests using "GPT 3.5 Turbo for places where it doesnt matter" which is also a good idea.

Fixed bug in spec reviewer

CyKiller mentioned this issue Sep 1, 2023

Ability to use GPT 3.5 Turbo #31

Closed

nalbion mentioned this issue Sep 7, 2023

handle rate limit exceeded error #53

Closed

nalbion closed this as completed Sep 28, 2023

LeonOstrez added a commit that referenced this issue Oct 3, 2024

Merge pull request #17 from Pythagora-io/spec-writer-upgrade

d5260d9

Fixed bug in spec reviewer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API responded with status code: 429. Rate limit reached for 10KTPM-200RPM in organization org-WyXXXXXXXXX #17

API responded with status code: 429. Rate limit reached for 10KTPM-200RPM in organization org-WyXXXXXXXXX #17

akamalov commented Aug 28, 2023

zenchantlive commented Aug 28, 2023

hafizSiddiq7675 commented Aug 29, 2023

zvone187 commented Aug 29, 2023

CyKiller commented Aug 30, 2023 •

edited

Loading

Zate commented Sep 1, 2023

zenchantlive commented Sep 1, 2023 via email

CyKiller commented Sep 1, 2023 •

edited

Loading

nalbion commented Sep 9, 2023

API responded with status code: 429. Rate limit reached for 10KTPM-200RPM in organization org-WyXXXXXXXXX #17

API responded with status code: 429. Rate limit reached for 10KTPM-200RPM in organization org-WyXXXXXXXXX #17

Comments

akamalov commented Aug 28, 2023

zenchantlive commented Aug 28, 2023

hafizSiddiq7675 commented Aug 29, 2023

zvone187 commented Aug 29, 2023

CyKiller commented Aug 30, 2023 • edited Loading

Zate commented Sep 1, 2023

zenchantlive commented Sep 1, 2023 via email

CyKiller commented Sep 1, 2023 • edited Loading

nalbion commented Sep 9, 2023

CyKiller commented Aug 30, 2023 •

edited

Loading

CyKiller commented Sep 1, 2023 •

edited

Loading