Timeout Error OpenAI #3512

shreyabhadwal · 2023-04-25T10:27:34Z

I am facing a Warning similar to the one described here #3005

WARNING:langchain.embeddings.openai:Retrying langchain.embeddings.openai.embed_with_retry.<locals>._completion_with_retry in 4.0 seconds as it raised Timeout: Request timed out: HTTPSConnectionPool(host='api.openai.com', port=443): Read timed out. (read timeout=600).
It just keeps retrying. How do I get around this?

The text was updated successfully, but these errors were encountered:

dnrico1 · 2023-04-26T18:47:32Z

Same for me as well

La1c · 2023-04-27T10:15:07Z

Getting the same error, with map-reduce summarizing chain.
Vanilla open ai api works as expected.

gabacode · 2023-04-27T22:26:09Z

Same, following 👀

shreyabhadwal · 2023-04-28T05:42:04Z

@dnrico1 @La1c @gabacode
When are y'all getting the error?
For instance, I am getting it through my websocket app deployed on Azure (it's a chatbot application). Weirdly enough, I don't face it when I run the application locally.

bkamapantula · 2023-04-28T11:59:57Z

+1

OpenAI chat endpoint always seems to time out when using the summarization chain.

It works with the anthropic endpoint though.

With longer context and completions, gpt-3.5-turbo and, especially, gpt-4, will more times than not take > 60seconds to respond. Based on some other discussions, it seems like this is an increasingly common problem, especially with summarization tasks. - #3512 - #3005 OpenAI's max 600s timeout seems excessive, so I settled on 120, but I do run into generations that take >240 seconds when using large prompts and completions with GPT-4, so maybe 240 would be a better compromise?

Binb1 · 2023-05-02T11:55:54Z

+1

@shreyabhadwal Experiencing the exact same behaviour. Local works well but it timeouts on Azure.

shreyabhadwal · 2023-05-03T13:48:17Z

@Binb1 do the timeouts happen every time for you or occasionally? Also, are you using websockets or SSE?

Binb1 · 2023-05-03T13:57:51Z

@shreyabhadwal Strangely enough, every time I deploy a new version of my app it seems to work well. But after a few minutes I get timeouts and I can't really understand why so far. I'm using SSE.
I've tested a lot of different options and I have the same problem doing an Openai python SDK call or Langchain.

shreyabhadwal · 2023-05-03T14:27:01Z

@Binb1 I experience the exact same behavior. It works well if I restart the app, and then after a few minutes when I try again I get timeouts. Very weird.

Interestingly, I have tried doing it without streaming and it seems to be working well. I don't quite understand it.

Binb1 · 2023-05-03T14:34:11Z

@shreyabhadwal This makes me think that it is more Azure than langchain/openai related then 😕

I have not tried streaming yet as I don't really need it but it fails for me even without it. So strange.

It feels like the webapp needs a "warmup" before being able to make the calls.

gabacode · 2023-05-03T15:11:48Z

Increasing the timeout fixes it for me! Thanks @timothyasp !

firezym · 2023-05-05T18:28:48Z

+1
I set the timeout to 300s, but each time after 3 to 5 requests, It still fails as timeout...

timothyasp · 2023-05-05T19:04:57Z

openai requests can go as long as 600s, and if you're doing large token prompts with gpt-4, 300s might be too low. So i'd set it at 600s and hope for the best. But i have noticed latencies on OpenAI's end being a lot higher over the last week or two.

…

-Tim

On Fri, May 5, 2023 at 11:29 AM firezym ***@***.***> wrote: +1 I set the timeout to 300s, but each time after 3 to 5 requests, It still fails as timeout... — Reply to this email directly, view it on GitHub <#3512 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAFMY47CRDATG6DVC7KAY5TXEVBGXANCNFSM6AAAAAAXKZOUOA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

ColinTitahi · 2023-05-10T00:56:16Z

@shreyabhadwal @Binb1 any luck with Azure?

Same issue local fine and fast, on Azure issues.
Something seems to fall asleep after 4-10 minutes
For me "Retrying langchain.chat_models.openai.ChatOpenAI.completion_with_retry" seems to get called before the call for chat etc.
After it times out it returns and is good until idle for 4-10 minutes
So Increasing the timeout just increases the wait until it does timeout and calls again.

Driving me nuts and suspect there is a simple configuration I'm missing.

shreyabhadwal · 2023-05-10T06:24:00Z

Nope nothing yet
@Binb1 @ColinTitahi are y'all using async calls to OpenAI?

ColinTitahi · 2023-05-10T18:12:13Z

@shreyabhadwal Not explicitly so I don't think so. I'm using generate on the ChatOpenAI so I can get the llm_output token etc and another run call to a chat-conversational-react-description agent with some additional tools.
These endpoints in my flask app are being called from the client JavaScript which uses async to wait for the response.
It's like something gets set up when the flask app initially starts up and then falls asleep or disconnects or something after say 4-5 minutes and then has to wait for the timeout to occur to reconnect when the user calls it. Hence upping the timeout just increases that initial wait.

I'm using the OpenAI Chat model and hosting on an Azure web service.

sagardspeed2 · 2023-05-18T05:41:39Z

I am getting same error with model gpt-4-0314 and max_token = 2048 with request_timeout = 240 in local and live server.
yesterday this was working fine

DennisSchwartz · 2023-05-22T18:23:44Z

Same issue here. Running it in a Kubernetes Pod deployed to an AWS cluster and using async calls.
Works perfectly locally but times out as soon as it's in the cluster.

Weirdly, calling the OpenAI LLM directly works, but running the Agent it gets stuck.

This works:

agent_executor = get_agent(user_token)
driver = agent_executor.agent.llm_chain.llm
cl = driver.client()
print(cl.create(model=driver.model_name, prompt='Tell me a poem'))

But this does not:

await agent_executor.arun(query)

DennisSchwartz · 2023-05-22T19:00:44Z

Ok so from the comments above I realised I was testing async in one case and blocking in the other.

print(await cl.acreate(model=driver.model_name, prompt='Tell me a poem'))

Does indeed also time out and fail to run!
So there definitely seems to be an issue with the Async running of OpenAI. I'm going to try Anthropic for now. :)

UPDATE

I still can't make it run, neither for OpenAI nor Anthropic - but I think I know what's going on.

Our Kubernetes cluster running the application is blocking access to the internet using Squid Proxy. The OpenAI API is allowed, but only for HTTP requests.
I think the OpenAI client is probably using web sockets to stream the responses and this is blocked by our proxy/firewall.
Have resorted to using the sync application for now until we can figure out how to fix our proxy.

jpsmartbots · 2023-06-04T12:54:18Z

I have the same issue. I am trying to hit completion api on text-davinci-003 engine. I am unable to replicate the issue on my local as it works always. When I containerize and deploy it in AWS Lambda, I get the following error sometimes (dont know when).
Request timed out: HTTPSConnectionPool(host='instanceid.openai.azure.com', port=443): Max retries exceeded with url: //openai/deployments/textdavinci003/completions?api-version=2022-12-01 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at XXXX>, 'Connection to instanceid.openai.azure.com timed out. (connect timeout=5)')

Any resoultion?

maxmarkov · 2023-06-13T08:21:03Z

It could be a problem with the SSL key. Set it up as a system environment variable.
os.environ["REQUESTS_CA_BUNDLE"] = "PATH_TO_YOUR_CERTIFICATE/YOUR_CERTIFICATE.crt"

bigrig2212 · 2023-06-14T19:34:26Z

Same issue here. Works for a bit and then starts timing out.
I just can't nail down when it happens and why. There doesn't seem to be a rhyme or reason. Seems to happen a lot more on production (gcp) than locally. Although it happens on both. Seems to happen with short sentences more than long ones. Although not exclusively. It happens a LOT though. Like 1 out of 4 requests.

flake9 · 2023-08-08T16:51:58Z

+1

HaochenQ · 2023-08-30T05:56:14Z

I have the same issue. Works well locally but faces timeout issues when the app is deployed to Azure App Service for Linux Python or Custom Container.

jpsmartbots · 2023-08-30T07:25:44Z

Hi HaochenQ

May be deploying your solution in a virtual machine might solve your problem. When I moved from AWS Lambda to EC2, the problem got resolved

HaochenQ · 2023-08-31T00:43:57Z

Hi HaochenQ

May be deploying your solution in a virtual machine might solve your problem. When I moved from AWS Lambda to EC2, the problem got resolved

Thank you@jpsmartbots, I tried to deploy my container with an Azure VM, but the issue persists.

For those of you who are facing 504 gateway timeout issues Retrying langchain.embeddings.openai.embed_with_retry.<locals>._embed_with_retry in 4.0 seconds as it raised Timeout: Request timed out: HTTPSConnectionPool(host='api.openai.com', port=443): Read timed out. (read timeout=600). with Azure App Services, the issue is because the default HTTP timeout of Azure App Service 230/240 seconds while the default timeout of OpenAI APIs is 600 seconds. Before langchian hear back from OpenAI and do a retry, Azure returns an error and our app appears down. You can use request_timeout - OpenAIEmbeddings(request_timeout=30) to avoid time timeout from Azure side and somehow the retry call to OpenAI from langchain can always work.

Not sure why the langchian call to the OpenAI after a period of inactivity will fail and cause a timeout.

ShantanuNair · 2023-09-20T08:16:38Z

Hey all, I believe this being fixed in the openai-python client should also help with this issue, and with generations:

openai/openai-python#387

The async and sync request_timeouts are NOT identical.

luoqingming110 · 2023-11-22T02:56:47Z

same problem

ryoung562 · 2023-11-22T05:30:25Z

I'm running into the same issue. i am running a proxy container that talks to openai API works locally, but not when i deploy it to railway.

mallapraveen · 2024-01-25T15:29:58Z

did anyone fix this, running into the same issue when I use summarize map reduce chain from Langhian on AWS lambda?

timothyasp mentioned this issue May 1, 2023

Increase request_timeout on ChatOpenAI #3910

Merged

dosubot bot added the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label May 3, 2024

dosubot bot closed this as not planned Won't fix, can't repro, duplicate, stale May 10, 2024

dosubot bot removed the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label May 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Timeout Error OpenAI #3512

Timeout Error OpenAI #3512

shreyabhadwal commented Apr 25, 2023

dnrico1 commented Apr 26, 2023

La1c commented Apr 27, 2023

gabacode commented Apr 27, 2023

shreyabhadwal commented Apr 28, 2023 •

edited

bkamapantula commented Apr 28, 2023 •

edited

Binb1 commented May 2, 2023

shreyabhadwal commented May 3, 2023

Binb1 commented May 3, 2023

shreyabhadwal commented May 3, 2023

Binb1 commented May 3, 2023 •

edited

gabacode commented May 3, 2023

firezym commented May 5, 2023

timothyasp commented May 5, 2023 via email

ColinTitahi commented May 10, 2023

shreyabhadwal commented May 10, 2023

ColinTitahi commented May 10, 2023 •

edited

sagardspeed2 commented May 18, 2023

DennisSchwartz commented May 22, 2023 •

edited

DennisSchwartz commented May 22, 2023 •

edited

jpsmartbots commented Jun 4, 2023 •

edited

maxmarkov commented Jun 13, 2023

bigrig2212 commented Jun 14, 2023 •

edited

flake9 commented Aug 8, 2023

HaochenQ commented Aug 30, 2023

jpsmartbots commented Aug 30, 2023

HaochenQ commented Aug 31, 2023 •

edited

ShantanuNair commented Sep 20, 2023

luoqingming110 commented Nov 22, 2023

ryoung562 commented Nov 22, 2023

mallapraveen commented Jan 25, 2024

Timeout Error OpenAI #3512

Timeout Error OpenAI #3512

Comments

shreyabhadwal commented Apr 25, 2023

dnrico1 commented Apr 26, 2023

La1c commented Apr 27, 2023

gabacode commented Apr 27, 2023

shreyabhadwal commented Apr 28, 2023 • edited

bkamapantula commented Apr 28, 2023 • edited

Binb1 commented May 2, 2023

shreyabhadwal commented May 3, 2023

Binb1 commented May 3, 2023

shreyabhadwal commented May 3, 2023

Binb1 commented May 3, 2023 • edited

gabacode commented May 3, 2023

firezym commented May 5, 2023

timothyasp commented May 5, 2023 via email

ColinTitahi commented May 10, 2023

shreyabhadwal commented May 10, 2023

ColinTitahi commented May 10, 2023 • edited

sagardspeed2 commented May 18, 2023

DennisSchwartz commented May 22, 2023 • edited

DennisSchwartz commented May 22, 2023 • edited

jpsmartbots commented Jun 4, 2023 • edited

maxmarkov commented Jun 13, 2023

bigrig2212 commented Jun 14, 2023 • edited

flake9 commented Aug 8, 2023

HaochenQ commented Aug 30, 2023

jpsmartbots commented Aug 30, 2023

HaochenQ commented Aug 31, 2023 • edited

ShantanuNair commented Sep 20, 2023

luoqingming110 commented Nov 22, 2023

ryoung562 commented Nov 22, 2023

mallapraveen commented Jan 25, 2024

shreyabhadwal commented Apr 28, 2023 •

edited

bkamapantula commented Apr 28, 2023 •

edited

Binb1 commented May 3, 2023 •

edited

ColinTitahi commented May 10, 2023 •

edited

DennisSchwartz commented May 22, 2023 •

edited

DennisSchwartz commented May 22, 2023 •

edited

jpsmartbots commented Jun 4, 2023 •

edited

bigrig2212 commented Jun 14, 2023 •

edited

HaochenQ commented Aug 31, 2023 •

edited