Connection timeout strategy causing disconnections #295

dotansimha · 2017-10-24T16:45:26Z

Since the PR that adds the following:
https://github.com/apollographql/subscriptions-transport-ws/blob/v0.8.3/src/client.ts#L407-L416

The default connection timeout starts with 1000ms which isn't enough for low performance servers (for example: free plan in heroku or free plan in now).
This causing the client to disconnect if it took more than 1000ms to perform the connection.

For example, with my remote server, I expect the client to connect in about 1500ms or more - in this case I'm seeing one or more disconnections every time I recreate my socket.

The 1000ms should be configurable, and I think it should be more than 1000ms by default.

apollo-client: 1.9.3
transport: 0.8.3

The text was updated successfully, but these errors were encountered:

pandemosth · 2017-10-25T00:04:20Z

I'm seeing this at the moment too (also on heroku). To add to the issue, appears that the reconnect logic is broken if the max connect timeout fires.

In my scenario the lazy flag is set, so first subscription is triggering the websocket connection attempt. If this times-out and is reattempted I'm seeing two different behaviours after the connection is opened:

No subscription has been established (on Android real device)
Two instances of the subscription that both then fire on new data from the server (on iOS Simulator and real device)

Haven't got any further on why this is happening but will continue to investigate.

And this is react-native with following versions:

apollo-client: 1.9.3
react-apollo: 1.4.16

pandemosth · 2017-10-25T13:05:35Z

OK, so I've managed to fix both issues above, changes here for comment and advice on whether this should be a PR.

@dotansimha I can create a separate issue if I'm hijacking this one..?

master...pandemosth:issue295

Explanation follows:-

Issue 1 - No subscription is made on reconnect, after initial connect times out (only seen on Android).
This is a strange one, the WebSocket onOpen callback is being called but the readyState is still 0 (connecting). This causes the following (apologies for verbose explanation):

sendMessage is called from onOpen, then in sendMessageRaw status is CONNECTING so message is pushed onto unsent queue
then next line in onOpen calls flushUnsentMessagesQueue. The subscription message and the init message in the unsent queue are attempted again, but since status is CONNECTING they are not sent and are added back to the unsent queue. But, in flushUnsentMessagesQueue the queue is then set to an empty array, so the unsent messages are discarded.
onOpen is called again, this time with status 1, the websocket is connected but no graphql subscription is established since the unsent queue is empty.

This is probably an issue with React Native / WebSocket on Android. Seems odd that onOpen would be called with reedyState still 0. But, easy enough to guard against that in code which is what I've done.

Issue 2 - Duplicate subscription made on reconnect, after initial connect timeout (both platforms).
After fixing issue 1, this one was observed consistently on both platforms. The max connect timeout calls close which calls tryReconnect. In tryReconnect all current operations are added onto the unsent queue, which is causing the duplication. The simple fix here is to set this.reconnecting = true in the max connect timeout callback in order to bypass this step.

I couldn't see any tests in the project related to timeouts so haven't updated tests.

Btw, Charles proxy with throttling (+1000ms latency) was used to reproduce these issues and verify.

perrosnk · 2017-12-11T23:46:31Z

Any updates on this? I am having the same issues

ilijaNL · 2018-01-12T11:52:25Z

@pandemosth can confirm this on android, sometimes a dubbel subscription is made, or none at all

ash0080 · 2018-03-05T03:04:55Z

Same issues, no subscription or duplicated subscriptions, any news here?

Amareis · 2018-05-16T07:28:29Z

#377 solve this

nwronski · 2018-05-24T18:13:10Z

I was also experiencing Issue 2 described by @pandemosth and can confirm his changes fixed the issue. I observed that multiple copies of the initial operation(s) were sent to the server whenever a websocket connection took longer than 1000ms to connect.

NeoPhi · 2018-06-11T14:41:16Z

@pandemosth or @nwronski Please open a PR with the changes. The existing logic definitely sounds buggy.

jpgarcia · 2018-06-18T20:12:34Z

@NeoPhi If you want I can create a PR with the changes @pandemosth suggested against master.

I've just forked the repo and pushed underscopeio@15c0d70 which is pretty much the same as @pandemosth except the src/server.ts which AFAIK is already updated on master.

I've tested the fix in a react-native project and it seems to be working fine now! (thank you @pandemosth !)

Once published the new version we should also create an issue in apollo-link-ws to upgrade the subscriptions-transport-ws dependency

mxstbr · 2018-06-18T20:27:08Z

@jpgarcia PR would be much appreciated!

jpgarcia · 2018-06-18T21:04:43Z

@mxstbr done!

…de on reconnect. apollographql#295 (comment)

…de on reconnect. (#439) #295 (comment)

NeoPhi · 2018-06-29T18:21:21Z

Included in: https://github.com/apollographql/subscriptions-transport-ws/releases/tag/v0.9.12

jedwards1211 · 2019-10-17T21:04:16Z

@NeoPhi not everything in OP is fixed, it still appears we don't have a good way to raise the initial connect timeout do we?

I bet even just an expensive initial page load can overwhelm the 1 second initial connect timeout, even if the server is performing just fine.

jedwards1211 · 2019-10-17T21:07:30Z

Also, is applying the backoff to the connection timeout really the best way to go about it? Seems to me the backoff should apply to the time waited before attempting a reconnect, instead of the time waited for the connection to succeed.

actually fix apollographql#295 completely

* feat(client): add minTimeout option actually fix #295 completely * chore(CHANGELOG.md): document minTimeout change * Changelog updates Co-authored-by: hwillson <hugh@octonary.com>

jpgarcia added a commit to underscopeio/subscriptions-transport-ws that referenced this issue Jun 18, 2018

Apply fix suggested by @pandemosth in apollographql#295

15c0d70

jpgarcia added a commit to underscopeio/subscriptions-transport-ws that referenced this issue Jun 18, 2018

Apply fix suggested by @pandemosth in apollographql#295

48d73a1

damour mentioned this issue Jun 20, 2018

Memory leaks in duplicate subscriptions #433

Closed

jpgarcia mentioned this issue Jun 26, 2018

Fix websocket reconnection #428

Closed

4 tasks

damour pushed a commit to damour/subscriptions-transport-ws that referenced this issue Jun 29, 2018

Fix no subscription is made on reconnect && duplicate subscription ma…

46b15d1

…de on reconnect. apollographql#295 (comment)

NeoPhi pushed a commit that referenced this issue Jun 29, 2018

Fix no subscription is made on reconnect && duplicate subscription ma…

57874d2

…de on reconnect. (#439) #295 (comment)

NeoPhi closed this as completed Jun 29, 2018

jedwards1211 added a commit to jedwards1211/subscriptions-transport-ws that referenced this issue Oct 17, 2019

feat(client): add minTimeout option

0f39b0d

actually fix apollographql#295 completely

jedwards1211 mentioned this issue Oct 17, 2019

feat(client): add minTimeout option #675

Merged

4 tasks

snyk-bot mentioned this issue Apr 24, 2020

[Snyk] Upgrade subscriptions-transport-ws from 0.8.2 to 0.9.16 walmartlabs/lacinia-pedestal#96

Closed

hwillson pushed a commit to jedwards1211/subscriptions-transport-ws that referenced this issue Aug 10, 2020

feat(client): add minTimeout option

ed5cf65

actually fix apollographql#295 completely

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Connection timeout strategy causing disconnections #295

Connection timeout strategy causing disconnections #295

dotansimha commented Oct 24, 2017 •

edited

pandemosth commented Oct 25, 2017 •

edited

pandemosth commented Oct 25, 2017

perrosnk commented Dec 11, 2017

ilijaNL commented Jan 12, 2018

ash0080 commented Mar 5, 2018

Amareis commented May 16, 2018

nwronski commented May 24, 2018 •

edited

NeoPhi commented Jun 11, 2018

jpgarcia commented Jun 18, 2018

mxstbr commented Jun 18, 2018

jpgarcia commented Jun 18, 2018

NeoPhi commented Jun 29, 2018

jedwards1211 commented Oct 17, 2019

jedwards1211 commented Oct 17, 2019

Connection timeout strategy causing disconnections #295

Connection timeout strategy causing disconnections #295

Comments

dotansimha commented Oct 24, 2017 • edited

pandemosth commented Oct 25, 2017 • edited

pandemosth commented Oct 25, 2017

perrosnk commented Dec 11, 2017

ilijaNL commented Jan 12, 2018

ash0080 commented Mar 5, 2018

Amareis commented May 16, 2018

nwronski commented May 24, 2018 • edited

NeoPhi commented Jun 11, 2018

jpgarcia commented Jun 18, 2018

mxstbr commented Jun 18, 2018

jpgarcia commented Jun 18, 2018

NeoPhi commented Jun 29, 2018

jedwards1211 commented Oct 17, 2019

jedwards1211 commented Oct 17, 2019

dotansimha commented Oct 24, 2017 •

edited

pandemosth commented Oct 25, 2017 •

edited

nwronski commented May 24, 2018 •

edited