-
Notifications
You must be signed in to change notification settings - Fork 17.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
x/tools/internal/jsonrpc2_v2: failures with "connection refused" #49387
Comments
Change https://go.dev/cl/388598 mentions this issue: |
Change https://go.dev/cl/388597 mentions this issue: |
Change https://go.dev/cl/388134 mentions this issue: |
Change https://go.dev/cl/388600 mentions this issue: |
Change https://go.dev/cl/388775 mentions this issue: |
…n case of error The JSON-RPC 2.0 protocol requires that responses objects have either a "result" or an "error", but not both. In Go, this corresponds to a non-nil result interface value or a non-nil error. However, the generated wrappers for the LSP protocol were passing non-nil values for both in case of error, due to passing typed-nil pointers as (non-nil) interfaces (see https://go.dev/doc/faq#nil_error). This change fixes the generator to explicitly pass only one or the other, and re-runs the generator at the existing commit. For golang/go#49387 For golang/go#46520 Change-Id: I582b52820bdac15d9f947e8d6c1e9daa70c53e40 Reviewed-on: https://go-review.googlesource.com/c/tools/+/388600 Run-TryBot: Bryan Mills <bcmills@google.com> Reviewed-by: Ian Cottrell <iancottrell@google.com> gopls-CI: kokoro <noreply+kokoro@google.com> TryBot-Result: Gopher Robot <gobot@golang.org>
Found new matching dashboard test flakes for:
2022-07-25 19:36 openbsd-amd64-68 tools@2a6393fe go@d9242f7a x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-08-05 17:04 openbsd-386-68 tools@06d96ee8 go@fefac44a x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-08-11 16:19 openbsd-386-70 tools@37a81b68 go@3c200d6c x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-08-12 12:39 openbsd-amd64-68 tools@88d981ef go@2cf49a76 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-08-15 17:37 openbsd-386-68 tools@987de349 go@a1390356 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-08-25 21:25 openbsd-386-70 tools@2f38e1de go@d7a3fa12 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-08-26 17:59 freebsd-amd64-12_3 tools@717a6716 go@846c378b x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-08-31 18:33 linux-ppc64-buildlet tools@cb91d6c8 go@86e9e0ea x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-09-01 22:03 openbsd-386-68 tools@40cfafff go@0592ce5f x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-09-02 18:17 linux-s390x-ibm tools@5ba85415 go@0cf996a9 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-09-07 14:10 openbsd-386-70 tools@c1dd25e8 go@88149ed4 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-09-08 19:07 openbsd-386-68 tools@4754f75d go@76c94eb7 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-09-08 19:07 openbsd-386-70 tools@4754f75d go@218294f1 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-09-13 15:21 openbsd-amd64-68 tools@9250e22a go@9503bcae x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-09-17 00:45 openbsd-amd64-68 tools@4d18923f go@19d792c1 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-09-19 18:18 linux-ppc64-buildlet tools@fdf581fd go@31d06b58 x/tools/internal/jsonrpc2_v2.TestServe (log)
|
Found new dashboard test flakes for:
2022-09-20 12:41 openbsd-386-68 tools@f9016238 go@225bcec9 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
|
Found new dashboard test flakes for:
2022-09-21 16:48 linux-s390x-ibm tools@2f047133 go@e40a130c x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
|
Found new dashboard test flakes for:
2022-09-27 19:33 linux-s390x-ibm tools@10e9d3ce go@d268504f x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
|
Found new dashboard test flakes for:
2022-10-05 17:59 openbsd-386-68 tools@c5514b75 go@947091d3 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
2022-10-10 18:26 openbsd-386-70 tools@150b5f8b go@742e0a97 x/tools/internal/jsonrpc2_v2.TestServe (log)
|
Found new dashboard test flakes for:
2022-10-15 19:01 openbsd-386-68 tools@9b5e55b1 go@61f0409c x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
|
Change https://go.dev/cl/443355 mentions this issue: |
…r breaks Otherwise, the Await method on the corresponding AsyncCall will never unblock, leading to a deadlock (detected by the test changes in CL 388597). For golang/go#46047 For golang/go#46520 For golang/go#49387 Change-Id: I7954f80786059772ddd7f8c98d8752d56d074919 Reviewed-on: https://go-review.googlesource.com/c/tools/+/388775 Run-TryBot: Bryan Mills <bcmills@google.com> Reviewed-by: Ian Cottrell <iancottrell@google.com> gopls-CI: kokoro <noreply+kokoro@google.com> Auto-Submit: Bryan Mills <bcmills@google.com> TryBot-Result: Gopher Robot <gobot@golang.org>
With the suffixes I end up a little lost in the “box” noise while I'm reading — the channel ops alone suffice to make the storage mechanism clear. (To me, the mechanism of storing a value in a 1-buffered channel is conceptually similar to storing it in a pointer, atomic.Pointer, or similar — and we don't generally name those with a suffix either.) For golang/go#46047. For golang/go#46520. For golang/go#49387. Change-Id: I7f58a9ac532f597fe49ed70606d89bd8cbe33b55 Reviewed-on: https://go-review.googlesource.com/c/tools/+/443355 TryBot-Result: Gopher Robot <gobot@golang.org> Run-TryBot: Bryan Mills <bcmills@google.com> Auto-Submit: Bryan Mills <bcmills@google.com> Reviewed-by: Alan Donovan <adonovan@google.com> gopls-CI: kokoro <noreply+kokoro@google.com>
This eliminates a race between a successful Accept call and a concurrent Close call, which previously could have shut down the 'run' goroutine before Accept sent to the newConns channel, causing Accept to deadlock. In fact, it eliminates the long-running 'run' goroutine entirely (replacing it with a time.Timer), and in the process avoids leaking O(N) closed connections when only the very first one is long-lived. It also eliminates a potential double-Close bug: if the run method had called l.wrapped.Close due to an idle timeout, a subsequent call to Close would invoke l.wrapped.Close again. The io.Closer method explicitly documents doubled Close calls as undefined behavior, and many Closer implementations (especially test fakes) panic or deadlock in that case. It also eliminates a timer leak if the Listener rapidly oscillates between active and idle: previously the implementation used time.After, but it now uses an explicit time.Timer which can be stopped (and garbage-collected) when the listener becomes active. Idleness is now tracked based on the connection's Close method rather than Read: we have no guarantee in general that a caller will ever actually invoke Read (if, for example, they Close the connection as soon as it is dialed), but we can reasonably expect a caller to at least try to ensure that Close is always called. We now also verify, using a finalizer on a best-effort basis, that the Close method on each connection is called. We use the finalizer to verify the Close call — rather than to close the connection implicitly — because closing the connection in a finalizer would delay the start of the idle timer by an arbitrary and unbounded duration after the last connection is actually no longer in use. Fixes golang/go#46047. Fixes golang/go#51435. For golang/go#46520. For golang/go#49387. Change-Id: If173a3ed7a44aff14734b72c8340122e8d020f26 Reviewed-on: https://go-review.googlesource.com/c/tools/+/388597 Run-TryBot: Bryan Mills <bcmills@google.com> TryBot-Result: Gopher Robot <gobot@golang.org> Reviewed-by: Ian Cottrell <iancottrell@google.com> Auto-Submit: Bryan Mills <bcmills@google.com> gopls-CI: kokoro <noreply+kokoro@google.com>
For golang/go#46047 For golang/go#49387 For golang/go#46520 Change-Id: Ib72863a024d74f45c70a6cb53482cb606510f7e4 Reviewed-on: https://go-review.googlesource.com/c/tools/+/388598 Run-TryBot: Bryan Mills <bcmills@google.com> Reviewed-by: Ian Cottrell <iancottrell@google.com> TryBot-Result: Gopher Robot <gobot@golang.org> gopls-CI: kokoro <noreply+kokoro@google.com> Auto-Submit: Bryan Mills <bcmills@google.com>
Found new dashboard test flakes for:
2022-10-19 20:55 linux-amd64 tools@9eda97bc go@947091d3 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
|
This change fixes the semantics of Close to actually wait for in-flight requests before closing the ReadWriteCloser. (Previously, the Close method closed the ReadWriteCloser immediately, which I suspect is what led to many of the failures observed in golang/go#49387 and golang/go#46520.) It achieves this by explicitly tracking the number of in-flight requests, including requests with pending async responses, and explicitly rejecting new Call requests (while keeping the read loop open!) once Close has begun. To make it easier for me to reason about the request lifetimes, I reduced the number of long-lived goroutines from three to just one (the Read loop), with an additional Handler goroutine that runs only while the Handler queue is non-empty. Now, it is clearer (I hope!) that the number of in-flight async requests strictly decreases after Close has begun, even though the Read goroutine continues to read requests (and, importantly, responses) and to forward Notifications to the preempter. For golang/go#49387 For golang/go#46520 Change-Id: Idf5960f848108a7ced78c5382099c8692e9b181e Reviewed-on: https://go-review.googlesource.com/c/tools/+/388134 gopls-CI: kokoro <noreply+kokoro@google.com> Run-TryBot: Bryan Mills <bcmills@google.com> TryBot-Result: Gopher Robot <gobot@golang.org> Reviewed-by: Alan Donovan <adonovan@google.com>
Change https://go.dev/cl/446315 mentions this issue: |
This should be fixed as of CL 443678 or earlier. |
Prior to this CL we already shut down a jsonrpc2_v2.Conn when its Reader breaks, which we expect to be the common shutdown path. However, with certain kinds of connections (notably those over stdin+stdout), it is possible for the Writer side to fail while the Reader remains working. If the Writer has failed, we have no way to return the required Response messages for incoming calls, nor to write new Request messages of our own. Since we have no way to return a response, we will now mark those incoming calls as canceled. However, even if the Writer has failed we may still be able to read the responses for any outgoing calls that are already in flight. When our in-flight calls complete, we could in theory even continue to process Notification messages from the Reader; however, those are unlikely to be useful with half the connection broken. It seems more helpful — and less surprising — to go ahead and shut down the connection completely when it becomes idle. Updates golang/go#46520. Updates golang/go#49387. Change-Id: I713f172ca7031f4211da321560fe7eae57960a48 Reviewed-on: https://go-review.googlesource.com/c/tools/+/446315 TryBot-Result: Gopher Robot <gobot@golang.org> Reviewed-by: Alan Donovan <adonovan@google.com> Auto-Submit: Bryan Mills <bcmills@google.com> Run-TryBot: Bryan Mills <bcmills@google.com>
Change https://go.dev/cl/447035 mentions this issue: |
Prior to this CL we already shut down a jsonrpc2_v2.Conn when its Reader breaks, which we expect to be the common shutdown path. However, with certain kinds of connections (notably those over stdin+stdout), it is possible for the Writer side to fail while the Reader remains working. If the Writer has failed, we have no way to return the required Response messages for incoming calls, nor to write new Request messages of our own. Since we have no way to return a response, we will now mark those incoming calls as canceled. However, even if the Writer has failed we may still be able to read the responses for any outgoing calls that are already in flight. When our in-flight calls complete, we could in theory even continue to process Notification messages from the Reader; however, those are unlikely to be useful with half the connection broken. It seems more helpful — and less surprising — to go ahead and shut down the connection completely when it becomes idle. This is a redo of CL 446315, with additional fixes for bugs exposed on the -race builders and some extra code cleanup from the process of diagnosing those bugs. Updates golang/go#46520. Updates golang/go#49387. Change-Id: I746409a7aa2c22d5651448ed0135b5ac21a9808e Reviewed-on: https://go-review.googlesource.com/c/tools/+/447035 Auto-Submit: Bryan Mills <bcmills@google.com> Run-TryBot: Bryan Mills <bcmills@google.com> TryBot-Result: Gopher Robot <gobot@golang.org> gopls-CI: kokoro <noreply+kokoro@google.com> Reviewed-by: Alan Donovan <adonovan@google.com>
Found new dashboard test flakes for:
2022-11-04 18:06 openbsd-386-70 tools@39c2fd8b go@156bf3dd x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
|
Ah, that failure mode is new, at least! The test failure here isn't because of the |
Change https://go.dev/cl/448096 mentions this issue: |
Found new dashboard test flakes for:
2022-11-09 13:57 openbsd-amd64-68 tools@d41a43b9 go@3a410941 x/tools/internal/jsonrpc2_v2.TestIdleTimeout (log)
|
That last flake was just before https://go.dev/cl/448096 landed, so I'm considering this likely to have been fixed by that CL. |
greplogs --dashboard -md -l -e '(?ms)connection refused.*FAIL\s+golang\.org/x/tools/internal/jsonrpc2_v2' --since=2021-06-03
2021-11-05T04:20:33-6f75aad-0a5ca24/solaris-amd64-oraclerel
2021-11-03T20:51:16-84e69e7-88407a8/openbsd-amd64-68
2021-10-09T14:53:12-ee04797-eba91e8/dragonfly-amd64
2021-10-08T19:19:09-c5188f2-b9e1e1b/openbsd-amd64-64
2021-09-24T07:22:13-6d1e33f-d413908/plan9-arm
2021-09-19T16:34:52-7559231-771b8ea/freebsd-amd64-race
2021-08-18T22:16:57-bf6c7f2-eda3de0/solaris-amd64-oraclerel
2021-08-02T20:18:28-bb69444-8a7ee4c/freebsd-386-11_4
2021-07-13T20:15:39-ef97713-a985897/solaris-amd64-oraclerel
2021-07-08T19:56:33-febfa9d-3d1d066/linux-amd64-wsl
2021-07-08T19:53:41-c979f92-296ddf2/linux-amd64-wsl
2021-07-08T17:53:43-55cd480-296ddf2/linux-amd64-wsl
2021-06-24T04:20:08-4833ac5-0e7012e/darwin-amd64-nocgo
Likely related to #46520.
The text was updated successfully, but these errors were encountered: