New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[NETBEANS-5744] Avoid java.io deadlock of close vs. pending read. #2986
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please remove the unused Writer
class in the test, it looks good otherwise.
...est/unit/src/org/netbeans/modules/java/lsp/server/debugging/launch/NbProcessConsoleTest.java
Outdated
Show resolved
Hide resolved
...a.lsp.server/src/org/netbeans/modules/java/lsp/server/debugging/launch/NbProcessConsole.java
Show resolved
Hide resolved
a5040c7
to
ba5bb8a
Compare
Sharing the deadlock stacktraces would help me to investigate the sources at the right locations. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Assuming the test fails before and now succeeds it seems like a step forward. Let's make it.
@Override | ||
public void close() throws IOException { | ||
// the underlying StreamDecoder would just block on synchronized read(); close the underlying stream. | ||
ioCtx.getStdIn().close(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Without this explicit close, the following lockup occurs when LSP client initiates the process:
"pool-2-thread-1" #52 prio=5 os_prio=0 cpu=151.58ms
elapsed=22.34s tid=0x00007f3b388b3000 nid=0x43c3 waiting for monitor entry [0x00007f3b5d1bb000]
java.lang.Thread.State: BLOCKED (on object monitor)
at sun.nio.cs.StreamDecoder.close(java.base@11.0.5/StreamDecoder.java:191)
- waiting to lock <0x000000061a4d2068> (a org.netbeans.modules.java.lsp.server.ui.AbstractLspInputOutputProvider$LspIO$1)
at java.io.InputStreamReader.close(java.base@11.0.5/InputStreamReader.java:200)
at org.netbeans.modules.java.lsp.server.ui.AbstractLspInputOutputProvider$LspIO$1.close(AbstractLspInputOutputProvider.java:152)
at org.openide.util.io.ReaderInputStream.close(ReaderInputStream.java:130)
at org.netbeans.modules.gradle.execute.GradleDaemonExecutor.closeInOutErr(GradleDaemonExecutor.java:348)
- locked <0x000000061a4c7440> (a org.netbeans.modules.gradle.execute.GradleDaemonExecutor)
at org.netbeans.modules.gradle.execute.GradleDaemonExecutor.cancel(GradleDaemonExecutor.java:368)
at org.netbeans.modules.progress.spi.InternalHandle.requestCancel(InternalHandle.java:364)
at org.netbeans.modules.java.lsp.server.progress.LspInternalHandle.forceRequestCancel(LspInternalHandle.java:93)
at org.netbeans.modules.java.lsp.server.debugging.DebugAdapterContext.requestProcessTermination(DebugAdapterContext.java:140)
at org.netbeans.modules.java.lsp.server.debugging.launch.NbDisconnectRequestHandler.destroyDebugSession(NbDisconnectRequestHandler.java:49)
at org.netbeans.modules.java.lsp.server.debugging.launch.NbDisconnectRequestHandler.disconnect(NbDisconnectRequestHandler.java:35)
at org.netbeans.modules.java.lsp.server.debugging.NbProtocolServer.disconnect(NbProtocolServer.java:180)
at jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(java.base@11.0.5/Native Method)
at jdk.internal.reflect.NativeMethodAccessorImpl.invoke(java.base@11.0.5/NativeMethodAccessorImpl.java:62)
at jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(java.base@11.0.5/DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(java.base@11.0.5/Method.java:566)
at org.eclipse.lsp4j.jsonrpc.services.GenericEndpoint.lambda$null$0(GenericEndpoint.java:65)
at org.eclipse.lsp4j.jsonrpc.services.GenericEndpoint$$Lambda$213/0x0000000800468440.apply(Unknown Source)
at org.eclipse.lsp4j.jsonrpc.services.GenericEndpoint.request(GenericEndpoint.java:120)
at org.eclipse.lsp4j.jsonrpc.RemoteEndpoint.handleRequest(RemoteEndpoint.java:261)
at org.eclipse.lsp4j.jsonrpc.RemoteEndpoint.consume(RemoteEndpoint.java:190)
at org.netbeans.modules.java.lsp.server.debugging.Debugger$ConsumeWithLookup.lambda$consume$0(Debugger.java:92)
at org.netbeans.modules.java.lsp.server.debugging.Debugger$ConsumeWithLookup$$Lambda$771/0x0000000800aa7c40.run(Unknown Source)
at org.netbeans.modules.openide.util.GlobalLookup.execute(GlobalLookup.java:45)
at org.openide.util.lookup.Lookups.executeWith(Lookups.java:278)
at org.netbeans.modules.java.lsp.server.debugging.Debugger$ConsumeWithLookup.consume(Debugger.java:90)
at org.eclipse.lsp4j.jsonrpc.json.StreamMessageProducer.handleMessage(StreamMessageProducer.java:194)
at org.eclipse.lsp4j.jsonrpc.json.StreamMessageProducer.listen(StreamMessageProducer.java:94)
at org.eclipse.lsp4j.jsonrpc.json.ConcurrentMessageProcessor.run(ConcurrentMessageProcessor.java:113)
at java.util.concurrent.Executors$RunnableAdapter.call(java.base@11.0.5/Executors.java:515)
at java.util.concurrent.FutureTask.run(java.base@11.0.5/FutureTask.java:264)
at java.util.concurrent.ThreadPoolExecutor.runWorker(java.base@11.0.5/ThreadPoolExecutor.java:1128)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(java.base@11.0.5/ThreadPoolExecutor.java:628)
at java.lang.Thread.run(java.base@11.0.5/Thread.java:834)
....
"DisconnectableInputStream source reader" #63 daemon prio=5 os_prio=0 cpu=1.19ms elapsed=20.00s tid=0x00007f3a796c2800 nid=0x43f2 in Object.wait() [0x00007f3a993e7000]
java.lang.Thread.State: TIMED_WAITING (on object monitor)
at java.lang.Object.wait(java.base@11.0.5/Native Method)
- waiting on <no object reference available>
at java.io.PipedInputStream.read(java.base@11.0.5/PipedInputStream.java:326)
- waiting to re-lock in wait() <0x000000061a485ed8> (a org.netbeans.modules.java.lsp.server.debugging.launch.NbProcessConsole$1)
at java.io.PipedInputStream.read(java.base@11.0.5/PipedInputStream.java:377)
- waiting to re-lock in wait() <0x000000061a485ed8> (a org.netbeans.modules.java.lsp.server.debugging.launch.NbProcessConsole$1)
at sun.nio.cs.StreamDecoder.readBytes(java.base@11.0.5/StreamDecoder.java:284)
at sun.nio.cs.StreamDecoder.implRead(java.base@11.0.5/StreamDecoder.java:326)
at sun.nio.cs.StreamDecoder.read(java.base@11.0.5/StreamDecoder.java:178)
- locked <0x000000061a4d2068> (a org.netbeans.modules.java.lsp.server.ui.AbstractLspInputOutputProvider$LspIO$1)
at sun.nio.cs.StreamDecoder.read0(java.base@11.0.5/StreamDecoder.java:127)
- locked <0x000000061a4d2068> (a org.netbeans.modules.java.lsp.server.ui.AbstractLspInputOutputProvider$LspIO$1)
at sun.nio.cs.StreamDecoder.read(java.base@11.0.5/StreamDecoder.java:112)
at java.io.InputStreamReader.read(java.base@11.0.5/InputStreamReader.java:168)
at org.openide.util.io.ReaderInputStream.read(ReaderInputStream.java:65)
at org.openide.util.io.ReaderInputStream.read(ReaderInputStream.java:88)
at org.gradle.util.DisconnectableInputStream$1.run(DisconnectableInputStream.java:98)
at java.lang.Thread.run(java.base@11.0.5/Thread.java:834)
The cancel request waits in sun.nio.cs.StreamDecoder.close
blocked on monitor held by pending sun.nio.cs.StreamDecoder.read
Because of NETBEANS-2038, streams are closed before Gradle is instructed to stop the build. It could be possible to flag + filter the streams in Gradle, or close input stream after instructing Gradle to stop. The PipedInputStream
waits on an input from LSP client which might never come.
Full thread dump:
threaddump-streamdecoder.txt
// Bug in Piped*Stream: in.close() closes, but does | ||
// not unblock waiters, nor returns -1 from writer to | ||
// stop waiting. Close the writer - will also notifyAll(). | ||
inputSource.close(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Without this explicit close, the PipedInputStream
won't wake up or receive -1
from the writer (which causes its read
to return and release lock):
"pool-2-thread-1" #48 prio=5 os_prio=0 cpu=305.23ms elapsed=44.59s tid=0x00007feefc052800 nid=0x54c4 waiting for monitor entry [0x00007fef03703000]
java.lang.Thread.State: BLOCKED (on object monitor)
at sun.nio.cs.StreamDecoder.close(java.base@11.0.5/StreamDecoder.java:190)
- waiting to lock <0x000000060d8752c0> (a org.netbeans.modules.java.lsp.server.ui.AbstractLspInputOutputProvider$LspIO$1)
at java.io.InputStreamReader.close(java.base@11.0.5/InputStreamReader.java:200)
at org.netbeans.modules.java.lsp.server.ui.AbstractLspInputOutputProvider$LspIO$1.close(AbstractLspInputOutputProvider.java:152)
at org.openide.util.io.ReaderInputStream.close(ReaderInputStream.java:130)
at org.netbeans.modules.gradle.execute.GradleDaemonExecutor.closeInOutErr(GradleDaemonExecutor.java:348)
- locked <0x000000060ccc1400> (a org.netbeans.modules.gradle.execute.GradleDaemonExecutor)
at org.netbeans.modules.gradle.execute.GradleDaemonExecutor.cancel(GradleDaemonExecutor.java:368)
at org.netbeans.modules.progress.spi.InternalHandle.requestCancel(InternalHandle.java:364)
at org.netbeans.modules.java.lsp.server.progress.LspInternalHandle.forceRequestCancel(LspInternalHandle.java:93)
at org.netbeans.modules.java.lsp.server.debugging.DebugAdapterContext.requestProcessTermination(DebugAdapterContext.java:140)
at org.netbeans.modules.java.lsp.server.debugging.launch.NbDisconnectRequestHandler.destroyDebugSession(NbDisconnectRequestHandler.java:49)
at org.netbeans.modules.java.lsp.server.debugging.launch.NbDisconnectRequestHandler.disconnect(NbDisconnectRequestHandler.java:35)
at org.netbeans.modules.java.lsp.server.debugging.NbProtocolServer.disconnect(NbProtocolServer.java:180)
at jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(java.base@11.0.5/Native Method)
at jdk.internal.reflect.NativeMethodAccessorImpl.invoke(java.base@11.0.5/NativeMethodAccessorImpl.java:62)
at jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(java.base@11.0.5/DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(java.base@11.0.5/Method.java:566)
at org.eclipse.lsp4j.jsonrpc.services.GenericEndpoint.lambda$null$0(GenericEndpoint.java:65)
at org.eclipse.lsp4j.jsonrpc.services.GenericEndpoint$$Lambda$216/0x000000080043d440.apply(Unknown Source)
at org.eclipse.lsp4j.jsonrpc.services.GenericEndpoint.request(GenericEndpoint.java:120)
at org.eclipse.lsp4j.jsonrpc.RemoteEndpoint.handleRequest(RemoteEndpoint.java:261)
at org.eclipse.lsp4j.jsonrpc.RemoteEndpoint.consume(RemoteEndpoint.java:190)
at org.netbeans.modules.java.lsp.server.debugging.Debugger$ConsumeWithLookup.lambda$consume$0(Debugger.java:92)
at org.netbeans.modules.java.lsp.server.debugging.Debugger$ConsumeWithLookup$$Lambda$259/0x00000008005ab040.run(Unknown Source)
at org.netbeans.modules.openide.util.GlobalLookup.execute(GlobalLookup.java:45)
at org.openide.util.lookup.Lookups.executeWith(Lookups.java:278)
at org.netbeans.modules.java.lsp.server.debugging.Debugger$ConsumeWithLookup.consume(Debugger.java:90)
at org.eclipse.lsp4j.jsonrpc.json.StreamMessageProducer.handleMessage(StreamMessageProducer.java:194)
at org.eclipse.lsp4j.jsonrpc.json.StreamMessageProducer.listen(StreamMessageProducer.java:94)
at org.eclipse.lsp4j.jsonrpc.json.ConcurrentMessageProcessor.run(ConcurrentMessageProcessor.java:113)
at java.util.concurrent.Executors$RunnableAdapter.call(java.base@11.0.5/Executors.java:515)
at java.util.concurrent.FutureTask.run(java.base@11.0.5/FutureTask.java:264)
at java.util.concurrent.ThreadPoolExecutor.runWorker(java.base@11.0.5/ThreadPoolExecutor.java:1128)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(java.base@11.0.5/ThreadPoolExecutor.java:628)
at java.lang.Thread.run(java.base@11.0.5/Thread.java:834)
...
"DisconnectableInputStream source reader" #58 daemon prio=5 os_prio=0 cpu=2.40ms elapsed=42.47s tid=0x00007feee14f8800 nid=0x54d6 in Object.wait() [0x00007fef03503000]
java.lang.Thread.State: TIMED_WAITING (on object monitor)
at java.lang.Object.wait(java.base@11.0.5/Native Method)
- waiting on <no object reference available>
at java.io.PipedInputStream.read(java.base@11.0.5/PipedInputStream.java:326)
- waiting to re-lock in wait() <0x000000060d8752e0> (a org.netbeans.modules.java.lsp.server.debugging.launch.NbProcessConsole$1)
at java.io.PipedInputStream.read(java.base@11.0.5/PipedInputStream.java:377)
- waiting to re-lock in wait() <0x000000060d8752e0> (a org.netbeans.modules.java.lsp.server.debugging.launch.NbProcessConsole$1)
at sun.nio.cs.StreamDecoder.readBytes(java.base@11.0.5/StreamDecoder.java:284)
at sun.nio.cs.StreamDecoder.implRead(java.base@11.0.5/StreamDecoder.java:326)
at sun.nio.cs.StreamDecoder.read(java.base@11.0.5/StreamDecoder.java:178)
- locked <0x000000060d8752c0> (a org.netbeans.modules.java.lsp.server.ui.AbstractLspInputOutputProvider$LspIO$1)
at sun.nio.cs.StreamDecoder.read0(java.base@11.0.5/StreamDecoder.java:127)
- locked <0x000000060d8752c0> (a org.netbeans.modules.java.lsp.server.ui.AbstractLspInputOutputProvider$LspIO$1)
at sun.nio.cs.StreamDecoder.read(java.base@11.0.5/StreamDecoder.java:112)
at java.io.InputStreamReader.read(java.base@11.0.5/InputStreamReader.java:168)
at org.openide.util.io.ReaderInputStream.read(ReaderInputStream.java:65)
at org.openide.util.io.ReaderInputStream.read(ReaderInputStream.java:88)
at org.gradle.util.DisconnectableInputStream$1.run(DisconnectableInputStream.java:98)
at java.lang.Thread.run(java.base@11.0.5/Thread.java:834)
super.close() would block on monitor of this Piped stream that would be still waiting on (never coming) input.
Full thread dump:
threaddump-pipedstream.txt
During testing of NETBEANS-5744 I have found that there are two lockups related to stream closing, which is done during process termination:
StreamDecoder
synchronizes around itsread()
(reading the underlying InputStream) and so doesclose
. So closing anInputStreamReader
while read is pending will block the close() thread.PipedInputStream
close() just sets a flag, but does not actually wake up potentially waiting reader and does not cause writer to return-1
EOF indicator.The patch attempts to fix it, as close() to streams bound to the process I/O may happen asynchronously when the client terminates the process. I've attempted to write tests that check the close can be done from other thread while read is pending / waiting for the data.