Possible Socket Regression in .NET Core 5.0 on Linux #30895

halter73 · 2019-09-18T22:20:02Z

ASP.NET Core has tests where the client reads response data at an artificially slow rate, but above the configured minimum rate enforced by the Kestrel HTTP server. These tests started becoming flaky with the client observing a “Connection reset by peer” SocketExeption on Linux when we updated the AspNetCore repo to depend on .NET Core 5.0.

In these tests, Kestrel calls Socket.Shutdown(SocketShutdown.Both) and then Socket.Dispose() immediately after the last Socket.SendAsync() Task completes. There isn’t any special LingerState or anything like that. I know the standard way to close a socket is to close the sending side, wait to receive a FIN (a 0-length read with a timeout), and then dispose the socket, but this is the logic we’ve had in the Socket transport since 2.0 and the libuv transport since 1.0 and these tests weren’t flaky before and still aren’t flaky on Windows or macOS.

The PR (dotnet/aspnetcore#13532) where I clean up the flaky tests a couple of days after taking the .NET Core 5.0 dependency goes into more detail about the flaky tests. @jkotalik looked through changes made to Sockets after 3.0 that might explain this regression, and he found dotnet/corefx#38804 which is a PR titled “Socket: improve cross-platform behavior on Dispose.” I agree that this PR looks pretty suspicious.

I tried creating a minimal repro for this issue without Kestrel or any testing infrastructure, but to this point I haven’t been successful in getting a repro. I thought that simply reading response data slowly from a Socket that was already shutdown and disposed by the peer would be sufficient, but apparently there’s something more to this regression than I realize. Here’s a gist with my minimal repro attempt (that doesn’t repro yet).

@tmds @stephentoub

davidsh · 2019-09-18T23:04:53Z

@wfurt

wfurt · 2019-09-18T23:23:22Z

cc: @tmds

tmds · 2019-09-19T05:00:22Z

These tests started becoming flaky with the client observing a “Connection reset by peer”

This is caused by calling Disconnect:

https://github.com/dotnet/corefx/blob/19b304f7815894b13cb61e87e1c9eac49a474c7e/src/System.Net.Sockets/src/System/Net/Sockets/SafeSocketHandle.Unix.cs#L397

This happens in TryUnblockSocket to cancel on-going operations from the socket when the handle isn't released immediately on Dispose:

https://github.com/dotnet/corefx/blob/b49a8a9be1d53cd9e50cb68fd8540be25c65d433/src/System.Net.Sockets/src/System/Net/Sockets/SafeSocketHandle.cs#L176-L184

Although you've called Send and Shutdown in the server, the data may still be in the kernel send buffer (so the peer hasn't observed a FIN close). Then when we call Disconnect, the data gets thrown away and the peer immediately sees a RST close.

The fix is to call Shutdown instead of Disconnect in TryUnblockSocket in case the user has already made an explicit call to Shutdown for the Send/Both end.

@halter73 does this match with the test?

halter73 · 2019-09-19T17:46:32Z

@halter73 does this match with the test?

I think so. In the test, the client is still receiving data for several seconds after server calls Socket.Shutdown(SocketShutdown.Both). Then again, so does my gist which I haven't seen repro my issue.

stephentoub closed this as completed in dotnet/corefx#41250 Oct 16, 2019

msftgits transferred this issue from dotnet/corefx Feb 1, 2020

msftgits added this to the 5.0 milestone Feb 1, 2020

ghost locked as resolved and limited conversation to collaborators Dec 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible Socket Regression in .NET Core 5.0 on Linux #30895

Possible Socket Regression in .NET Core 5.0 on Linux #30895

halter73 commented Sep 18, 2019

davidsh commented Sep 18, 2019

wfurt commented Sep 18, 2019

tmds commented Sep 19, 2019 •

edited

Loading

halter73 commented Sep 19, 2019

Possible Socket Regression in .NET Core 5.0 on Linux #30895

Possible Socket Regression in .NET Core 5.0 on Linux #30895

Comments

halter73 commented Sep 18, 2019

davidsh commented Sep 18, 2019

wfurt commented Sep 18, 2019

tmds commented Sep 19, 2019 • edited Loading

halter73 commented Sep 19, 2019

tmds commented Sep 19, 2019 •

edited

Loading