Socket.Connect methods that connect to multiple endpoints are broken on Unix #16263

ericeil · 2016-02-01T21:31:44Z

Unlike Windows, the sockets API on Linux and OSX does not allow a single socket to attempt more than one connection. We had attempted to do this on Linux, but it did not work (see #16238).

Socket has multiple methods that depend on being able to try connections to multiple endpoints on a single socket:

public void Connect(IPAddress[] addresses, int port)

...attempts to connect to each of addresses, returning when one of them is successful.

public void Connect(string host, int port)

...does a DNS lookup, which may return multiple addresses, then attempts to connect to each of them until successful.

public void Connect(EndPoint remoteEP)

If this is passed a DnsEndPoint, then this behaves the same as Connect(string, int), trying all resolved addresses.

Additionally, we support async variants of all of these.

On Unix we currently "fake" this by creating multiple temporary sockets behind the scenes, trying to connect each of them to a different address. If one succeeds, we immediately disconnect it and connect the "real" socket to that address. This workaround has some possibly unacceptable implications:

Whichever endpoint we finally choose, we will end up connecting to it twice, once for the "probe" connection, to see if it will work, and again for the "real" connection. This will cause issues for servers that do not expect more than one connection, for load balancers that try to distribute connections, etc. It's also a lot of undesired network overhead.
Even though the first connection attempt succeeded, the second will not necessarily succeed. This is especially true in scenarios like DNS-based load balancing. So we have a functional issue here, where we may fail to connect the "real" socket in cases where we would have succeeded.

I do not think we should leave this as-is, as it will create too many unexpected strange behaviors in code that uses these, expecting them to behave the same way as they do on Windows. So I propose doing one of the following:

a) Remove the Connect overloads that take multiple endpoints, and the DnsEndpoint type, from the public contracts.

-- or --

b) Change these Connect overloads to throw PlatformNotSupportedException on Unix, Also, change Connect(EndPoint, int) to throw PlatformNotSupportedException if passed a DnsEndPoint.

In either case, we should provide sample code for how to code equivalent behavior using the remaining APIs.

@stephentoub, @pgavlin, @CIPop, @davidsh, @SidharthNabar, please provide feedback.

The text was updated successfully, but these errors were encountered:

pgavlin · 2016-02-01T21:56:18Z

I feel like I should also point out another option: we could replace all of the instance Connect methods with static equivalents that return a connected Socket instance. This design would be implementable on any POSIX-compliant sockets implementation and would not imply the removal of DnsEndpoint. Unfortunately, it has the distinct downside of dramatically changing the surface area of the S.N.Sockets contract.

I'm personally in favor of option (b) with the possible addition of a toolable notification that the existing APIs should not be used for cross-platform code (e.g. a Roslyn diagnostic or some appropriate attribute).

davidsh · 2016-02-01T23:38:54Z

b) Change these Connect overloads to throw PlatformNotSupportedException on Unix, Also, change Connect(EndPoint, int) to throw PlatformNotSupportedException if passed a DnsEndPoint.

I would vote for b).

stephentoub · 2016-02-02T02:47:30Z

Sounds like we should go for (b), ideally where the message in the PlatformNotSupportedException provides some guidance.

bgrainger · 2016-04-08T18:48:17Z

I just tried to run a .NET Core app (that uses sockets) on Ubuntu 14.04. Calling Socket.Connect(hostname, port) threw a PlatformNotSupportedException : Sockets on this platform are invalid for use after a failed connection attempt, and as a result do not support attempts to connect to multiple endpoints.

I did not find this exception message to be helpful in explaining the underlying issue or advising how to avoid it. At first I thought the message meant that I was attempting to reuse a socket after a failed connection attempt (which didn't make sense). Eventually I found 30bd4b7 and (thanks to the very helpful commit message) understood the problem (and the workaround).

I propose two improvements:

Rewrite the exception message to clearly indicate the specific problem and ideally a workaround: e.g., "Connect(string) and Connect(IPAddress[]) are not supported on this platform; use Connect(IPAddress) or Connect(EndPoint) instead", or "This overload of (Begin)Connect is not supported on this platform; see aka.ms/... for more information" (which could link to an issue or wiki page here?).
Mark the non-supported Connect overloads as [Obsolete], even for Windows builds. If I'm writing a library that I intend to be cross-platform, I'd like notice as soon as possible that the code I'm writing will not work on other platforms, especially if I'm typically building and testing on Windows.

I'd be happy to open a PR for either/both of these suggestions if you're in favour of them.

stephentoub · 2016-04-08T18:56:36Z

@bgrainger, sorry the error message wasn't as helpful as it could have been (but I'm glad the commit message was). You're welcome to submit a PR to improve it. As for marking the overloads as [Obsolete], that's a question for @davidsh or @CIPop.

Socket.Connect overloads that might attempt to create connections to multiple IP addresses are not supported: https://github.com/dotnet/corefx/issues/5829. As per dotnet/corefx@30bd4b7, the workaround is to resolve the name to a collection of IP addresses and try them sequentially. Also implemented support for comma-delimited hostnames (even though this appears to have been dropped from the official connector in mysql/mysql-connector-net@03f3a23.)

NickCraver · 2016-05-22T11:23:22Z

@stephentoub Was the intent here really to disable connecting to sockets via hostname on some platforms? Option 2 in that commit rather needlessly breaks StackExchange.Redis on OS X for example, even when trying to connect to a single endpoint (as we always do).

We just got this issue filed: StackExchange/StackExchange.Redis#410 All of the socket code I've written goes to a single host and that's all broken cross-platform now, without merit I think.

IMO, we can and should break far less code with Option 3 and a clearer commit message here. There's no need to break people aside from erroring on what would actually break.

stephentoub · 2016-05-22T13:37:26Z

@NickCraver, if you open a new issue to @ericeil, we can reevaluate.

NickCraver · 2016-05-22T13:42:37Z

@stephentoub k, will do that when time allows today - thanks.

…forms. Having the socket do the DNS lookup is apparently not supported by corefx. See issue: https://github.com/dotnet/corefx/issues/5829 and https://github.com/dotnet/corefx/issues/8768.

ericeil self-assigned this Feb 1, 2016

stephentoub assigned stephentoub and unassigned ericeil Feb 3, 2016

stephentoub closed this as completed in dotnet/corefx#5867 Feb 6, 2016

shayhatsor referenced this issue in shayhatsor/zookeeper Apr 16, 2016

new DynamicHostProvider in reaction to dotnet/corefx#5829

7fdc872

msftgits transferred this issue from dotnet/corefx Jan 31, 2020

msftgits added this to the 1.0.0-rtm milestone Jan 31, 2020

cocowalla mentioned this issue Jan 31, 2020

PlatformNotSupportedException with TcpClient.ConnectAsync #24917

Closed

ghost locked as resolved and limited conversation to collaborators Jan 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Socket.Connect methods that connect to multiple endpoints are broken on Unix #16263

Socket.Connect methods that connect to multiple endpoints are broken on Unix #16263

ericeil commented Feb 1, 2016

pgavlin commented Feb 1, 2016

davidsh commented Feb 1, 2016

stephentoub commented Feb 2, 2016

bgrainger commented Apr 8, 2016

stephentoub commented Apr 8, 2016

NickCraver commented May 22, 2016

stephentoub commented May 22, 2016

NickCraver commented May 22, 2016

Socket.Connect methods that connect to multiple endpoints are broken on Unix #16263

Socket.Connect methods that connect to multiple endpoints are broken on Unix #16263

Comments

ericeil commented Feb 1, 2016

pgavlin commented Feb 1, 2016

davidsh commented Feb 1, 2016

stephentoub commented Feb 2, 2016

bgrainger commented Apr 8, 2016

stephentoub commented Apr 8, 2016

NickCraver commented May 22, 2016

stephentoub commented May 22, 2016

NickCraver commented May 22, 2016