Connection counts climbing far past HighWater setting #6286

leerspace · 2019-05-01T18:51:21Z

Version information:

$ ipfs version --all
go-ipfs version: 0.4.20-
Repo version: 7
System version: amd64/linux
Golang version: go1.12.4

Type:

bug

Description:

While using my node, the connection counts started climbing rapidly past the HighWater connection setting and seemed to be stuck in a climb -- reaching 2000+ connections before I shutdown the daemon (see ipfs.peers file in link below). At the time of the connection count climb I think I was pinning a few hashes and publishing an IPNS entry.

It looks like there are a couple of old issues that sound similar (e.g., #4718, #5248) but they are closed. I wonder if this could be related to #3532, but it's not clear to me if connections are building rapidly in that issue.

My node's LowWater and HighWater connection counts are set to the defaults (see ipfs.config for full output from ipfs config show). QUIC and EnableAutoRelay are both enabled, but EnableRelayHop is not.

Debug data using these instructions and ipfs swarm peers and ipfs config show output from after the issue started and before the daemon were killed are available here:

https://ipfs.io/ipfs/QmZqhucUHSoW3WzXsu7gepmQX9NqQzkjcsQAN1r4kjQYBH

The text was updated successfully, but these errors were encountered:

vyzo · 2019-05-01T18:53:53Z

Have you recently advertised as a relay hop with autorelay?
The provider record stays for 24hrs, and that would certainly inundate you with new connections.

leerspace · 2019-05-01T18:56:26Z

@vyzo Assuming I would do that using the EnableRelayHop setting, I have not done that on this node as far as I can remember (I'd say 99% sure).

Stebalien · 2019-05-01T20:57:06Z

@leerspace could you post the output of ipfs swarm peers -v. The -v will help me figure out if these are inbound connections or outbound connections and what protocols you're speaking.

leerspace · 2019-05-02T14:22:07Z

I didn't think to get verbose output from swarm peers before; but once I notice it happening again I'll grab it and add it to this issue.

leerspace · 2019-05-03T02:46:08Z

I got this to happen again to some extent (2000+ peers), but I think this is probably just a duplicate of #6283.

Here's the ipfs swarm peers -v output in case I'm wrong (see 1556850569 for example, file names are unix timestamps from date +%s): https://ipfs.io/ipfs/QmSzBfiMwU5ChkMtZpFbUtWWXQPQBhbPSg6kh5GZQxBx6x

I should probably be using ipfs daemon --routing=dhtclient on this node as suggested in another issue since the *Water connection thresholds don't seem to keep connections under control with default routing.

Stebalien · 2019-05-03T03:56:17Z

Unfortunately, we don't (yet) have anything to simply stop new connections. Libp2p needs to feed the connection manager down through to the transports themselves.

But still, that's a lot of inbound connections.

swedneck · 2019-07-29T13:18:28Z

i seem to still be running into this with v0.4.21, my node consistently has around 8000 peers even though my highwater is set to 900.

dokterbob · 2019-08-25T12:39:54Z

Some problem with 0.4.22. With the following settings:

    "ConnMgr": {
      "GracePeriod": "30s",
      "HighWater": 15000,
      "LowWater": 10000,
      "Type": "basic"
    },

I'm consistently seeing 35-50k connections, which is essentially bringing our server to it's knees.

Lowering it down to:

    "ConnMgr": {
      "Type": "basic",
      "LowWater": 3000,
      "HighWater": 5000,
      "GracePeriod": "30s"
    }

Still yields about 35-40k connections.

Lowering it down to :

    "ConnMgr": {
      "Type": "basic",
      "LowWater": 1000,
      "HighWater": 3000,
      "GracePeriod": "30s"
    }

Still gives around 35k connections!

@Stebalien There really seems to be an issue here!

This seems to be somewhat of a runaway feedback loop; ones the DHT starts routing, and it's a good peer, more peers use it and it gets overloaded. Or something like that.

Note that we're consistently fetching 100-150 files (ipfs-search).

ipfs --version --all:

go-ipfs version: 0.4.22-
Repo version: 7
System version: amd64/linux
Golang version: go1.12.7

dokterbob · 2019-08-25T12:58:25Z

Additional note: this seems to have started right after I enabled RelayHop and AutoRelay (second vertical white line), which I then quickly disabled (third vertical line) - the first line is the 0.4.22 upgrade. Could the removal thereof not have been propagated well throughout the DHT?

dokterbob · 2019-08-26T08:31:18Z

Sadly, this problem persists. I think it's time to reopen this issue.

On an 8-core CPU it soaks up a good 700% of load (purple here is IPFS).

aschmahmann · 2019-08-26T14:42:05Z

@dokterbob @Stebalien could confirm, but I think that enabling both EnableRelayHop (I'm willing to serve as a relay) and EnableAutoRelay(I'm looking for relays) together is a bad idea. In theory your advertisements should have disappeared a day after you turned them off, but it looks like some nodes in the network have decided to continue advertising for you.

My understanding is that the plan is to remove serving as a relay node from IPFS and making it available through the libp2p daemon instead to minimize confusion and people shooting themselves in the foot. In the meanwhile, if you can, I'd recommend rotating your node's peerID to a new one which should restore your traffic to normal.

If you need any help figuring out how to do peerID rotation that's probably best asked on discuss.ipfs.io.

Stebalien · 2019-08-27T01:37:48Z

^^ That's the issue.

dokterbob · 2019-08-27T15:56:12Z

@aschmahmann Great, thanks for the quick feedback!

Note that the [config doc] specifically state:

EnableAutoRelay Enables automatic relay for this node. If the node is a HOP relay (EnableRelayHop is true) then it will advertise itself as a relay through the DHT.

Note that, by now, my server is slowly returning to normal.

Stebalien · 2019-08-28T18:29:03Z

Note that the [config doc] specifically state:

Are you noting that the current behavior is documented or is the documentation confusing?

dokterbob · 2019-08-28T18:43:39Z

The latter.

Stebalien · 2019-08-28T19:28:59Z

Got it. I agree the whole flags interacting with each other is really confusing and I'll try to improve the documentation to make it less confusing.

Stebalien · 2019-08-28T19:29:38Z

Actually, @vyzo, could you take a pass at this?

dokterbob · 2019-08-28T20:43:31Z

Thanks!

leerspace changed the title ~~Connection~~ Connection counts climbing far past HighWater setting May 1, 2019

Stebalien mentioned this issue May 2, 2019

Too many open files (regression?) #6237

Closed

leerspace closed this as completed May 3, 2019

dokterbob mentioned this issue Aug 26, 2019

handleAddProvider messages don't appear in log tail after some time #6296

Closed

livid mentioned this issue Jun 8, 2022

Can we make this part configurable? Planetable/Planet#17

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Connection counts climbing far past HighWater setting #6286

Connection counts climbing far past HighWater setting #6286

leerspace commented May 1, 2019

vyzo commented May 1, 2019

leerspace commented May 1, 2019 •

edited

Loading

Stebalien commented May 1, 2019 •

edited

Loading

leerspace commented May 2, 2019

leerspace commented May 3, 2019

Stebalien commented May 3, 2019

swedneck commented Jul 29, 2019

dokterbob commented Aug 25, 2019 •

edited

Loading

dokterbob commented Aug 25, 2019

dokterbob commented Aug 26, 2019

aschmahmann commented Aug 26, 2019

Stebalien commented Aug 27, 2019

dokterbob commented Aug 27, 2019 •

edited

Loading

Stebalien commented Aug 28, 2019

dokterbob commented Aug 28, 2019 via email

Stebalien commented Aug 28, 2019

Stebalien commented Aug 28, 2019

dokterbob commented Aug 28, 2019 via email

Connection counts climbing far past HighWater setting #6286

Connection counts climbing far past HighWater setting #6286

Comments

leerspace commented May 1, 2019

Version information:

Type:

Description:

vyzo commented May 1, 2019

leerspace commented May 1, 2019 • edited Loading

Stebalien commented May 1, 2019 • edited Loading

leerspace commented May 2, 2019

leerspace commented May 3, 2019

Stebalien commented May 3, 2019

swedneck commented Jul 29, 2019

dokterbob commented Aug 25, 2019 • edited Loading

dokterbob commented Aug 25, 2019

dokterbob commented Aug 26, 2019

aschmahmann commented Aug 26, 2019

Stebalien commented Aug 27, 2019

dokterbob commented Aug 27, 2019 • edited Loading

Stebalien commented Aug 28, 2019

dokterbob commented Aug 28, 2019 via email

Stebalien commented Aug 28, 2019

Stebalien commented Aug 28, 2019

dokterbob commented Aug 28, 2019 via email

leerspace commented May 1, 2019 •

edited

Loading

Stebalien commented May 1, 2019 •

edited

Loading

dokterbob commented Aug 25, 2019 •

edited

Loading

dokterbob commented Aug 27, 2019 •

edited

Loading