eth, les, light: enforce CHT checkpoints on fast-sync too #19468

karalabe · 2019-04-16T10:33:02Z

Fast sync is susceptible to a grieving version of an eclipse attack, where a malicious remote node attempts to get a new Geth node to fast sync to some small chain, before a real heavy chain is discovered in the network. This results in Geth falling back to full sync for the main chain, taking too much time.

This attack can only be meaningfully mounted against nodes which are properly exposed on a public IP address (i.e. not firewalled, not NATed). Even then, it's a race against the node finding good peers fast enough, in which case the attack doesn't work any more.

There is no economic advantage in pulling this attack off, only causing sync annoyance. That said, there is currently a number of (at least 4 identified, maybe more) Parity nodes at 207.148.5.229, which are doing variations of this attack. It might be deliberate, or it might also be leftover nodes from some experiment that only have a few blocks on their chain and subsequently disabled sync. A bit unprobable.

This PR repurposes the DAO challenge to do a checkpoint challenge based on the recently hard coded CHTs. It also makes the challenge stricter, in that while the node is doing a fast-sync, remote peers are not permitted to be synced below the checkpoint block. This should also help sync the chain faster, getting rid of stalling or useless peers.

holiman · 2019-04-16T14:01:15Z

This looks clean and nice, however, it has the drawback that peers undergoing sync won't help other unsynced nodes. Would it be possible instead to allow unsynched peers to connect to us, once we have already found a good pivot? Because once we're settled on a pivot and have started syncing, we're not vulnerable to the tarpit-fastsync attack described above any longer.

karalabe · 2019-04-16T15:19:22Z

It makes things more complicated. Currently the eth protocol manager that handles connecting peers doesn't know much about the downloader, definitely not internal state. I'm also not sure about the change in general as we were considering doing exactly this: disconnecting peers that can't serve fast sync data.

Whilst I agree that we're going from one end of the spectrum (allow everyone) to the opposite (allow only useful ones), I'm not sure it makes sense to complicate things. There are a few thousand peers in the network, maybe a handful that are currently syncing. Perhaps lets use the thousand synced ones first and only then try to help the remainder joiners. Yes, it puts a bit more burden on the existing peers (that is, if I have a joiner peer too), but it also makes us sync faster, so we can meaningfully help the network in turn faster.

holiman · 2019-04-16T16:48:46Z

Yes, I agree that it's probably the sanest choice for now

karalabe · 2019-04-17T08:17:51Z

Damn, found a tiny bug. I didn't stop the challenge timer on non-fast sync :P Bleah, need to add a test.

karalabe · 2019-04-17T08:18:17Z

diff --git a/eth/handler.go b/eth/handler.go
index 9d29f8cb1..bd414e068 100644
--- a/eth/handler.go
+++ b/eth/handler.go
@@ -447,13 +447,13 @@ func (pm *ProtocolManager) handleMsg(p *peer) error {
                }
                // If no headers were received, but we're expencting a checkpoint header, maybe it's that
                if len(headers) == 0 && p.syncDrop != nil {
+                       p.syncDrop.Stop()
+                       p.syncDrop = nil
+
                        // If we're doing a fast sync, we must enforce the checkpoint block to avoid
                        // eclipse attacks. Unsynced nodes are welcome to connect after we're done
                        // joining the network
                        if atomic.LoadUint32(&pm.fastSync) == 1 {
-                               p.syncDrop.Stop()
-                               p.syncDrop = nil
-
                                p.Log().Warn("Dropping unsynced node during fast sync", "addr", p.RemoteAddr(), "type", p.Name())
                                return errors.New("unsynced node cannot serve fast sync")
                        }

karalabe · 2019-04-17T09:25:49Z

Fixed and tested. @holiman PTAL

karalabe · 2019-04-17T09:47:50Z

@matthalp This PR might interest you. It changes the fork challenge semantics. Instead of requiring to be on the same DAO fork, it requires to have the same header as contained in a remote peer's CHT. I think you had some nodes that aggressively pruned headers too, this might make it more complicated, since the challenge can be arbitrary, not a specific fork block header.

holiman · 2019-04-17T09:58:37Z

I approve

karalabe · 2019-04-17T10:16:49Z

Fixed the linter and squashed.

rjl493456442

LGTM

Matthalp-zz · 2019-04-17T13:33:12Z

Thanks for keeping me in the loop @karalabe! Fortunately we do keep all headers along the canonical chain, so I don't anticipate any problems.

karalabe added this to the 1.9.0 milestone Apr 16, 2019

karalabe requested a review from holiman April 16, 2019 10:33

karalabe requested a review from zsfelfoldi as a code owner April 16, 2019 10:33

karalabe force-pushed the enforce-fastsync-checkpoints branch from f64d13d to 4a07d42 Compare April 16, 2019 12:34

gumb0 mentioned this pull request Apr 16, 2019

Replace DAO hard fork test ethereum/aleth#5540

Open

eth, les, light: enforce CHT checkpoints on fast-sync too

38f6b85

karalabe force-pushed the enforce-fastsync-checkpoints branch from 234bcbf to 38f6b85 Compare April 17, 2019 10:16

rjl493456442 approved these changes Apr 17, 2019

View reviewed changes

karalabe mentioned this pull request Apr 17, 2019

Mental note: disconnect unsynced peers during state sync #19444

Closed

karalabe merged commit f496927 into ethereum:master Apr 17, 2019

karalabe mentioned this pull request Apr 17, 2019

[1.8.27 backport] eth, les, light: enforce CHT checkpoints on fast-sync too #19473

Merged

benbaley mentioned this pull request Aug 24, 2021

[update needed] enforce CHT checkpoints on fast-sync AlayaNetwork/Alaya-Go#106

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eth, les, light: enforce CHT checkpoints on fast-sync too #19468

eth, les, light: enforce CHT checkpoints on fast-sync too #19468

karalabe commented Apr 16, 2019

holiman commented Apr 16, 2019

karalabe commented Apr 16, 2019

holiman commented Apr 16, 2019

karalabe commented Apr 17, 2019

karalabe commented Apr 17, 2019

karalabe commented Apr 17, 2019

karalabe commented Apr 17, 2019

holiman commented Apr 17, 2019

karalabe commented Apr 17, 2019

rjl493456442 left a comment

Matthalp-zz commented Apr 17, 2019

eth, les, light: enforce CHT checkpoints on fast-sync too #19468

eth, les, light: enforce CHT checkpoints on fast-sync too #19468

Conversation

karalabe commented Apr 16, 2019

holiman commented Apr 16, 2019

karalabe commented Apr 16, 2019

holiman commented Apr 16, 2019

karalabe commented Apr 17, 2019

karalabe commented Apr 17, 2019

karalabe commented Apr 17, 2019

karalabe commented Apr 17, 2019

holiman commented Apr 17, 2019

karalabe commented Apr 17, 2019

rjl493456442 left a comment

Choose a reason for hiding this comment

Matthalp-zz commented Apr 17, 2019