Headers-based synchronization and parallel block download #2964

sipa · 2013-08-31T20:47:14Z

(see individual commits, in particular "Switch to headers-based synchronization", for detailed explanation of the changes).

This implements headers-based synchronization and parallel block download. Summary of the changes:

Use getheaders instead of getblocks based synchronization, so that the best block in the chain is always known before actually downloading it. No more orphan blocks, ever.
Use a moving-window based block downloading mechanism (the heuristics are simple and behave badly for very small blocks, but except for the very beginning of the chain, it works very well).
Allow blocks to be stored out-of-order in the block database files.
Reorganize the block connection logic significantly.
Add a getblockheader RPC command.

gmaxwell · 2013-09-01T11:31:50Z

Pulltester where are you?

sipa · 2013-09-01T13:04:26Z

Dang. @TheBlueMatt: how does pulltester send/announce blocks, does it answer getheaders, and how does it know a block was received?

TheBlueMatt · 2013-09-03T20:24:19Z

Currently block-tester doesnt support getheaders requests, so that needs fixed

theuni · 2013-09-12T01:10:03Z

@sipa fyi, now you can easily run the same scripts as pull-tester to check locally, in case you're looking into fixing up the block-tester. Just add --with-comparison-tool to your configure, and this test will be enabled during 'make check':

./configure --with-comparison-tool=/tmp/BitcoindComparisonTool.jar
make
make check

If you replace the .jar, just 'make check' again.

sipa · 2013-09-13T11:08:09Z

Rebased, though I haven't investigated the pulltester failure further.

I assume it has to do with actually changed semantics, as the best known not-known-to-be-invalid header chain is always considered the main chain, and pindexBest is always within this chain. As the decision is not just made per-received-block, you could have a valid chain A-B-C, and another chain A-B-C'-D, where D is invalid, in which case C' is reported as best chain instead of C (assuming they have the same amount of work, so the choice should be just as good). Reverting to the old behaviour here would require a hack like reducing the scoring of blocks that have invalid successors for example, but that seems pointless to me, and there are likely other differences as well.

I suppose that during reorganizations, pulltester should just accept any not-invalid block chain with maximum PoW as correct?

sipa · 2013-09-14T11:42:44Z

@TheBlueMatt Any help decoding pulltester's output?

TheBlueMatt · 2013-09-14T17:56:26Z

Have you tried the latest comparison tool jar from https://github.com/TheBlueMatt/test-scripts ?

sipa · 2013-09-17T21:27:18Z

@TheBlueMatt (as discussed on IRC) Yes, this gets us further already, but it seems the getheaders implementation in the comparison tool is too incomplete to do a successful multi-block reorg with headers-first. So this means fixing that becomes sort of a blocker for this; unfortunately, I don't have the time any time soon to look into that.

SendMessages() tries to acquire a cs_main lock now, but this isn't nessecary for much of its functionality. Move those parts out of the locked section, so they can always be performed, and we hold cs_main for a shorter time.

This fixes some weird reorganization issues that probably don't normally occur, but can be triggered when blacklisting arbitrary blocks is enabled.

As this is a fairly dangerous operation, it is normally disabled when building. Define ENABLE_BLOCK_BLACKLISTING to enable compiling it in.

Instead of using "getblocks" to learn about the hashes of blocks peers have, use "getheaders" to retrieve the headers of those blocks entirely. This allows the client to immediately build an internal representation of the block tree. This means the best chain (or at least its most likely candidate, disregarding invalid transactions) is almost instantly known, and block download can proceed as a secondary (though in practice simultaneous) and very robust process to acquire the transaction data along this already-known best chain. Advantages include: * No more vulnerabilities related to orphan blocks, as they simply don't exist anymore. We only fetch blocks along the best chain; any other block we're not interested in. * No more need for checkpoints to determine bounds on difficulties of received blocks, as we never receive a block without knowing (the headers of) its ancestry. For a new, smaller vulnerability (sending many long low-difficulty header chains, consuming memory), checkpoints are still useful. * No more need for checkpoints for safety when disabling signature check. A rule like "Do not check signatures in blocks covered by N weeks worth of hash power" could be used, though this is currently not yet implemented. * The IsInitialBlockDownload() become very precises and safe: we're syncing a backlog as long as there are headers in our best chain for which blocks are missing. There is no need for relying on peers' reported number of blocks anymore. * It can serve as a basis to implement SPV mode, which is basically the same logic, but with the blockchain-driven block fetching replaced by wallet-driven filtered block fetching. * Efficient parallel block download, and generally avoiding many edge cases that were possible during the previous getblocks-based synchronization (including frequent duplicate downloads, orphans and and needing to wait for the next block to be announced before resuming). Additionally, good synchronization speed does not rely on guessing a good initial peer to fetch from anymore - a single slow peer does not hurt you anymore. This also means that throttling upload speeds will no longer be a problem to the network. For now, block sync is only done from outbound peers, to retain the "no listening == less bandwidth" property that currently exist. As soon as enough nodes upgrade to headers-first sync, this can be removed. Potentially significant semantical changes: * In case a new best chain is known, but its tip blocks are not yet known, the best known ancestor of the new tip is used instead of the old tip. On the other hand, during this period it is known that we're not up to date. The effects of this on mining should be considered, probably. More in detail, changes include: * All redundant globals are gone (nBestHeight, nBestChainWork, nBestInvalidWork, hashBestChain) - instead, we just have 4 pointers into the block tree (pindexGenesisBlock, pindexBest, pindexBestHeader, pindexBestInvalid). * ProcessBlock and AcceptBlock are split into ProcessBlockHeader and AcceptBlockHeader. * Instead of 'getblocks', push out 'getheaders' to the sync node. * Implement 'headers' P2P handler, causing headers to be included in the block index. * pindexBest remains as pointer to the "current state of the UTXO set" (chainstate), meaning the point until where we are synchronized. pindexBestHeader is added as a pointer to the tip of the currently best known (and validated, except for transaction validity) header. pindexBest is guaranteed to be always an ancestor of (or equal to) pindexBestHeader. * A new function SwitchToBestHeader finds the best header in the database, and switches to it. SwitchToBestBlock connects blocks along this best chain. These replace ConnectBestBlock. * The block tree internally can deal with orphan blocks (whose parents are unknown), but this mechanism is only used during reindexing, as block may be stored out-of-order. A new function LinkOrphans connects orphans to the tree, and replaces some code in LoadBlockIndex. * Actual blocks are only fetched along the known-best chain, using parallel block downloading logic in SendMessages. For each peer, a (bounded) list of requested blocks is maintained. New requests are pipelined, as the requested blocks arrive. As the tree they belong to is already known, we can receive and store them out-of-order. This also means that the concept of orphan blocks disappears. If one peer stalls us, it is disconnected and the blocks requested from it are sent elsewhere. This is nicer both for us and for them. * One exception to the headers-first syncing, is when we are fully synced (pindexBest==pindexBestHeader), and a new block 'inv' is announced. In this case, the block is fetched immediately ('getdata') after asking for the headers preceding it, to avoid the extra roundtrip. * SetBestChain (which handled reorganizations) is no longer necessary, as SwitchToBestHeader and SwitchToBestBlock only ever connect/disconnect individual blocks. It is therefore replaced by two simpler functions, ConnectTip and DisconnectTip, implented using helpers WriteChainState and UpdateTip. * In several places code is added to deal with the case that a CBlockIndex entry does not have corresponding block data available.

BitcoinPullTester · 2013-10-04T07:03:18Z

Automatic sanity-testing: FAILED MERGE, see http://jenkins.bluematt.me/pull-tester/bd49d084540ebd15c19b3dcb660802aa40da5f2d for test log.

This pull does not merge cleanly onto current master
This test script verifies pulls every time they are updated. It, however, dies sometimes and fails to test properly. If you are waiting on a test, please check timestamps to verify that the test.log is moving at http://jenkins.bluematt.me/pull-tester/current/
Contact BlueMatt on freenode if something looks broken.

sipa · 2013-11-16T16:57:49Z

Closing. This is outdated, as I've been reworking this as more managable patches/pull requests.

haltingstate · 2014-05-01T11:41:04Z

The blockchain takes weeks to download. Please merge this in ASAP. This is the absolutely most important thing right now.

laanwj · 2014-05-01T12:00:13Z

@haltingstate Then help reviewing and testing #3884, which is a step towards this.

sipa mentioned this pull request Sep 23, 2013

Don't store or send blocks forked before last checkpoint. #2910

Merged

sipa and others added 11 commits September 28, 2013 16:05

Allow SendMessages to run partially without cs_main

ad3f970

SendMessages() tries to acquire a cs_main lock now, but this isn't nessecary for much of its functionality. Move those parts out of the locked section, so they can always be performed, and we hold cs_main for a shorter time.

Run node deletions outside of cs_vNodes

6b6f089

Push down cs_main locking in ProcessMessage

7826f7e

Improve handling of obscure reorganizations

1646093

This fixes some weird reorganization issues that probably don't normally occur, but can be triggered when blacklisting arbitrary blocks is enabled.

Add blacklistblock RPC

dee601d

As this is a fairly dangerous operation, it is normally disabled when building. Define ENABLE_BLOCK_BLACKLISTING to enable compiling it in.

RPC updates after header-first synchronization

8f05f6f

Enforce first not-invalid best-pow chain

da5cbf6

Only prevent historic blocks from being downloaded from inbound peers

924e992

Print incoming peer's service bits

668a9b1

Debugging

bd49d08

laanwj mentioned this pull request Oct 26, 2013

already have block - causing bandwidth to be wasted - vicious circle? #1120

Closed

sipa closed this Nov 16, 2013

sipa mentioned this pull request Mar 18, 2014

Speed limit / throttle network usage #273

Closed

sipa mentioned this pull request May 9, 2014

Split headers-only versions off CheckBlock/AcceptBlock #3884

Merged

bitcoin locked as resolved and limited conversation to collaborators Sep 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Headers-based synchronization and parallel block download #2964

Headers-based synchronization and parallel block download #2964

sipa commented Aug 31, 2013

gmaxwell commented Sep 1, 2013

sipa commented Sep 1, 2013

TheBlueMatt commented Sep 3, 2013

theuni commented Sep 12, 2013

sipa commented Sep 13, 2013

sipa commented Sep 14, 2013

TheBlueMatt commented Sep 14, 2013

sipa commented Sep 17, 2013

BitcoinPullTester commented Oct 4, 2013

sipa commented Nov 16, 2013

haltingstate commented May 1, 2014

laanwj commented May 1, 2014

Headers-based synchronization and parallel block download #2964

Headers-based synchronization and parallel block download #2964

Conversation

sipa commented Aug 31, 2013

gmaxwell commented Sep 1, 2013

sipa commented Sep 1, 2013

TheBlueMatt commented Sep 3, 2013

theuni commented Sep 12, 2013

sipa commented Sep 13, 2013

sipa commented Sep 14, 2013

TheBlueMatt commented Sep 14, 2013

sipa commented Sep 17, 2013

BitcoinPullTester commented Oct 4, 2013

sipa commented Nov 16, 2013

haltingstate commented May 1, 2014

laanwj commented May 1, 2014