fix possible block db breakage during re-index #5864

theuni · 2015-03-07T01:48:08Z

I could really use some more eyes on this. I discussed it briefly with @sipa on IRC a few days ago.

I noticed this while looking into #5668 . This is one possible explanation I can come up with for overlapping block data. Whether it has anything to do with that issue or not, I think this still needs to be addressed.

When re-indexing, there are a few cases where garbage data may be skipped in the block files. In these cases, the indices are correctly written to the index db, however the pointer to the next position for writing in the current block file is calculated by adding the sizes of the valid blocks found.

As a result, when the re-index is finished, the index db is correct for all existing blocks, but the next block will be written to an incorrect offset, likely overwriting existing blocks.

Rather than using the sum of all valid blocks to determine the next write position, use the end of the last block written to the file. Don't assume that the current block is the last one in the file, since they may be read out-of-order.

I was able to trigger this problem by inserting some garbage data between two valid blocks on disk in the last .dat file, then reindexing. After that, run normally for a few min in order to write a few new blocks to disk, then run with -checkblocks=0.

Before this change, I would get different errors (de-serialization, eof, etc) depending on what garbage i add and where. After the change, it appears to survive the re-index without issue regardless of the garbage.

sipa · 2015-03-07T12:11:28Z

Seems very reasonable to me. Even if this doesn't fix anything, it should be safe.

pstratem · 2015-03-09T05:37:32Z

I ran into this issue today.

Power failured during IBD.

Restarted with -reindex=1

Waited for IBD to finish and then called getblock RPC call for all blocks and got about a dozen ReadBlockFromDisk

laanwj · 2015-03-09T08:26:57Z

src/main.cpp

    vinfoBlockFile[nFile].AddBlock(nHeight, nTime);
+    if (fKnown && vinfoBlockFile[nFile].nSize < (pos.nPos + nAddSize))
+        vinfoBlockFile[nFile].nSize = pos.nPos + nAddSize;


Looks good to me, although I'm not sure about the fKnown && vinfoBlockFile[nFile].nSize >= (pos.nPos + nAddSize) case. Would we want to increase nSize at all? Or is this better:

if (fKnown) vinfoBlockFile[nFile].nSize = std::max(pos.nPos + nAddSize, vinfoBlockFile[nFile].nSize); else vinfoBlockFile[nFile].nSize += nAddSize;

ACK this suggested change.

When can I fetch latest source code, compile and test?

On Mon, Mar 9, 2015 at 1:01 PM, Jeff Garzik notifications@github.com
wrote:

In src/main.cpp
#5864 (comment):

vinfoBlockFile[nFile].AddBlock(nHeight, nTime);

if (fKnown && vinfoBlockFile[nFile].nSize < (pos.nPos + nAddSize))

vinfoBlockFile[nFile].nSize = pos.nPos + nAddSize;

+1

—
Reply to this email directly or view it on GitHub
https://github.com/bitcoin/bitcoin/pull/5864/files#r26029884.

Mvh
-fredrik-normann-

Sent from my Gmail Account

@laanwj Yes, thanks for catching that.

morcos · 2015-03-09T18:54:25Z

I tested this in the this little RPC test, https://gist.github.com/morcos/ae506817284cd776d5b2.
Warning it does some raw file IO to munge your block files.
Tested and works with and without the suggested change, but fails on master.

theuni · 2015-03-09T19:55:07Z

Updated for @laanwj's suggestion.

@morcos Big thanks for that, much nicer than my manual testnet dd+hexedit hackery :)

When re-indexing, there are a few cases where garbage data may be skipped in the block files. In these cases, the indices are correctly written to the index db, however the pointer to the next position for writing in the current block file is calculated by adding the sizes of the valid blocks found. As a result, when the re-index is finished, the index db is correct for all existing blocks, but the next block will be written to an incorrect offset, likely overwriting existing blocks. Rather than using the sum of all valid blocks to determine the next write position, use the end of the last block written to the file. Don't assume that the current block is the last one in the file, since they may be read out-of-order.

bb6acff fix possible block db breakage during re-index (Cory Fields)

When re-indexing, there are a few cases where garbage data may be skipped in the block files. In these cases, the indices are correctly written to the index db, however the pointer to the next position for writing in the current block file is calculated by adding the sizes of the valid blocks found. As a result, when the re-index is finished, the index db is correct for all existing blocks, but the next block will be written to an incorrect offset, likely overwriting existing blocks. Rather than using the sum of all valid blocks to determine the next write position, use the end of the last block written to the file. Don't assume that the current block is the last one in the file, since they may be read out-of-order. Rebased-From: bb6acff Github-Pull: #5864

laanwj · 2015-03-11T07:48:09Z

Cherry-picked to 0.10 as 002c8a2

When re-indexing, there are a few cases where garbage data may be skipped in the block files. In these cases, the indices are correctly written to the index db, however the pointer to the next position for writing in the current block file is calculated by adding the sizes of the valid blocks found. As a result, when the re-index is finished, the index db is correct for all existing blocks, but the next block will be written to an incorrect offset, likely overwriting existing blocks. Rather than using the sum of all valid blocks to determine the next write position, use the end of the last block written to the file. Don't assume that the current block is the last one in the file, since they may be read out-of-order. Rebased-From: bb6acff Github-Pull: bitcoin#5864 (cherry picked from commit 002c8a2)

laanwj reviewed Mar 9, 2015
View reviewed changes

laanwj added the UTXO Db and Indexes label Mar 9, 2015

laanwj added this to the 0.10.0 milestone Mar 9, 2015

theuni mentioned this pull request Mar 9, 2015

ERROR: ReadBlockFromDisk : Deserialize or I/O error - ReadCompactSize() : size too large #5668

Closed

theuni force-pushed the fix-reindex-corruption branch from 9add379 to 8eb826f Compare March 9, 2015 18:55

theuni force-pushed the fix-reindex-corruption branch from 8eb826f to bb6acff Compare March 10, 2015 17:50

laanwj merged commit bb6acff into bitcoin:master Mar 11, 2015

laanwj added a commit that referenced this pull request Mar 11, 2015

Merge pull request #5864

45b7dc2

bb6acff fix possible block db breakage during re-index (Cory Fields)

laanwj mentioned this pull request Jul 31, 2015

"Read attempted past buffer limit" error while reindexing #6492

Closed

prairiefalcon mentioned this pull request Mar 7, 2021

Reindex subtly corrupts blknnnnn.dat files and causes subsequent deserialization errors #21379

Closed

bitcoin locked as resolved and limited conversation to collaborators Sep 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix possible block db breakage during re-index #5864

fix possible block db breakage during re-index #5864

theuni commented Mar 7, 2015

sipa commented Mar 7, 2015

pstratem commented Mar 9, 2015

laanwj Mar 9, 2015

sipa Mar 9, 2015

jgarzik Mar 9, 2015

eN0Rm Mar 9, 2015

theuni Mar 9, 2015

morcos commented Mar 9, 2015

theuni commented Mar 9, 2015

laanwj commented Mar 11, 2015

fix possible block db breakage during re-index #5864

fix possible block db breakage during re-index #5864

Conversation

theuni commented Mar 7, 2015

sipa commented Mar 7, 2015

pstratem commented Mar 9, 2015

laanwj Mar 9, 2015

Choose a reason for hiding this comment

sipa Mar 9, 2015

Choose a reason for hiding this comment

jgarzik Mar 9, 2015

Choose a reason for hiding this comment

eN0Rm Mar 9, 2015

Choose a reason for hiding this comment

theuni Mar 9, 2015

Choose a reason for hiding this comment

morcos commented Mar 9, 2015

theuni commented Mar 9, 2015

laanwj commented Mar 11, 2015