DB/Blockchain speed optimizations #337

NoodleDoodleNoodleDoodleNoodleDoodleNoo · 2015-07-15T08:31:04Z

Please refer to 4de799e for modification notes.

…hread to lock up.

…ction start This will assist in a DB resize check.

This currently only affects blockchain_import and blockchain_converter. When the number of blocks expected for the batch transaction is provided, make an estimate of the DB space needed. If not enough free space remains, resize the DB. The estimate is made based on: - the average size of the last 500 blocks, or if larger, a min. block size of 4k - a factor for the expanded size a block occupies in the DB across the sub-dbs/tables - a safety factor (1.7) to allow for a "reasonable" average block size increase over the batch Increase the DB size by whichever is greater: the estimated size needed or a minimum increase size, currently 128 MB. The conservative factors in the estimate help in testing that the resize occurs when needed, and without gratuitous size increases. For common use, the safety factor and minimum increase size could reasonably be increased. For testing, setting DEFAULT_MAPSIZE (blockchain_db/lmdb/db_lmdb.h) to 1 << 27 (128 MB) and recompiling will ensure DB resizes take place sooner and more frequently.

The system is mostly the Qt system, but we don't use Qt to avoid the dependencies. See README.i18n for details.

5d304ca Fix loop bug when calling core::get_block_template, causing calling thread to lock up. (NoodleDoodleNoodleDoodleNoodleDoodleNoo)

fd73d9c Check and resize if needed at batch transaction start (warptangent) f9e4afd blockchain_utilities: Increase debug statement's log level (warptangent) 699e4b3 blockchain_utilities: Pass expected number of blocks when starting batch (warptangent) 6e170c8 Optionally allow DB to know expected number of blocks at batch transaction start (warptangent)

78b2eab Translatable strings for simplewallet (moneromooo-monero)

ea58576 Add missing file - i18n.cpp (moneromooo-monero)

- bugfix: prevent re-entering db->get when current buffer contains all possible index values.

Fix compilation error

Bockchain: 1. Optim: Multi-thread long-hash computation when encountering groups of blocks. 2. Optim: Cache verified txs and return result from cache instead of re-checking whenever possible. 3. Optim: Preload output-keys when encoutering groups of blocks. Sort by amount and global-index before bulk querying database and multi-thread when possible. 4. Optim: Disable double spend check on block verification, double spend is already detected when trying to add blocks. 5. Optim: Multi-thread signature computation whenever possible. 6. Patch: Disable locking (recursive mutex) on called functions from check_tx_inputs which causes slowdowns (only seems to happen on ubuntu/VMs??? Reason: TBD) 7. Optim: Removed looped full-tx hash computation when retrieving transactions from pool (???). 8. Optim: Cache difficulty/timestamps (735 blocks) for next-difficulty calculations so that only 2 db reads per new block is needed when a new block arrives (instead of 1470 reads). Berkeley-DB: 1. Fix: 32-bit data errors causing wrong output global indices and failure to send blocks to peers (etc). 2. Fix: Unable to pop blocks on reorganize due to transaction errors. 3. Patch: Large number of transaction aborts when running multi-threaded bulk queries. 4. Patch: Insufficient locks error when running full sync. 5. Patch: Incorrect db stats when returning from an immediate exit from "pop block" operation. 6. Optim: Add bulk queries to get output global indices. 7. Optim: Modified output_keys table to store public_key+unlock_time+height for single transaction lookup (vs 3) 8. Optim: Used output_keys table retrieve public_keys instead of going through output_amounts->output_txs+output_indices->txs->output:public_key 9. Optim: Added thread-safe buffers used when multi-threading bulk queries. 10. Optim: Added support for nosync/write_nosync options for improved performance (*see --db-sync-mode option for details) 11. Mod: Added checkpoint thread and auto-remove-logs option. 12. *Now usable on 32-bit systems like RPI2. LMDB: 1. Optim: Added custom comparison for 256-bit key tables (minor speed-up, TBD: get actual effect) 2. Optim: Modified output_keys table to store public_key+unlock_time+height for single transaction lookup (vs 3) 3. Optim: Used output_keys table retrieve public_keys instead of going through output_amounts->output_txs+output_indices->txs->output:public_key 4. Optim: Added support for sync/writemap options for improved performance (*see --db-sync-mode option for details) 5. Mod: Auto resize to +1GB instead of multiplier x1.5 ETC: 1. Minor optimizations for slow-hash for ARM (RPI2). Incomplete. 2. Fix: 32-bit saturation bug when computing next difficulty on large blocks. [PENDING ISSUES] 1. Berkely db has a very slow "pop-block" operation. This is very noticeable on the RPI2 as it sometimes takes > 10 MINUTES to pop a block during reorganization. This does not happen very often however, most reorgs seem to take a few seconds but it possibly depends on the number of outputs present. TBD. 2. Berkeley db, possible bug "unable to allocate memory". TBD. [NEW OPTIONS] (*Currently all enabled for testing purposes) 1. --fast-block-sync arg=[0:1] (default: 1) a. 0 = Compute long hash per block (may take a while depending on CPU) b. 1 = Skip long-hash and verify blocks based on embedded known good block hashes (faster, minimal CPU dependence) 2. --db-sync-mode arg=[[safe|fast|fastest]:[sync|async]:[nblocks_per_sync]] (default: fastest:async:1000) a. safe = fdatasync/fsync (or equivalent) per stored block. Very slow, but safest option to protect against power-out/crash conditions. b. fast/fastest = Enables asynchronous fdatasync/fsync (or equivalent). Useful for battery operated devices or STABLE systems with UPS and/or systems with battery backed write cache/solid state cache. Fast - Write meta-data but defer data flush. Fastest - Defer meta-data and data flush. Sync - Flush data after nblocks_per_sync and wait. Async - Flush data after nblocks_per_sync but do not wait for the operation to finish. 3. --prep-blocks-threads arg=[n] (default: 4 or system max threads, whichever is lower) Max number of threads to use when computing long-hash in groups. 4. --show-time-stats arg=[0:1] (default: 1) Show benchmark related time stats. 5. --db-auto-remove-logs arg=[0:1] (default: 1) For berkeley-db only. Auto remove logs if enabled. **Note: lmdb and berkeley-db have changes to the tables and are not compatible with official git head version. At the moment, you need a full resync to use this optimized version. [PERFORMANCE COMPARISON] **Some figures are approximations only. Using a baseline machine of an i7-2600K+SSD+(with full pow computation): 1. The optimized lmdb/blockhain core can process blocks up to 585K for ~1.25 hours + download time, so it usually takes 2.5 hours to sync the full chain. 2. The current head with memory can process blocks up to 585K for ~4.2 hours + download time, so it usually takes 5.5 hours to sync the full chain. 3. The current head with lmdb can process blocks up to 585K for ~32 hours + download time and usually takes 36 hours to sync the full chain. Averate procesing times (with full pow computation): lmdb-optimized: 1. tx_ave = 2.5 ms / tx 2. block_ave = 5.87 ms / block memory-official-repo: 1. tx_ave = 8.85 ms / tx 2. block_ave = 19.68 ms / block lmdb-official-repo (0f4a036) 1. tx_ave = 47.8 ms / tx 2. block_ave = 64.2 ms / block **Note: The following data denotes processing times only (does not include p2p download time) lmdb-optimized processing times (with full pow computation): 1. Desktop, Quad-core / 8-threads 2600k (8Mb) - 1.25 hours processing time (--db-sync-mode=fastest:async:1000). 2. Laptop, Dual-core / 4-threads U4200 (3Mb) - 4.90 hours processing time (--db-sync-mode=fastest:async:1000). 3. Embedded, Quad-core / 4-threads Z3735F (2x1Mb) - 12.0 hours processing time (--db-sync-mode=fastest:async:1000). lmdb-optimized processing times (with per-block-checkpoint) 1. Desktop, Quad-core / 8-threads 2600k (8Mb) - 10 minutes processing time (--db-sync-mode=fastest:async:1000). berkeley-db optimized processing times (with full pow computation) 1. Desktop, Quad-core / 8-threads 2600k (8Mb) - 1.8 hours processing time (--db-sync-mode=fastest:async:1000). 2. RPI2. Improved from estimated 3 months(???) into 2.5 days (*Need 2AMP supply + Clock:1Ghz + [usb+ssd] to achieve this speed) (--db-sync-mode=fastest:async:1000). berkeley-db optimized processing times (with per-block-checkpoint) 1. RPI2. 12-15 hours (*Need 2AMP supply + Clock:1Ghz + [usb+ssd] to achieve this speed) (--db-sync-mode=fastest:async:1000).

Fixed OSX compilation issues due to random lmdb resize points. Fixed infinite loop bug when calling core::get_block_template(..).

Added option to cache tx-input verification results.

…oodleDoodleNoo/bitmonero

df94d02 Removed on_idle() calls to Blockchain::store_blockchain() for lmdb. Added option to cache tx-input verification results. (NoodleDoodleNoodleDoodleNoodleDoodleNoo) df4f42e Fixed binary size issue due to embedded checkpoint data. Fixed OSX compilation issues due to random lmdb resize points. Fixed infinite loop bug when calling core::get_block_template(..). (NoodleDoodleNoodleDoodleNoodleDoodleNoo) 12bdc01 Pause miner before preparing for incoming blocks (NoodleDoodleNoodleDoodleNoodleDoodleNoo) 4de799e ** CHANGES ARE EXPERIMENTAL (FOR TESTING ONLY) (NoodleDoodleNoodleDoodleNoodleDoodleNoo) f90b3b8 Update blockchain.cpp (NoodleDoodleNoodleDoodleNoodleDoodleNoo) 0d62d5b Update db_bdb.cpp (NoodleDoodleNoodleDoodleNoodleDoodleNoo) 81a0c31 Update db_bdb.cpp (NoodleDoodleNoodleDoodleNoodleDoodleNoo) c564e7d Update db_bdb.cpp (NoodleDoodleNoodleDoodleNoodleDoodleNoo) d495968 Experimental BDB workaround optimizations (NoodleDoodleNoodleDoodleNoodleDoodleNoo) 432feb0 Removed on_idle() calls to Blockchain::store_blockchain() for lmdb. Added option to cache tx-input verification results. (NoodleDoodleNoodleDoodleNoodleDoodleNoo) dec523f Fixed binary size issue due to embedded checkpoint data. Fixed OSX compilation issues due to random lmdb resize points. Fixed infinite loop bug when calling core::get_block_template(..). (NoodleDoodleNoodleDoodleNoodleDoodleNoo) 2c7384a Pause miner before preparing for incoming blocks (NoodleDoodleNoodleDoodleNoodleDoodleNoo) 4aea07c ** CHANGES ARE EXPERIMENTAL (FOR TESTING ONLY) (NoodleDoodleNoodleDoodleNoodleDoodleNoo) f67cb31 Update blockchain.cpp (NoodleDoodleNoodleDoodleNoodleDoodleNoo) 6109dd6 Update db_bdb.cpp (NoodleDoodleNoodleDoodleNoodleDoodleNoo) ea4c6e9 Update db_bdb.cpp (NoodleDoodleNoodleDoodleNoodleDoodleNoo) 474ac31 Update db_bdb.cpp (NoodleDoodleNoodleDoodleNoodleDoodleNoo) ee8bb5c Experimental BDB workaround optimizations (NoodleDoodleNoodleDoodleNoodleDoodleNoo)

….0.0 Release/3.1.0.0

NoodleDoodleNoodleDoodleNoodleDoodleNoo and others added 21 commits July 10, 2015 22:09

Fix loop bug when calling core::get_block_template, causing calling t…

5d304ca

…hread to lock up.

Optionally allow DB to know expected number of blocks at batch transa…

6e170c8

…ction start This will assist in a DB resize check.

blockchain_utilities: Pass expected number of blocks when starting batch

699e4b3

blockchain_utilities: Increase debug statement's log level

f9e4afd

Translatable strings for simplewallet

78b2eab

The system is mostly the Qt system, but we don't use Qt to avoid the dependencies. See README.i18n for details.

Merge pull request monero-project#333

10e50a4

5d304ca Fix loop bug when calling core::get_block_template, causing calling thread to lock up. (NoodleDoodleNoodleDoodleNoodleDoodleNoo)

Merge pull request monero-project#335

ad841cb

78b2eab Translatable strings for simplewallet (moneromooo-monero)

Add missing file - i18n.cpp

ea58576

Merge pull request monero-project#336

b484931

ea58576 Add missing file - i18n.cpp (moneromooo-monero)

Experimental BDB workaround optimizations

ee8bb5c

Update db_bdb.cpp

474ac31

Update db_bdb.cpp

ea4c6e9

Update db_bdb.cpp

6109dd6

- bugfix: prevent re-entering db->get when current buffer contains all possible index values.

Update blockchain.cpp

f67cb31

Fix compilation error

Pause miner before preparing for incoming blocks

2c7384a

Fixed binary size issue due to embedded checkpoint data.

dec523f

Fixed OSX compilation issues due to random lmdb resize points. Fixed infinite loop bug when calling core::get_block_template(..).

Removed on_idle() calls to Blockchain::store_blockchain() for lmdb.

432feb0

Added option to cache tx-input verification results.

Merge branch 'master' of https://github.com/NoodleDoodleNoodleDoodleN…

f0c8ff3

…oodleDoodleNoo/bitmonero

fluffypony merged commit f0c8ff3 into monero-project:master Jul 15, 2015

lxop pushed a commit to lxop/monero that referenced this pull request Jan 27, 2021

Merge pull request monero-project#337 from ChrisCharrison/release/3.1…

0b0d8b2

….0.0 Release/3.1.0.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DB/Blockchain speed optimizations #337

DB/Blockchain speed optimizations #337

NoodleDoodleNoodleDoodleNoodleDoodleNoo commented Jul 15, 2015

DB/Blockchain speed optimizations #337

DB/Blockchain speed optimizations #337

Conversation

NoodleDoodleNoodleDoodleNoodleDoodleNoo commented Jul 15, 2015