Reduce Zebra disk usage for mining pools #5718

teor2345 · 2022-11-25T02:11:58Z

Motivation

Some mining pools have asked us to reduce Zebra's disk usage.

Alternative Designs

Here are some different things we could try, in rough order of effort/disruptiveness:

Stop storing duplicate state data
- Store only the first tree state in each series of identical tree states #4784
- Change the finalized sprout_note_commitment_tree to a key of () and a value of sprout::Root. Look up the actual note commitment tree in sprout_anchors.
Improve database compression using:
- a different level 0 compression algorithm, like zstd
- the maximum compression rate
- this probably doesn't need a state version change, but:
  - old states will have less compression, and
  - old versions of Zebra might not be able to open new states, if they don't have all the algorithms we're using

We might want to delay this work until after the audit, because it could change a lot of code:

Add a config to Zebra that doesn't create unused indexes:
- delete balance_by_transparent_addr
- delete tx_loc_by_transparent_addr_loc
- delete utxo_loc_by_transparent_addr_loc
- delete sprout_note_commitment_tree lower than the finalized tip
- delete sapling_note_commitment_tree lower than the finalized tip
- delete orchard_note_commitment_tree lower than the finalized tip
- delete history_tree lower than the finalized tip
- This will cause errors in RPCs that use these indexes, but that's ok if they aren't called
Add a config to Zebra that deletes blocks below finalized tip - how far we look back to check for legacy chains:
- block_header_by_height
- tx_by_loc
- maybe hash_by_height
- maybe height_by_hash
- maybe hash_by_tx_loc
- maybe tx_loc_by_hash
- This could cause a lot of errors, we should try a quick and dirty implementation first

The text was updated successfully, but these errors were encountered:

teor2345 added C-bug Category: This is a bug S-needs-triage Status: A bug report needs triage I-heavy Problems with excessive memory, disk, or CPU usage P-Optional ✨ A-rpc Area: Remote Procedure Call interfaces A-state Area: State / database changes labels Nov 25, 2022

mpguerra mentioned this issue Aug 22, 2023

Tracking: Official support for mining RPCs in Zebra #7366

Closed

17 tasks

teor2345 mentioned this issue Aug 29, 2023

diagnostic: Log column family and database size on startup and shutdown #7416

Open

mpguerra removed the P-Optional ✨ label Jan 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce Zebra disk usage for mining pools #5718

Reduce Zebra disk usage for mining pools #5718

teor2345 commented Nov 25, 2022

Reduce Zebra disk usage for mining pools #5718

Reduce Zebra disk usage for mining pools #5718

Comments

teor2345 commented Nov 25, 2022

Motivation

Alternative Designs