Merkle tree structure for UTXO set #10

ignopeverell · 2016-10-31T16:50:04Z

The current Merkle tree implementation is mostly a placeholder and is going to be inadequate to handle the UTXO set tree. The following are highly desirable property:

cheap lookups, additions and deletions
efficient store and update operations in db (likely rocksdb)
simple
an immutable data structure (pruned on a later pass) may be easier to reason about

The MMR [1] [2] algorithm has been proposed and Merklix trees [3] offer interesting possibilities as well (i.e. p2p querying, sharding).

[1] https://github.com/opentimestamps/opentimestamps-server/blob/master/python-opentimestamps/opentimestamps/core/timestamp.py#L324
[2] https://github.com/opentimestamps/opentimestamps-server/blob/master/doc/merkle-mountain-range.md
[3] http://www.deadalnix.me/2016/09/29/using-merklix-tree-to-checkpoint-an-utxo-set/

@GarrickOllivander @merope07 and @apoelstra have expressed interest.

merope07 · 2016-10-31T21:11:38Z

Thanks for finding Merklix, I will study it. Something I see is the prefixes are variable-length, which means that an implementation is likely to be as complex as one of a Patricia tree. Also, I'm uncertain about serialization of Merklix trees, since the article does not go into this, although I think it's easy to do in a malleability-free way since the prefixes force the structure of the tree. Not a big deal, just something to think about.

It appears straightforward to create a sum-Merklix tree which is good.

As for the benefits, we never need deletions or proof-of-absense for standard MW operation. For the purpose of a UTXO set, we can simulate this with a "spent count" on each output which is always 1 or 0, then a proof-of-absense is simply a proof-of-presense where the spent-count reveals whether or not the output has been spent. This isn't exactly a proof-of-absense since it does not allow proving that a UTXO never existed, but it does allow to prove that a UTXO existed then was spent, which is all that's required for fraud proofs.

Both addition and in-place updates are O(ln(n)) for both MMR and Merklix.

~M

ignopeverell · 2016-10-31T22:23:19Z

One advantage I saw for Merklix trees is the possibility of sharding. Maintaining the UTXO set is not really standard MW (in the Jedusor version at least), but it's really handy when it comes to block cut-through. It should allow us to do full cut-through initial sync without too much worry and I think will also let us discard rangeproofs in the full cut-through horizon (something like head minus 1000 blocks), which is a huge win.

With all that in mind, the ability to shard that UTXO set would be pretty cool :)

apoelstra · 2016-10-31T22:39:53Z

I think because of https://www.reddit.com/r/Bitcoin/comments/4vub3y/mimblewimble_noninteractive_coinjoin_and_better/d62cux6/ committing to the UTXO set is a necessary extension to the Jedusor scheme (like, there are consensus failures if you don't).

I like the sound of sharding .. though I think if the UTXOs that users store are randomly chosen we should be able to do this with MMRs. Unsure. Maybe @merope07 knows more about this.

Discarding rangeproofs means SPV security of noninflation. I definitely think this should be a supported mode of operation but I don't think we can really get rid of the data for full validators.

merope07 · 2016-10-31T22:53:59Z

With a MMR, since you never delete data, only update, and updating always requires only the rightmost branch of the tree, I think sharding is fine. Any data you retain, you need to know enough auxilary hashes to compute the root, and these hashes are exactly the ones you need to update the data (e.g. to increment a spent-count).

It's probably more efficient to store and transmit, e.g. "all the outputs whose hashes start with 0x0e" in a Merklix tree, but I don't have a clear idea of by how much.

GarrickOllivander · 2016-11-02T07:17:57Z

I am probably missing the obvious, but how would MMR support lookup?

merope07 · 2016-11-02T13:00:00Z

You would need to maintain a mapping from UTXO to its index in the tree.

gellert-grindelwald · 2016-12-12T18:16:50Z

LMDB is a more suitable embedded-database for fast operations and incredible resiliency to corruption than RocksDB. Not sure if any of the database work has already underway, but it is definitely something worth considering.

ignopeverell added the enhancement label Oct 31, 2016

ignopeverell added enhancement help wanted and removed enhancement labels Dec 27, 2016

MoaningMyrtle mentioned this issue Jan 14, 2017

External interface of UTXO set #29

Closed

ignopeverell modified the milestone: Testnet Jun 15, 2017

ignopeverell closed this as completed Oct 10, 2017

hashmap mentioned this issue Aug 15, 2018

Transaction deserialisation allows an invalid input to aggsig::verify_single #1356

Closed

dwayneem mentioned this issue Feb 3, 2019

Pre-built grin 1.0.1 binary fails with illegal instruction vmovdqa in ra_portable_serialize() #2519

Closed

bladedoyle mentioned this issue Nov 2, 2020

API thread poor performance getting (old) block data #3483

Open

bladedoyle mentioned this issue May 18, 2021

Grin node 5.1.0 won't launch on Linux #3641

Closed

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merkle tree structure for UTXO set #10

Merkle tree structure for UTXO set #10

ignopeverell commented Oct 31, 2016

merope07 commented Oct 31, 2016 •

edited

Loading

ignopeverell commented Oct 31, 2016

apoelstra commented Oct 31, 2016

merope07 commented Oct 31, 2016

GarrickOllivander commented Nov 2, 2016

merope07 commented Nov 2, 2016

gellert-grindelwald commented Dec 12, 2016

Merkle tree structure for UTXO set #10

Merkle tree structure for UTXO set #10

Comments

ignopeverell commented Oct 31, 2016

merope07 commented Oct 31, 2016 • edited Loading

ignopeverell commented Oct 31, 2016

apoelstra commented Oct 31, 2016

merope07 commented Oct 31, 2016

GarrickOllivander commented Nov 2, 2016

merope07 commented Nov 2, 2016

gellert-grindelwald commented Dec 12, 2016

merope07 commented Oct 31, 2016 •

edited

Loading