Apex merkle commitment #562

porcuquine · 2019-03-18T18:10:11Z

This PR implements a component we can use to avoid rehashing the top (apex) part of a merkle tree repeatedly when performing many inclusion proofs. I will write the larger plan (which @nicola and I have been discussing in detail since Friday) up separately.

I'm submitting this PR now because what it contains is independently meaningful and can be reviewed independently — even though not used for our proofs yet. We can iterate on the interface, etc. if necessary when we get to the full optimization.

The idea here is that we can create a vector commitment to a power-of-2 number of Fr values and hash them together to form a single root more cheaply than by creating a complete binary merkle tree. (If/when we have more generators, we can do this in fewer pedersen hashes. For now, we still have to use merkle-damgard compression.)

If we implemented an equivalent vector commitment using a binary merkle tree with height L and size (number of commited values = 2^L - 1) S, we would need S - 1 hashes outside the circuit to construct, and L-1 hashes inside the circuit to prove inclusion.

Instead, we perform S-1 hashes outside the circuit, the same S-1 hashes inside the circuit, with an additional (2^L + L) - 1 constraints per inclusion proof. Since each pedersen hash uses more than 1,000 constraints, this is a savings for L up to 10.

In order to make this work, we make use of num::AllocatedNum::conditionally_reverse which we repurpose to select between one of two alternative allocated numbers at the cost of only two constraints.

I ended up implementing this two different ways in order to verify. FlatApexCommitment will be more efficient at synthesis time than BinaryApexCommitment. It could be refined further if need be.

If anyone knows of a way to verify positional inclusion in fewer constraints, we could also use that. For now, this stands as the best candidate.

See more design detail in #563.

nicola · 2019-03-18T18:24:18Z

Can you attach a very brief spec/intuition doc of what is happening?

porcuquine · 2019-03-18T19:11:35Z

@nicola I wrote something very brief in #563. See if that helps you get the idea.

porcuquine · 2019-03-21T20:04:45Z

@nicola @dignifiedquire

It seems that we will be able to use the fantastic optimization of challenge bucketing, with each apex leaf representing a bucket. This has some wrinkles to sort out, but it also means we will not need this code now.

However, since this is a standalone implementation and has some value to our proving toolkit, if only informational, I'd like to merge so it's not lost.

porcuquine · 2019-03-31T21:04:22Z

Unfortunately, it seems unlikely that we can make challenge bucketing work for most inclusion proofs, since parents (which represent most inclusion proofs) are random by their nature. We could still eliminate the need for this code for data and replica inclusion proofs, but some variation would be needed for the parents.

Also note that a distinct apex is required for each merkle tree. Since committing to the apex is somewhat expensive, we should avoid repeating this unnecessarily in each partition. Although we cannot get rid of the need to commit to each apex once, we can reduce the cost of duplication by segregating partitions by layer. For example, the last (largest, with tapering) layer might have its own partition.

This would require distinct circuits for each such set of layers to be grouped, but would avoid the overhead of paying setup costs for every layer in every partition.

dignifiedquire

one small change requested other than that, meeeerge it

dignifiedquire · 2019-04-16T22:15:40Z

storage-proofs/src/circuit/apex_commitment.rs

+ * This sets an uppper bound on the size of the apex. When the cost of including another row in the apex
+ * exceeds the cost (in constraints) of one hash, there's no potential savings. (Reference: 1 32-byte pedersen hash
+ * requires ~1152 constraints).
+ */


please use module level comments //!

porcuquine requested a review from dignifiedquire as a code owner March 18, 2019 18:10

porcuquine requested a review from nicola March 18, 2019 18:10

porcuquine mentioned this pull request Mar 18, 2019

Apex Commitment #563

Closed

nicola mentioned this pull request Mar 21, 2019

Captain log filecoin-project/research#103

Closed

porcuquine mentioned this pull request Apr 7, 2019

Next round for post #583

Closed

porcuquine force-pushed the apex-merkle branch from 7e24915 to a00426b Compare April 16, 2019 18:37

dignifiedquire previously approved these changes Apr 16, 2019

View reviewed changes

porcuquine dismissed dignifiedquire’s stale review via 0f1449a April 16, 2019 22:19

porcuquine force-pushed the apex-merkle branch 2 times, most recently from 0f1449a to f514f3d Compare April 16, 2019 22:32

Initial proof-of-concept of BinaryCommitment.

314042c

porcuquine force-pushed the apex-merkle branch from f514f3d to 314042c Compare April 16, 2019 22:59

porcuquine merged commit 3e5039e into master Apr 16, 2019

porcuquine deleted the apex-merkle branch April 17, 2019 00:22

porcuquine mentioned this pull request Jul 19, 2019

integrate basic apex into circuits #748

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apex merkle commitment #562

Apex merkle commitment #562

porcuquine commented Mar 18, 2019 •

edited

nicola commented Mar 18, 2019

porcuquine commented Mar 18, 2019

porcuquine commented Mar 21, 2019

porcuquine commented Mar 31, 2019

dignifiedquire left a comment

dignifiedquire Apr 16, 2019

Apex merkle commitment #562

Apex merkle commitment #562

Conversation

porcuquine commented Mar 18, 2019 • edited

nicola commented Mar 18, 2019

porcuquine commented Mar 18, 2019

porcuquine commented Mar 21, 2019

porcuquine commented Mar 31, 2019

dignifiedquire left a comment

Choose a reason for hiding this comment

dignifiedquire Apr 16, 2019

Choose a reason for hiding this comment

porcuquine commented Mar 18, 2019 •

edited