add new function Walk() to trie #146

p4u · 2020-10-29T10:05:47Z

Walk allows iteration over all values stored into the SMT.

This is a useful method for exporting the tree (and importing afterwards) and also for search specific values.

p4u · 2020-10-30T15:19:49Z

This is not 100% correct... Sometimes the test will fail and I don't know why (yet).

Maybe someone with more knowledge on this SMT implementation can find the problem.

p4u · 2020-10-30T15:39:21Z

So with this last implementation, go test -timeout 30s . -run ^TestTrieWalk$ -count=1 works always but -count=10 crashes... I'm not sure if this is something from my changes or just anything wrong with the trie_test.go

Walk allows iteration over all key/values stored into the SMT. If the callback function returns true, walk will stop. Signed-off-by: p4u <pau@dabax.net>

Signed-off-by: p4u <pau@dabax.net>

paouvrard · 2020-11-02T01:55:20Z

Thank you @p4u for your input. Have you checked this repository which does the export of an Aergo state snapshot ?
https://github.com/aergoio/state-tools

p4u · 2020-11-02T08:06:34Z

I did not know about the existence of that repository, I'll take a look.

However, from a SMT API consumer perspective with the experience of using several goLang Merkletree implementations for blockchain development, I think the following methods are missing:

A Walk() as proposed here
A Snapshot(root) in order to make an immutable tree (will not change over time)
Get and Walk should allow specifying a root, something like smt.Get(key, root []byte)
A Count(root) method in order to get the number of leafs
A way to export and import the tree in order to generate the same exact Root hash

Does it makes sense to you?
To this end, some functions from the state-tools repository might be added to the SMT trie package. I think it would help on the adoption of this trie implementation by other open source blockchain projects.

This is our interface and our current implementations (where I want to add Aergo SMT): https://gitlab.com/vocdoni/go-dvote/-/tree/master/statedb

paouvrard · 2020-11-03T02:57:39Z

Definitely makes sense.

I know some projects used the same trie but usually adapt it to their needs. So if there is consensus on an interface/usage that is useful to many, I'm happy to add features. We might have to refactor outside of the Aergo client code.
The interface you linked mentions Iterate, is that the same as Walk but for iterating from a subtree?

There is also the new approach https://github.com/ledgerwatch/turbo-geth which stores the leaves directly instead of querying from the root in logN, and re-constructs the root when a leaf is updated. It brings performance benefits but changes how we think about the trie (need to keep track of what is the current version of the trie).

paouvrard

Thank you for your contribution ! Nice work navigating the trie serialization 👍
Please see comments in the code for solving the random bug.

paouvrard · 2020-11-27T02:12:16Z

pkg/trie/trie_test.go

+	// Walk over the whole tree and compare the values
+	i := 0
+	if err := smt.Walk(func(v *WalkResult) bool {
+		if string(v.Value) != string(values[i]) {


bytes can be compared with bytes.Equal()

paouvrard · 2020-11-27T02:36:09Z

pkg/trie/trie_tools.go

+		return err
+	}
+	if isShortcut {
+		var key []byte


Key can be accessed directly as:
lnode[:HashLength]

paouvrard · 2020-11-27T02:40:51Z

pkg/trie/trie_tools.go

+// walk fetches the value of a key given a trie root
+func (s *Trie) walk(walkc chan (*WalkResult), stop *bool, root []byte, batch [][]byte, iiBatch, height int) error {
+	if len(root) == 0 || *stop {
+		// the trie does not contain the key or stop bool is set tu true


// the sub tree is empty or stop walking

paouvrard · 2020-11-27T02:41:01Z

pkg/trie/trie_tools.go

+		return nil
+	}
+	// Fetch the children of the node
+	nbatch, iBatch, lnode, rnode, isShortcut, err := s.loadChildren(root, height, iiBatch, batch)


for consistency, nbatch and iiBatch can be batch and iBatch

paouvrard · 2020-11-27T02:46:52Z

pkg/trie/trie_tools.go

+		for {
+			select {
+			case <-close:
+				break


I tracked down the random bug you mentioned to this function. There are cases where the Walk() finishes iterating the tree and returns while the callback hasn't finished executing. So if calling Walk() 2 times in a row, the i++ from the 1st walk can override the i = 1 reset for the 2nd walk.

To prevent this, you need confirmation that the goroutine has finished executing before Walk() returns, by using a blocking channel that waits for the goroutine to return for example.

Also, this break should be a return otherwise only select is broken and not the for loop.

Using WaitGroup doesn't seem necessary here.

close channel can be renamed as it is a golang builtin

For example:

s.lock.RLock() defer s.lock.RUnlock() s.atomicUpdate = false walkc := make(chan *WalkResult) exit := make(chan (bool)) finishedWalk := make(chan (bool)) stop := false go func() { for { select { case <-finishedWalk: exit <- true return case value := <-walkc: if stop = callback(value); stop { // break and loop to case <- finishedWalk break } } } }() err := s.walk(walkc, &stop, s.Root, nil, 0, s.TrieHeight) finishedWalk <- true <- exit. // wait for goroutine to return return err

p4u · 2020-12-10T18:51:21Z

Hey @paouvrard I have been out for some days but now I'm back. Thank you for your review, I am going to incorporate the changes.

p4u · 2020-12-10T19:55:54Z

I applied your suggestions and now go test -timeout 30s . -run=^TestTrieWalk$ -count=100 works fine, thank you!

I added another function named GetWithRoot in order to Get a value for a specific Root. And in addition I added the root as a parameter to Walk() to the caller can choose on which Root he wants to walk.

I keep everything on the same PR because in my local repository I did not split it, I hope that's not a problem.

Looking forward for your review :)

p4u · 2020-12-10T20:00:54Z

Damn it, go test -timeout 30s . -count=100 fails... I'll look into it.

p4u · 2020-12-11T08:50:07Z

Fixed! Now go test -timeout 120s . -count=100 works for me.

Now Walk() is data race safe and does not return until all callbacks have finished. Signed-off-by: p4u <pau@dabax.net>

p4u · 2020-12-12T18:25:14Z

The last version solves concurrency problems and it is data race safe. Tested with go test -timeout 120s . -run ^TestTrieWalk$ -count=100 -race

p4u · 2020-12-12T19:02:43Z

If this PR gets finally merged, next step could be to add a String() function, Export() and Import(), which IMO would make the API more powerful. Something like:

func (t *trie) String() string {
	buf := bytes.Buffer{}
	t.Walk(nil, func(k, v []byte) int32 {
		buf.WriteString(fmt.Sprintf("%x => %x\n", k, v))
		return 0
	})
	return buf.String()
}

p4u · 2021-03-27T17:39:40Z

So I see there is probably no interest on having this new methods on the SMT. That's fine. I forked the code and applied some changes (including a new abstraction layer). If anyone is interested, the code is here: https://github.com/p4u/asmt

kroggen · 2023-05-04T17:50:23Z

The aergoio/SMT repo appears to be better for this

The aergoio/state-tools is also related

kroggen requested a review from paouvrard October 30, 2020 03:38

p4u force-pushed the develop branch from ba71d81 to b126a7e Compare October 30, 2020 15:38

p4u force-pushed the develop branch from b126a7e to f40d923 Compare October 30, 2020 18:48

p4u added 2 commits October 30, 2020 19:48

add new function Walk() to trie

f40d923

Walk allows iteration over all key/values stored into the SMT. If the callback function returns true, walk will stop. Signed-off-by: p4u <pau@dabax.net>

allow Get and Walk on a specific Root hash

c186626

Signed-off-by: p4u <pau@dabax.net>

paouvrard reviewed Nov 27, 2020

View reviewed changes

p4u force-pushed the develop branch from 06bb134 to 89ee6db Compare December 11, 2020 08:49

fix Walk() test small improvements

61ce780

Now Walk() is data race safe and does not return until all callbacks have finished. Signed-off-by: p4u <pau@dabax.net>

p4u force-pushed the develop branch from 89ee6db to 61ce780 Compare December 12, 2020 18:23

kroggen mentioned this pull request May 4, 2023

add new function Walk() to trie aergoio/SMT#4

Open

kroggen closed this May 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add new function Walk() to trie #146

add new function Walk() to trie #146

p4u commented Oct 29, 2020

p4u commented Oct 30, 2020 •

edited

Loading

p4u commented Oct 30, 2020

paouvrard commented Nov 2, 2020

p4u commented Nov 2, 2020

paouvrard commented Nov 3, 2020

paouvrard left a comment

paouvrard Nov 27, 2020

paouvrard Nov 27, 2020

paouvrard Nov 27, 2020

paouvrard Nov 27, 2020

paouvrard Nov 27, 2020 •

edited

Loading

p4u commented Dec 10, 2020

p4u commented Dec 10, 2020

p4u commented Dec 10, 2020 •

edited

Loading

p4u commented Dec 11, 2020

p4u commented Dec 12, 2020

p4u commented Dec 12, 2020

p4u commented Mar 27, 2021

kroggen commented May 4, 2023

add new function Walk() to trie #146

add new function Walk() to trie #146

Conversation

p4u commented Oct 29, 2020

p4u commented Oct 30, 2020 • edited Loading

p4u commented Oct 30, 2020

paouvrard commented Nov 2, 2020

p4u commented Nov 2, 2020

paouvrard commented Nov 3, 2020

paouvrard left a comment

Choose a reason for hiding this comment

paouvrard Nov 27, 2020

Choose a reason for hiding this comment

paouvrard Nov 27, 2020

Choose a reason for hiding this comment

paouvrard Nov 27, 2020

Choose a reason for hiding this comment

paouvrard Nov 27, 2020

Choose a reason for hiding this comment

paouvrard Nov 27, 2020 • edited Loading

Choose a reason for hiding this comment

p4u commented Dec 10, 2020

p4u commented Dec 10, 2020

p4u commented Dec 10, 2020 • edited Loading

p4u commented Dec 11, 2020

p4u commented Dec 12, 2020

p4u commented Dec 12, 2020

p4u commented Mar 27, 2021

kroggen commented May 4, 2023

p4u commented Oct 30, 2020 •

edited

Loading

paouvrard Nov 27, 2020 •

edited

Loading

p4u commented Dec 10, 2020 •

edited

Loading