Move HexaryTrieSync from py-trie here, and make it async #1124

gsalgado · 2018-07-31T14:07:44Z

Closes: #1074

gsalgado · 2018-08-06T06:26:39Z

@pipermerriam I think this may have slipped under your radar?

pipermerriam

Maybe rename trinity/sync/full/trie.py to be hexary_trie since we kind of know there is going to be a BinaryTrie making it's way into here at some point.

pipermerriam · 2018-08-06T16:14:00Z

tests/trinity/core/p2p-proto/test_state_sync.py

+        for key, value in contents.items():
+            assert dest_trie[key] == value
+
+    asyncio.get_event_loop().run_until_complete(_test_trie_sync())


Should we instead use the event_loop fixture provided by pytest? I know these are supposed to be equivalent during runtime, but it seems slightly more correct to use the fixture version of this.

pipermerriam

I'd like to hear your thoughts on the proposed SyncResponse pattern, or any alternate ideas you might have.

pipermerriam · 2018-08-06T16:21:34Z

trinity/sync/full/trie.py

+                       is_raw: bool = False) -> None:
+        """Schedule a request for the node with the given key."""
+        if node_key in self._existing_nodes:
+            self.logger.debug("Node %s already exists in db" % encode_hex(node_key))


Logging statements should not do string formatting. Can you change to self.logger.debug("the %s message", encode_hex(nodekey))

pipermerriam · 2018-08-06T16:21:57Z

trinity/sync/full/trie.py

+            return
+        if await self.db.coro_exists(node_key):
+            self._existing_nodes.add(node_key)
+            self.logger.debug("Node %s already exists in db" % encode_hex(node_key))


Same here, drop the string formatting and just provide the hex encoded node_key as a positional argument.

pipermerriam · 2018-08-06T16:26:32Z

trinity/sync/full/trie.py

+        :rtype: A two-tuple with one list containing the children that reference other nodes and
+        another containing the leaf children.
+        """
+        node = decode_node(request.data)


This stands out as a potential source of trouble. request.data can be None in cases where the SyncRequest has not been processed. I trust that this code path is only called after the data attribute has been populated, but the implicitness of this approach makes me think we should look for an alternate pattern, maybe something like a SyncResponse object which takes the request and the data as an argument and exposes response.data which is always guaranteed to be populated.

get_children() is called precisely to find the children of the node we're processing, and process() does set request.data immediately before calling it. I guess if I inlined this method into process() it wouldn't raise a red flag, and we wouldn't make process() any more complex or less readable as this is a two-line method that just decodes the request data and passes it to _get_children()

👍 if it gets rid of the mutation of the request object and thus the implicit availability of the request.data attribute.

In order to completely get rid of the mutation of request.data I'll have to get rid of the check in process() that raises SyncRequestAlreadyProcessed when request.data is not None, but that is useless anyway as it is caught and ignored in StateDownloader

Hmm, spoke too soon. I thought I'd be able to have process() pass the data to commit() but that doesn't work because we don't always commit at the end of process() -- we only commit once all the child nodes have been retrieved and committed. IOW, we need to keep (somewhere) the data received for every request, until that request is eventually committed. I could get rid of request.data, but then I'd have to store the data for every request in an instance variable of HexaryTrieSync. The SyncResponse you suggest would also have to be stored somewhere, and since it's disjoint from the actual request I'm not sure it gives us any guarantees really -- we could still try to commit a request for which there's no SyncResponse in the same way we can try to commit one which has .data set to None

If you can't find a way to do it that seems good, my fallback request would be to add some comments to document this deficiency. Given that if request.data is None the two methods that access it will very likely fail loudly I'm ok with it staying in.

Ok, I've added a docstring to commit(), and the other method is gone as there was no point in having it anyway

pipermerriam · 2018-08-06T16:28:25Z

trinity/sync/full/trie.py

+                await self.commit(request)
+                continue
+
+            references, leaves = self.get_children(request)


Related to my other comment about request.data being potentially problematic. At this call-site we'd instead pass in a response object so that we have a guarantee that the data attribute is present.

pipermerriam · 2018-08-06T16:29:17Z

trinity/sync/full/trie.py

+            if request.dependencies == 0:
+                await self.commit(request)
+
+    async def commit(self, request: SyncRequest) -> None:


This function would also change to take a SyncResponse object.

gsalgado · 2018-08-07T08:48:48Z

@pipermerriam I think I've addressed all your comments. wanna have another look?

pipermerriam · 2018-08-07T13:23:40Z