[feature] [BIT 578] speed up metagraph storage query #933

camfairchild · 2022-09-29T19:00:01Z

This PR speeds up the storage call made bysubtensor.neurons in the subtensor.use_neurons_fast function.

This feature works by bundling a nodejs binary with the polkadotjs API.
This binary is a CLI that implements the sync_and_save --filename <default:~/.bittensor/metagraph.json> --block_hash <default:latest> command.
This syncs the metagraph at the blockhash and saves it to a json file.

The speed-up is quite significant, below is a test run of the manual without the fix, with the ipfs cache, and with the fix.

And below is the IPFS cache sync versus the manual sync (with fix)

A pro of this is that it removes the need for a centralized IPFS cache of the metagraph.

A downside of this fix is that the binaries with nodejs bundled use ~50MB each (one linux, one macos).
There is currently no binary for windows, but I'm not certain this should be included anyway, as we only support linux/macos.

Another pro of this fix is it works on both nobunaga and nakamoto, and can be adapted to any network.
This also leaves room for adding other large substrate queries and working further with the polkadot js api.

setup.py

eduardogr · 2022-10-05T20:38:27Z

bittensor/_subtensor/subtensor_impl.py

+            """
+            We expect a JSON array of:
+            {
+                "uid": int,
+                "ip": str,
+                "ip_type": int,
+                "port": int,
+                "stake": str(int),
+                "rank": str(int),
+                "emission": str(int),
+                "incentive": str(int),
+                "consensus": str(int),
+                "trust": str(int),
+                "dividends": str(int),
+                "modality": int,
+                "last_update": str(int),
+                "version": int,
+                "priority": str(int),
+                "last_update": int,
+                "weights": [
+                    [int, int],
+                ],
+                "bonds": [
+                    [int, str(int)],
+                ],
+            }
+            """


This is helpful given that you are including an external binary.

What about moving this specification to a document in the bin file and translate this comment also to a set of unit tests?

Both documents; the specification and the tests could be referenced here with links

Adding unit tests for the binary file is good to:

localize files and ensure that you have all of them in the repo

write a specification

Agree, @camfairchild this should be moved to a proper documentation as well. Nicely done!

bittensor/utils/networking.py

tests/unit_tests/bittensor_tests/utils/test_network_utils.py

eduardogr · 2022-10-11T16:08:24Z

The binaries are from https://github.com/opentensor/subtensor-node-api Not sure where to write docs about this though

If we are getting binaries from there I would suggest to comment it somewhere so we can wire the components and generate the knowledge

@camfairchild very nice job!!

camfairchild · 2022-10-11T16:11:34Z

The binaries are from https://github.com/opentensor/subtensor-node-api Not sure where to write docs about this though

If we are getting binaries from there I would suggest to comment it somewhere so we can wire the components and generate the knowledge

@camfairchild very nice job!!

Good idea. I'll add it in the setup.py file. Perhaps though we should maintain a docs website in future.

shibshib · 2022-10-14T18:47:25Z

Perhaps I misunderstood your graphs, but looks like the ifs cache is still a little faster despite being centralized, is that right?

Also, you have conflicts ;)

camfairchild · 2022-10-14T23:11:46Z

Perhaps I misunderstood your graphs, but looks like the ifs cache is still a little faster despite being centralized, is that right?

Also, you have conflicts ;)

Nope, you're right. The IPFS is faster, but only when it's up ;)

shibshib · 2022-10-18T03:59:14Z

Perhaps I misunderstood your graphs, but looks like the ifs cache is still a little faster despite being centralized, is that right?
Also, you have conflicts ;)

Nope, you're right. The IPFS is faster, but only when it's up ;)

ouch. 🗡️

shibshib · 2022-10-19T20:17:16Z

@camfairchild why is this still marked do not merge? any blockers?

camfairchild · 2022-10-19T20:27:00Z

Going to refactor into an external pypi package so the binary can be distributed easier

camfairchild and others added 16 commits September 28, 2022 15:32

add bin files for mac and linux

84b290c

add fast_neurons flag

95e254d

add print out

95ee66c

change naming

9614b8f

update bin

c776224

update binaries

d1bf728

update to get bonds

221bc64

cast back to int

d6a3ba1

rename argument to use_neurons_fast

1cc692d

fix endpoint url

5e07a11

update bin

475ac28

fallback to manual when on windows

e069441

Merge branch 'nobunaga' into BIT-578-speed-up-metagraph-storage-query

f0d1290

update bin

ca97998

switch to default true

63fe7d3

modify comment

ea1f346

camfairchild requested review from unconst, Eugene-hu and shibshib and removed request for Eugene-hu October 3, 2022 15:21

camfairchild marked this pull request as ready for review October 3, 2022 15:21

camfairchild and others added 5 commits October 3, 2022 11:21

remove unsupported binaries

c5f4b91

change name to fast sync

80f8867

Merge branch 'nobunaga' into BIT-578-speed-up-metagraph-storage-query

a9b45af

fix cached

18f422b

update bin to add active

bfd9d87

camfairchild requested a review from eduardogr October 5, 2022 20:26

eduardogr reviewed Oct 5, 2022

View reviewed changes

setup.py Outdated Show resolved Hide resolved

eduardogr reviewed Oct 5, 2022

View reviewed changes

eduardogr reviewed Oct 11, 2022

View reviewed changes

bittensor/utils/networking.py Show resolved Hide resolved

eduardogr reviewed Oct 11, 2022

View reviewed changes

tests/unit_tests/bittensor_tests/utils/test_network_utils.py Show resolved Hide resolved

unconst approved these changes Oct 11, 2022

View reviewed changes

camfairchild added the do not merge label Oct 11, 2022

camfairchild added 7 commits October 12, 2022 12:21

remove tkinter

10ccc0a

add manifest; exclude non-package

95febf1

remove bin from manifest

83d204b

fix extra

1a19ab3

add some keywords

d52b2cc

remove cubit as extra because of pypi policy

2c686ac

add bin using package data

f24f5bd

camfairchild added 4 commits October 17, 2022 14:19

update bin with new command

f66a16d

update bin

761356a

add blockAtRegistration to fastsync

31a7b8e

add subtensor function for new storage

3cdb226

camfairchild added 6 commits October 18, 2022 20:45

fix tests

e3dc85e

removed validation of fast sync json

5c5903f

Remove uneeded _

4c4ddbc

add comments about structure of JSON

f2d4046

Merge branch 'nobunaga' into BIT-578-speed-up-metagraph-storage-query

6d41dc0

add find packages

c88d1e1

camfairchild closed this Oct 19, 2022

camfairchild mentioned this pull request Oct 26, 2022

Bit 578 speed up metagraph storage query #961

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature] [BIT 578] speed up metagraph storage query #933

[feature] [BIT 578] speed up metagraph storage query #933

camfairchild commented Sep 29, 2022 •

edited

eduardogr Oct 5, 2022

eduardogr Oct 5, 2022 •

edited

eduardogr Oct 5, 2022

shibshib Oct 11, 2022

eduardogr commented Oct 11, 2022

camfairchild commented Oct 11, 2022

shibshib commented Oct 14, 2022

camfairchild commented Oct 14, 2022

shibshib commented Oct 18, 2022

shibshib commented Oct 19, 2022

camfairchild commented Oct 19, 2022

[feature] [BIT 578] speed up metagraph storage query #933

[feature] [BIT 578] speed up metagraph storage query #933

Conversation

camfairchild commented Sep 29, 2022 • edited

eduardogr Oct 5, 2022

Choose a reason for hiding this comment

eduardogr Oct 5, 2022 • edited

Choose a reason for hiding this comment

eduardogr Oct 5, 2022

Choose a reason for hiding this comment

shibshib Oct 11, 2022

Choose a reason for hiding this comment

eduardogr commented Oct 11, 2022

camfairchild commented Oct 11, 2022

shibshib commented Oct 14, 2022

camfairchild commented Oct 14, 2022

shibshib commented Oct 18, 2022

shibshib commented Oct 19, 2022

camfairchild commented Oct 19, 2022

camfairchild commented Sep 29, 2022 •

edited

eduardogr Oct 5, 2022 •

edited