Aggregate metrics from other discovery nodes #1266

sddioulde · 2021-02-26T23:22:10Z

Description

We want each discovery node to give us aggregated metrics across all discovery nodes, so that obtaining the metrics does not depend on all nodes being up.

Tests

Spun up locally, hit the new endpoints to make sure they return coherent data.
Played around with the cron intervals and checked the responses and the logs
Ran db migration locally to make sure reads and writes are happening
... also working on adding unit tests

Clients for the metrics data (e.g. the dashboard) will need to update the endpoints they make requests to in order to use the new aggregated metrics.

raymondjacobson

Just did a first pass, will review again in more depth on monday. This is so cool.

discovery-provider/src/api/v1/metrics.py

discovery-provider/src/models.py

discovery-provider/src/queries/get_app_name_metrics.py

discovery-provider/src/queries/get_route_metrics.py

discovery-provider/src/utils/redis_metrics.py

discovery-provider/src/tasks/index_metrics.py

discovery-provider/src/utils/helpers.py

discovery-provider/src/utils/redis_metrics.py

dmanjunath

overall structure looks solid. just had a few questions and comments

discovery-provider/src/utils/redis_metrics.py

dmanjunath · 2021-03-01T17:26:32Z

discovery-provider/alembic/versions/c967ae0fcaf6_add_aggregate_metrics_tables.py

+
+
+def upgrade():
+    op.create_table('daily_unique_users_metrics',


we should try to distinguish the old per node tables from the new all discovery node tables. if "aggregate" is the wording we're using for all the discovery nodes together, i'd suggest putting that in the table names and stuff too. cause otherwise i'm sure we'll get confused between app_name_metrics and daily_app_name_metrics. conversely we could rename the existing app_name_metrics and route_metrics as app_name_metrics_single_node or something like that

dmanjunath · 2021-03-01T17:39:12Z

discovery-provider/src/tasks/index_metrics.py

+        start_time = int(start_time_obj.timestamp())
+        new_route_metrics, new_app_metrics = get_metrics(node, start_time)
+
+        logger.info(f"received route metrics: {new_route_metrics}")


this will probably be a massive object to print, do we want to output this?

dmanjunath · 2021-03-01T17:39:43Z

discovery-provider/src/tasks/index_metrics.py

+    num_discovery_providers = sp_factory_inst.functions.getTotalServiceTypeProviders(discovery_node_service_type).call()
+    logger.info(f"number of discovery providers: {num_discovery_providers}")
+    service_infos = [sp_factory_inst.functions.getServiceEndpointInfo(discovery_node_service_type, i).call() \
+        for i in range(1, num_discovery_providers + 1)]


just an optimization, don't need to add this, but if performance is/was an issue we can use threadpoolexecutors to parallelize this like https://github.com/AudiusProject/audius-protocol/blob/master/discovery-provider/src/tasks/index_network_peers.py#L66-L85

i think doing this sync is probably better

dmanjunath · 2021-03-01T17:44:43Z

discovery-provider/src/tasks/index_metrics.py

+            update_app_metrics_count(monthly_app_metrics, historical_metrics['apps']['monthly'])
+
+    logger.info("synchronizing historical metrics")
+    logger.info(f"daily historical route metrics to update: {daily_route_metrics}")


same thing about printing large objects here, do we want this here?

these shouldn't be super large though, at least for the route metrics. daily should be about 30 records, each with a unique and total count; monthly would be (12*num_total_years) records. for apps it's harder to predict the number because the number of apps used for a given day/month could be different

but we can surely pull it for now until we see a real need for it

discovery-provider/src/utils/helpers.py

dmanjunath

Great work @sddioulde! I love how all of this is net new too

dmanjunath · 2021-03-10T13:36:13Z

discovery-provider/src/tasks/index_metrics.py

+    all_other_nodes = []
+
+    # fetch all discovery nodes info in parallel
+    with concurrent.futures.ThreadPoolExecutor(max_workers=5) as executor:


raymondjacobson · 2021-03-10T22:30:54Z

mergeeeeeeeeeeeeeeeeeeeee!!!!!!

sddioulde · 2021-03-10T23:24:54Z

Merged :)

sddioulde requested a review from raymondjacobson February 26, 2021 23:22

sddioulde assigned dmanjunath Feb 26, 2021

sddioulde force-pushed the saliou-aud-metrics branch from 0302223 to 9c8427a Compare February 26, 2021 23:32

raymondjacobson reviewed Feb 27, 2021

View reviewed changes

sddioulde force-pushed the saliou-aud-metrics branch 2 times, most recently from 5c7c9a3 to 7a602c8 Compare March 1, 2021 17:41

dmanjunath reviewed Mar 1, 2021

View reviewed changes

sddioulde force-pushed the saliou-aud-metrics branch 3 times, most recently from 0223483 to 0ae8036 Compare March 3, 2021 19:58

Saliou Diallo and others added 16 commits March 9, 2021 19:37

Aggregate metrics from other discovery nodes

4a35a29

Add comments and small changes

ac5d5ac

Rename models, update comments, remove logs

a436cfa

Add command to spin up discovery node and its dependencies

9e08658

Fix lint

9b3fec6

Describe timestamp attributes in new metrics tables

15a0d4a

Parallelize calls to get other discovery node endpoints

db36148

Update command name to start up discovery provider with its dependencies

6a936ef

Add tests

f746a48

Order aggregate route metrics by timestamp asc and fix test

7585a71

Merge and persist route and app metrics regardless of new metrics

c094971

Fix month metrics returned

5e0de26

Fix tests after db query updates

dbe7e84

Use date instead of datetime for db query comparison and add more logs

87ab975

Add more logs

f501f40

Persist own metrics

dc8a229

sddioulde force-pushed the saliou-aud-metrics branch from 0ae8036 to dc8a229 Compare March 10, 2021 00:42

raymondjacobson approved these changes Mar 10, 2021

View reviewed changes

dmanjunath approved these changes Mar 10, 2021

View reviewed changes

sddioulde merged commit ceaebcc into master Mar 10, 2021

sddioulde deleted the saliou-aud-metrics branch March 10, 2021 23:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aggregate metrics from other discovery nodes #1266

Aggregate metrics from other discovery nodes #1266

sddioulde commented Feb 26, 2021

raymondjacobson left a comment

dmanjunath left a comment

dmanjunath Mar 1, 2021 •

edited

sddioulde Mar 1, 2021

dmanjunath Mar 1, 2021

sddioulde Mar 1, 2021

dmanjunath Mar 1, 2021

raymondjacobson Mar 1, 2021

sddioulde Mar 1, 2021

dmanjunath Mar 1, 2021

sddioulde Mar 1, 2021

dmanjunath left a comment

dmanjunath Mar 10, 2021

raymondjacobson commented Mar 10, 2021

sddioulde commented Mar 10, 2021

Aggregate metrics from other discovery nodes #1266

Aggregate metrics from other discovery nodes #1266

Conversation

sddioulde commented Feb 26, 2021

Description

Tests

raymondjacobson left a comment

Choose a reason for hiding this comment

dmanjunath left a comment

Choose a reason for hiding this comment

dmanjunath Mar 1, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dmanjunath left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

raymondjacobson commented Mar 10, 2021

sddioulde commented Mar 10, 2021

dmanjunath Mar 1, 2021 •

edited