api: Reduce amount of updates done by DB metrics logic #1179

victorges · 2022-07-30T06:37:45Z

What does this pull request do? Explain your changes. (required)

This to reduce the amount of UPDATE queries that we do in our database. These queries
are performed mostly for some "metric" functionalities that we created on top of it, mainly:

100k/hour: The logic to track the lastSeen timestamp of both users and API keys (but especially API keys)
600k/hour: The logic to track the amount of transcoded segments that have been streamed a given Stream object.

The goal of this pull request is to fix both of those update logics by creating some "buffer" in memory
and then combining updates done on the database.

The expectation is that this will also fix a couple of issues we've been having with our databases regarding
replication lags everytime a VACCUUM operation is run on the streams table. By doing 30x less updates
on it we'll hopefully be able to tune it better and avoid disruptions in replication.

Specific updates (required)

Make the stream-info-service not send updates for every transcoded segment. Keep them in memory instead and only flush each record every 60 seconds instead.
Make tracking not send a transaction on the database on every observation of the API key. Instead, keep the last seen value in memory and flush every 60s as well.
Make sure to flush the metrics on process termination (so added some cleanup calls on SIGTERM handler)
Make add and set into a single query on stream-info-service

-

How did you test each of these updates (required)
Check metrics are still updated on the database with these versions running.

Does this pull request close any open issues?
Hopefully fixes https://github.com/livepeer/livepeer-infra/issues/851

Checklist:

I have read the CONTRIBUTING document.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have added tests to cover my changes.

vercel · 2022-07-30T06:37:48Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Updated
livepeer-studio	✅ Ready (Inspect)	Visit Preview	Sep 2, 2022 at 2:44PM (UTC)

packages/api/src/app/stream-info/stream-info-app.ts

packages/api/src/middleware/tracking.ts

That simple!

codecov · 2022-08-01T15:30:24Z

Codecov Report

Merging #1179 (55449ee) into master (7d4f42e) will increase coverage by 0.11769%.
The diff coverage is 67.74194%.

@@                 Coverage Diff                 @@
##              master       #1179         +/-   ##
===================================================
+ Coverage   50.39654%   50.51423%   +0.11768%     
===================================================
  Files             66          66                 
  Lines           4161        4181         +20     
  Branches         736         740          +4     
===================================================
+ Hits            2097        2112         +15     
- Misses          1816        1820          +4     
- Partials         248         249          +1

Impacted Files	Coverage Δ
packages/api/src/store/table.ts	`66.89189% <0.00000%> (-1.38397%)`	⬇️
packages/api/src/middleware/tracking.ts	`80.95238% <80.95238%> (ø)`
packages/api/src/index.ts	`50.00000% <100.00000%> (+0.90908%)`	⬆️
packages/api/src/middleware/auth.ts	`87.61905% <100.00000%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7d4f42e...55449ee. Read the comment docs.

First steps towards creating the flush logic on shutdown.

Will be called on clean up

These update some other properties in the stream like isActive etc, so we don't want to change those. Just make sure that we do flush the updates before deleting the entries.

Apparently the primary key is not really suitable for that, just changing that makes some queries much faster. This is unrelated to the change here, just including it cause I'm already touching the context.

Avoids a round-trip and a query in the DB.

iameli · 2022-08-02T01:16:05Z

packages/api/src/store/table.ts

-    q.append(sql`) WHERE id = ${id}`);
+    q.append(`)`);
+    if (set) {
+      q.append(sql` || ${JSON.stringify(set)}`);


The thinking here is just to do as much as possible in one query?

Yeah exactly! This small change reduces the number of transactions/updates in half as well! This was slightly necessary since we didn't get as much of a reduction only by buffering the updates. I expected 30x on the best case, but it was something like ~8x instead. With this we get ~16x less transactions which I think will be enough for now.

This reverts commit 0d6d759.

victorges · 2022-08-02T14:43:06Z

packages/api/src/store/table.ts

@@ -15,7 +15,7 @@ import {
  FieldSpec,
 } from "./types";

-const DEFAULT_SORT = "id ASC";


FTR: This was not really worth it. It (luckily) broke tests and when I went to do some further testing I noticed it's not really making queries faster (sometimes they're worse, cause not all objects have an indexed data->>'id'). Let's stick to just the id column.

gioelecerati

LGTM

vercel bot deployed to Preview July 30, 2022 06:38 View deployment

victorges commented Jul 30, 2022

View reviewed changes

packages/api/src/app/stream-info/stream-info-app.ts Outdated Show resolved Hide resolved

packages/api/src/middleware/tracking.ts Outdated Show resolved Hide resolved

victorges commented Jul 30, 2022

View reviewed changes

packages/api/src/middleware/tracking.ts Outdated Show resolved Hide resolved

victorges added 4 commits August 1, 2022 12:20

api/tracking: Update module to modern patterns

167a103

api/tracking: Buffer lastSeen updates for a minute

169225b

stream-info: Increase polling and update intervals

f14269c

That simple!

stream-info: Make sure to flush old entries

637b9ef

victorges force-pushed the vg/fix/update-machinegun branch from 624de1b to 637b9ef Compare August 1, 2022 15:20

vercel bot deployed to Preview August 1, 2022 15:22 View deployment

Address self nits

dbb6410

vercel bot deployed to Preview August 1, 2022 15:31 View deployment

stream-info: Move flushStreamMetrics to its own function

2eee278

First steps towards creating the flush logic on shutdown.

vercel bot deployed to Preview August 1, 2022 16:00 View deployment

stream-info: Move seen streams housekeeping to separate func

55c5cc6

Will be called on clean up

vercel bot deployed to Preview August 1, 2022 18:56 View deployment

stream-info: Flush memory metrics on exit

2f30d61

vercel bot deployed to Preview August 1, 2022 19:01 View deployment

api/tracking: Move flushing logic to its own helper

a9cb8fd

vercel bot deployed to Preview August 1, 2022 19:48 View deployment

api/tracking: Flush all updates on SIGTERM

94d59cc

victorges force-pushed the vg/fix/update-machinegun branch from a51b716 to 94d59cc Compare August 1, 2022 19:50

victorges changed the title ~~Vg/fix/update machinegun~~ api: Reduce amount of updates done by DB metrics logic Aug 1, 2022

vercel bot deployed to Preview August 1, 2022 19:52 View deployment

victorges marked this pull request as ready for review August 1, 2022 19:59

victorges requested a review from a team as a code owner August 1, 2022 19:59

api/tracking: Fix typo in filter

5d40eba

vercel bot deployed to Preview August 1, 2022 20:48 View deployment

stream-info: Fix handling of streams not created by mapic

c63a23e

vercel bot deployed to Preview August 1, 2022 21:02 View deployment

victorges force-pushed the vg/fix/update-machinegun branch from 42e06f6 to c63a23e Compare August 1, 2022 21:02

vercel bot deployed to Preview August 1, 2022 21:04 View deployment

stream-info: Remove change on deleteTimeout and seenSegmentsTimeout

457568c

These update some other properties in the stream like isActive etc, so we don't want to change those. Just make sure that we do flush the updates before deleting the entries.

vercel bot deployed to Preview August 1, 2022 21:38 View deployment

victorges added 2 commits August 1, 2022 18:51

api/db: Make default sort on JSON column

0d6d759

Apparently the primary key is not really suitable for that, just changing that makes some queries much faster. This is unrelated to the change here, just including it cause I'm already touching the context.

stream-info: Make update and add in a single query

87c6784

Avoids a round-trip and a query in the DB.

vercel bot deployed to Preview August 1, 2022 21:54 View deployment

vercel bot deployed to Preview August 1, 2022 21:56 View deployment

iameli reviewed Aug 2, 2022

View reviewed changes

Revert "api/db: Make default sort on JSON column"

55449ee

This reverts commit 0d6d759.

victorges commented Aug 2, 2022

View reviewed changes

vercel bot deployed to Preview August 2, 2022 14:44 View deployment

gioelecerati approved these changes Aug 2, 2022

View reviewed changes

victorges merged commit 6814be1 into master Aug 2, 2022

victorges deleted the vg/fix/update-machinegun branch August 2, 2022 17:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

api: Reduce amount of updates done by DB metrics logic #1179

api: Reduce amount of updates done by DB metrics logic #1179

victorges commented Jul 30, 2022 •

edited

vercel bot commented Jul 30, 2022 •

edited

codecov bot commented Aug 1, 2022 •

edited

iameli Aug 2, 2022

victorges Aug 2, 2022

victorges Aug 2, 2022

gioelecerati left a comment

api: Reduce amount of updates done by DB metrics logic #1179

api: Reduce amount of updates done by DB metrics logic #1179

Conversation

victorges commented Jul 30, 2022 • edited

-

vercel bot commented Jul 30, 2022 • edited

codecov bot commented Aug 1, 2022 • edited

Codecov Report

iameli Aug 2, 2022

Choose a reason for hiding this comment

victorges Aug 2, 2022

Choose a reason for hiding this comment

victorges Aug 2, 2022

Choose a reason for hiding this comment

gioelecerati left a comment

Choose a reason for hiding this comment

victorges commented Jul 30, 2022 •

edited

vercel bot commented Jul 30, 2022 •

edited

codecov bot commented Aug 1, 2022 •

edited