Add Kubernetes Monitoring Mixin #5

ishanjainn · 2023-08-26T16:34:29Z

rgeyer

Some leading questions about the type of metrics you're using, and how exactly tokens are handled (and thus, what sort of query and thresholds make sense for the alerts).

I know very little about OpenAI in particular, which is why I ask the questions I do. If you've already considered these things, you can ignore my comments :)

Overall, no notes on the design of the dashboard.

rgeyer · 2023-08-28T18:58:24Z

kubernetes-mixin/README.md

+- HighCompletionTokensUsage
+- HighPromptTokensUsage
+- HighTotalTokensUsage
+- LongRequestDuration
+- HighUsageCost


The alerts for these have (apparently?) arbitrary numeric values. Could these be configurable?

Taking a quick look at the OpenAI tokens system, it looks like the maximum number of tokens per-request is 4097.

Some questions for which I do not have answers, but could inform these numbers.

Are the token metrics gauges or counters? If they're counters, a rate should be used instead of sum.

Can a single model process more than one request at once? If so, the sum of these over 5m could handle roughly 2 requests every 5min (assuming they're gauges, with a 5+min range).

All of the metrics are gauges
No, Can just make a single request at a time

rgeyer · 2023-08-28T19:01:38Z

kubernetes-mixin/dashboards/openai-monitoring.json

-            "values": false
+          "disableTextWrap": false,
+          "editorMode": "code",
+          "expr": "sum by(job) (count_over_time(openai_promptTokens{job=~\"$job\"}[$__interval]))",


Prompt tokens are not the same as requests. Is there a bespoke request counter?

Sorry I didnt get this

rgeyer · 2023-08-28T19:10:03Z

kubernetes-mixin/dashboards/openai-monitoring.json

+          "useBackend": false
+        }
+      ],
+      "title": "Total Tokens vs Request Duration",


Does it make sense to have both of these on the same timeseries? Request duration is likely to be less than 100 seconds, while average total tokens per request may be over 1000. A 10x scale difference will be hard to grok on the same graph

Yeah axis wise I agree it looks a bit off, But I just wanted to have a correlation panel, Is there a way we can do this better?

rgeyer · 2023-08-28T19:12:43Z

kubernetes-mixin/dashboards/openai-monitoring.json

-            "useBackend": false
+          "disableTextWrap": false,
+          "editorMode": "code",
+          "expr": "openai_totalTokens{job=~\"$job\"}",


Again, not clear if this is a counter or a gauge?

If it is a gauge, it will only have the value of total available tokens when the metrics is sampled?

If it is a counter, some sort of increase or rate function needs to be applied.

If it is a gauge, it will only have the value of total available tokens when the metrics is sampled?

Yup this is a gauge, So totalToken is a sum of completition tokens and prompt tokens, Hence used it directly against duration in this panel

ishanjainn added 7 commits August 26, 2023 12:22

Create openai-alerts.yml

5e09737

Update chat_v1.py

4163ba0

Update chat_v2.py

bd012ba

Create mixin.libsonnet

252d8d6

Create makefile

4c810c6

Create .lint

f3a6235

Create README.md

111e55c

ishanjainn changed the title ~~Mixin~~ Add Kubernetes Monitoring Mixin Aug 26, 2023

ishanjainn and others added 5 commits August 26, 2023 12:36

Update chat_v1.py

ae62341

Update chat_v1.py

29232fd

update alerts

fbd7fa8

passing the linter

616de96

update README

5280066

ishanjainn requested a review from rgeyer August 28, 2023 15:08

rgeyer reviewed Aug 28, 2023

View reviewed changes

ishanjainn added 4 commits September 1, 2023 10:27

change mixin folder name

a218867

update dashboard

dfb3dff

update README

6055085

update tests

564234c

ishanjainn merged commit 18395c8 into main Sep 2, 2023
7 checks passed

ishanjainn deleted the mixin branch September 2, 2023 18:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Kubernetes Monitoring Mixin #5

Add Kubernetes Monitoring Mixin #5

ishanjainn commented Aug 26, 2023 •

edited

rgeyer left a comment

rgeyer Aug 28, 2023

ishanjainn Aug 30, 2023

rgeyer Aug 28, 2023

ishanjainn Aug 30, 2023

rgeyer Aug 28, 2023

ishanjainn Aug 30, 2023

rgeyer Aug 28, 2023

ishanjainn Aug 30, 2023

Add Kubernetes Monitoring Mixin #5

Add Kubernetes Monitoring Mixin #5

Conversation

ishanjainn commented Aug 26, 2023 • edited

rgeyer left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ishanjainn commented Aug 26, 2023 •

edited