Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need a dashboard #396

Closed
8 tasks done
aronchick opened this issue Jul 20, 2022 · 6 comments · Fixed by #438
Closed
8 tasks done

Need a dashboard #396

aronchick opened this issue Jul 20, 2022 · 6 comments · Fixed by #438
Assignees

Comments

@aronchick
Copy link
Collaborator

aronchick commented Jul 20, 2022

We need a dashboard that walks our traces and produces the following things (daily):

  • Number of jobs
  • Length of each job running
  • Spec of job (e.g. used GPU, amount of input data, amount of output data)
  • Number of unique users submitting jobs
  • Top users and jobs they submitted (in aggregate)
  • Lots more... just start here :)

Design

  • Internal dashboard, no auth needed
  • a 'unique user' is basically the public key used by the cli to submit job (find out more)

Tasks

Follow up: implement more metrics ---> #582

@enricorotundo enricorotundo self-assigned this Jul 28, 2022
@enricorotundo enricorotundo linked a pull request Aug 3, 2022 that will close this issue
@lukemarsden
Copy link
Contributor

Is this finished? Where's the dashboard?

@lukemarsden lukemarsden reopened this Aug 24, 2022
@enricorotundo
Copy link
Contributor

Waiting on PL infra team to provide us with a grafana cloud key. For some reason our (my & @aronchick ) accounts don't seem to be able to generate one.

Code wise it's all set up. We may want to add new metrics tho.

@enricorotundo
Copy link
Contributor

enricorotundo commented Sep 7, 2022

@enricorotundo
Copy link
Contributor

enricorotundo commented Sep 7, 2022

Update: still busy configuring the Grafana dashboard to display the actual total sum of the metrics. These are counters and they do reset every now and then, so a simple sum is not enough. The closest I got to achieving that is with this Grafana query sum(increase(jobs_completed[720h])). WIP. Stay tuned

@enricorotundo
Copy link
Contributor

Update: As per this thread, I've switched to using irate
Screenshot 2022-09-08 at 16 12 00

Public link: https://protocollabs.grafana.net/public-dashboards/9fe7717277e540a7be88f1e8e8339bd8
Private link (for editing): https://protocollabs.grafana.net/d/x6I6OcG4z/bacalhau?orgId=1&from=now-7d&to=now


Follow up issue: #582 (Add more metrics to the codebase).

@enricorotundo
Copy link
Contributor

Note Grafana cloud public dashboards have only alpha support - sometimes it's quite painful to have the public synced with the private dashboard :(

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants