[Metrics] Observability requirements for Hosted Che #13270

l0rd · 2019-05-02T09:15:41Z

l0rd · 2019-05-02T09:17:45Z

I have added this issue to epic #10329

ibuziuk · 2019-05-02T10:00:32Z

I believe the first step would still be adding what we already have available on dsaas - redhat-developer/rh-che#1336

l0rd · 2019-05-02T13:29:38Z

@ibuziuk are you talking about the monitoring infrastructure (prometheus, grafana etc...)? This issue is about implementing the prometheus endpoints.

ibuziuk · 2019-05-02T14:10:40Z

@l0rd Of course, this is not a blocker for implementing those endpoints upstream, but until we have a proper setup in downstream we will not be able to take full advantage of those metrics for Hosted Che

mkuznyetsov · 2019-05-15T12:41:23Z

we already have in Grafana "Workspace Detailed" section with heatmaps, which could be used for getting "The % of workspaces started in under N seconds" (if we want to do exactly that, we have metric endpoints specifically for it)

we also have "The % of workspaces started successfully" displayed on Grafana, yet we don't have the "The % of workspaces stopped successfully"

aditya-konarde · 2019-07-22T08:22:33Z

/cc @skryzhny can you please provide some feedback on the metrics and provide suggestions here?

Beyond the classic USE and RED metrics, we can look at some application specific metrics other than the current ones that either:

Provide business insights to someone running Eclipse Che (number of workspaces, number of users, number of signups)
Provide feedback into development (Workspace start time, average runtime of a workspace, workspace aggregate errors)

I believe we're already well covered looking at the dashboard. But may have scope fro some more :)

che-bot · 2021-02-04T14:39:49Z

Issues go stale after 180 days of inactivity. lifecycle/stale issues rot after an additional 7 days of inactivity and eventually close.

Mark the issue as fresh with /remove-lifecycle stale in a new comment.

If this issue is safe to close now please do so.

Moderators: Add lifecycle/frozen label to avoid stale mode.

l0rd added kind/enhancement A feature request - must adhere to the feature request template. team/platform labels May 2, 2019

yarivlifchuk mentioned this issue May 2, 2019

Che Monitoring #10329

Closed

21 tasks

skabashnyuk added team/ide2 severity/P1 Has a major impact to usage or development of the system. labels May 15, 2019

skabashnyuk mentioned this issue May 21, 2019

Platform-2019-06-12 (Sprint: 167) #13337

Closed

16 tasks

This was referenced May 21, 2019

Investigate how to report metrics from workspaces #13375

Closed

Add successful stopped workspaces metric #13404

Merged

Add workspace stop rate metric eclipse-che/che-docs#721

Merged

l0rd added this to the 7.x milestone Jul 18, 2019

l0rd added the team/languages label Jul 18, 2019

tsmaeder mentioned this issue Jul 25, 2019

Investigate Language server metrics #14017

Closed

tsmaeder mentioned this issue Aug 15, 2019

Implement Language Server Metrics #14245

Closed

tsmaeder modified the milestones: 7.x, Backlog - Languages Sep 23, 2019

skabashnyuk added the kind/epic A long-lived, PM-driven feature request. Must include a checklist of items that must be completed. label Oct 15, 2019

skabashnyuk modified the milestones: Backlog - Languages, Backlog - Epics Oct 15, 2019

This was referenced Oct 15, 2019

[MVP] Collecting Che Workspace metrics in single cluster mode with Prometheus #14888

Closed

Collecting Che Workspace metrics in multi-cluster mode with help of Prometheus Federation mechanism #14889

Closed

sparkoo mentioned this issue Oct 29, 2019

add metrics plugin eclipse-che/che-theia#520

Merged

This was referenced Oct 30, 2019

Test Prometheus federation in simplified mode. #15030

Closed

Investigate alternative possibilities to reduce Prometheus's permissions required to monitor workspaces #15031

Closed

skabashnyuk added this to In progress in Platform Epics Nov 20, 2019

skabashnyuk moved this from In progress to To do in Platform Epics Nov 25, 2019

azatsarynnyy removed the team/editors label Feb 7, 2020

ibuziuk changed the title ~~[Metrics] Observability requirements for hosted Che~~ [Metrics] Observability requirements for Hosted Che Jun 23, 2020

ericwill added team/plugins and removed team/languages labels Jul 6, 2020

l0rd added the area/che-server label Jul 30, 2020

che-bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 4, 2021

skabashnyuk added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 8, 2021

skabashnyuk closed this as completed Aug 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Metrics] Observability requirements for Hosted Che #13270

[Metrics] Observability requirements for Hosted Che #13270

l0rd commented May 2, 2019 •

edited by skabashnyuk

Loading

l0rd commented May 2, 2019

ibuziuk commented May 2, 2019

l0rd commented May 2, 2019

ibuziuk commented May 2, 2019

mkuznyetsov commented May 15, 2019 •

edited

Loading

aditya-konarde commented Jul 22, 2019

che-bot commented Feb 4, 2021

[Metrics] Observability requirements for Hosted Che #13270

[Metrics] Observability requirements for Hosted Che #13270

Comments

l0rd commented May 2, 2019 • edited by skabashnyuk Loading

Description

Che Server metrics

Workspace metrics

l0rd commented May 2, 2019

ibuziuk commented May 2, 2019

l0rd commented May 2, 2019

ibuziuk commented May 2, 2019

mkuznyetsov commented May 15, 2019 • edited Loading

aditya-konarde commented Jul 22, 2019

che-bot commented Feb 4, 2021

l0rd commented May 2, 2019 •

edited by skabashnyuk

Loading

mkuznyetsov commented May 15, 2019 •

edited

Loading