Monitoring #103

gazwald · 2022-08-20T23:18:48Z

darkshade9 · 2022-08-30T21:57:09Z

Free third party accounts to check availability like Checkly?
Prometheus for metrics gathering internally?

gazwald · 2022-09-06T10:29:33Z

Initially I thinking of exporting metrics to CloudWatch and generating alarms from those. Prometheus would be a good option as well. Depends on the pricing model in AWS. I figure the interface would be similar to how Storage is implemented so it's relatively simple to swap between services.

Synthetic tests (e.g. something sending a ping and expecting an ack) was another route I was planning with the client CLI. Probably spawned from a scheduled AWS Lambda or something.

Tracing would be awesome (X-Ray/Jaeger) but definitely something for later on. Need to divide all the options up into the minimum required for now, and required for if enough people start relying on the service.

darkshade9 · 2022-09-06T14:44:50Z

My day job is as an SRE/DevOps, so let me know what you think would work out best. Running an internal Prometheus container could fit the bill to keep tabs on utilization, and external synthetics for user-facing uptime/performance, integrate them into an alerting service for specific triggers, etc.

gazwald added the enhancement label Aug 20, 2022

gazwald added this to the Alpha milestone Aug 20, 2022

gazwald mentioned this issue Aug 24, 2022

Query caching and rate limiting #102

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Monitoring #103

Monitoring #103

gazwald commented Aug 20, 2022 •

edited

darkshade9 commented Aug 30, 2022

gazwald commented Sep 6, 2022

darkshade9 commented Sep 6, 2022

Monitoring #103

Monitoring #103

Comments

gazwald commented Aug 20, 2022 • edited

darkshade9 commented Aug 30, 2022

gazwald commented Sep 6, 2022

darkshade9 commented Sep 6, 2022

gazwald commented Aug 20, 2022 •

edited