Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Horreum observability #845

Open
johnaohara opened this issue Oct 30, 2023 · 1 comment
Open

Horreum observability #845

johnaohara opened this issue Oct 30, 2023 · 1 comment
Labels
area/backend branch/master The master branch type/enhancement An enhancement to an existing feature

Comments

@johnaohara
Copy link
Member

The Opentelemetry extension allows us to view telemetry data and export it to prometheus

I was thinking about: #342 and observability in general. I was able to identify the root cause of the issue using custom opentelemetry spans to track the tasks placed in the taskqueue.

How do we want to observe a running prod instance of horreum, want info do we want to track and how do we want to access it? For example, there is not enough information to understand what is happening in #342 from the logs even at a debug level.

Do we want to rely on grepping debug logs to find information, or do we want some form of observability tool to be able to query recorded events and obtain the necessary information to understand what is happening the running instance?

It is relatively simple to add OpenTelemetry as a telemetry backend, but the neccesary infrastructure is required to process the telemetry data (otel collector, promethues, jaeger etc). the flip side is the Quarkus OpenTelemetry plugin provides insight and error tracing in some of the subsystems in Quarkus. Including parts of the system we would not naturally think to instrument, e.g.;

Screenshot from 2023-02-07 19-00-12

Originally posted by @johnaohara in #365

@johnaohara johnaohara added type/enhancement An enhancement to an existing feature branch/master The master branch area/backend labels Oct 30, 2023
@stalep
Copy link
Member

stalep commented Oct 30, 2023

I think this would help us a lot when we encounter issues in Horreum and we have trouble finding the errors in the logs. Even though log reporting should be better with 0.10 this would make it even better.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/backend branch/master The master branch type/enhancement An enhancement to an existing feature
Projects
None yet
Development

No branches or pull requests

2 participants