Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Move from Monitoring Platform to Observability Platform #65

Closed
5 tasks done
ricsanfre opened this issue Aug 10, 2022 · 2 comments
Closed
5 tasks done
Milestone

Comments

@ricsanfre
Copy link
Owner

ricsanfre commented Aug 10, 2022

Feature Request

Improve current Monitoring platform, based on Prometheus (metrics) and EFK (logs) to a Observability Platform adding traces monitoring and a single plane of glass.

Explore solution based on Loki (logs), Tempo (traces), Prometheus (metrics) and Grafana as single plane of glass for monitoring.

Main requirements

  • Loki as complement of the current EFK platform, not a replacement. ES is used mainly for Log Analytics (log content is completely indexed) while Loki can be used for Observability (only log labels are indexed) having together logs, metrics and traces in the same Grafana Dashboards.

  • Current logs collection/distrution layer based on fluentbit/fluentd should be used to feed logs to ES and to Loki, instead of deploying a separate collector (Loki promtail)

    Fluentbit/Fluentd Loki plugins are available, so current forwarder/aggregator architecture could be maintained just adding Loki as backend for logs.

  • Linkerd distributed tracing mechanism should be enabled. See this Grafana's blog entry where the process is explained.

Issue Progress

  • Add Loki
  • Reconfigure Fluentbit/Fluentd to send logs to ES and Loki
  • Add Grafana Tempo
  • Configure Linkerd distributed tracing
  • Configure, Ingress Controller, Traefik traces
@ricsanfre ricsanfre added this to the backlog milestone Aug 10, 2022
@ricsanfre
Copy link
Owner Author

ricsanfre commented Oct 31, 2022

References:

@ricsanfre ricsanfre mentioned this issue Nov 1, 2022
@ricsanfre
Copy link
Owner Author

About Traefik tracing

  • Activate tracing using B3 propagation protocol (only one supported by Linkerd).

    --tracing.zipkin=true
    --tracing.zipkin.httpEndpoint=http://zipkin:9411/api/v2/spans
    --tracing.zipkin.sameSpan=true
    --tracing.zipkin.id128Bit=true
    --tracing.zipkin.sampleRate=1
    
  • In order to correlate logs with traces in Grafana, Traefik access log should be configured so, trace ID is also present as a field in the logs. By default no header is included in Traefik's access log. See https://doc.traefik.io/traefik/observability/access-logs/#limiting-the-fieldsincluding-headers

@ricsanfre ricsanfre mentioned this issue Nov 20, 2022
@ricsanfre ricsanfre modified the milestones: backlog, release 1.6 Nov 20, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant