Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

relay: send stats to portal #4451

Open
thomaseizinger opened this issue Apr 1, 2024 · 2 comments
Open

relay: send stats to portal #4451

thomaseizinger opened this issue Apr 1, 2024 · 2 comments
Labels
area/observability Logging, metrics, OpenTelemetry, tracing area/relay Issues involving the Firezone Relay complexity/low Something that should not take more than a few hours. kind/feature New feature or request

Comments

@thomaseizinger
Copy link
Member

The portal needs to know the utilisation of a relay in order to load-balance them for new connections.

The critical stats here are:

  • Relative utilisation of allocations (available port range / used port range)
  • Relative CPU utilisation of the entire machine (/proc/stat?)

Other related metrics are:

  • Active channel bindings: Those are per peer, so more difficult to report and not as useful for load balancing because the conclusion would have to be "you can still use this relay, just not for this one client / gateway". We have lots of channels per peer and we should do snownet: Check if direct connection is possible again after some time X #4164 to be more considerate with this resource.
  • Throughout: Not really useful for load balancing because we don't know the max. possible throughput.
@thomaseizinger thomaseizinger added kind/feature New feature or request area/relay Issues involving the Firezone Relay area/observability Logging, metrics, OpenTelemetry, tracing complexity/low Something that should not take more than a few hours. labels Apr 1, 2024
@AndrewDryga
Copy link
Collaborator

We might use the same metrics for #4415.

@thomaseizinger
Copy link
Member Author

The portal needs to know the utilisation of a relay in order to load-balance them for new connections.

How valid is this usecase given that we now send relays to the clients / gateways outside of connections?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/observability Logging, metrics, OpenTelemetry, tracing area/relay Issues involving the Firezone Relay complexity/low Something that should not take more than a few hours. kind/feature New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants