Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Publish storage worker and execute worker pool size in prometheus #1606

Merged
merged 4 commits into from
Jan 12, 2024

Conversation

amishra-u
Copy link
Contributor

Publish storage worker and execute worker pool size in prometheus

@amishra-u amishra-u requested a review from werkt as a code owner January 5, 2024 23:58
@amishra-u
Copy link
Contributor Author

Had dependency on #1605, buildfails due to missing newly added field.

.register();
private static final Gauge executeWorkerPoolSize =
Gauge.build()
.name("execute_worker_pool_size")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would be useful to add a labelNames("queue_name") here as you can have multiple pools of execute workers. Also can you please update the config docs with new metrics.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will require changes at multiple places.

  1. Change type of active_workers property in BackplaneStatus. Which will trigger changes in multiple places.
  2. Include worker queues in ShardWorker

Are we ok with these changes?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is fine as is for now. There is a different way to get the workers by type (if running in k8s) with something like this:
count(completed_operations_total) by (job)

where job is a label assigned to a kubernetes deployment.

Copy link
Collaborator

@80degreeswest 80degreeswest left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for updating the doc!

.register();
private static final Gauge executeWorkerPoolSize =
Gauge.build()
.name("execute_worker_pool_size")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is fine as is for now. There is a different way to get the workers by type (if running in k8s) with something like this:
count(completed_operations_total) by (job)

where job is a label assigned to a kubernetes deployment.

@80degreeswest 80degreeswest merged commit d155c0b into bazelbuild:main Jan 12, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants