fix(buffers): Optimize buffer usage metric tracking by bruceg · Pull Request #24911 · vectordotdev/vector

bruceg · 2026-03-12T21:02:03Z

Summary

The buffer usage metrics, in particular the value of the current utilization levels, were tracked using an atomic u64 which was updated using a fetch_update mechanism in order to protect against underflowing. This same mechanism was extended to all of the atomics as well for consistency. The problem with that is that fetch_udpate internally uses a loop around a compare-and-exchange operation which is very expensive, particularly when contended. In comparison, the base fetch_add is typically a single locked instruction which completes in many fewer cycles.

This change returns these atomics to only ever use fetch_add and then calculate the current level by subtracting the count of increments from the count of decrements.

Vector configuration

N/A

How did you test this PR?

Unit tests

Change Type

Is this a breaking change?

Yes
No

Does this PR include user facing changes?

Yes. Please add a changelog fragment based on our guidelines.
No. A maintainer will apply the no-changelog label to this PR.

References

#24058

Notes

Please read our Vector contributor resources.
Do not hesitate to use @vectordotdev/vector to reach out to us regarding this PR.
Some CI checks run only after we manually approve them.
- We recommend adding a pre-push hook, please see this template.
- Alternatively, we recommend running the following locally before pushing to the remote branch:
  - make fmt
  - make check-clippy (if there are failures it's possible some of them can be fixed with make clippy-fix)
  - make test
After a review is requested, please avoid force pushes to help us review incrementally.
- Feel free to push as many commits as you want. They will be squashed into one before merging.
- For example, you can run git merge origin master and git push.
If this PR introduces changes Vector dependencies (modifies Cargo.lock), please
run make build-licenses to regenerate the license inventory and commit the changes (if any). More details here.

The buffer usage metrics, in particular the value of the current utilization levels, were tracked using an atomic `u64` which was updated using a `fetch_update` mechanism in order to protect against underflowing. This same mechanism was extended to all of the atomics as well for consistency. The problem with that is that `fetch_udpate` internally uses a loop around a `compare-and-exchange` operation which is very expensive, particularly when contended. In comparison, the base `fetch_add` is typically a single locked instruction which completes in many fewer cycles. This change returns these atomics to only ever use `fetch_add` and then calculate the current level by subtracting the count of increments from the count of decrements.

bruceg · 2026-03-12T21:07:51Z

Regression test run indicates no change

pront · 2026-03-13T18:29:36Z

changelog.d/24911-buffer-usage-metric-performance.fix.md

@@ -0,0 +1,3 @@
+Fixed regression in performance of buffer usage metric tracking.


Did we validate if this PR fixes the regression?

Ref #24911 (comment)

pznamensky · 2026-03-16T11:04:42Z

Thanks for the effort, @bruceg!
I'd be happy to try this out in our production env with real workload if needed.
However I have some problems building proper docker image. So if you could trigger a docker build job, I could use that image to use in our setup.

pront · 2026-03-16T13:28:19Z

Thanks for the effort, @bruceg! I'd be happy to try this out in our production env with real workload if needed. However I have some problems building proper docker image. So if you could trigger a docker build job, I could use that image to use in our setup.

Hi @pznamensky I kicked off a custom build: https://github.com/vectordotdev/vector/actions/runs/23146056109

You can use those builds once they are published. Looking forward to hearing back from after you test this 🤞

pznamensky · 2026-03-16T19:09:25Z

@pront, thank you for preparing the images.
Bad news is that in our case Vector from this PR uses CPU on the same level as v0.54.0.
Average CPU usage in our cluster:

So it looks like the original issue might be not in metrics.

bruceg requested a review from a team as a code owner March 12, 2026 21:02

bruceg added meta: regression This issue represents a regression domain: performance Anything related to Vector's performance labels Mar 12, 2026

bruceg force-pushed the bruceg/optimize-buffer-usage-data branch from 7e9a011 to 4a15afc Compare March 12, 2026 21:04

bruceg mentioned this pull request Mar 12, 2026

Vector CPU Usage increased 0.50.0 #24058

Open

bruceg force-pushed the bruceg/optimize-buffer-usage-data branch from 4a15afc to 3936121 Compare March 12, 2026 21:06

pront reviewed Mar 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(buffers): Optimize buffer usage metric tracking#24911

fix(buffers): Optimize buffer usage metric tracking#24911
bruceg wants to merge 1 commit intomasterfrom
bruceg/optimize-buffer-usage-data

bruceg commented Mar 12, 2026 •

edited by pront

Loading

Uh oh!

bruceg commented Mar 12, 2026

Uh oh!

pront Mar 13, 2026

Uh oh!

pznamensky commented Mar 16, 2026

Uh oh!

pront commented Mar 16, 2026

Uh oh!

pznamensky commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1,3 @@
		Fixed regression in performance of buffer usage metric tracking.

Conversation

bruceg commented Mar 12, 2026 • edited by pront Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Vector configuration

How did you test this PR?

Change Type

Is this a breaking change?

Does this PR include user facing changes?

References

Notes

Uh oh!

bruceg commented Mar 12, 2026

Uh oh!

pront Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

pznamensky commented Mar 16, 2026

Uh oh!

pront commented Mar 16, 2026

Uh oh!

pznamensky commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bruceg commented Mar 12, 2026 •

edited by pront

Loading