chore(dataobj): Improve performance of dataset.Value API #18339

rfratto · 2025-07-04T18:31:19Z

This PR makes several small changes to improve the performance of the dataset.Value API:

Reduce the complexity of methods so that methods can be inlined. The previous complexity of Value.Type bubbled down to all the methods, making them hard to inline.

Using error types for panics also reduces the complexity cost of the panic lines while retaining the same error messages; the cost of both string concatenation and fmt.Sprintf bubbled up the cost, further making it difficult to inline functions.
Use pointer receivers wherever possible for Value methods. As each Value is 28 bytes (previously 24, but still large), copying the Value on the stack to call a non-inlined method was expensive. This was seen most prominently in CompareValues, which still can't be inlined. Using pointers for CompareValues significantly improves its performance.
Replace use of any for the final field with a plain *byte type. This ensures that there are absolutely no allocations for that field. Some usages of the old field have been replaced with the explicit kind field.

Additionally, smaller changes have been made:

All methods that mutate the state of Value have been removed, with the exception of Zero. Functionality of these methods have been pushed down to the caller. This was done to reduce the complexity of the code, but also to manually inline that logic into the plain value encoder.
Usage of cmp.Compare has been replaced with a hand-rolled compare method for integers, which is slightly less expensive due to not needing to handle the possibility of floating-point numbers.

As a result of these changes, the synthetic logql/bench benchmarks show a 10-15% speed improvement in all queries.

Using dataset.Value is a sizeable portion of total query time. This commit adds benchmarks to more directly test improvements to dataset.Value.

This commit makes several small changes to improve the performance of the dataset.Value API: 1. Reduce the complexity of methods so that methods can be inlined. The previous complexity of Value.Type bubbled down to all the methods, making them hard to inline. Using error types for panics also reduces the complexity cost of the panic lines while retaining the same error messages; the cost of both string concatenation and fmt.Sprintf bubbled up the cost, further making it difficult to inline functions. 2. Use pointer receivers wherever possible for Value methods. As each Value is 28 bytes (previously 24, but still large), copying the Value on the stack to call a non-inlined method was expensive. This was seen most prominently in CompareValues, which still can't be inlined. Using pointers for CompareValues significantly improves its performance. 3. Replace use of `any` for the final field with a plain `*byte` type. This ensures that there are absolutely no allocations for that field. Some usages of the old field have been replaced with the explicit kind field. Additionally, smaller changes have been made: * All methods that mutate the state of Value have been removed, with the exception of Zero. Functionality of these methods have been pushed down to the caller. This was done to reduce the complexity of the code, but also to manually inline that logic into the plain value encoder. * Usage of cmp.Compare has been replaced with a hand-rolled compare method for integers, which is slightly less expensive due to not needing to handle the possibility of floating-point numbers. As a result of these changes, the synthetic logql/bench benchmarks show a global 2x speed improvement in queries, alongside a 4% reduction in allocations.

rfratto · 2025-07-04T18:32:11Z

I made a few smaller changes since the last time I ran the benchmarks. I'm running them one final time, and I'll share the results here.

rfratto · 2025-07-04T20:09:14Z

The results are a little hard to verify using the CI, since it seems pretty inconsistent with its results today. It initially looked like this PR improved the speed by 2x, but it seems like it's probably closer to 10-15%.

The correctness tests are failing too, but they are also failing on the baseline I used, so I think that can be ignored here.

pkg/dataobj/internal/dataset/value.go

ashwanthgoli

lgtm

Co-authored-by: Ashwanth <iamashwanth@gmail.com>

rfratto added 2 commits July 4, 2025 13:58

chore(dataobj): add micro benchmarks for the dataset.Value API

2864679

Using dataset.Value is a sizeable portion of total query time. This commit adds benchmarks to more directly test improvements to dataset.Value.

rfratto requested a review from a team as a code owner July 4, 2025 18:31

pull-request-size bot added the size/L label Jul 4, 2025

ashwanthgoli reviewed Jul 8, 2025

View reviewed changes

pkg/dataobj/internal/dataset/value.go Outdated Show resolved Hide resolved

ashwanthgoli approved these changes Jul 8, 2025

View reviewed changes

rfratto and others added 2 commits July 8, 2025 09:23

chore(dataobj): fix typo on type name

a0fa6ba

Co-authored-by: Ashwanth <iamashwanth@gmail.com>

Merge branch 'main' into dataset-value-perf

a60edb4

rfratto merged commit 26cb50e into main Jul 8, 2025
65 checks passed

rfratto deleted the dataset-value-perf branch July 8, 2025 13:40

andrejshapal pushed a commit to andrejshapal/loki that referenced this pull request Jul 8, 2025

chore(dataobj): Improve performance of dataset.Value API (grafana#18339)

f22e656

Co-authored-by: Ashwanth <iamashwanth@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore(dataobj): Improve performance of dataset.Value API #18339

chore(dataobj): Improve performance of dataset.Value API #18339

rfratto commented Jul 4, 2025 •

edited

Loading

Uh oh!

rfratto commented Jul 4, 2025

Uh oh!

rfratto commented Jul 4, 2025

Uh oh!

Uh oh!

ashwanthgoli left a comment

Uh oh!

Uh oh!

Uh oh!

chore(dataobj): Improve performance of dataset.Value API #18339

chore(dataobj): Improve performance of dataset.Value API #18339

Conversation

rfratto commented Jul 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rfratto commented Jul 4, 2025

Uh oh!

rfratto commented Jul 4, 2025

Uh oh!

Uh oh!

ashwanthgoli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

rfratto commented Jul 4, 2025 •

edited

Loading