feat(wal): Benchmark and improve WAL writes using Reset. #13272

cyriltovena · 2024-06-20T13:43:46Z

What this PR does / why we need it:

This adds a benchmark and some improvement to the write path of wal segment.

Which issue(s) this PR fixes:
Fixes https://github.com/grafana/loki-private/issues/1005 https://github.com/grafana/loki-private/issues/1004

❯ benchstat before.txt after.txt                                                                                                                                 
name       old time/op    new time/op    delta
Writes-16    11.9ms ±29%     6.5ms ± 2%  -44.84%  (p=0.008 n=5+5)

name       old alloc/op   new alloc/op   delta
Writes-16    12.9MB ± 9%     0.1MB ± 1%  -98.84%  (p=0.008 n=5+5)

name       old allocs/op  new allocs/op  delta
Writes-16     2.36k ±19%     0.30k ± 0%  -87.13%  (p=0.008 n=5+5)

benclive · 2024-06-20T14:26:50Z

pkg/storage/wal/segment.go

-		streams: swiss.NewMap[streamID, *streamSegment](64),
-		buf1:    encoding.EncWith(make([]byte, 0, 4)),
+func NewWalSegmentWriter() (*SegmentWriter, error) {
+	idxWriter, err := index.NewWriter(context.TODO())


Do we have to pass a ctx to the index? I think we can remove it because we only ever pass context.TODO(). We also don't pass a context to the parent SegmentWriter so the utility is very limited.

Yes I'll look into that.

benclive · 2024-06-20T14:30:17Z

pkg/storage/wal/segment.go

+		s = streamSegmentPool.Get().(*streamSegment)
+		s.lbls = lbls
+		s.tenantID = tenantID
+		s.entries = s.entries[:0]


Should a streamSegment have a reset() method?
It looks like only one line to reset them but it would maintain consistency with our other Readers/Writers.

benclive

LGTM! Just one comment on the tests but the benchmark should already cover it so happy to approve regardless.

benclive · 2024-06-20T17:30:54Z

pkg/storage/wal/segment_test.go

+	t.Logf("Series sizes: [%s]\n", sizesString)
+}
+
+func TestReset(t *testing.T) {


Could you add/modify this test case so the re-use case adds more data than the initial case?
If we had re-use errors, we'd likely panic when re-using a small buffer buit we'd be less likely to panic when re-using a large buffer.

benclive · 2024-06-20T17:36:24Z

pkg/storage/wal/segment_test.go

+	for i := 0; i < b.N; i++ {
+		writer := pool.Get().(*SegmentWriter)
+
+		dst.Reset()


This benchmark isn't running in parallel (since dst is reused). You could remove the pool for writers from the benchmark, or add a pool for dst and run the whole thing in parallel :)

cyriltovena added 30 commits May 16, 2024 18:20

wip

0e5aa15

wip

2ef5c3c

wip

d68a08d

add some doc and vision

144bb9c

move compressed len to chunk

5f8cf08

work on the chunk encoding

f32c755

missing changes

19bbd76

working on fixes and tests

9e1d5b1

add more tests and found a bug with dod

c8b792f

fix(wal): Use varint encoding for ts_2_dod in WAL format

749acf7

refactor(wal): Remove unnecessary code in writeChunk function

7590f55

chore: Refactor ChunkReader to improve performance and memory usage

7991408

chore: Add more realistic tests and benchmarks

38fcad4

refactor: Update index writer to support in memory buffer.

bdf389f

pausing work I need a new index different than the current work

296daee

Add a special in memory index for the wal package

d1cfcae

Finalize writing and start reading index

37ea6d6

Add offset/start to chunk ref

d649646

wip

fd1dbd8

refactor(wal): Implement SeriesIter.

b49d2ba

fix(wal): Fixes snappy block offsets counting.

f575efb

chore: update format doc to reflect latest changes

071ee04

chore: lint

6227361

refactor: Removes changes not required.

f625252

chore: format

32b1d2c

feat(wal): Add sizing information to writer and reader.

d3f179e

Merge remote-tracking branch 'upstream/main' into wal-sizing

57ad53b

Merge remote-tracking branch 'upstream/main' into wal-sizing

d7dc2b1

lint

95c1015

ensure stable test

b6ba673

feat(wal): Benchmark and improve WAL writes using Reset.

3700f3e

cyriltovena requested a review from a team as a code owner June 20, 2024 13:43

pull-request-size bot added the size/L label Jun 20, 2024

lint

ad52803

benclive reviewed Jun 20, 2024

View reviewed changes

Review feedback

6f9489c

pull-request-size bot added size/XL and removed size/L labels Jun 20, 2024

benclive approved these changes Jun 20, 2024

View reviewed changes

Merge remote-tracking branch 'origin/main' into wal-write-benchmark

b6263ef

pull-request-size bot added size/L and removed size/XL labels Jun 24, 2024

benclive merged commit debb5f2 into grafana:main Jun 24, 2024
61 checks passed

loki-gh-app bot mentioned this pull request Jul 1, 2024

chore(k209): release 3.1.0 #13356

Closed

This was referenced Jul 8, 2024

chore(k210): release 3.1.0 #13435

Closed

chore(k210): release 3.1.0 #13462

Closed

loki-gh-app bot mentioned this pull request Jul 15, 2024

chore(k211): release 3.1.0 #13521

Closed

loki-gh-app bot mentioned this pull request Jul 22, 2024

chore(k212): release 3.1.0 #13595

Closed

This was referenced Aug 15, 2024

chore(k215): release 3.2.0 #13905

Open

chore(k216): release 3.2.0 #13929

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(wal): Benchmark and improve WAL writes using Reset. #13272

feat(wal): Benchmark and improve WAL writes using Reset. #13272

cyriltovena commented Jun 20, 2024

benclive Jun 20, 2024

cyriltovena Jun 20, 2024

benclive Jun 20, 2024

benclive left a comment

benclive Jun 20, 2024

benclive Jun 20, 2024

feat(wal): Benchmark and improve WAL writes using Reset. #13272

feat(wal): Benchmark and improve WAL writes using Reset. #13272

Conversation

cyriltovena commented Jun 20, 2024

benclive Jun 20, 2024

Choose a reason for hiding this comment

cyriltovena Jun 20, 2024

Choose a reason for hiding this comment

benclive Jun 20, 2024

Choose a reason for hiding this comment

benclive left a comment

Choose a reason for hiding this comment

benclive Jun 20, 2024

Choose a reason for hiding this comment

benclive Jun 20, 2024

Choose a reason for hiding this comment