Fix compaction index size overflow #7687

ztlpn · 2022-12-10T00:20:59Z

As by default max compacted segment size is 5 GiB, compacted index size for large segments can oveflow 32-bit ints. This PR substitutes size and the number of keys in the footer for 64-bit ints. Old 32-bit fields are left for backwards compatibility with the code that doesn't check version number.

Fixes #7647

Additionally a more stringent footer version check is introduced that requires version to be equal to the current one (it is better to rebuild even if the version is greater as the index is probably incompatible anyway).

Backports Required

v22.3.x
v22.2.x
v22.1.x

Release Notes

Bug Fixes

Fix integer overflow in compaction index footer that could lead to hangs when trying to compact large segments.

If the index is of different version than the current one, request a rebuild. It is easier than dealing with compatibility issues.

mmaslankaprv · 2022-12-13T11:15:04Z

src/v/storage/compacted_index_chunk_reader.cc

+    ss::temporary_buffer<char> tmp = co_await in.read_exactly(
+      compacted_index::footer_size);


nit: can we use read_iobuf_exactly ?

jcsp · 2022-12-16T15:25:31Z

src/v/storage/compacted_index_chunk_reader.cc


-    if (tmp.size() != compacted_index::footer_size) {
+    if (buf.size_bytes() != compacted_index::footer_size) {
        throw std::runtime_error(fmt::format(
          "could not read enough bytes to parse "


Is this the case we hit when loading old cluster's compaction index? If so, maybe we should make the message a bit clear to indicate that this isn't totally unexpected.

It is kind of a catch-all, but adding a note about a likely cause is a good idea.

src/v/storage/compacted_index.h

emaxerrno · 2022-12-16T16:23:01Z

@ztlpn thanks so much for this. i did a quick git blame on it ... and the original thinking was that segments would be 4G max, at some point along the way i forgot to add the assert. At the time of writing this, we were actually capping segments at 2G, so I figured why not 2X. it's all more bits now :) thanks for the fix.

jcsp · 2022-12-16T16:31:53Z

Just to check we're thinking the same thing -- the regeneration of v1 indices is expected to happen on-demand during compaction, right? So we won't be doing some kind of mass rewrite when we start up...

As by default max compacted segment size is 5 GiB, compacted index size for large segments can oveflow 32-bit ints. This commit substitutes size and the number of keys in the footer for 64-bit ints. Old 32-bit fields are left for backwards compatibility with the code that doesn't check version number. Fixes redpanda-data#7647

ztlpn · 2022-12-16T17:42:10Z

Just to check we're thinking the same thing -- the regeneration of v1 indices is expected to happen on-demand during compaction, right? So we won't be doing some kind of mass rewrite when we start up...

Good point, I did a quick code audit, looks like we only read compaction index during actual compaction.

ztlpn · 2022-12-16T17:54:21Z

test failure is #7816

jcsp · 2022-12-16T19:12:14Z

@ztlpn before we merge a backport + release this to the field, perhaps it is worth adding a ducktape test in a followup PR that ensures this works end to end across the upgrade? (including the rewrites that we expect to happen via exceptions)

ztlpn · 2022-12-16T19:13:11Z

sure, will do

Clamp max compacted segment size to 1.5GiB to avoid compaction index size overflow. Fixes redpanda-data#7647 This is a workaround for older versions, proper fix is redpanda-data#7687

github-actions bot added the area/redpanda label Dec 10, 2022

ztlpn force-pushed the fix-7647-compaction-index-overflow branch 2 times, most recently from e2af2cb to fd4bcc9 Compare December 12, 2022 11:26

ztlpn added 4 commits December 12, 2022 16:11

s/compacted_index_chunk_reader: coroutinize load_footer

8861a09

storage: better warning message

e7fa937

storage: better error message

21b7337

s/compacted_index: more stringent footer version check

f526fbc

If the index is of different version than the current one, request a rebuild. It is easier than dealing with compatibility issues.

ztlpn force-pushed the fix-7647-compaction-index-overflow branch from fd4bcc9 to 3ad52b8 Compare December 12, 2022 23:14

ztlpn requested review from mmaslankaprv and jcsp December 12, 2022 23:20

ztlpn marked this pull request as ready for review December 12, 2022 23:20

mmaslankaprv reviewed Dec 13, 2022

View reviewed changes

jcsp reviewed Dec 16, 2022

View reviewed changes

src/v/storage/compacted_index.h Show resolved Hide resolved

jcsp previously approved these changes Dec 16, 2022

View reviewed changes

ztlpn added 3 commits December 16, 2022 19:35

s/compacted_index: index format compatibility tests

7b6df9f

s/compacted_index: use read_iobuf_exactly to read footer

3933d29

ztlpn dismissed jcsp’s stale review via 3933d29 December 16, 2022 16:35

ztlpn force-pushed the fix-7647-compaction-index-overflow branch from 9700e4d to 3933d29 Compare December 16, 2022 16:35

jcsp mentioned this pull request Dec 16, 2022

tests: fix upgrade tests not to skip versions #7310

Closed

ztlpn requested review from mmaslankaprv and jcsp December 16, 2022 18:31

jcsp approved these changes Dec 16, 2022

View reviewed changes

ztlpn merged commit 4c6e2d6 into redpanda-data:dev Dec 16, 2022

ztlpn mentioned this pull request Feb 1, 2023

Add parsing of v1 compaction index footers #8555

Merged

6 tasks

ztlpn mentioned this pull request Feb 8, 2023

[v22.3.x] storage: clamp max_compacted_segment_size to 1.5 GiB #8708

Merged

6 tasks

ztlpn deleted the fix-7647-compaction-index-overflow branch November 27, 2023 13:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix compaction index size overflow #7687

Fix compaction index size overflow #7687

ztlpn commented Dec 10, 2022 •

edited

mmaslankaprv Dec 13, 2022

ztlpn Dec 16, 2022

jcsp Dec 16, 2022

ztlpn Dec 16, 2022

emaxerrno commented Dec 16, 2022

jcsp commented Dec 16, 2022

ztlpn commented Dec 16, 2022

ztlpn commented Dec 16, 2022

jcsp commented Dec 16, 2022

ztlpn commented Dec 16, 2022

		ss::temporary_buffer<char> tmp = co_await in.read_exactly(
		compacted_index::footer_size);

Fix compaction index size overflow #7687

Fix compaction index size overflow #7687

Conversation

ztlpn commented Dec 10, 2022 • edited

Backports Required

Release Notes

Bug Fixes

mmaslankaprv Dec 13, 2022

Choose a reason for hiding this comment

ztlpn Dec 16, 2022

Choose a reason for hiding this comment

jcsp Dec 16, 2022

Choose a reason for hiding this comment

ztlpn Dec 16, 2022

Choose a reason for hiding this comment

emaxerrno commented Dec 16, 2022

jcsp commented Dec 16, 2022

ztlpn commented Dec 16, 2022

ztlpn commented Dec 16, 2022

jcsp commented Dec 16, 2022

ztlpn commented Dec 16, 2022

ztlpn commented Dec 10, 2022 •

edited