Track the amount of read data per row #2368

jacek-lewandowski · 2023-05-26T07:30:24Z

If an sstable is corrupted in a nasty way, we may read invalid cell sizes and try to read much more data for a row than we should. In rare scenarios this can lead even to OOMs.

This simple fix adds tracking and limiting the amount of data that is read per row. Row has its size stored in preamble which can be used as a limit. If the deserialization code tries to read more than that, it will simply fail with EOF which will prevent more serious problems later.

Thanks for sending a pull request! Here are some tips if you're new here:

Ensure you have added or run the appropriate tests for your PR.
Be sure to keep the PR description updated to reflect all changes.
Write your PR title to summarize what this PR proposes.
If possible, provide a concise example to reproduce the issue for a faster review.
Read our contributor guidelines
If you're making a documentation change, see our guide to documentation contribution

Commit messages should follow the following format:

<One sentence description, usually Jira title or CHANGES.txt summary>

<Optional lengthier description (context on patch)>

patch by <Authors>; reviewed by <Reviewers> for CASSANDRA-#####

Co-authored-by: Name1 <email1>
Co-authored-by: Name2 <email2>

The Cassandra Jira

bereng · 2023-05-26T07:39:17Z

src/java/org/apache/cassandra/io/util/TrackedDataInputPlus.java

Do you want to use TypeSizes instead here?

well, I don't think so, that would require also refactoring that in bytesRead updates

I think it is more clear to use the size number 1/2/4/8 ,which make the code more readable as the code below just add these numbers

Not following. Why would you need to refactor anything? You guys think 1 is more readable than TypeSizes.BOOL_SIZE and on top of that you're hardcoding a value? It's like magic numbers where we have a proper abstraction ready unless I am missing sthg.

checkCanRead(TypeSizes.BOOL_SIZE) reads good in my eyes 🤷

I agree with your point of view. From the perspective of code readability, TypeSizes.BOOL_SIZE is definitely much better than hard code 1.
What I said above means that if the existing code is already written as 1/4/8, our newly added code also fills in 1/4/8, then it just echoes back and forth, so the readability is not bad.
Of course, if Jacek is willing to change the existing numbers to TypeSizes, I definitely support your point of view.

(1)checkCanRead(TypeSizes.BOOL_SIZE); byte b = source.readByte(); bytesRead += TypeSizes.BOOL_SIZE; return b;
(2) checkCanRead(1); byte b = source.readByte(); bytesRead += 1; return b;
(3)checkCanRead(TypeSizes.BOOL_SIZE); byte b = source.readByte(); bytesRead += 1; return b;
I think the code itself should explain its behavior.
just from my point of view, this order is (1) > (2) > (3)
and it seems Jacek has already resolved this issue.

bereng · 2023-05-26T07:42:32Z

test/unit/org/apache/cassandra/db/rows/UnfilteredSerializerTest.java

Missing new line?

bereng · 2023-05-26T07:48:54Z

We need CI runs and will you port this to 4.0 and 4.1 also?

jacek-lewandowski · 2023-05-26T07:58:10Z

@bereng thanks for super quick response. I'll provide patches for older branches but probably it will be next week

Maxwell-Guo · 2023-05-26T09:32:41Z

src/java/org/apache/cassandra/io/util/TrackedDataInputPlus.java

will limit just be zero？

I just test some times,it seems limit can not be 0 ~~~

Yes, we always encode at least a flag

Maxwell-Guo · 2023-05-26T09:37:29Z

test/unit/org/apache/cassandra/db/rows/UnfilteredSerializerTest.java

I think a test for some corner case is needed such as what I said when limit is 0？

No need for this

jacek-lewandowski · 2023-06-09T06:11:30Z

@bereng is it ok now?

Maxwell-Guo · 2023-06-09T06:39:27Z

test/unit/org/apache/cassandra/db/rows/UnfilteredSerializerTest.java

No need for this

bereng · 2023-06-09T07:26:25Z

Approach looks good to me. CI and PRs for the rest of the versions are missing iiuc.

jacek-lewandowski · 2023-06-09T07:37:53Z

@bereng there was no point in running CI before code review, same for other PRs

bereng · 2023-06-09T08:07:12Z

Ha you're brave. I always run CI before...

If an sstable is corrupted in a nasty way, we may read invalid cell sizes and try to read much more data for a row than we should. In rare scenarios this can lead even to OOMs. This simple fix adds tracking and limiting the amount of data that is read per row. Row has its size stored in preamble which can be used as a limit. If the deserialization code tries to read more than that, it will simply fail with EOF which will prevent more serious problems later. Patch by Jacek Lewandowski; reviewed by Berenguer Blasi and Maxwell Guo for CASSANDRA-18513

jacek-lewandowski · 2023-06-09T09:06:21Z

https://app.circleci.com/pipelines/github/jacek-lewandowski/cassandra/720/workflows/7495bccf-c7f8-47db-a145-e0e5f6e508c2

bereng · 2023-06-12T04:52:45Z

^That is j11 only, j8 seems to not have been triggered?

jacek-lewandowski · 2023-06-12T06:13:21Z

https://app.circleci.com/pipelines/github/jacek-lewandowski/cassandra/720/workflows/f6fd1707-5cd1-4680-b234-0a79939fb4d2

bereng reviewed May 26, 2023

View reviewed changes

Maxwell-Guo reviewed May 26, 2023

View reviewed changes

Maxwell-Guo approved these changes Jun 9, 2023

View reviewed changes

test/unit/org/apache/cassandra/db/rows/UnfilteredSerializerTest.java Outdated

Copy link

Contributor

Maxwell-Guo Jun 9, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No need for this

jacek-lewandowski force-pushed the CASSANDRA-18513 branch from 7400fc5 to 02b188a Compare June 9, 2023 08:27

DO NOT MERGE - CircleCI config

7f6cd92

belliottsmith force-pushed the trunk branch 2 times, most recently from df3eb40 to 54e39a9 Compare July 23, 2025 11:19

Track the amount of read data per row #2368

Are you sure you want to change the base?

Track the amount of read data per row #2368

Uh oh!

Conversation

jacek-lewandowski commented May 26, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Maxwell-Guo May 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bereng May 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Maxwell-Guo Jun 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bereng commented May 26, 2023

Uh oh!

jacek-lewandowski commented May 26, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jacek-lewandowski commented Jun 9, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bereng commented Jun 9, 2023

Uh oh!

jacek-lewandowski commented Jun 9, 2023

Uh oh!

bereng commented Jun 9, 2023

Uh oh!

jacek-lewandowski commented Jun 9, 2023

Uh oh!

bereng commented Jun 12, 2023

Uh oh!

jacek-lewandowski commented Jun 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Maxwell-Guo May 26, 2023 •

edited

Loading

bereng May 30, 2023 •

edited

Loading

Maxwell-Guo Jun 9, 2023 •

edited

Loading