Scavenge: Handle events that have been deleted from (now empty) chunks but not from the index #3813

shaan1337 · 2023-04-12T09:02:30Z

Fixed: Handle events that have been deleted from (now empty) chunks but not from the index

Previously this would result in the scavenge stopping with an error like "Could not get TimeStamp range for chunk "

But this is a possible state if the old scavenger had scavenged the chunks but not the index. This fix correctly handles this scenario so that the scavenge process can continue

shaan1337 · 2023-04-14T07:19:11Z

please hold on the review - i'm improving the test cases

src/EventStore.Core/TransactionLog/Scavenging/Data/DiscardDecision.cs

timothycoleman · 2023-04-18T10:49:31Z

great, nice work 👍 squash the changes and i'll approve

timothycoleman · 2023-04-18T10:51:17Z

for posterity, shaan and i identified that the scenario where the if (allDiscardedSoFar) check returns false is quite narrow:

we have an event E in the index that is not in the chunk
the logical chunk that it would have been in, in fact, has no prepares left in it at all
there is an earlier event for the same stream, the stream has a maxage, and we MaybeDiscard it in the calculator
but we do, in fact, want to keep that previous event D based on the current metadata (which means that E was removed in error or because of an earlier metadata)

the net effect is that in this scenario, if the allDiscardedSoFar check was not present, D would be removed, perhaps to the surprise of the user.

note that a stream in this state is hard to read. in order to read D the read must not also try to include E

if it wasn't for this corner case, then the allDiscardedSoFar variable and the AlreadyDiscarded discard decision would be unnecessary, we could just return Discard (although it would unnecessarily add weight to that chunk)

…from the index Previously this would result in the scavenge stopping with an error like "Could not get TimeStamp range for chunk <chunk number>" But this is a possible state if the old scavenge had scavenged the chunks but not the index. This fix correctly handles this scenario so that the scavenge process can continue

timothycoleman

👍

shaan1337 · 2023-04-19T08:23:45Z

great, nice work +1 squash the changes and i'll approve

thanks for the thorough review!

timothycoleman · 2023-04-21T08:02:42Z

great, nice work +1 squash the changes and i'll approve

thanks for the thorough review!

thanks for the careful code!

github-actions

@hayley-jean 👉 Created pull request targeting release/oss-v22.10: #3821

shaan1337 force-pushed the new-scavenger-fix branch 2 times, most recently from caab6f9 to 9e1b740 Compare April 13, 2023 11:06

shaan1337 marked this pull request as ready for review April 13, 2023 11:15

hayley-jean requested a review from timothycoleman April 13, 2023 11:17

hayley-jean added the cherry-pick:release/oss-v22.10 label Apr 13, 2023

hayley-jean self-requested a review April 13, 2023 11:19

shaan1337 marked this pull request as draft April 14, 2023 05:34

shaan1337 force-pushed the new-scavenger-fix branch 4 times, most recently from 6ca83d0 to 31a1fef Compare April 17, 2023 09:01

shaan1337 marked this pull request as ready for review April 17, 2023 09:42

timothycoleman reviewed Apr 17, 2023

View reviewed changes

src/EventStore.Core/TransactionLog/Scavenging/Data/DiscardDecision.cs Outdated Show resolved Hide resolved

shaan1337 force-pushed the new-scavenger-fix branch from 3ea99e7 to f257bcd Compare April 18, 2023 10:58

timothycoleman approved these changes Apr 18, 2023

View reviewed changes

timothycoleman changed the title ~~Handle events that have been deleted from (now empty) chunks but not from the index~~ Scavenge: Handle events that have been deleted from (now empty) chunks but not from the index Apr 19, 2023

hayley-jean approved these changes Apr 21, 2023

View reviewed changes

hayley-jean merged commit d7d8ea5 into master Apr 21, 2023

hayley-jean deleted the new-scavenger-fix branch April 21, 2023 13:07

github-actions bot mentioned this pull request Apr 21, 2023

[release/oss-v22.10] Scavenge: Handle events that have been deleted from (now empty) chunks but not from the index #3821

Merged

github-actions bot reviewed Apr 21, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scavenge: Handle events that have been deleted from (now empty) chunks but not from the index #3813

Scavenge: Handle events that have been deleted from (now empty) chunks but not from the index #3813

shaan1337 commented Apr 12, 2023 •

edited

Loading

shaan1337 commented Apr 14, 2023

timothycoleman commented Apr 18, 2023

timothycoleman commented Apr 18, 2023 •

edited

Loading

timothycoleman left a comment

shaan1337 commented Apr 19, 2023

timothycoleman commented Apr 21, 2023

github-actions bot left a comment

Scavenge: Handle events that have been deleted from (now empty) chunks but not from the index #3813

Scavenge: Handle events that have been deleted from (now empty) chunks but not from the index #3813

Conversation

shaan1337 commented Apr 12, 2023 • edited Loading

shaan1337 commented Apr 14, 2023

timothycoleman commented Apr 18, 2023

timothycoleman commented Apr 18, 2023 • edited Loading

timothycoleman left a comment

Choose a reason for hiding this comment

shaan1337 commented Apr 19, 2023

timothycoleman commented Apr 21, 2023

github-actions bot left a comment

Choose a reason for hiding this comment

shaan1337 commented Apr 12, 2023 •

edited

Loading

timothycoleman commented Apr 18, 2023 •

edited

Loading