Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cloud_storage: The remote partition reader can get stuck #17788

Closed
Lazin opened this issue Apr 11, 2024 · 0 comments · Fixed by #17805
Closed

cloud_storage: The remote partition reader can get stuck #17788

Lazin opened this issue Apr 11, 2024 · 0 comments · Fixed by #17805
Assignees
Labels
area/cloud-storage Shadow indexing subsystem kind/bug Something isn't working

Comments

@Lazin
Copy link
Contributor

Lazin commented Apr 11, 2024

Version & Environment

Redpanda version: (use rpk version): dev

What went wrong?

In case if LSO is low (inside the TS range) the read_some method can get stuck by returning an empty record batch set.
The internal read loop of the reader goes into an infinite loop in this case. In order for the problem to reproduce we need to have some transactional batches in the log. This batches has to be skipped to proceed but we fail to skip them correctly because our LSO does not allow this.

It's not clear why this could happen because when we're reading near the LSO we're supposed to read from the local storage. Probably, we're start reading from the offset which is only available in the cloud storage.

What should have happened instead?

The reader shouldn't get stuck.

How to reproduce the issue?

N/A

Additional information

N/A

JIRA Link: CORE-2321

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/cloud-storage Shadow indexing subsystem kind/bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant