Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix the bug that BaseMapPartition may not read data sequentially #79

Closed
Tracked by #78
TanYuxin-tyx opened this issue Aug 22, 2022 · 0 comments
Closed
Tracked by #78

Comments

@TanYuxin-tyx
Copy link
Collaborator

When reading data from MapPartition files, It is common that some subpartitions are requested before others and their region indexes are ahead of others. If all region data of a subpartition can be read in one round, some subpartition readers will always ahead of others which will cause random IO. This patch fixes this case by polling one subpartition reader at a time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
1 participant