-
Notifications
You must be signed in to change notification settings - Fork 582
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Description
- Move the lock from WholeStageResultIterator constructor to shuffle reader, the threads can prepare the first batch in advance, now it is a small batch, for GPU, it will produce a big batch with bytes 1GB
- Implement the decompression, convert buffer to cudf table, resize batch in GPU
- Then the threads can prepare read buffer from file
- The other thread to notify needs to fetch more data than batch size, prepare batches for GPU to process, reserve the bytes from pool, if reserve successfully, CPU threads fetch data in the background, save the batches in Vector.
- The waited CPU may can do the decompression and resize batch and wait GPU to fetch.
Gluten version
None
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request
Type
Projects
Status
No status