[GLUTEN-10933][VL] Introduce GPU ShuffleWriterType kGpuHashShuffle#10984
[GLUTEN-10933][VL] Introduce GPU ShuffleWriterType kGpuHashShuffle#10984jinchengchenghh merged 8 commits intoapache:mainfrom
Conversation
|
Run Gluten Clickhouse CI on x86 |
marin-ma
left a comment
There was a problem hiding this comment.
LGTM. Just one more question: how do we currently test gpu_hash shuffle?
|
Run Gluten Clickhouse CI on x86 |
|
Now it is tested on my local environment, because it relies on GPU, I test TPCDS Q95 for previous PR, this is just a refactor, I don't test it. I will add cpp test for GPU shuffle reader. |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
|
BTW We have the same parameter |
|
@jinchengchenghh I think for shuffle, we still can stick with adding different writer type. Again the shuffle writer type is a match for different shuffle reader deserialisation, and using different writer type it's easy to distinguish them from avoiding introducing params like "isRssShuffle", "isGpuShuffle", etc. Moreover, |
|
Run Gluten Clickhouse CI on x86 |
|
I find I also need to update shuffle writer, but shuffleWriterType does not send to native, do you have any suggestion? @marin-ma |
|
@jinchengchenghh Do you have the code for the changes in shuffle writer? I don't see any modifications to the existing shuffle writer. |
|
In the followup PR, Velox bool is bit, but cudf is BOOL8, it is byte |
|
Run Gluten Clickhouse CI on x86 |
|
@jinchengchenghh Let's discuss and address that in your next pr. Based on the discussion offline seems like there are many discrepancies between cudf and the current hash shuffle writer implementation. |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
Related issue: #10933