Skip to content

Conversation

@clementval
Copy link
Contributor

Set block size x and y to 1024 if the given value is higher. Set block z to 64 if the given value is higher.

@clementval clementval requested a review from wangzpgi October 20, 2025 21:10
@wangzpgi
Copy link
Contributor

Can we have test for this?

@clementval
Copy link
Contributor Author

Can we have test for this?

Not really. We cannot execute kernel in the current configuration

@clementval clementval merged commit 803883c into llvm:main Oct 20, 2025
11 checks passed
@clementval clementval deleted the cuf_bloxk_size branch October 20, 2025 21:20
clementval added a commit that referenced this pull request Oct 21, 2025
clementval added a commit that referenced this pull request Oct 21, 2025
Reverts #164321

Align behavior with other CUDA Compiler
llvm-sync bot pushed a commit to arm/arm-toolchain that referenced this pull request Oct 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants