Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dynamic shared memory trait #993

Open
psychocoderHPC opened this issue May 4, 2020 · 1 comment
Open

dynamic shared memory trait #993

psychocoderHPC opened this issue May 4, 2020 · 1 comment

Comments

@psychocoderHPC
Copy link
Member

The traits BlockSharedMemDynSizeBytes is designed with a local view in mind. The number of threads and elements per thread where passed to the trait ref
If you like to implement a global prefix sum as it could be helpful to know how many blocks are involved.
The example is not running (syntax issues) but it shows a general way how to implement it.

IMO there is not reason to limit knowledge of the trait to information about a block even if shared memory can not by accesses by other blocks.

My suggestion: pass the workDiv to the trait instead of the thread extents and number of elements per thread.

@sbastrakov
Copy link
Member

I like passing the workDiv.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants