Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

spatialdateupdate kernel_*_integration requires too many registers #6

Closed
moritzaugustin opened this issue Oct 29, 2015 · 1 comment
Closed
Assignees

Comments

@moritzaugustin
Copy link
Contributor

Using the example compartmental/rall.py in cuda_standalone mode on my GT 610 GPU the kernel execution fails due to CUDA error (during integration): too many resources requested for launch (when calling the *_integration kernel).
The problems seems to be the kernel register usage and the resolution is to remove (at least two) unused pointers from the kernel argument list. However, as the latter is generated in cuda_standalone/device.py and we either have to reduce the items in uses_variables within the respective template (and explicitly declare variables in kernels) or otherwise we need to have a way of explicitly excluding variables from the list %DEVICE_PARAMETERS% (and for %KERNEL_VARIABLES% similarily).

related with iss #4

@denisalevi
Copy link
Member

See #60 for how to check the maximum number of threads that do not exceed a kernels resources.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants