Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RFE] EC/CUDA: using cuStreamMemOps #672

Open
marsaev opened this issue Nov 2, 2022 · 0 comments
Open

[RFE] EC/CUDA: using cuStreamMemOps #672

marsaev opened this issue Nov 2, 2022 · 0 comments

Comments

@marsaev
Copy link

marsaev commented Nov 2, 2022

Since CUDA 11.7 new cudaStreamOps_v2 introduced which doesn't require kernel module parameter to be set (https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__MEMOP.html#group__CUDA__MEMOP). This lower requirements on using these APIs.
As far as i understand this will allow lower number of kernels launched (i.e. executor_start or executor_shutdown_ack).

Also, as far as i understand from sources, STREAM_TASK_MODE=driver is not really fully supported, so it would be great to see those API used is some way or another.

Thanks,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant