Skip to content

[SYSTEMDS-3362] CUDA code gen stream bugfix#1600

Closed
corepointer wants to merge 3 commits into
apache:mainfrom
corepointer:3362-cuda_stream_bugfix
Closed

[SYSTEMDS-3362] CUDA code gen stream bugfix#1600
corepointer wants to merge 3 commits into
apache:mainfrom
corepointer:3362-cuda_stream_bugfix

Conversation

@corepointer
Copy link
Copy Markdown
Contributor

No description provided.

@corepointer corepointer changed the title 3362 cuda stream bugfix [SYSTEMDS-3362] CUDA code gen stream bugfix Apr 28, 2022
This change pulls in the latest commits from upstream jitify. This includes a local fix of an ignored PR that was eventually fixed by someone else.
The CUDA code generation launcher handles streams per operator at the moment. This is wrong since a read before write can happen on a certain device allocation. Switching to a central stream object for now.

Closes apache#1600
Code gen native support compiled on Ubuntu 20 LTS/CUDA-10.2
@corepointer corepointer force-pushed the 3362-cuda_stream_bugfix branch from b0f6a40 to dc3f857 Compare May 9, 2022 14:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant