-
Notifications
You must be signed in to change notification settings - Fork 556
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[cuda] Port over allocator and buffer implementation #13985
Conversation
8d35942
to
6c237e7
Compare
@benvanik: The first commit is just copying code. My modification happens in the second commit. I figured this way it might make the review easiser. |
Nice! If you could, I'd recommend taking #13440 as the base instead - that has some non-trivial changes that will be harder to port on top of your changes here without just deleting these files. You could replace your base commit doing the copy with the files from the PR and then try rebasing your second commit on top of it (or just find/replace again). That PR is ready to land if we disabled the two tests suites testing graphs - the followup that hacks the memsets will happen tonight but neither should block you taking a copy of them. |
SG! Let me include the new changes you made there then. |
6c237e7
to
311fe45
Compare
This commit ports over existing CUDA driver allocator and buffer implementation. The main logic is kept as-is, with one noticeable changes--context wrapper is dropped and fields in it are directly put in various API calls. This is to make supporting multiple device and stream easier later. Other changes are just polishing on comments and errors.
311fe45
to
1148f63
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
style nits but otherwise lgtm!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
style nits but otherwise lgtm!
This commit ports over existing CUDA driver allocator and buffer implementation. The main logic is kept as-is, with one noticeable changes--context wrapper is dropped and fields in it are directly put in various API calls. This is to make supporting multiple device and stream easier later. Other changes are just polishing on comments and errors. Progress towards iree-org#13245
This commit ports over existing CUDA driver allocator and buffer implementation. The main logic is kept as-is, with one noticeable changes--context wrapper is dropped and fields in it are directly put in various API calls. This is to make supporting multiple device and stream easier later. Other changes are just polishing on comments and errors. Progress towards iree-org#13245
This commit ports over existing CUDA driver allocator and buffer
implementation. The main logic is kept as-is, with one noticeable
changes--context wrapper is dropped and fields in it are directly
put in various API calls. This is to make supporting multiple
device and stream easier later. Other changes are just polishing
on comments and errors.
Progress towards #13245