-
Notifications
You must be signed in to change notification settings - Fork 551
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RFC: AMD ROCm support with plugin #1519
Closed
Closed
Changes from 1 commit
Commits
Show all changes
29 commits
Select commit
Hold shift + click to select a range
51efa5c
Revert "Allow systemcfg proc file to be dumped"
rajbhar fbfa556
criu/parse: Treat some unsupported VMAs as regular
rajbhar 7146313
criu/plugin: Initialize AMD KFD header
rajbhar 4028ddc
criu/files-reg: Add offset and file path plugin
rajbhar 311aee4
criu/plugin: Support AMD ROCm Checkpoint Restore with KFD
rajbhar 4c1b8f3
criu/plugin: Optimize the proto image size
rajbhar f55fe48
criu/plugin: optimization for large bar read
rajbhar ca99c15
criu/restore: Introduce restore late stage hook
rajbhar ffeb86b
criu/plugin: Implement restore late hook for kfd
rajbhar 0a6771f
criu/plugin: Add support for dumping and restoring queues
dayatsin-amd c08db4a
criu/plugin: dump debug logs selectively
rajbhar 24a6761
criu/plugin: Support larger memory footprints
dayatsin-amd 0c32304
criu/plugin: Dump and restore events
dayatsin-amd 9ff8973
criu/plugin: Add initial documentation for ROCm support.
rajbhar e4819aa
criu/plugin: Re-adjust doorbell offset for queues
dayatsin-amd 1199c20
criu/plugin: Pytorch container with criu
rajbhar 95c9258
criu/plugin: Dockerfile for AMD criu repo
rajbhar 8268b61
criu/files: *RFC* Don't cache fd for amdgpu devices
rajbhar d83ddd5
criu/plugin: Add whitepaper document
fxkamd 274aabd
criu/plugin: Add build options for amdgpu plugin
rajbhar 84135f4
criu/plugin: Implement system topology parsing
dayatsin-amd 16778cc
criu/plugin: Remap GPUs on checkpoint restore
dayatsin-amd 98eddc9
criu/plugin: Add parameters to override mapping
dayatsin-amd c87fdf5
criu/plugin: Add unit tests for GPU remapping
dayatsin-amd ee928e1
criu/plugin: Read and write BO contents in parallel
dayatsin-amd d838942
criu/plugin: Restore libhsakmt shared memory files
dayatsin-amd a5df3ad
criu/plugin: fix build warnings
rajbhar f81a453
script/builds: add build dependepncy for libdrm
rajbhar 4f864a1
Merge branch 'criu-dev' into criu-dev
rajbhar File filter
Filter by extension
Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you please apply these changes in the "criu/plugin: Support AMD ROCm Checkpoint Restore with KFD" commit, which introduced
amdgpu_plugin_topology.c
?This could be done, for example, with
git rebase -i 311aee4ff^
, then changepick
toedit
on the first line and save.After the changes have been applied you can use
git commit -a --ammend
andgit rebase --continue
.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am not sure If I understood you here. I made the change to the CI docker scripts but the tests were not triggered. I am not sure if maintainers need to manually trigger those.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Currently CRIU would fail to compile if
libdrm
is not installed. Instead, whenlibdrm
is not installed it should skip buildingamdgpu_plugin
The CI tests are triggered on git push, for example, you can see in the Alpine test logs that
libdrm-dev
has been installed: