Transparent suspend/resume runtime enabling preemptible GPU workloads via memory snapshotting, UVM paging, and execution state orchestration.
kubernetes hpc cuda slurm uvm checkpointing gpu-runtime gpu-scheduling unified-virtual-memory gpu-preemption
-
Updated
Feb 19, 2026 - Python