Skip to content

docs: clarify cuda async execution model#6611

Merged
0ax1 merged 3 commits intodevelopfrom
ad/cuda-async-docs
Feb 20, 2026
Merged

docs: clarify cuda async execution model#6611
0ax1 merged 3 commits intodevelopfrom
ad/cuda-async-docs

Conversation

@0ax1
Copy link
Contributor

@0ax1 0ax1 commented Feb 20, 2026

Summary

Besides adding docs for the execution model, this includes notes on pageable and pinned physical memory. Further, move_to_device is dropped in favor of copy_to_device.

Signed-off-by: Alexander Droste <alexander.droste@protonmail.com>
@0ax1 0ax1 requested a review from joseph-isaacs February 20, 2026 11:21
@0ax1 0ax1 enabled auto-merge (squash) February 20, 2026 11:22
Signed-off-by: Alexander Droste <alexander.droste@protonmail.com>
@0ax1 0ax1 added the changelog/docs A docs change label Feb 20, 2026
@codspeed-hq
Copy link

codspeed-hq bot commented Feb 20, 2026

Merging this PR will improve performance by 10.94%

⚡ 1 improved benchmark
✅ 985 untouched benchmarks
⏩ 1426 skipped benchmarks1

Performance Changes

Mode Benchmark BASE HEAD Efficiency
Simulation map_each[BufferMut<i32>, 128] 887.2 ns 799.7 ns +10.94%

Comparing ad/cuda-async-docs (64e9d97) with develop (ab55ce4)

Open in CodSpeed

Footnotes

  1. 1426 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports.

@0ax1 0ax1 merged commit cc20e3d into develop Feb 20, 2026
50 checks passed
@0ax1 0ax1 deleted the ad/cuda-async-docs branch February 20, 2026 11:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

changelog/docs A docs change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants