Skip to content

Conversation

@mag1c-h
Copy link
Contributor

@mag1c-h mag1c-h commented Nov 20, 2025

Description:

This PR presents a unified API for transferring data between device and host memory, offering a straightforward interface while encapsulating device-specific logic.

Implementation:

  • Native API leveraging CopyEngine for transfers.
  • Kernel operations utilizing StreamMultiprocessor.

Examples:

  • C++: ucm/shared/test/case/trans/trans_test.cc
  • Python: ucm/shared/test/example/trans/trans_on_cuda_example.py

@mag1c-h mag1c-h merged commit 3127481 into ModelEngine-Group:develop Nov 20, 2025
3 checks passed
@mag1c-h mag1c-h deleted the dev-ucmtrans branch November 20, 2025 09:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants