Skip to content

NvTensorRtRtx dependency on CUDA device name removed #1485

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

Conversation

BLSharda
Copy link
Contributor

@BLSharda BLSharda commented May 15, 2025

GenAI EP NvTensorRtRtx dependency on using CUDA device removed. This is to align with latest changes in ORT and NV EP.

Also, still requires onnxruntime-genai-cuda.dll but device name is NvTensorRtRtx instead of CUDA. And the memory type is still "Cuda" for ORT API.

NvTensorRtRtx dependency on using ORT CUDA EP removed.
@BLSharda BLSharda changed the title GenAI EP NvTensorRtRtx dependency on ORT CUDA EP removed NvTensorRtRtx dependency on ORT CUDA EP removed May 15, 2025
- Fixing formatting for CICD failure
- Adding final inheritance for CUDA and NvTensorRtRtx EP interfaces impl structs.
- Removing Graph Capture for NvTensorRtRtx EP for now.
@BLSharda BLSharda marked this pull request as ready for review May 15, 2025 13:21
Fixing compilation error for linux builds
@BLSharda BLSharda requested a review from kunal-vaishnavi May 20, 2025 16:08
@BLSharda BLSharda changed the title NvTensorRtRtx dependency on ORT CUDA EP removed NvTensorRtRtx dependency on CUDA device name removed May 21, 2025
@BLSharda
Copy link
Contributor Author

@kunal-vaishnavi and @baijumeswani , can you help review and merge this change?

- Removing cuda graph knob for now until we test it for functional and perf aspect.
@anskumar01
Copy link

@baijumeswani, can you please help with review and merge this change?

@baijumeswani
Copy link
Collaborator

This is to align with latest changes in ORT and NV EP.

Could you elaborate or share the relevant onnxruntime pull-request so we can track the reason for this chance?

@BLSharda
Copy link
Contributor Author

This is to align with latest changes in ORT and NV EP.

Could you elaborate or share the relevant onnxruntime pull-request so we can track the reason for this chance?

In PR 24656 CUDA dependency is removed (USE_CUDA) for NvTensorRtRtx EP so we need to switch device name from "cuda" to "NvTensorRtRtx".

@BLSharda BLSharda requested a review from baijumeswani May 29, 2025 03:43
@baijumeswani baijumeswani merged commit b8904ac into microsoft:main May 29, 2025
14 checks passed
@baijumeswani baijumeswani added the 0.8.1 Patch Release for 0.8.0 label May 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
0.8.1 Patch Release for 0.8.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants