Skip to content

Conversation

@nv-kmcgill53
Copy link
Contributor

This PR fixes onnxruntime's autocomplete to check the instance(s) kind of the of the model before loading. If no GPU instance groups are found then autocomplete will set the kind to CPU before loading the model.

@nv-kmcgill53 nv-kmcgill53 requested review from CoderHam and rmccorm4 May 6, 2022 19:14
@CoderHam CoderHam requested a review from tanmayv25 May 6, 2022 19:19
@GuanLuo
Copy link
Contributor

GuanLuo commented May 6, 2022

@pranavsharma FYI, this is follow-up fix for #112. Before this fix the auto-complete will set all EPs as the model may contain EP-specific ops, but there is an edge case where model will fail to load if CUDA EP is set while there is no GPU on the system. With this change, the auto-complete will check GPU availability and set EPs conditionally

@nv-kmcgill53 nv-kmcgill53 merged commit 96e1d93 into main May 6, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants