Skip to content
Discussion options

You must be logged in to vote

It is likely due to credential environment propagation during the onboard flow. You can reference the recent fix in fix(inference) prs to address a case where the inference router was not correctly receiving the credential environment variables when the agent was invoked through certain shell paths. The router is intended to be in the call path unconditionally.

To verify whether the router is actually mediating your calls, check the router process logs directly rather than relying on onboard validation passing, since validation only confirms the router responds at startup and not that every subsequent call routes correctly. Make sure the NVIDIA_INFERENCE_CREDENTIAL environmenvt variable i…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by grandblue-cell
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants