-
Notifications
You must be signed in to change notification settings - Fork 134
Pin TensorRT Version in Stable Diffusion Tutorial #103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
rmccorm4
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the thorough description 🚀
|
This is a brittle change. If the goal of fixing the tutorial is to grab the latest That way we can pick up any other releases to the 9.2 branch which may be patched. |
|
That's a neat idea Kyle, I hadn't thought of this! I'm wondering though, in the case of a tutorial, is it better to lock it in to a specific version that we've actually tested and verified? For example, we now use |
|
Looking again at this PR, I think I would prefer my As a NIT which doesn't need to be addressed, I would prefer this tutorial to make use of the TRT 10 just because TRT 9 got rolled up into that and we could potentially have a more consistent tutorial base where users don't have to wonder why some are running on a lower major version than the rest. |
|
Hmm you bring up a good point with the deprecation.
We're using the same metric, I think we're just on different sides of the fence 😆. I guess the question is which is more likely:
I think you're right in that the former is more likely. Regarding not using TRT 10, I guess I'm less concerned because tutorials generally stop using the latest and greatest versions of things unless tediously maintained. I'll throw up PR soon and see if can bump to TRT 10: |
Pin TensorRT Version in Stable Diffusion Tutorial
SAs have reported issues with the current state of the the tutorial where, when trying to launch a Triton server with the models built in this tutorial, they encounter the following error:
tritonserver.InvalidArgumentError: load failed for model 'stable_diffusion_xl': version 1 is at UNAVAILABLE state: Internal: AttributeError: 'tensorrt_bindings.tensorrt.ICudaEngine' object has no attribute 'get_binding_dtype'Further investigation discovered that the version of TRT being installed in the generated image was
10.2instead of the intended9.2. There is no9.2.0version, so we select the latest version available in the9.2.Xseries.Confirmed both with the SA and through a tutorial walkthrough that this resolves the issue and enables the server to launch and perform inference successfully.