-
Notifications
You must be signed in to change notification settings - Fork 361
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added flux demo #3418
base: main
Are you sure you want to change the base?
Added flux demo #3418
Conversation
b2eb297
to
6d36077
Compare
Can the app display the inference time, might be nice to have some stats rendered live as you generate |
48a7c94
to
5a528f1
Compare
361fb76
to
0aeea36
Compare
9964674
to
cfbc9ea
Compare
py/torch_tensorrt/dynamo/runtime/_MutableTorchTensorRTModule.py
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Lets separate all the CPU offload API stuff into another PR
Also rebase |
if self.allow_complex_guards_as_runtime_asserts: | ||
return _export( | ||
self.original_model, | ||
self.arg_inputs, | ||
kwargs=self.kwarg_inputs, | ||
dynamic_shapes=self._get_total_dynamic_shapes(), | ||
strict=self.strict, | ||
allow_complex_guards_as_runtime_asserts=self.allow_complex_guards_as_runtime_asserts, | ||
) | ||
else: | ||
return torch.export.export( | ||
self.original_model, | ||
self.arg_inputs, | ||
kwargs=self.kwarg_inputs, | ||
dynamic_shapes=self._get_total_dynamic_shapes(), | ||
strict=self.strict, | ||
) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think let's do a try catch style here where we try torch.export.export first and then _trace.export. I found _export doesn't work with modelopt recently.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Catch style will make the process slower because it needs to try one and the other, right? My thoughts here is have the user decide whether they have to use allow_complex_guards_as_runtime_asserts, which only _export has. If that is absolute necessary, then we cannot use torch.export.export.
…daGraph and Weight streaming
2ad38e3
to
b254581
Compare
Description
Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.
Fixes # (issue)
Type of change
Please delete options that are not relevant and/or add your own.
Checklist: