We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent db53c7a commit b16f954Copy full SHA for b16f954
recipes_source/regional_aot.py
@@ -236,4 +236,6 @@ def aot_compile_load_model(regional=False) -> torch.nn.Module:
236
#
237
# This recipe shows how to control the cold start time when compiling your
238
# model ahead-of-time. This becomes effective when your model has repeated
239
-# blocks, which is typically seen in large generative models.
+# blocks, which is typically seen in large generative models. We used this
240
+# recipe on various models to speed up real-time performance. Learn more
241
+# `here <https://huggingface.co/blog/zerogpu-aoti>`__.
0 commit comments