diff --git a/docs/hub/spaces-zerogpu.md b/docs/hub/spaces-zerogpu.md index 5fb31291a..7bae8ac0a 100644 --- a/docs/hub/spaces-zerogpu.md +++ b/docs/hub/spaces-zerogpu.md @@ -88,6 +88,12 @@ This sets the maximum function runtime to 120 seconds. Specifying shorter durati By leveraging ZeroGPU, developers can create more efficient and scalable Spaces, maximizing GPU utilization while minimizing costs. +## Recommendations + +If your demo uses a large model, we recommend using optimizations like ahead-of-time compilation and flash-attention 3. You can learn how to leverage these with +ZeroGPU in [this post](https://huggingface.co/blog/zerogpu-aoti). These optimizations will help you to maximize the advantages of ZeroGPU hours and provide +a better user experience. + ## Feedback You can share your feedback on Spaces ZeroGPU directly on the HF Hub: https://huggingface.co/spaces/zero-gpu-explorers/README/discussions