FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation [Efficient ML Model]
cache transformers diffusion linear-algorithms efficient-deep-learning efficientml diffusion-transformers
-
Updated
Sep 6, 2025 - Python