Dynamic Resolution Compilation #94

chavinlo · 2023-02-09T01:41:30Z

Hello, is it possible to compile the model for dynamic resolution generation rather than static? Similar to TensorRT's?
I see in the code that the compilation call is made either if the model hasn't been compiled already OR the request is for a different resolution than the already compilated one.

strint · 2023-02-10T06:56:31Z

Compiling a dynamic shape graph is not supported right now. We use some optimization skills which are related to static input shape:

memory static allocation;
constant folding;

Have you met some problems with static shapes?

chavinlo · 2023-02-10T17:25:01Z

Have you met some problems with static shapes?

No, but we are building a service that plans to offer dynamic resolutions.
I was thinking of compiling it for multiple resolutions, but that would take up vram space

chavinlo · 2023-02-10T17:51:05Z

Please let me know if this feature is ever implemented. Thanks

strint · 2023-02-13T02:41:29Z

Please let me know if this feature is ever implemented. Thanks

We provide an offline-compile mode to reduce online-compile time for scenarios where the input shapes of online inference is limited. Hope this will help: https://github.com/Oneflow-Inc/diffusers/wiki/How-to-Run-OneFlow-Stable-Diffusion#optimization-for-multi-resolution-picture

@chavinlo

chavinlo · 2023-02-13T19:12:06Z

Please let me know if this feature is ever implemented. Thanks

We provide an offline-compile mode to reduce online-compile time for scenarios where the input shapes of online inference is limited. Hope this will help: https://github.com/Oneflow-Inc/diffusers/wiki/How-to-Run-OneFlow-Stable-Diffusion#optimization-for-multi-resolution-picture

@chavinlo

Thanks. First I call pipe.enable_graph_share_mem(), then run inference on the resolutions I want, and the graph should be ready for those resolutions right?

pipe.enable_graph_share_mem()

prompt = "a photo of an astronaut riding a horse on mars, red sky, (green sky:1.5)"
with torch.autocast("cuda"):
    images = pipe(prompt, height=1024).images
    images = pipe(prompt, height=768).images
    images = pipe(prompt, height=512).images
    images = pipe(prompt, height=256).images

sorting the input shape from large to small to trigger graph compilation

I see that it takes 1 second to load when it changes resolution:

    images = pipe(prompt, height=256).images
    images = pipe(prompt, height=256).images
    images = pipe(prompt, height=512).images
    images = pipe(prompt, height=512).images
    images = pipe(prompt, height=768).images
    images = pipe(prompt, height=768).images
    images = pipe(prompt, height=1024).images
    images = pipe(prompt, height=1024).images
    images = pipe(prompt, height=768).images
    images = pipe(prompt, height=768).images
    images = pipe(prompt, height=512).images
    images = pipe(prompt, height=512).images
    images = pipe(prompt, height=256).images
    images = pipe(prompt, height=256).images

Is there anyway to speed this up? or maybe mantain all of them active? I don't mind having to use more vram.

strint · 2023-02-14T02:55:24Z

You can run and read this test to be familiar with these two features: https://github.com/Oneflow-Inc/diffusers/blob/oneflow-fork/tests/test_pipelines_oneflow_graph_load.py

@chavinlo

chavinlo closed this as completed Feb 16, 2023

strint added the sig-compiler label Apr 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic Resolution Compilation #94

Dynamic Resolution Compilation #94

chavinlo commented Feb 9, 2023

strint commented Feb 10, 2023

chavinlo commented Feb 10, 2023

chavinlo commented Feb 10, 2023

strint commented Feb 13, 2023 •

edited

Loading

chavinlo commented Feb 13, 2023

strint commented Feb 14, 2023

Dynamic Resolution Compilation #94

Dynamic Resolution Compilation #94

Comments

chavinlo commented Feb 9, 2023

strint commented Feb 10, 2023

chavinlo commented Feb 10, 2023

chavinlo commented Feb 10, 2023

strint commented Feb 13, 2023 • edited Loading

chavinlo commented Feb 13, 2023

strint commented Feb 14, 2023

strint commented Feb 13, 2023 •

edited

Loading