fix AnimateDiff creation with a unet loaded with IP Adapter #7791

fabiorigano · 2024-04-27T16:11:29Z

What does this PR do?

Makes it possible to load a pipeline with an IP Adapter into an AnimateDiff pipeline with from_pipe()

Fixes #7661

Pipelines: @sayakpaul @yiyixuxu @DN6

sayakpaul

Looks very clean!

Could we also see some results?

HuggingFaceDocBuilderDev · 2024-04-27T16:21:44Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

fabiorigano · 2024-04-27T16:27:41Z

hi @sayakpaul thank you

I used YiYi's code to test:

from diffusers import DiffusionPipeline, AnimateDiffPipeline, MotionAdapter, DDIMScheduler
from diffusers.utils import export_to_gif
import torch
from diffusers.utils import load_image

base_repo = "SG161222/Realistic_Vision_V6.0_B1_noVAE"
num_inference_steps = 20
image = load_image("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/load_neg_embed.png")
prompt="bear eats pizza"
negative_prompt = "wrong white balance, dark, sketches,worst quality,low quality"

pipe_sd = DiffusionPipeline.from_pretrained(base_repo, torch_dtype=torch.float16)
pipe_sd.load_ip_adapter("h94/IP-Adapter", subfolder="models", weight_name="ip-adapter_sd15.bin")
pipe_sd.set_ip_adapter_scale(0.6)   
pipe_sd.to("cuda")

adapter = MotionAdapter.from_pretrained("guoyww/animatediff-motion-adapter-v1-5-2", torch_dtype=torch.float16)

pipe_animate = AnimateDiffPipeline.from_pipe(pipe_sd, motion_adapter=adapter)
pipe_animate.scheduler = DDIMScheduler.from_config(pipe_animate.scheduler.config, beta_schedule="linear")

pipe_animate.load_lora_weights("guoyww/animatediff-motion-lora-zoom-out", adapter_name="zoom-out")
pipe_animate.to("cuda")
pipe_animate.enable_vae_slicing()
pipe_animate.enable_model_cpu_offload()

generator = torch.Generator(device="cpu").manual_seed(33)
pipe_animate.set_adapters("zoom-out", adapter_weights=0.75)
out = pipe_animate(
    prompt= prompt,
    num_frames=8,
    num_inference_steps=num_inference_steps,
    ip_adapter_image = image,
    generator=generator,
).frames[0]
export_to_gif(out, "out_animate.gif")

Output is the same of

pipe_sd = DiffusionPipeline.from_pretrained(base_repo, torch_dtype=torch.float16)
pipe_sd.to("cuda")
adapter = MotionAdapter.from_pretrained("guoyww/animatediff-motion-adapter-v1-5-2", torch_dtype=torch.float16)
pipe_animate = AnimateDiffPipeline.from_pipe(pipe_sd, motion_adapter=adapter)
pipe_animate.scheduler = DDIMScheduler.from_config(pipe_animate.scheduler.config, beta_schedule="linear")

pipe_animate.load_ip_adapter("h94/IP-Adapter", subfolder="models", weight_name="ip-adapter_sd15.bin") 
pipe_animate.set_ip_adapter_scale(0.6)    
pipe_animate.load_lora_weights("guoyww/animatediff-motion-lora-zoom-out", adapter_name="zoom-out")

and code doesn't break during loading

DN6

LGTM 👍🏽

yiyixuxu · 2024-04-29T22:30:40Z

src/diffusers/models/unets/unet_motion_model.py

+            attn_procs = {}
+            for name, processor in unet.attn_processors.items():
+                if name.endswith("attn1.processor"):
+                    attn_processor_class = (
+                        AttnProcessor2_0 if hasattr(F, "scaled_dot_product_attention") else AttnProcessor
+                    )
+                    attn_procs[name] = attn_processor_class()
+                else:
+                    attn_processor_class = (
+                        IPAdapterAttnProcessor2_0
+                        if hasattr(F, "scaled_dot_product_attention")
+                        else IPAdapterAttnProcessor
+                    )
+                    attn_procs[name] = attn_processor_class(
+                        hidden_size=processor.hidden_size,
+                        cross_attention_dim=processor.cross_attention_dim,
+                        scale=processor.scale,
+                        num_tokens=processor.num_tokens,
+                    )
+            for name, processor in model.attn_processors.items():
+                if name not in attn_procs:
+                    attn_procs[name] = processor.__class__()
+            model.set_attn_processor(attn_procs)


Is it the same as this?

model.set_attn_processor(unet.attn_processors)

actually no, because the UNetMotion used in AnimateDiff has motion modules, that the original pipeline does not have

if you do something like this:

if any( isinstance(proc, (IPAdapterAttnProcessor, IPAdapterAttnProcessor2_0)) for proc in unet.attn_processors.values() ): model.set_attn_processor(unet.attn_processors) model.config.encoder_hid_dim_type = "ip_image_proj" model.encoder_hid_proj = unet.encoder_hid_proj

you will end up (in the particular case of the code snippet above) with a ValueError, because the number of attention processors does not match
ValueError: A dict of processors was passed, but the number of processors 32 does not match the number of attention layers: 74. Please make sure to pass 74 processor classes.

@yiyixuxu I don't know if you read it, I'm pinging you because the issue has gone stale
thank you

hey, yes, indeed, I missed this one!
thanks for pining!

yiyixuxu · 2024-05-13T18:16:26Z

merged! sorry for the delay!
thanks again @fabiorigano for the great work:)

* Fix loading from_pipe * Fix style --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

fabiorigano added 2 commits April 27, 2024 17:51

Fix loading from_pipe

0e93562

Fix style

c3fbb64

sayakpaul reviewed Apr 27, 2024

View reviewed changes

Merge branch 'main' into unetmotionloadsipadapter

0cd4d01

DN6 approved these changes Apr 29, 2024

View reviewed changes

yiyixuxu reviewed Apr 29, 2024

View reviewed changes

yiyixuxu merged commit 44aa9e5 into huggingface:main May 13, 2024

fabiorigano deleted the unetmotionloadsipadapter branch May 13, 2024 18:21

sayakpaul pushed a commit that referenced this pull request Dec 23, 2024

fix AnimateDiff creation with a unet loaded with IP Adapter (#7791)

19370c6

* Fix loading from_pipe * Fix style --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix AnimateDiff creation with a unet loaded with IP Adapter #7791

fix AnimateDiff creation with a unet loaded with IP Adapter #7791

Uh oh!

fabiorigano commented Apr 27, 2024

Uh oh!

sayakpaul left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 27, 2024

Uh oh!

fabiorigano commented Apr 27, 2024 •

edited

Loading

Uh oh!

DN6 left a comment

Uh oh!

yiyixuxu Apr 29, 2024

Uh oh!

fabiorigano Apr 30, 2024

Uh oh!

fabiorigano May 13, 2024

Uh oh!

yiyixuxu May 13, 2024

Uh oh!

yiyixuxu commented May 13, 2024

Uh oh!

Uh oh!

fix AnimateDiff creation with a unet loaded with IP Adapter #7791

fix AnimateDiff creation with a unet loaded with IP Adapter #7791

Uh oh!

Conversation

fabiorigano commented Apr 27, 2024

What does this PR do?

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 27, 2024

Uh oh!

fabiorigano commented Apr 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DN6 left a comment

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Apr 29, 2024

Choose a reason for hiding this comment

Uh oh!

fabiorigano Apr 30, 2024

Choose a reason for hiding this comment

Uh oh!

fabiorigano May 13, 2024

Choose a reason for hiding this comment

Uh oh!

yiyixuxu May 13, 2024

Choose a reason for hiding this comment

Uh oh!

yiyixuxu commented May 13, 2024

Uh oh!

Uh oh!

fabiorigano commented Apr 27, 2024 •

edited

Loading