[Versatile Diffusion] Add versatile diffusion model #1283

patrickvonplaten · 2022-11-14T19:45:01Z

Add model from https://github.com/SHI-Labs/Versatile-Diffusion

HuggingFaceDocBuilder · 2022-11-14T19:49:01Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

src/diffusers/pipelines/versatile_diffusion/pipeline_versatile_diffusion.py

scripts/convert_versatile_diffusion_to_diffusers.py

src/diffusers/pipelines/versatile_diffusion/pipeline_versatile_diffusion.py

patrickvonplaten

Super nice that everything works and that we need no changes to the UNet2DCondition! Not super happy about the context manager 😅 Could we maybe do a different design here it's quite difficult to understand

patrickvonplaten

Super nice progress here and great that you made it work! Think I understand the difficulties with the architecture a bit more now - should we maybe add all the existing functionality first and then discuss API / design a bit more?

Think we should also make multiple pipelines no?

All in one pipeline (this one will be heavy as it'll have both unets loaded in memory
To-Image pipeline (this one will only have image_unet + text_cross_att & image_cross_att)
To-Text pipeline (this one will only have text_unet + text_cross_att & image_cross_att
4 very light pipelines (text2img, img2img, img2text, text2text) ? (maybe we don't need to add those if memory is low enough in the "dual" pipelines

…add_versatile_diffusers

…d_versatile_diffusers

patrickvonplaten · 2022-11-21T13:47:42Z

Added the "GPT2 optimus"

It expects latent diffusion outputs and should work as follows:

#!/usr/bin/env python3
import torch
from diffusers.pipelines.versatile_diffusion import GPT2OptimusForLatentConnector
from transformers import GPT2Tokenizer

model = GPT2OptimusForLatentConnector.from_pretrained("fusing/gpt2_optimus")
tokenizer = GPT2Tokenizer.from_pretrained("fusing/gpt2_optimus")

latent_output_of_unet =  # get tensor from unet

output = model.generate(bos_token_id=tokenizer.bos_token_id, past=latent_output_of_unet)

Haven't tested it end to end as I think we first need to wait for the text unet, but more than happy to debug further when the text unet is ready #784

…d_versatile_diffusers

…ace/diffusers into add_versatile_diffusers

src/diffusers/models/attention.py

…ace/diffusers into add_versatile_diffusers

Ir1d · 2022-11-23T18:33:11Z

src/diffusers/pipelines/versatile_diffusion/pipeline_versatile_diffusion_image_variation.py

+    library implements for all the pipelines (such as downloading or saving, running on a particular device, etc.)
+
+    Parameters:
+        vqvae ([`VQModel`]):


HI @patrickvonplaten , it seems that these docs still need to be updated 🙏

Good point! Would you like to open a PR? :-)

* up * convert dual unet * revert dual attn * adapt for vd-official * test the full pipeline * mixed inference * mixed inference for text2img * add image prompting * fix clip norm * split text2img and img2img * fix format * refactor text2img * mega pipeline * add optimus * refactor image var * wip text_unet * text unet end to end * update tests * reshape * fix image to text * add some first docs * dual guided pipeline * fix token ratio * propose change * dual transformer as a native module * DualTransformer(nn.Module) * DualTransformer(nn.Module) * correct unconditional image * save-load with mega pipeline * remove image to text * up * uP * fix * up * final fix * remove_unused_weights * test updates * save progress * uP * fix dual prompts * some fixes * finish * style * finish renaming * up * fix * fix * fix * finish Co-authored-by: anton-l <anton@huggingface.co>

up

ed8c82c

anton-l added 9 commits November 15, 2022 14:14

convert dual unet

d1e8a50

revert dual attn

e00a9cf

adapt for vd-official

833cd1d

test the full pipeline

e455921

mixed inference

53f080f

mixed inference for text2img

b5778e0

merge main

ee84175

add image prompting

9a8114a

fix clip norm

b17475e

anton-l reviewed Nov 16, 2022

View reviewed changes

src/diffusers/pipelines/versatile_diffusion/pipeline_versatile_diffusion.py Outdated Show resolved Hide resolved

anton-l reviewed Nov 16, 2022

View reviewed changes

scripts/convert_versatile_diffusion_to_diffusers.py Outdated Show resolved Hide resolved

patrickvonplaten commented Nov 16, 2022

View reviewed changes

src/diffusers/pipelines/versatile_diffusion/pipeline_versatile_diffusion.py Show resolved Hide resolved

patrickvonplaten commented Nov 16, 2022

View reviewed changes

src/diffusers/pipelines/versatile_diffusion/pipeline_versatile_diffusion.py Outdated Show resolved Hide resolved

patrickvonplaten commented Nov 16, 2022

View reviewed changes

patrickvonplaten and others added 8 commits November 21, 2022 11:27

Merge branch 'main' of https://github.com/huggingface/diffusers into …

a758804

…add_versatile_diffusers

split text2img and img2img

74fde82

Merge remote-tracking branch 'origin/add_versatile_diffusers' into ad…

5785e27

…d_versatile_diffusers

fix format

22e6b54

refactor text2img

d36cf41

mega pipeline

303052d

add optimus

f2bc526

add gpt2

2a50c84

anton-l added 4 commits November 21, 2022 15:04

refactor image var

bc509b2

Merge remote-tracking branch 'origin/add_versatile_diffusers' into ad…

4d9ec98

…d_versatile_diffusers

wip text_unet

8c989eb

text unet end to end

f706729

patrickvonplaten and others added 14 commits November 23, 2022 13:41

Merge branch 'add_versatile_diffusers' of https://github.com/huggingf…

c91d0a4

…ace/diffusers into add_versatile_diffusers

up

ff8188a

uP

1bded5a

fix

af8a378

up

7bf2d4d

final fix

a32c942

remove_unused_weights

447780d

test updates

1b85e34

save progress

e950199

Merge branch 'add_versatile_diffusers' of https://github.com/huggingf…

1599b14

…ace/diffusers into add_versatile_diffusers

uP

2e2df18

fix dual prompts

dd9dce5

some fixes

6cbee51

finish

e9843fa

anton-l reviewed Nov 23, 2022

View reviewed changes

src/diffusers/models/attention.py Outdated Show resolved Hide resolved

anton-l and others added 10 commits November 23, 2022 18:16

style

cea10a0

finish renaming

59c2fef

merge main

eb02e1d

finish

5669d93

Merge branch 'add_versatile_diffusers' of https://github.com/huggingf…

5fc757e

…ace/diffusers into add_versatile_diffusers

up

2e5128d

fix

9f31d8a

fix

e742f16

fix

ace7123

finish

8a6f0c9

patrickvonplaten merged commit 2625fb5 into main Nov 23, 2022

patrickvonplaten deleted the add_versatile_diffusers branch November 23, 2022 18:03

Ir1d reviewed Nov 23, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Versatile Diffusion] Add versatile diffusion model #1283

[Versatile Diffusion] Add versatile diffusion model #1283

patrickvonplaten commented Nov 14, 2022

HuggingFaceDocBuilder commented Nov 14, 2022

patrickvonplaten left a comment

patrickvonplaten left a comment

patrickvonplaten commented Nov 21, 2022

Ir1d Nov 23, 2022

patrickvonplaten Nov 29, 2022

[Versatile Diffusion] Add versatile diffusion model #1283

[Versatile Diffusion] Add versatile diffusion model #1283

Conversation

patrickvonplaten commented Nov 14, 2022

HuggingFaceDocBuilder commented Nov 14, 2022

patrickvonplaten left a comment

Choose a reason for hiding this comment

patrickvonplaten left a comment

Choose a reason for hiding this comment

patrickvonplaten commented Nov 21, 2022

Ir1d Nov 23, 2022

Choose a reason for hiding this comment

patrickvonplaten Nov 29, 2022

Choose a reason for hiding this comment