Feat/pixart binsize #5708

lawrence-cj · 2023-11-08T16:01:15Z

What does this PR do?

Use heights and widths in the bin during inference, then resize and crop the image to the users required one. Controlable function added. Feel free to change the code style. @sayakpaul

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

* init pixart alpha pipeline * fix: import * script * script * script * add: vae to the pipeline * add: vae_scale_factor * add: checkpoint_path * clean conversion script a bit. * size embeddings. * fix: size embedding * update scrip * support for interpolation of position embedding. * support for conditioning. * .. * .. * .. * final layer * final layer * align if encode_prompt * support for caption embedding * refactor * refactor * refactor * start cross attention * start cross attention * cross_attention_dim * cross * cross * support for resolution and aspect_ratio * support for caption projection * refactor patch embeddings * batch_size * up * commit * commit * commit. * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze. * squeeze. * fix final block./ * fix final block./ * fix final block./ * clean * fix: interpolation scale. * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging * debugging * debugging * debugging * debugging * debugging * debugging * make --checkpoint_path non-required. * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * remove num_tokens * timesteps -> timestep * timesteps -> timestep * timesteps -> timestep * timesteps -> timestep * timesteps -> timestep * timesteps -> timestep * debug * debug * update conversion script. * update conversion script. * update conversion script. * debug * debug * debug * clean * debug * debug * debug * debug * debug * debug * debug * debug * deug * debug * debug * debug * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * clean * fix * fix * boom * boom * some changes * boom * save * up * remove i * fix more tests * DPMSolverMultistepScheduler * fix * offloading * fix conversion script * fix conversion script * remove print * remove support for negative prompt embeds. * typo. * remove extra kwargs * bring conversion script to where it was * fix * trying mu luck * trying my luck again * again * again * again * clean up * up * up * update example * support for 512 * remove spacing * finalize docs. * test debug * fix: assertion values. * debug * debug * debug * fix: repeat * remove prints. * Apply suggestions from code review * Apply suggestions from code review * Correct more * Apply suggestions from code review * Change all * Clean more * fix more * Fix more * Fix more * Correct more * address patrick's comments. * remove unneeded args * clean up pipeline. * sty;e * make the use of additional conditions better conditioned. * None better * dtype * height and width validation * add a note about size brackets. * fix * spit out slow test outputs. * fix? * fix optional test * fix more * remove unneeded comment * debug --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

…vision (huggingface#5659) fix custom pipelines

post release

update custom diffusion attn processor

* fix model xformers test * update

update free model hooks

* fix * Update src/diffusers/models/attention.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* explicit torch dependency check * update * update * update

… with a batch size > 1 (huggingface#5677) * fix embeds * remove todo * add: test * better name

…ingface#5668) * fix: import bug * fix * fix * fix import utils for lcm * fix: pixart alpha init * Fix --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* debug * support non-square images * add: test * fix: test --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Refactor LCMScheduler.step such that prev_sample == denoised at the last timestep in the schedule. * Make timestep scaling when calculating boundary conditions configurable. * Reparameterize timestep_scaling to be a multiplicative rather than division scaling. * make style * fix dtype conversion * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

…ggingface#5611) * Fix typos, improve, update; kandinsky doesn't want fp16 due to deprecation; ogkalu and kohbanye don't have safetensor; add make_image_grid for better visualization * Update inpaint.md * Remove erronous Space * Update docs/source/en/using-diffusers/conditional_image_generation.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update img2img.md * load_image() already converts to RGB * Update depth2img.md * Update img2img.md * Update inpaint.md --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

…ion. (huggingface#5651) * I added a new doc string to the class. This is more flexible to understanding other developers what are doing and where it's using. * Update src/diffusers/models/unet_2d_blocks.py This changes suggest by maintener. Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update src/diffusers/models/unet_2d_blocks.py Add suggested text Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update unet_2d_blocks.py I changed the Parameter to Args text. * Update unet_2d_blocks.py proper indentation set in this file. * Update unet_2d_blocks.py a little bit of change in the act_fun argument line. * I run the black command to reformat style in the code * Update unet_2d_blocks.py similar doc-string add to have in the original diffusion repository. * I removed the dummy variable defined in both the encoder and decoder. * Now, I run black package to reformat my file * Remove the redundant line from the adapter.py file. * Black package using to reformated my file * Replacing the nn.Mish activation function with a get_activation function allows developers to more easily choose the right activation function for their task. Additionally, removing redundant variables can improve code readability and maintainability. * I try to fix this: Fast tests for PRs / Fast PyTorch Models & Schedulers CPU tests (pull_request) * Update src/diffusers/models/resnet.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>

skip rendering Co-authored-by: yiyixuxu <yixu310@gmail,com>

…ace#5700) Fix the misaligned pipeline usage

…huggingface#5650) Closes huggingface#4665

* [LCM] Fix img2img * make fix-copies * make fix-copies * make fix-copies * up

* fix mask feature condition. * debug * remove identical test * set correct * Empty-Commit

* up * up * up * Empty-Commit * fix keyword argument call. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Add adapter fusing + PEFT to the docs * Update docs/source/en/tutorials/using_peft_for_inference.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update docs/source/en/tutorials/using_peft_for_inference.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update docs/source/en/tutorials/using_peft_for_inference.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/tutorials/using_peft_for_inference.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/tutorials/using_peft_for_inference.md * Update docs/source/en/tutorials/using_peft_for_inference.md --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* fix prompt bug * add test

* bugfix peft lor * Apply suggestions from code review --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

sayakpaul · 2023-11-10T03:48:48Z

Closed in favor of #5716

sayakpaul and others added 30 commits November 6, 2023 08:40

correct pipeline class name (huggingface#5652)

aec3de8

[Custom Pipelines] Make sure that community pipelines can use repo re…

f05d75c

…vision (huggingface#5659) fix custom pipelines

post release (v0.22.0) (huggingface#5658)

6460338

post release

Add Pixart to AUTO_TEXT2IMAGE_PIPELINES_MAPPING (huggingface#5664)

9bafef3

Update custom diffusion attn processor (huggingface#5663)

6a89a6c

update custom diffusion attn processor

Model tests xformers fixes (huggingface#5679)

71f56c7

* fix model xformers test * update

Update free model hooks (huggingface#5680)

8ca179a

update free model hooks

Fix Basic Transformer Block (huggingface#5683)

414d7c4

* fix * Update src/diffusers/models/attention.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

Explicit torch/flax dependency check (huggingface#5673)

97c8199

* explicit torch dependency check * update * update * update

[PixArt-Alpha] fix mask_feature so that precomputed embeddings work…

a8523bf

… with a batch size > 1 (huggingface#5677) * fix embeds * remove todo * add: test * better name

Make sure DDPM and diffusers can be used without Transformers (hugg…

84cd9e8

…ingface#5668) * fix: import bug * fix * fix * fix import utils for lcm * fix: pixart alpha init * Fix --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

[PixArt-Alpha] Support non-square images (huggingface#5672)

1dc231d

* debug * support non-square images * add: test * fix: test --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

fix mask feature condition.

a4cc2d2

debug

37a57d9

remove identical test

e458510

speed up Shap-E fast test (huggingface#5686)

6999693

skip rendering Co-authored-by: yiyixuxu <yixu310@gmail,com>

set correct

83bd1cd

Merge branch 'main' into fix/pixart-embeds-2

157308b

Fix the misaligned pipeline usage in dreamshaper docstrings (huggingf…

11c1256

…ace#5700) Fix the misaligned pipeline usage

Fixed is_safetensors_compatible() handling of windows path separators (…

d384265

…huggingface#5650) Closes huggingface#4665

[LCM] Fix img2img (huggingface#5698)

c803a8f

* [LCM] Fix img2img * make fix-copies * make fix-copies * make fix-copies * up

Merge branch 'main' into fix/pixart-embeds-2

be029a2

Empty-Commit

fde24bf

[PixArt-Alpha] fix mask feature condition. (huggingface#5695)

78be400

* fix mask feature condition. * debug * remove identical test * set correct * Empty-Commit

Fix styling issues (huggingface#5699)

17528af

* up * up * up * Empty-Commit * fix keyword argument call. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

inference using height-widths in bin

da55b0b

apolinario and others added 4 commits November 8, 2023 18:26

Fix prompt bug in AnimateDiff (huggingface#5702)

65ef7a0

* fix prompt bug * add test

[Bugfix] fix error of peft lora when xformers enabled (huggingface#5697)

6110d7c

* bugfix peft lor * Apply suggestions from code review --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

Merge branch 'huggingface:main' into feat/pixart_binsize

ab52c75

lawrence-cj closed this Nov 9, 2023

lawrence-cj deleted the feat/pixart_binsize branch November 9, 2023 02:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/pixart binsize #5708

Feat/pixart binsize #5708

Uh oh!

lawrence-cj commented Nov 8, 2023

Uh oh!

sayakpaul commented Nov 10, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants

Feat/pixart binsize #5708

Feat/pixart binsize #5708

Uh oh!

Conversation

lawrence-cj commented Nov 8, 2023

What does this PR do?

Before submitting

Who can review?

Uh oh!

sayakpaul commented Nov 10, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants