Fix AutoencoderTiny encoder scaling convention #4682

madebyollin · 2023-08-19T18:28:05Z

What does this PR do?

Add [-1, 1] -> [0, 1] rescaling to EncoderTiny
Move [0, 1] -> [-1, 1] rescaling from AutoencoderTiny.decode to DecoderTiny (i.e. immediately after the final conv, as early as possible)
Fix missing [0, 255] -> [0, 1] rescaling in AutoencoderTiny.forward
Update AutoencoderTinyIntegrationTests to protect against scaling issues. The new test constructs a simple image, round-trips it through AutoencoderTiny, and confirms the decoded result is approximately equal to the source image. This test checks behavior with and without tiling enabled. This test will fail if new AutoencoderTiny scaling issues are introduced.

Motivation

Raw TAESD weights expect images in [0, 1], but diffusers' convention represents images with zero-centered values in [-1, 1], so AutoencoderTiny needs to scale / unscale images at the start of encoding and at the end of decoding in order to work with diffusers.

Fixes #4676

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

* Add [-1, 1] -> [0, 1] rescaling to EncoderTiny * Move [0, 1] -> [-1, 1] rescaling from AutoencoderTiny.decode to DecoderTiny (i.e. immediately after the final conv, as early as possible) * Fix missing [0, 255] -> [0, 1] rescaling in AutoencoderTiny.forward * Update AutoencoderTinyIntegrationTests to protect against scaling issues. The new test constructs a simple image, round-trips it through AutoencoderTiny, and confirms the decoded result is approximately equal to the source image. This test checks behavior with and without tiling enabled. This test will fail if new AutoencoderTiny scaling issues are introduced. * Context: Raw TAESD weights expect images in [0, 1], but diffusers' convention represents images with zero-centered values in [-1, 1], so AutoencoderTiny needs to scale / unscale images at the start of encoding and at the end of decoding in order to work with diffusers.

tests/models/test_models_vae.py

sayakpaul · 2023-08-20T06:52:23Z

The failing tests seem to be irrelevant.

tests/models/test_models_vae.py

src/diffusers/models/autoencoder_tiny.py

HuggingFaceDocBuilderDev · 2023-08-21T03:41:09Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

sayakpaul

Thanks so much!

@yiyixuxu @DN6 could one of you give this a look as well?

yiyixuxu

thank you!

* Fix AutoencoderTiny encoder scaling convention * Add [-1, 1] -> [0, 1] rescaling to EncoderTiny * Move [0, 1] -> [-1, 1] rescaling from AutoencoderTiny.decode to DecoderTiny (i.e. immediately after the final conv, as early as possible) * Fix missing [0, 255] -> [0, 1] rescaling in AutoencoderTiny.forward * Update AutoencoderTinyIntegrationTests to protect against scaling issues. The new test constructs a simple image, round-trips it through AutoencoderTiny, and confirms the decoded result is approximately equal to the source image. This test checks behavior with and without tiling enabled. This test will fail if new AutoencoderTiny scaling issues are introduced. * Context: Raw TAESD weights expect images in [0, 1], but diffusers' convention represents images with zero-centered values in [-1, 1], so AutoencoderTiny needs to scale / unscale images at the start of encoding and at the end of decoding in order to work with diffusers. * Re-add existing AutoencoderTiny test, update golden values * Add comments to AutoencoderTiny.forward

madebyollin force-pushed the main branch from 7a9a7c2 to 15b30ba Compare August 19, 2023 18:40

madebyollin force-pushed the main branch from 15b30ba to 42fe9ec Compare August 19, 2023 18:47

keturn reviewed Aug 19, 2023

View reviewed changes

tests/models/test_models_vae.py Outdated Show resolved Hide resolved

madebyollin mentioned this pull request Aug 19, 2023

TAESD-encoded latents are too dark #4676

Closed

sayakpaul reviewed Aug 20, 2023

View reviewed changes

tests/models/test_models_vae.py Outdated Show resolved Hide resolved

sayakpaul reviewed Aug 20, 2023

View reviewed changes

tests/models/test_models_vae.py Outdated Show resolved Hide resolved

sayakpaul requested review from DN6 and yiyixuxu August 20, 2023 06:52

Re-add existing AutoencoderTiny test, update golden values

70ecb91

sayakpaul reviewed Aug 21, 2023

View reviewed changes

tests/models/test_models_vae.py Show resolved Hide resolved

sayakpaul reviewed Aug 21, 2023

View reviewed changes

src/diffusers/models/autoencoder_tiny.py Show resolved Hide resolved

Add comments to AutoencoderTiny.forward

a1bc873

sayakpaul approved these changes Aug 21, 2023

View reviewed changes

yiyixuxu approved these changes Aug 23, 2023

View reviewed changes

sayakpaul merged commit 052bf32 into huggingface:main Aug 23, 2023
9 checks passed

keturn mentioned this pull request Aug 25, 2023

feature: support TAESD - Tiny Autoencoder for Stable Diffusion invoke-ai/InvokeAI#4316

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix AutoencoderTiny encoder scaling convention #4682

Fix AutoencoderTiny encoder scaling convention #4682

madebyollin commented Aug 19, 2023

sayakpaul commented Aug 20, 2023

HuggingFaceDocBuilderDev commented Aug 21, 2023

sayakpaul left a comment

yiyixuxu left a comment

Fix AutoencoderTiny encoder scaling convention #4682

Fix AutoencoderTiny encoder scaling convention #4682

Conversation

madebyollin commented Aug 19, 2023

What does this PR do?

Motivation

Before submitting

Who can review?

sayakpaul commented Aug 20, 2023

HuggingFaceDocBuilderDev commented Aug 21, 2023

sayakpaul left a comment

Choose a reason for hiding this comment

yiyixuxu left a comment

Choose a reason for hiding this comment