feature: support TAESD - Tiny Autoencoder for Stable Diffusion #4316

keturn · 2023-08-18T03:14:57Z

TAESD - Tiny Autoencoder for Stable Diffusion - is a tiny VAE that provides significantly better results than my single-multiplication hack but is still very fast.

The entire TAESD model weights are under 10 MB!

This PR requires diffusers 0.20:

dep(diffusers): upgrade diffusers to 0.20 #4311

To Do

Test with

SD 1.x
SD 2.x: [bug]: cannot select VAE with SD 2.x #4415
SDXL

Have you discussed this change with the InvokeAI team?

See TAESD Invocation API

Have you updated all relevant documentation?

No

Related Tickets & Documents

Related Issue #
Closes #

QA Instructions, Screenshots, Recordings

Should be able to import these models:

and use them as VAE.

Added/updated tests?

Some. There are new tests for VaeFolderProbe based on VAE configurations, but no tests that require the full model weights.

blessedcoolant · 2023-08-18T03:41:29Z

I was testing out the Tiny AutoEncoder last night. It definitely is a massive time and speed improvement but the quality of the output is a little bit inferior to the full encoder as expected. Especially prevalent in realistic models where areas like the eyes that need more detail just do not translate well.

Will give this PR a run in a bit.

Maybe we should also consider adding TAESD progress previews?

keturn · 2023-08-18T03:54:39Z

Yep, I don't expect it to be a finishing step for any production work, but it'll be useful for previews and I think also certain kinds of intermediate steps that go on to to be further mashed or re-noised.

And I guess I could imagine a workflow where, if the normal VAE is slow enough for some people, they might elect to do a quick batch generation and then only use the full VAE on the ones they want to keep?

blessedcoolant · 2023-08-18T03:59:12Z

Yeah. Definitely handy to have. Esp if we can do progress previews with TAESD too. That would cut down the generation times coz each step callback decoding will take considerably lesser time.

I'm actually half in mind to actually ship the TAESD models as core models coz they're pretty small and provide the user with an option next to VAE to check Tiny. In which case, we just use the Tiny AutoEncoder otherwise we use the regular.

Unless we anticipate more Tiny AE models and having it locked to just TAESD might be an issue later.

… works

# Conflicts: # invokeai/app/invocations/latent.py

keturn · 2023-08-19T03:31:34Z

I think the encoder is buggy, and I think the bug is upstream: huggingface/diffusers#4676

Update: just waiting on a diffusers release > 0.20 for the fix (diffusers 0.20.2 is out but seems to be a narrowly focused release that doesn't include this; I assume that means we're waiting for 0.21):

Fix AutoencoderTiny encoder scaling convention huggingface/diffusers#4682

psychedelicious

@singledispatchmethod - cool!

I agree - we should make TAESD a core model. No reason not to.

Then the only question is how to manage preview images with it? The step_callback has access to the InvocationContext and therefore has access to the model manager, so it could easily decode the image in there. I'm just concerned about the overhead of doing this up to 40 times per second.

Anyways, that's for a followup PR.

# Conflicts: # invokeai/app/invocations/latent.py

Millu · 2023-08-28T08:44:52Z

@lstein @StAlKeR7779 could you take a look at this? would be awesome if we could get this in for 3.1

blessedcoolant · 2023-08-29T00:45:03Z

If we are adding this, then @lstein will need to add the model support during installation.

hipsterusername · 2023-09-11T19:36:53Z

@Millu happy to review, but I think this still needs model support in the installer, right?

cc: @lstein

Millu · 2023-09-12T07:50:49Z

@Millu happy to review, but I think this still needs model support in the installer, right?

cc: @lstein

We can have this PR merged, and have included TASED as a core model as a follow-up. It's my understanding this PR supports the use of TASED in Invoke and if a user wants it, they'll have to install it manually

…port. Now available in diffusers 0.21: huggingface/diffusers#4627

keturn · 2023-09-13T16:57:55Z

Doing some manual testing after upgrading to diffusers 0.21.

TAESD encoding and decoding looks to be working okay, but the normal VAE is failing with

[2023-09-13 09:52:08,298]::[InvokeAI]::ERROR --> Error while invoking:
new_LoRACompatibleConv_forward() takes 2 positional arguments but 3 were given
File "…/diffusers/models/resnet.py", line 637, in forward
hidden_states = self.conv1(hidden_states, scale)

new_LoRACompatibleConv_forward seems to be something from Invoke's hotfixes 🤢 and I don't know what a LoRA-related hotfix is doing in the middle of a vae.decode call stack.

keturn · 2023-09-13T17:01:39Z

but I've just confirmed that same failure is on main, so this branch's changes to invocations.latents aren't to blame.

RyanJDick · 2023-09-15T13:59:33Z

Doing some manual testing after upgrading to diffusers 0.21.

TAESD encoding and decoding looks to be working okay, but the normal VAE is failing with

[2023-09-13 09:52:08,298]::[InvokeAI]::ERROR --> Error while invoking:
new_LoRACompatibleConv_forward() takes 2 positional arguments but 3 were given
File "…/diffusers/models/resnet.py", line 637, in forward
hidden_states = self.conv1(hidden_states, scale)

new_LoRACompatibleConv_forward seems to be something from Invoke's hotfixes 🤢 and I don't know what a LoRA-related hotfix is doing in the middle of a vae.decode call stack.

I think this should be fixed by: #4534

RyanJDick · 2023-09-19T15:22:23Z

I just tested installing both taesd models via the 'Import Models' UI. It seems like the model probe is not correctly detecting the base model.

keturn · 2023-09-20T17:44:25Z

It seems like the model probe is not correctly detecting the base model.

Oh, in that taesdxl is showing up under SD and not SDXL? hmm.

# Conflicts: # invokeai/backend/model_management/model_probe.py

keturn · 2023-09-20T18:04:33Z

The code that determines whether a VAE's base model is SD or SDXL is here:

InvokeAI/invokeai/backend/model_management/model_probe.py

Lines 470 to 481 in f222b87

    
           class VaeFolderProbe(FolderProbeBase): 
        
               def get_base_type(self) -> BaseModelType: 
        
                   config_file = self.folder_path / "config.json" 
        
                   if not config_file.exists(): 
        
                       raise InvalidModelException(f"Cannot determine base type for {self.folder_path}") 
        
                   with open(config_file, "r") as file: 
        
                       config = json.load(file) 
        
                   return ( 
        
                       BaseModelType.StableDiffusionXL 
        
                       if config.get("scaling_factor", 0) == 0.13025 and config.get("sample_size") in [512, 1024] 
        
                       else BaseModelType.StableDiffusion1 
        
                   )

it checks certain values inside the VAE's config.json.

However, for TAESD, the config.json files for taesd and taesdxl are identical. Indeed, there's no reason for them to have any different parameters, as the shape of the VAE are the same.

The only heuristic I can think of is to check to see if the model name literally ends in XL.

Or to define some other metadata field and ask @madebyollin to add it to the model's config.

…name

keturn added 2 commits August 17, 2023 20:08

feat(TAESD): support TAESD — Tiny Autoencoder for Stable Diffusion

8611ffe

feat(model_probe): provide more clues when we fail to load a model.

26a7b7b

keturn added the enhancement New feature or request label Aug 18, 2023

keturn requested review from damian0815, lstein, blessedcoolant, GreggHelt2, StAlKeR7779, brandonrising, Kyle0654 and psychedelicious as code owners August 18, 2023 03:14

keturn added 4 commits August 18, 2023 14:05

fix(TAESD): correct usage of singledispatchmethod so normal VAE still…

4f0e43e

… works

lint: formatting

811c82a

Merge remote-tracking branch 'origin/dep/diffusers020' into feat/taesd

6f9c1c6

# Conflicts: # invokeai/app/invocations/latent.py

Merge branch 'main' into feat/taesd

f5d95ff

psychedelicious approved these changes Aug 22, 2023

View reviewed changes

Merge remote-tracking branch 'origin/main' into feat/taesd

dff4662

# Conflicts: # invokeai/app/invocations/latent.py

Merge branch 'main' into feat/taesd

bc1bce1

keturn requested a review from RyanJDick as a code owner September 1, 2023 03:27

keturn mentioned this pull request Sep 1, 2023

[enhancement]: Support TAESD to display better progress image #4056

Open

1 task

keturn linked an issue Sep 1, 2023 that may be closed by this pull request

[enhancement]: Support TAESD to display better progress image #4056

Open

1 task

keturn and others added 2 commits September 1, 2023 22:18

Merge branch 'main' into feat/taesd

7df67d0

Merge branch 'main' into feat/taesd

88db094

Millu requested a review from hipsterusername as a code owner September 11, 2023 12:11

Merge branch 'main' into feat/taesd

3dfff27

keturn added 2 commits September 13, 2023 09:17

Merge remote-tracking branch 'origin/main' into feat/taesd

090db1a

fix(latent): remove temporary workaround for lack of TAESD tiling sup…

d219167

…port. Now available in diffusers 0.21: huggingface/diffusers#4627

hipsterusername and others added 2 commits September 15, 2023 12:19

Merge branch 'main' into feat/taesd

afe9756

Merge branch 'main' into feat/taesd

578e682

Merge remote-tracking branch 'origin/main' into feat/taesd

f222b87

# Conflicts: # invokeai/backend/model_management/model_probe.py

keturn force-pushed the feat/taesd branch from ad0fada to 46e6f48 Compare September 20, 2023 19:05

feat(model management): guess whether a VAE is for SDXL based on its …

e0f8274

…name

keturn force-pushed the feat/taesd branch from 46e6f48 to e0f8274 Compare September 20, 2023 19:07

keturn added 4 commits September 20, 2023 12:07

feat(model management): guess whether a VAE is for SDXL based on its …

e487bcd

…name

test(model management): test VaeFolderProbe

2c39aec

lint

6392098

Merge branch 'main' into feat/taesd

b1b5f70

hipsterusername approved these changes Sep 20, 2023

View reviewed changes

Merge branch 'main' into feat/taesd

3c44a74

hipsterusername merged commit b64ade5 into main Sep 20, 2023
8 checks passed

hipsterusername deleted the feat/taesd branch September 20, 2023 21:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: support TAESD - Tiny Autoencoder for Stable Diffusion #4316

feature: support TAESD - Tiny Autoencoder for Stable Diffusion #4316

keturn commented Aug 18, 2023 •

edited

blessedcoolant commented Aug 18, 2023

keturn commented Aug 18, 2023

blessedcoolant commented Aug 18, 2023 •

edited

keturn commented Aug 19, 2023 •

edited

psychedelicious left a comment

Millu commented Aug 28, 2023

blessedcoolant commented Aug 29, 2023

hipsterusername commented Sep 11, 2023

Millu commented Sep 12, 2023

keturn commented Sep 13, 2023

keturn commented Sep 13, 2023

RyanJDick commented Sep 15, 2023

RyanJDick commented Sep 19, 2023

keturn commented Sep 20, 2023

keturn commented Sep 20, 2023

feature: support TAESD - Tiny Autoencoder for Stable Diffusion #4316

feature: support TAESD - Tiny Autoencoder for Stable Diffusion #4316

Conversation

keturn commented Aug 18, 2023 • edited

To Do

Have you discussed this change with the InvokeAI team?

Have you updated all relevant documentation?

Related Tickets & Documents

QA Instructions, Screenshots, Recordings

Added/updated tests?

blessedcoolant commented Aug 18, 2023

keturn commented Aug 18, 2023

blessedcoolant commented Aug 18, 2023 • edited

keturn commented Aug 19, 2023 • edited

psychedelicious left a comment

Choose a reason for hiding this comment

Millu commented Aug 28, 2023

blessedcoolant commented Aug 29, 2023

hipsterusername commented Sep 11, 2023

Millu commented Sep 12, 2023

keturn commented Sep 13, 2023

keturn commented Sep 13, 2023

RyanJDick commented Sep 15, 2023

RyanJDick commented Sep 19, 2023

keturn commented Sep 20, 2023

keturn commented Sep 20, 2023

keturn commented Aug 18, 2023 •

edited

blessedcoolant commented Aug 18, 2023 •

edited

keturn commented Aug 19, 2023 •

edited