Get depth programatically #65

GavChap · 2024-06-18T20:45:51Z

Get the depth of the model by counting layers to allow for models with more depth. This allows for deeper models to be created. Example: https://huggingface.co/ptx0/pixart-reality-mix which is a 900M model

…h more depth. This allows for deeper models to be created. Example: https://huggingface.co/ptx0/pixart-reality-mix which is a 900M model

…yer with a 'final' layer giving me an out by one error

city96 · 2024-06-18T22:36:06Z

Looks good. We'll want to add an auto checkpoint select node as well that auto detects/generates the correct config.

That way we can support any size model. In theory, a node like that isn't hard, but one issue I ran into was the pe_interpolation factor, which is not stored in the diffusers state dict.

I think it should be possible to completely get rid of that value by dynamically generating it from the image size similar to how it's done for HunYuanDiT.

I gave it a quick test and it seems to work. Should I try to make those changes I just mentioned on the base repo so we can use this PR for the auto config node as well or should I merge this as-is?

city96 · 2024-06-18T22:39:41Z

Actually, with that last commit it does seem to fail with the diffusers weights for me since cross_attn.proj.weight is the comfy name and not the diffusers name

…sed a layer with a 'final' layer giving me an out by one error" This reverts commit e43ecbe.

GavChap · 2024-06-18T22:44:21Z

Actually, with that last commit it does seem to fail with the diffusers weights for me since cross_attn.proj.weight is the comfy name and not the diffusers name

Yes, I accidentally made something break so I reverted it, i was trying to fix the "missing UNET message" but that doesn't matter as long as the correct layers exist.

GavChap · 2024-06-18T22:46:31Z

Looks good. We'll want to add an auto checkpoint select node as well that auto detects/generates the correct config.

That way we can support any size model. In theory, a node like that isn't hard, but one issue I ran into was the pe_interpolation factor, which is not stored in the diffusers state dict.

I think it should be possible to completely get rid of that value by dynamically generating it from the image size similar to how it's done for HunYuanDiT.

I gave it a quick test and it seems to work. Should I try to make those changes I just mentioned on the base repo so we can use this PR for the auto config node as well or should I merge this as-is?

I think merge it as is, then we could work on a detection node, I've been trying to figure out pe_interpolation as that should allow inference at any size, I had it working on square! I could gen 2048x2048 from the 1024 model, but as soon as you selected an aspect ratio it went off the wall.

GavChap · 2024-06-18T22:50:21Z

I've closed it as there are issues I just ran into. I'll reopen it when I've made sure I fix them

city96 · 2024-06-18T22:53:14Z

Fair lol, take your time. I'll check on the PE factor stuff, see how hard it is to guess. I assume just doing an average for width+height and then taking the ratio for the base (512?) didn't help?

GavChap · 2024-06-18T22:55:11Z

Fair lol, take your time. I'll check on the PE factor stuff, see how hard it is to guess. I assume just doing an average for width+height and then taking the ratio for the base (512?) didn't help?

Nope, didn't help at all. But give it a go and you'll see

city96 · 2024-06-18T23:25:56Z

I could gen 2048x2048 from the 1024 model, but as soon as you selected an aspect ratio it went off the wall.

Doing that seems like it shouldn't work, unless you were doing it the other way around. DiT is notoriously bad at resolutions it wasn't trained on.

Also, I'm able to guess the factor with the formula (x.shape[-1]+x.shape[-2])/2.0 / (512/8.0) [PE scale computed: 2.0625 [vs:2]]

I'm thinking maybe a soft-rounding for values that are close to whole integers could work, then leave it up to luck for values outside that lol. Not like the model works outside those anyway.

city96 · 2024-06-19T01:05:44Z

Pushed an auto checkpoint loader but it needs better logic to get the right config for diffusers, which is missing a bunch of keys that the default one has. I can go into more detail if this is something you'd like to look into. de52d3a

city96 · 2024-06-19T01:08:43Z

PixArt/loader.py

@@ -76,6 +76,8 @@ def load_pixart(model_path, model_conf):
 		device=model_management.get_torch_device()
 	)

+	model_conf.unet_config['depth'] = sum(key.endswith('.scale_shift_table') for key in state_dict.keys())


Overriding the blocks here is a bad idea, and using .scale_shift_table will always be off by one because the final layer also has a scale shift table entry. I think just leave the loader as-is in this PR.

The problem is that if you don't override it it messes up the generation for the larger models as it expects 28 layers not 42.

Fixed using simple loader and an additional config in the standard loader.

PixArt/diffusers_convert.py

PixArt/models/PixArtMS.py

… model

GavChap · 2024-06-19T13:13:13Z

I've made more changes, I'm not sure the autodetect node works with 900M models, I'll do some more investigation, I kept getting problems with gens by using the autoconfig + the new layer code to allow for more depth, the only combo I found that works is forcing the depth in the loader.py. I will keep digging since there is renewed interest in PixArt after SD3's launch.

…ve depth selectable?

GavChap · 2024-06-23T18:50:24Z

In further testing the simple loader is working fine. It was something to do with my setup. I've removed the model_conf override and added a config to the standard loader now so this should be good to merge?

city96 · 2024-06-23T20:14:03Z

Well, I can confirm that it "works", though it looks like it definitely needs more training lol. Still, good job on this! Thanks!

GavChap added 3 commits June 18, 2024 21:44

Get the depth of the model by counting layers to allow for models wit…

03a81ed

…h more depth. This allows for deeper models to be created. Example: https://huggingface.co/ptx0/pixart-reality-mix which is a 900M model

Minor optimization to stop a double .to() being needed

1d6c291

Use correct key to get the number of layers, I accidentally used a la…

e43ecbe

…yer with a 'final' layer giving me an out by one error

Revert "Use correct key to get the number of layers, I accidentally u…

5b082f4

…sed a layer with a 'final' layer giving me an out by one error" This reverts commit e43ecbe.

GavChap closed this Jun 18, 2024

Fix loader depth

91340de

GavChap reopened this Jun 18, 2024

city96 reviewed Jun 19, 2024

View reviewed changes

Use different keys for detection between original model and converted…

712f57c

… model

GavChap mentioned this pull request Jun 23, 2024

Any possibility to support 900M param Pixart Sigma models? #68

Open

GavChap and others added 3 commits June 23, 2024 19:43

Merge branch 'city96:main' into get-depth-programatically

7d0cb8f

Remove override as the new simple loader works it out for us.

8f42946

Add new 900M config for the traditional loader, might be better to ha…

46e4fc8

…ve depth selectable?

city96 merged commit faf6979 into city96:main Jun 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get depth programatically #65

Get depth programatically #65

GavChap commented Jun 18, 2024

city96 commented Jun 18, 2024

city96 commented Jun 18, 2024

GavChap commented Jun 18, 2024 •

edited

Loading

GavChap commented Jun 18, 2024

GavChap commented Jun 18, 2024

city96 commented Jun 18, 2024

GavChap commented Jun 18, 2024

city96 commented Jun 18, 2024

city96 commented Jun 19, 2024

city96 Jun 19, 2024

GavChap Jun 19, 2024

GavChap Jun 23, 2024

GavChap commented Jun 19, 2024

GavChap commented Jun 23, 2024

city96 commented Jun 23, 2024

Get depth programatically #65

Get depth programatically #65

Conversation

GavChap commented Jun 18, 2024

city96 commented Jun 18, 2024

city96 commented Jun 18, 2024

GavChap commented Jun 18, 2024 • edited Loading

GavChap commented Jun 18, 2024

GavChap commented Jun 18, 2024

city96 commented Jun 18, 2024

GavChap commented Jun 18, 2024

city96 commented Jun 18, 2024

city96 commented Jun 19, 2024

city96 Jun 19, 2024

Choose a reason for hiding this comment

GavChap Jun 19, 2024

Choose a reason for hiding this comment

GavChap Jun 23, 2024

Choose a reason for hiding this comment

GavChap commented Jun 19, 2024

GavChap commented Jun 23, 2024

city96 commented Jun 23, 2024

GavChap commented Jun 18, 2024 •

edited

Loading