DoRA cross-compatibility #196

ntc-ai · 2024-07-01T23:45:35Z

Hi,
I was referred to by comfyanonymous when trying to merge a PR [1]

As a DoRA model author, I want to be able to adjust my DoRA range from -1 to 1. I was using the 'alpha' parameter to accomplish this.

While this works in A1111 [2], this does not work in ComfyUI as the author says he matches your implementation. There is no other way to adjust in DoRA.

My goal is to create DoRAs that work in A1111 and in ComfyUI.

Do you have thoughts on which implementation is correct? I can submit a PR to your project to make this change if it will help.

Best,
-NTC

Links:

1: ComfyUI PR comfyanonymous/ComfyUI#3922
2: A1111 implementation https://github.com/AUTOMATIC1111/stable-diffusion-webui/blob/feee37d75f1b168768014e4634dcb156ee649c05/extensions-builtin/Lora/network.py#L210

KohakuBlueleaf · 2024-07-05T10:49:37Z

I don't get it.
What you want to do?

And, "A1111 implementation" is implemented by me.
Whole bulit-in lora extension is maintained by me

KohakuBlueleaf · 2024-07-05T10:53:48Z

ok I think I got it
will update things on A41 side

KohakuBlueleaf · 2024-07-05T10:54:07Z

(But I can't ensure that's the point
since that is not issue of this repo
closed

KohakuBlueleaf · 2024-07-05T10:57:51Z

AUTOMATIC1111/stable-diffusion-webui#16151

ntc-ai · 2024-07-05T16:57:42Z

Hi your A1111 code was right before patching. Your new patch is wrong.

The code here and in comfyui needs changes. I believe the alpha should be applied where A1111 applies it, otherwise it's useless for DoRA authors.

My use case is to normalize my DoRAs from -1 to 1 but the only way to do it is with the alpha term as it was written in A1111.

Sorry for the confusion.

KohakuBlueleaf · 2024-07-05T17:03:42Z

Hi your A1111 code was right before patching. Your new patch is wrong.

The code here and in comfyui needs changes. I believe the alpha should be applied where A1111 applies it, otherwise it's useless for DoRA authors.

My use case is to normalize my DoRAs from -1 to 1 but the only way to do it is with the alpha term as it was written in A1111.

Sorry for the confusion.

Plz read the source code of this repo.
Thx

The original paper never use alpha, and I assume they take alpha as part of BA directly.

So the modified version is correct.

"Can't achieve what you want" never equals to "Wrong"

ntc-ai · 2024-07-05T17:58:39Z

I read through the source code and the whitepaper. As you said there is no alpha term in the whitepaper.

LyCORIS/lycoris/functional/general.py

Line 93 in c48365c

def apply_dora_scale(org_weight, rebuild, dora_scale, scale):

This code doesn't appear used but applies scale after norm.

LyCORIS/lycoris/modules/locon.py

Line 301 in c48365c

weight = self.apply_weight_decompose(weight)

This code seems used although applying scale before apply_weight_decompose will change the weights even at scale=0.

alpha is also most useful when applied after norm. alpha is not useful when it is applied to BA directly.

Some background:
In LoRAs, alpha can used by model authors as a way to normalize the scale so someone can include a LoRA at 1.0.
In DoRAs changing alpha breaks the model unless it is applied after norm.

Thanks for looking into this.

KohakuBlueleaf · 2024-07-05T18:07:40Z

I found it
I should ensure self.multiplier work after weight decompose

But the formula for A1111 side modification is still correct.

KohakuBlueleaf · 2024-07-05T18:09:21Z

And training side will assume multiplier is always 1 so currently it will not affect anything.

KohakuBlueleaf · 2024-07-05T18:19:52Z

Fixed in dev.

ntc-ai · 2024-07-05T18:51:21Z

Sent some LTC https://nanswap.com/transaction-all/orbf4BBOn9pw

Thanks for the changes, this is tricky. I noticed an issue:

       merged = self.apply_weight_decompose(weight + diff, multiplier)

This needs to apply the scaled diff after the norm.
Heres the pseudocode as I understand it. Happy to send a patch if you'd like.

scale = self.multiplier * self.scale # alpha in contention
original_weights = weights
weights = weights + BA
weights *= dora_scale/norm(weights)
return weights + (weights - original_weights) * scale

This is how comfy does it:
https://github.com/comfyanonymous/ComfyUI/blob/master/comfy/model_patcher.py#L27

The placement of alpha on line 14 above is what is contentious. I believe the variable should be like multiplier so that alpha=0 means the diff is turned off. This is how LoRAs work.

This isn't in the whitepaper so it's up to interpretation. You are right that multiplier is 1 in training as is self.scale. If alpha is not trained and can only be a specific value, I don't understand it's function.

ntc-ai mentioned this issue Jul 1, 2024

Fix DoRA alpha comfyanonymous/ComfyUI#3922

Closed

KohakuBlueleaf closed this as completed Jul 5, 2024

ntc-ai mentioned this issue Jul 5, 2024

Possible fix of wrong scale in weight decomposition AUTOMATIC1111/stable-diffusion-webui#16151

Merged

4 tasks

KohakuBlueleaf closed this as not planned Won't fix, can't repro, duplicate, stale Jul 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DoRA cross-compatibility #196

DoRA cross-compatibility #196

ntc-ai commented Jul 1, 2024

KohakuBlueleaf commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

ntc-ai commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

ntc-ai commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

ntc-ai commented Jul 5, 2024

DoRA cross-compatibility #196

DoRA cross-compatibility #196

Comments

ntc-ai commented Jul 1, 2024

KohakuBlueleaf commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

ntc-ai commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

ntc-ai commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024

ntc-ai commented Jul 5, 2024