[Bug]: When EMA is enabled it doesn't learn anything is this expected? SDXL Base model #892
Replies: 12 comments
-
|
How are you judging that it learned nothing, by external sampling of a checkpoint or by OT sampling? If external: |
Beta Was this translation helpful? Give feedback.
-
|
@dxqbYD I am checking from saved checkpoints. I am using SwarmUI (ComfyUI backend), I compared both saved checkpoints and also final checkpoint As you can see 150 epoch and 200 epoch are literally same and character is nothing of my training dataset Unless there is no additional option that I have to check than these 2, this is the current status of a fresh new OneTrainer installation The only different option between regular working training and EMA is below setting nothing else GPU vs off |
Beta Was this translation helpful? Give feedback.
-
|
@dxqbYD here my EMA preset |
Beta Was this translation helpful? Give feedback.
-
|
as I said above I recommend using OT sampling because you can see both, EMA and non-EMA samples |
Beta Was this translation helpful? Give feedback.
-
well if it doesnt work at the end generated product (saved safetensors checkpoints) i dont see how i can count it working even if working with samples during generation also i had tested EMA previously on only a single subject and it was slightly improving but currently it didnt train at all like before, it is not like it didnt improve or benefited |
Beta Was this translation helpful? Give feedback.
-
it will not. |
Beta Was this translation helpful? Give feedback.
-
ok doing a fresh training right now will post here |
Beta Was this translation helpful? Give feedback.
-
|
@dxqbYD yes proven it is not working these 3 below are with EMA non ema samples are accurate and learning epoch 0 20 30 now non EMA samples 0 20 30 |
Beta Was this translation helpful? Give feedback.
-
|
Isn't your EMA decay of 0.999 just way to high for dataset of just 28 images? Use 0.99 or even 0.9 or you will see results much later than after 30 epoch... |
Beta Was this translation helpful? Give feedback.
-
I think you might have shown that it is working. Model learns, it's just not transferred to the EMA model.
I'll change this into a github Discussion because this is likely a matter of parameter tuning. |
Beta Was this translation helpful? Give feedback.
-
|
just to let you know model was trained 5600+5600 (subject + reg images) steps and it still didnt have any impact but i am testing with EMA decay 0.9 and EMA 0.99 right now to test |
Beta Was this translation helpful? Give feedback.
-
|
@FurkanGozukara can you share your loss graph, smooth loss and loss. |
Beta Was this translation helpful? Give feedback.







Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
What happened?
with EMA on 0 learning. Absolutely same settings except EMA on and off
EMA off
150 epoch working as expected
EMA on GPU
Decay 0.999
Update interval 1
150 epoch
200 epoch
Beta Was this translation helpful? Give feedback.
All reactions