Question about difference of training SDXL between kohya and diffusers #2534
SeungHwa92
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I am working on training SDXL Lora so I review the kohya code and diffusers code.
I was checking whether kohya and diffusers have same output of text_encoder_2(hidden_states2, pool2).
hidden_state2 values are same but pool2 have different values.
I found that the way of calculating pool2 are different.
the code for calculating pool2 from kohya and diffusers belows.
Difference start from this.
kohya uses "text_encoder_2_output['last_hidden_state']" or "text_encoder_2_output[1]" for pool2
but diffusers uses "text_encoder_2_output['text_embeds']" or "text_encoder_2_output[0]" for pool2
if you have any insights for implementing this code.
Can you share your insights ?
Beta Was this translation helpful? Give feedback.
All reactions