You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
should y_embedding in CaptionEmbedder class be fixed as the embedding of null text when training PixArt-alpha from scratch? instead of randomly initialized.
@lawrence-cj Got it. But why inference codes in pixart-alpha use model.y_embedder.y_embedding as the null text embedding instead of using real null text embedding produced by T5 encoder, which is adopted by PixArt-sigma inference code? I check the model.y_embedder.y_embedding and found it is not the same as the real null text embedding produced by T5 encoder.
It's a historical problem. Later in sigma this problem is solve, but it won't influence much. BTW, the inference code in alpha is kinda outdated. The app.py and so on can pass the negative prompt into T5.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
should y_embedding in CaptionEmbedder class be fixed as the embedding of null text when training PixArt-alpha from scratch? instead of randomly initialized.
Beta Was this translation helpful? Give feedback.
All reactions