-
Notifications
You must be signed in to change notification settings - Fork 6.4k
Closed
Description
Hi there !
I am currently training a DDPM model on a custom image dataset following the cool unconditional_image_generation example script.
Since I don't have the compute to perform comprehensive hyperparameter tuning of my architecture, I was wondering if there are any common intuitions when designing the UNet
denoiser : width/length of the residual blocks, number and positions of the attention blocks, etc. with respect to the number of samples in the training set as well as their resolution.
If anyone has a wide experience in training DMs, it would be super cool to share insights here or in a dedicated blog post such as the one discussing the hyperparameters choice when training Dreambooth.
Thank you ! 🤗
louixp
Metadata
Metadata
Assignees
Labels
No labels