New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ideal settings for face PLUS body training on current version #1173
Comments
I join the question. There are images: face only; body only; body and face. In most "body and face" cases, the body is cropped due to the 512px square limit. In rare cases, a part of the face is cropped. |
The ideal settings is to stay below 15 for instance images, make sure they are diverse, you can reach results in less than 800 steps, that's less than 15 minutes, so you can comfortably try different settings until you get the desired result. when you want to resume training, try reducing the learning rate slightly to concentrate on the small details of the picture. |
I personally tried the new settings as I only use dreambooth for faces and the only way I get any good results are by using the previous settings. so 3000 steps for around 20+ photos, 2e-6 unet for both images and text. Trying with 10 / 15 / 20 images with 800 steps or under gives me questionable results, and although people say "just keep adding to the training" not many people can do that as colab has limits. Even though I am a premium user the 15 minute training suddenly takes an hour because of all the minor tweaks you have to do to get it looking anything good. So if I were you use 3000 steps for 20+ images, 2e-6 unet learning rate for both the text encoder and images. Takes around 45 mins on standard colab gpu or 20 mins on premium colab and it will look GREAT first time with no tweaking. |
yeah i had less than ideal results with latest settings and have had to lower the learning rate. i have premium colab. are you quoting total session time or just training time? i am cheap and trying to calculate total time for the job using a premium gpu vs not |
the fast_DreamBooth-Old-Method , always worked for me the first time and I haven't gotten that quality in the models anymore. |
@kozka set the learning rate to 2e-6 for both unet and text_enc and up the unet steps to 3000, this is exactly like before. |
Since they removed the OLD method, NONE of my face results are favorable, except styles, i have no problems with styles, but on faces, i have paid 2 months of Google PRO and NEVER had a good ckpt file no matter what i do and i have been using this since October , ALL WAS GOOD with PRIOR images and the old method...... then after introducing the renaming INSTANCE IMAGES, everything was OK if you followed instructions, but now, after the OLD page was removed, this NEW page only works for me, for styles.... NICE quality, But since 2 weeks from now, all models that i make from a person look ugly, and very hard to get settings correct.... |
send me 10 of your instance images and I will train the model for you to prove that it works |
Hello, |
mixing your face with other faces is a common issue with deep learning models called overfitting. if you want your face to by stylized as painting, you need to reduce the text encoder steps to 250 and its learning rate to 1e-6 |
@TheLastBen ty for your tip. I will do now and i will coment to you the results. |
Hello again @TheLastBen, Seems that the class person are in cclonflict with i am a male? or smth wrong on the prompt? I cant understand why will smith is considered as a man as a base and me i need to put my token as a man.. bla bla .. Thank you so much for this help! i am sooo happy to have some light on this ^^! |
did you used the new CAPTION OPTION and the regularization images section too? |
also how many INSTANCE images you used with this new results? |
Hello @LIQUIDMIND111 UNet_Training_Steps: 3000 |
thanks mate, i will try soon |
I was testing many models and many configurations etc,, |
Thank you @kozka, i will try your params and write here if i get a good results too |
hello again @kozka , with 1600 steps on train text i get a really bad results |
ok I guess the photos I used had something to do with it, |
@kozka i am really don't know... i am using the same photos that i used to create a sd 1.5 model with success but redimensioned to 768. any param works well and dunno why :S. With your OK model coudl you do a test for me? A beautiful portrait of Will Smith, award winning photography and thell me if the pic generated is will smith but similar to your tained model or is will smith 100% |
I think the best thing is to use the normal model 1.5 to generate an image of willsmith next to someone and then put your face to the other person using your personal trained model, using inpainting to put it next to him or something like that, when I tried to train 2 models at the same time sometimes the images come out well and many others don't, *i have tried way 1, take a picture of willsmith with someone and then use my model in inpainting to change the face only |
i didn't consider that -> person it will mix the faces. |
I'm not an expert |
Hello again @kozka !!! So if i am not wrong i need some more unet training steps to get a better and acurate model, right? |
you are right , |
Hello,
first of all thank you for what you are doing for free. This topic is not meant to be a complain and I hope it is clear.
I would like to open a thread and hopefully get good answers from people and from TheLastBen about what are the ideal settings in order to train the model on a single subject and having the best results possible:
Assuming we have access to unlimited images and unlimited time
Wanting to train on both subject face + and face body shots . This can be achieved by including various angles and distances from the subject (including more or less of the body in different images).
Thank you to everyone that will contribute.
The text was updated successfully, but these errors were encountered: