New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dreambooth support #995
Comments
+1 Will also note there have been discussions of making it easy to generate (or import) new concepts from the WebUI. Should support both textual inversion & dreambooth, and plans include having a "library" of these for ongoing use. I think, given the purpose and intent of this repo, full integration should be the aim. |
Interesting comparison:
|
Even better, an actual implementation in 10G VRAM https://www.reddit.com/r/StableDiffusion/comments/xtc25y/dreambooth_stable_diffusion_training_in_10_gb/?context=3 |
@tildebyte I tried this Colab (the one in the reddit post you share), but in the free tier, you have to pass It was a few days ago, so I may try again, just to see if they've introduced any other improvement. |
Oh, nvm. There's still a free tier, but they've added a "pay-as-you-go" tier. TL;DR - I haven't done anything in Colab since SD dropped 😁 |
I think there is potential in TI, but I want to get Dreambooth to work, and compare them. @tildebyte Here are some TI results, for comparison with what you refer to in #995 (comment) Workflow 1Use
Workflow 2Obtain a concept with
(By the way, the training images in #517 (comment) were personally created by the author of the comment, and the character will appear on some project -which I find pretty cool- so you can use them to test TI but don't share them as training set) I've also managed to train using a couple other concepts (hamburger, dog cartoon and myself), but results weren't that good. For people, I wonder if using some 'beautify' filter on the training set wold help, to make sure the face looks closer to a 3D character -very smooth, so it doesn't have to learn patterns inside the face, e.g. cheek colors, smiles, wrinkles, etc. and can focus on learning more generally the face shape). |
All in all, I tend to prefer the " |
@tildebyte But yeah, the word out there seems to be Dreambooth is better or easier to use. For example, this is a quick attempt (using It may still be possible to do these things with Textual Inversion, but it may require a more complex workflow, while Dreambooth may be easier to use. Still, if someone has successfully used free Colab for Dreambooth and has had success, you are encouraged to share it here! The more info we have (what Colab, obviously, but also number of training images, lighting, closeness...), the better for us to implement it in the repo and document it. |
Dreambooth is INCREDIBLE. No contest. |
Update: I've tried https://colab.research.google.com/github/ShivamShrirao/diffusers/blob/main/examples/dreambooth/DreamBooth_Stable_Diffusion.ipynb a bunch of times and I can't get it to work well. It does run, and the result has some resemblance, but that's all. Last attempt was with 50 training images and 150 reg images. |
This issue is to discuss Dreambooth (whether fully integrating it in this repo -training and inference-, or training via 3rd party, for example Colab, and doing inference in this repo).
Discussion and comparison with regular Textual Inversion is also encouraged.
The text was updated successfully, but these errors were encountered: