image transforms #115

ethansmith2000 · 2023-01-06T08:48:34Z

making a fork for my own usage, but thought i'd give a pr as well. thank you for your work!

refactor

brian6091 · 2023-01-06T09:34:15Z

Great, looks a lot cleaner! One issue though (which applies in general, not just to your PR) is that if both resize and center_crop are false, you will potentially give the VAE images at different resolutions. Perhaps useful to include a check to force a resize at the end of the pipeline before converting to a tensor? This is useful if you feed the training script images with varying sizes.

I've done this here:
https://github.com/brian6091/Dreambooth/blob/mix/src/datasets.py#L112-132

ethansmith2000 · 2023-01-06T09:40:10Z

good point! also noticed there isn't an option for caching latents, I'd be happy to make a pr for that as well if that's something that would be useful

brian6091 · 2023-01-06T09:50:48Z

There is an issue for caching latents: #62

One point is that if you are relying on transformations to augment your data, like color jitter, caching latents won't help you since you need to run the transformations on each batch.

Maybe you could chime in on the discussion to see what others think. My general feeling is to keep the training scripts in the LoRA repo as simple as possible so that others can easily see what they need to do to adapt their own scripts. But if caching latents saves enough memory to let more people train with lower GPU requirements, then it may be worth it.

ethansmith2000 · 2023-01-06T09:56:08Z

ah didn't think of that. good reason to have it optional.
there probably isn't a reliable way to jitter colors of the latents, but also maybe could do the linear decode trick, apply, and turn it back

brian6091 · 2023-01-06T10:04:42Z

Or you could augment in latent space, but I'm not sure anyone knows what that means!

cloneofsimo · 2023-01-07T02:21:54Z

Thanks!

image transforms

2c0ba7b

refactor

cloneofsimo changed the base branch from master to develop January 7, 2023 02:20

Merge branch 'develop' into patch-2

be51b8f

cloneofsimo merged commit 04089c8 into cloneofsimo:develop Jan 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image transforms #115

image transforms #115

ethansmith2000 commented Jan 6, 2023

brian6091 commented Jan 6, 2023 •

edited

Loading

ethansmith2000 commented Jan 6, 2023

brian6091 commented Jan 6, 2023 •

edited

Loading

ethansmith2000 commented Jan 6, 2023

brian6091 commented Jan 6, 2023 •

edited

Loading

cloneofsimo commented Jan 7, 2023

image transforms #115

image transforms #115

Conversation

ethansmith2000 commented Jan 6, 2023

brian6091 commented Jan 6, 2023 • edited Loading

ethansmith2000 commented Jan 6, 2023

brian6091 commented Jan 6, 2023 • edited Loading

ethansmith2000 commented Jan 6, 2023

brian6091 commented Jan 6, 2023 • edited Loading

cloneofsimo commented Jan 7, 2023

brian6091 commented Jan 6, 2023 •

edited

Loading

brian6091 commented Jan 6, 2023 •

edited

Loading

brian6091 commented Jan 6, 2023 •

edited

Loading