Finetune SDXL #35

dydxdt · 2024-05-17T03:17:36Z

Thanks for your good work. I use the offered SDXL weights to finetune with my own data, but it seems the loss doesn't converge and I wonder whether the offered weights are trained on 1024 resolution. I test the finetuned model and it cannot learn the style of the training data. Do you have any advice? Thx

huiyang865 · 2024-06-03T03:15:38Z

I also encountered a similar problem: SDXL model training at 1024 resolution, loss does not seem to converge.

The model training configuration is as follows:

accelerate launch examples/brushnet/train_brushnet_sdxl.py \
--pretrained_model_name_or_path /disk1/BrushNet/data/ckpt/anything-xl \
--brushnet_model_name_or_path /disk1/BrushNet/data/ckpt/random_mask_brushnet_ckpt_sdxl_v0 \
--output_dir runs/logs/selfdata_brushnetsdxl_1024 \
--train_data_dir /disk1/data/self_developed_animate_data \
--resolution 1024 \
--max_train_steps 100000 \
--learning_rate 1e-5 \
--train_batch_size 1 \
--gradient_accumulation_steps 4 \
--tracker_project_name brushnet \
--report_to tensorboard \
--resume_from_checkpoint latest \
--validation_steps 1000 \
--checkpointing_steps 1000 \
--random_mask

The training log shows following:

How did you @dydxdt solve the problem later?

dydxdt · 2024-06-06T01:34:47Z

Haven't figured it out. It sucks. Hope for helpful advice -_- @huiyang865

I also encountered a similar problem: SDXL model training at 1024 resolution, loss does not seem to converge.

The model training configuration is as follows:

accelerate launch examples/brushnet/train_brushnet_sdxl.py \
--pretrained_model_name_or_path /disk1/BrushNet/data/ckpt/anything-xl \
--brushnet_model_name_or_path /disk1/BrushNet/data/ckpt/random_mask_brushnet_ckpt_sdxl_v0 \
--output_dir runs/logs/selfdata_brushnetsdxl_1024 \
--train_data_dir /disk1/data/self_developed_animate_data \
--resolution 1024 \
--max_train_steps 100000 \
--learning_rate 1e-5 \
--train_batch_size 1 \
--gradient_accumulation_steps 4 \
--tracker_project_name brushnet \
--report_to tensorboard \
--resume_from_checkpoint latest \
--validation_steps 1000 \
--checkpointing_steps 1000 \
--random_mask

The training log shows following:

How did you @dydxdt solve the problem later?

huiyang865 · 2024-06-13T10:51:56Z

Thanks for your reply.

Is the Brushnet part of SDXL and SD1.5 the same structure? I look at the code and find that BrushNet features are not injected into the Refiner module of XL, is that right?

Do these factors limit the convergence of the XL version? Look forward to your further reply. Thank you very much. @dydxdt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetune SDXL #35

Finetune SDXL #35

dydxdt commented May 17, 2024

huiyang865 commented Jun 3, 2024

dydxdt commented Jun 6, 2024

huiyang865 commented Jun 13, 2024 •

edited

Finetune SDXL #35

Finetune SDXL #35

Comments

dydxdt commented May 17, 2024

huiyang865 commented Jun 3, 2024

dydxdt commented Jun 6, 2024

huiyang865 commented Jun 13, 2024 • edited

huiyang865 commented Jun 13, 2024 •

edited