Snapfusion seems to get better results? #25

jianyuheng · 2023-08-29T04:44:37Z

Thanks for the generosity of open sourcing your work, but there was a previous work similar to yours, called Snapfusion, aimed at speeding up Stable diffusion.

From the results of their paper, they achieved better results through efficient-unet and step distillation, but unfortunately this work is not open source.

Do you have any opinion on this work? https://snap-research.github.io/SnapFusion/

bokyeong1015 · 2023-08-29T08:09:26Z

Hi, thanks for your interest :)
SnapFusion has attained impressive results and is concurrent to our work. We sincerely appreciate their research efforts.

Below are potential points of comparison. In short, we've highlighted the potential of classical architectural compression, which remains powerful even under limited resources; meanwhile, SnapFusion has nicely approached both architectural reduction and step distillation.

	BK-SDM (Ours)	SnapFusion
U-Net: architecture reduction	O (Block Removal + KD)	O (Architecture Evolving)
U-Net: # sampling steps reduction	X	O (Step Distillation)
Image Decoder: architecture reduction	X	O (Ch Reduction + KD)
Training Data	0.22M LAION pairs	unclear (from LAION-5B + COYO-700M + internal dataset)
Training GPUs	1 A100 GPU	16 or 32 nodes for most of the training (each node: 8 A100 GPUs)

The following directions could be promising:

Applying step distillation in conjunction with architectural compression.
Extending compression to the other parts (Image Decoder, Text Encoder) beyond the U-Net.
Investigating the impact of training data volumes and computational resources.

jianyuheng · 2023-08-30T04:44:16Z

Very detailed comparison, thanks.

Bikesuffer · 2023-09-01T17:00:18Z

That's really an interesting topic.
I actually tried both approaches for inpainting.
Since Snapfusion is not open source and the authors not responding, I can only write the robust training code based on the description in their paper. After 300k steps training, the model still can't generate acceptable inpainting result.
Later I tried BK SDM approaches for inpainting.
I tried SD_small_64, SD_base_64, SD_base_256, SD_small_256 and SD_tiny_64.
All of them can generate acceptable inpainting results after 50K steps.

abhigoku10 · 2024-03-11T06:28:50Z

@Bikesuffer can u share the source for inpainting so that we can check it from our end Thanks in advance

jianyuheng closed this as completed Aug 30, 2023

bokyeong1015 mentioned this issue Sep 1, 2023

Scale of KD-feature loss for SD inpainting 1.5 #21

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Snapfusion seems to get better results? #25

Snapfusion seems to get better results? #25

jianyuheng commented Aug 29, 2023

bokyeong1015 commented Aug 29, 2023

jianyuheng commented Aug 30, 2023

Bikesuffer commented Sep 1, 2023 •

edited

Loading

abhigoku10 commented Mar 11, 2024

Snapfusion seems to get better results? #25

Snapfusion seems to get better results? #25

Comments

jianyuheng commented Aug 29, 2023

bokyeong1015 commented Aug 29, 2023

jianyuheng commented Aug 30, 2023

Bikesuffer commented Sep 1, 2023 • edited Loading

abhigoku10 commented Mar 11, 2024

Bikesuffer commented Sep 1, 2023 •

edited

Loading