Why self-conditioning? #94

Silverster98 · 2022-09-24T11:37:10Z

Are there some works or explanations demonstrating the effectiveness of self-conditioning implementation? Or it is just an empirical trick?

denoising-diffusion-pytorch/denoising_diffusion_pytorch/denoising_diffusion_pytorch.py

Lines 656 to 668 in 6e8a0f2

    
           # if doing self-conditioning, 50% of the time, predict x_start from current set of times 
        
           # and condition with unet with that 
        
           # this technique will slow down training by 25%, but seems to lower FID significantly 
        
           x_self_cond = None 
        
           if self.self_condition and random() < 0.5: 
        
               with torch.no_grad(): 
        
                   x_self_cond = self.model_predictions(x, t).pred_x_start 
        
                   x_self_cond.detach_() 
        
           # predict and take gradient step 
        
           model_out = self.model(x, t, x_self_cond)

yiyixuxu · 2022-09-26T05:51:19Z

see paper here https://arxiv.org/abs/2208.04202

tuttyfrutyee · 2022-10-18T10:38:04Z

This (self-conditioning) does not work for me btw (train loss insists on not decreasing), but the data I work with might be too noisy compared to natural images.

1049451037 mentioned this issue Oct 11, 2022

Why first predict x_start then predict x_{t-1}? #103

Closed

murrellb mentioned this issue Feb 27, 2023

Roadmap MurrellGroup/Diffusions.jl#5

Open

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why self-conditioning? #94

Why self-conditioning? #94

Silverster98 commented Sep 24, 2022

yiyixuxu commented Sep 26, 2022

tuttyfrutyee commented Oct 18, 2022

Why self-conditioning? #94

Why self-conditioning? #94

Comments

Silverster98 commented Sep 24, 2022

yiyixuxu commented Sep 26, 2022

tuttyfrutyee commented Oct 18, 2022