eps for GroupNorm #5

Asthestarsfalll · 2022-07-22T09:10:13Z

Great work!
The paramter 'eps' in group norm will be initialized to 1e-5 by default.
However, the group norm in TensorFlow has a little diference, which is initialized with 1e-6.
Maybe it doesn't have any influence on training results, but can you just modify this(for all GroupNorm in code) for aligning?
Because I want to convert the trained model from torch or tf to megengine, the less the error is, the better it is.

…tf (resovle #5)

ChaiByte · 2022-07-22T15:08:45Z

Thanks for your watching! The DDPM model was based on some Pytorch code implementation at first and I'm glad to hear that you are willing to convert original pre-trained model to MegEngine. Here are some information might be helpful:

I only checked that the forward process is consistent with the Pytorch version I referenced, but not sure if all details of the original Tensorflow version are implemented.
Other converted ckpts: https://github.com/pesser/pytorch_diffusion

In my opinion, converting scripts are also important for users to understand how converted pre-trained models come from. So I sugguest you upload them into this repo, which could encourage more users join us.

Btw, I'm not sure yet how to develop this library in the future, I hope it will help more people understand the implementation of diffusion models. (OpenAI's improved/guided codebase is great, but lack of readability.)

ChaiByte · 2022-07-22T15:11:51Z

During developing this repo, I write some notes in Chinese for myself to understand more about diffusion models. Here is a post: https://meg.chai.ac.cn/ddpm-megengine/ Welcome to read it and give me some advice.

Asthestarsfalll · 2022-07-24T06:12:15Z

I'm willing to upload my convert codes, but it doesn't work well after converting.
The error between megengine and pytorch implementation are high with the same input.
Because of the padding of convolution in Downsample are different, which in pytorch implementation it uses asymmetric padding.
Atfter I modified the megengine implmetation, the result:

class DownSample(M.Module):
    """"A downsampling layer with an optional convolution.

    Args:
        in_ch: channels in the inputs and outputs.
        use_conv: if ``True``, apply convolution to do downsampling; otherwise use pooling.
    """""

    def __init__(self, in_ch, with_conv=True):
        super().__init__()
        self.with_conv = with_conv
        if with_conv:
            self.main = M.Conv2d(in_ch, in_ch, 3, stride=2)
        else:
            self.main = M.AvgPool2d(2, stride=2)

    def _initialize(self):
        for module in self.modules():
            if isinstance(module, M.Conv2d):
                init.xavier_uniform_(module.weight)
                init.zeros_(module.bias)

    def forward(self, x, temb):  # add unused temb param here just for convince
        if self.with_conv:
            x = F.nn.pad(x, [*[(0, 0)
                         for i in range(x.ndim - 2)], (0, 1), (0, 1)])
        return self.main(x)

Btw, I'm also a beginner in ddpm, your blog helps me a lot!

ChaiByte · 2022-07-24T10:01:22Z

Got it. I'm not available at the moment and I will check the padding mode and #6 after day off.

ChaiByte · 2022-07-24T10:04:41Z

The initial eps value has been updated and I will close this issue now to keep tracking the same thing in one issue.

Feel free to reopen it if you have any questions or suggestion.

ChaiByte added a commit that referenced this issue Jul 22, 2022

🐛 fix: specify groupnorm's eps initial value to keep consistent with …

e77c37b

…tf (resovle #5)

ChaiByte mentioned this issue Jul 24, 2022

About padding in Downsample #7

Closed

ChaiByte closed this as completed Jul 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eps for GroupNorm #5

eps for GroupNorm #5

Asthestarsfalll commented Jul 22, 2022 •

edited

Loading

ChaiByte commented Jul 22, 2022 •

edited

Loading

ChaiByte commented Jul 22, 2022

Asthestarsfalll commented Jul 24, 2022 •

edited

Loading

ChaiByte commented Jul 24, 2022

ChaiByte commented Jul 24, 2022

eps for GroupNorm #5

eps for GroupNorm #5

Comments

Asthestarsfalll commented Jul 22, 2022 • edited Loading

ChaiByte commented Jul 22, 2022 • edited Loading

ChaiByte commented Jul 22, 2022

Asthestarsfalll commented Jul 24, 2022 • edited Loading

ChaiByte commented Jul 24, 2022

ChaiByte commented Jul 24, 2022

Asthestarsfalll commented Jul 22, 2022 •

edited

Loading

ChaiByte commented Jul 22, 2022 •

edited

Loading

Asthestarsfalll commented Jul 24, 2022 •

edited

Loading