Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why is the number of output channels of the first convolution in the first layer 32? #3

Closed
niranjantdesai opened this issue Jun 18, 2018 · 2 comments

Comments

@niranjantdesai
Copy link

According to Appendix A in the paper, for the CIFAR datasets, the number of output channels of the three scales is set to 6, 12 and 24 respectively. However, num_channels is set to 32 in msdnet.py. This means that the number of output channels in the first layer for the three scales is 32, 64 and 128 respectively according to the default growth rate 1-2-4-4. Why is there a difference between the implementation details in the paper and the code?

@avirambh
Copy link
Owner

avirambh commented Jun 29, 2018

Hi @niranjantdesai, sorry for the late reply.
If I understand correctly, the original implementation initializes the first layer to initChannels, while growth rate channels (layer's output) are being concatenated to these.
Note that the number of initChannels has been changed in the original implementation 11 days ago:
gaohuang/MSDNet@fc14920.
I didn't find initChannels description in the original paper, so please let me know if you understand this differently.

@niranjantdesai
Copy link
Author

@avirambh You're right. In this discussion, the original author says that the first layer has a slightly different structure. It is usually set to be twice the width of the subsequent layers, following the design of DenseNet. This is not explicitly mentioned in the paper.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants