Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Typos in Convolutional Networks page #9

Open
anguyen8 opened this issue Feb 10, 2015 · 2 comments
Open

Typos in Convolutional Networks page #9

anguyen8 opened this issue Feb 10, 2015 · 2 comments

Comments

@anguyen8
Copy link

Hi Andrej,

Thanks for putting such a great resource about convnets and Deep Learning online. I believe this would be greatly helpful for many people including me.

I was reading the convnets page and found this minor issue that you might want to fix. The page says:

The Krizhevsky et al. architecture ...  Real-world example: Since (227 - 11)/4 + 1 = 55, we can see that ...

However, they actually use crops of 224x224. So the formula would be (224 - 11)/4 + 1 = 54.25, which is not an integer but I guess still valid configurations, right?

For the case of Zeiler et al. (Visualizing and Understanding convnets), this formula also gives a non-integer result: (224 - 7)/2 + 1 = 109.5..

Best,

Anh

@karpathy
Copy link
Contributor

Hey Anh, thanks I should maybe address this explicitly in notes.

The Caffe model of AlexNet uses 227x227 images, as it should be. These dimensions make sense.

It's not clear to me how Alex made 224x224 images fit because based on the numbers he gives in the paper the dimensions don't work out. He padded the images somehow in unmentioned way.

@anguyen8
Copy link
Author

@karpathy : Thanks for your response. Yes, I agree it seems like they must use paddings somewhere.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants