The vanilla cnn downsampling architecture cannot recover spatial information of a image #4

zhiqwang · 2019-04-15T12:10:05Z

The convolutional part of the architecture act as a encoder part, it capture image's contexture information, the architecture should ensemble a decoder part (deconvolution layer or RNN layer) to recover image's spatial information.

zhiqwang · 2019-04-16T02:45:48Z

The current CNN architecture implemented here is classified two categories by the pooling size of the image's width: the one is densenet121, it compress image's width by 1/8, the other one is densenet_cifar by 1/4. So the current network architecture cannot handle the situation where the text have different width.

zhiqwang mentioned this issue Apr 15, 2019

annotation file format for English data #5

Closed

zhiqwang mentioned this issue Apr 21, 2019

Not found recurrent layer in model files #6

Open

zhiqwang added a commit that referenced this issue Apr 24, 2019

Update readme (#4, #6)

bccb547

zhiqwang mentioned this issue May 22, 2019

dimensions in forward pass #16

Closed

zhiqwang added the enhancement New feature or request label May 22, 2019

zhiqwang added this to To do in image captioning Jun 11, 2019

zhiqwang moved this from To do to In progress in image captioning Jun 11, 2019

zhiqwang moved this from In progress to Done in image captioning Jun 21, 2019

zhiqwang moved this from Done to In progress in image captioning Jun 21, 2019

zhiqwang moved this from In progress to To do in image captioning Jun 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The vanilla cnn downsampling architecture cannot recover spatial information of a image #4

The vanilla cnn downsampling architecture cannot recover spatial information of a image #4

zhiqwang commented Apr 15, 2019 •

edited

Loading

zhiqwang commented Apr 16, 2019

The vanilla cnn downsampling architecture cannot recover spatial information of a image #4

The vanilla cnn downsampling architecture cannot recover spatial information of a image #4

Comments

zhiqwang commented Apr 15, 2019 • edited Loading

zhiqwang commented Apr 16, 2019

zhiqwang commented Apr 15, 2019 •

edited

Loading