Problem about the running time. #20

tjulyz · 2018-01-03T11:12:17Z

Hi!
Thanks for your kind sharing! There is a problem when I running your code for Cifar10 classification. That is, when I change the kernel size of the convolutional layers in each block to 1x1 (from 3x3 to 1x1), the running time is about 4.11s for each epoch (from 3.05s to 4.11s) on Titan X. However, 3x3 convolution always consumes much computional resources than 1x1 convolution. So I am confused. Can you help analyze whether there is a problem in your code or in the tensorflow optimization?
Thanks again!

ikhlestov · 2018-01-04T09:03:38Z

Hi!
It's really strange behaviour. I've examined the code and I haven't find any mistakes. Of course it can highly depends on CUDA convolution and paralelization implementation by itself. You may print all existed shapes in the network with 3x3 kernels and 1x1 kernels, and after create dummy variables with tensorflow and with help of python timeit module just mesure execution time of the each component. Maybe this will point you in the right direction.

tjulyz · 2018-01-08T07:47:50Z

Thanks for your advice. I will try it again.

tjulyz changed the title ~~Problem about the runting time.~~ Problem about the running time. Jan 8, 2018

tjulyz closed this as completed Jan 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem about the running time. #20

Problem about the running time. #20

tjulyz commented Jan 3, 2018 •

edited

Loading

ikhlestov commented Jan 4, 2018

tjulyz commented Jan 8, 2018

Problem about the running time. #20

Problem about the running time. #20

Comments

tjulyz commented Jan 3, 2018 • edited Loading

ikhlestov commented Jan 4, 2018

tjulyz commented Jan 8, 2018

tjulyz commented Jan 3, 2018 •

edited

Loading