-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why is original resnet50 faster than octave-resnet50? #4
Comments
I noticed the same thing during training. It actually takes less GPU memory, but training is slower. Not sure if this is an issue from pytorch (static graph vs dynamic graphs) or it is related to the fact that the authors use TVM to compile the model. |
i train a model of mask-rcnn using octave-resnet-50, and find that it's slower than resnet-50 and the performance is 3 points lower as well. |
@BlueAnthony I've observed the same thing from my side here facebookresearch/OctConv#4 My guess is Oct Conv graph is not well suited on GPU (Could be due to two different branch structure, upsampling layers, more number of convolutions, more kernel launches and more mem copies). On CPU, Octave conv performance is better as reported in the paper. |
@peri044 Yeah, I also try Octave Conv on CPU, it really has some improvement on CPU. |
Why is original resnet50 faster than octave-resnet50?
In inference on gtx-1080, original resnet50 is about 70 fps and octave-resnet50 is around 42 fps.
The text was updated successfully, but these errors were encountered: