Reproducibility of RFB speed #46

underfitting · 2018-10-11T12:40:47Z

Hi,

I think you should add torch.cuda.synchronize() inside timer(e.g. after net(x) ), because CUDA is asynchronous.
By adding this, I got ~0.12s/forward.

The text was updated successfully, but these errors were encountered:

GOATmessi8 · 2018-10-12T15:54:43Z

The current inference time measurement followed as faster rcnn. Moreover, each forward only process a single image at test, so we do not need synchronize to depress the power of CUDA.

GOATmessi8 closed this as completed Oct 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproducibility of RFB speed #46

Reproducibility of RFB speed #46

underfitting commented Oct 11, 2018

GOATmessi8 commented Oct 12, 2018

Reproducibility of RFB speed #46

Reproducibility of RFB speed #46

Comments

underfitting commented Oct 11, 2018

GOATmessi8 commented Oct 12, 2018