Enable tensor core for cudnn conv #9623

kexinzhao · 2018-04-04T00:59:43Z

Enabling tensor core for float16 cudnn conv has been tested to provide significant speedup.

chengduoZH

LGTM!

kexinzhao added 2 commits April 3, 2018 17:50

enable tensor core for conv cudnn

187ba08

fix cpplint error

9ba3660

kexinzhao requested a review from chengduoZH April 4, 2018 01:40

chengduoZH approved these changes Apr 4, 2018

View reviewed changes

kexinzhao merged commit d904b3d into PaddlePaddle:develop Apr 4, 2018

kexinzhao deleted the enable_cudnn_tensor_core branch April 4, 2018 17:32

Xreki added the 预测原名Inference，包含Capi预测问题等 label Apr 11, 2018

Xreki added this to Support FP16 in Inference Framework Apr 12, 2018

Provide feedback