Basic cuDNN v3 support (update) #3160

shelhamer · 2015-10-06T20:12:09Z

This is the same as #2737 except for

Restoring the pooling layer fallback for max + argmax output when the layer is configured to have two tops. The padding is dropped since it is now supported by cuDNN pooling.
Clearing the warnings for unused variables in cuDNNConvolutionLayer::Forward_gpu() now that algo and workspace are determined in Reshape().

Thanks @slayton58 for the integration.

slayton58 · 2015-10-07T13:42:26Z

@shelhamer Regarding the test failure for Groups -- it seems like this->weight_offset_ for the cuDNN routines is getting set incorrectly / not set (either way, it's wrong!) This seems to have been introduced in 9d8206e

Setting it back explicitly to:
this->weight_offset_ = (this->num_output_ / this->group_) * (this->channels_ / this->group_) * kernel_h * kernel_w;

in CuDNNConvolutionLayer::Setup seems to fix the issue, not sure if there's a better way - let me know and I'll update the PR

Basic cuDNN v3 support

ronghanghu · 2015-10-16T04:01:39Z

Great 👍

shelhamer · 2015-10-16T16:20:59Z

cuDNN v3 is not itself backward compatible with v2, so adopting v3 in this PR does deprecate v2. We plan to follow the latest cuDNN version in master but keep compatability as the cuDNN interface itself allows.

ronghanghu · 2015-10-28T05:45:27Z

src/caffe/layers/cudnn_conv_layer.cu

  }
  Dtype* bias_diff = NULL;
  if (this->bias_term_ && this->param_propagate_down_[1]) {
    bias_diff = this->blobs_[1]->mutable_gpu_diff();
+    caffe_gpu_set(this->blobs_[1]->count(), Dtype(0), bias_diff);


@shelhamer @slayton58 I am a bit confused here... Why do we need to zero out the diff? It confuses me as parameter gradients should be accumulated.

This is definitely a bug -- thanks for the fix in #3254.

shelhamer mentioned this pull request Oct 6, 2015

Basic cuDNN v3 support #2737

Closed

shelhamer added the in progress label Oct 6, 2015

Initial cuDNN v3 support

ecac7ff

shelhamer added ready for review and removed in progress labels Oct 16, 2015

shelhamer force-pushed the cudnnV3 branch from c2803dd to ecac7ff Compare October 16, 2015 01:11

shelhamer added the speed-up label Oct 16, 2015

shelhamer added a commit that referenced this pull request Oct 16, 2015

Merge pull request #3160 from shelhamer/cudnnV3

321720d

Basic cuDNN v3 support

shelhamer merged commit 321720d into BVLC:master Oct 16, 2015

shelhamer deleted the cudnnV3 branch October 16, 2015 03:17

shelhamer mentioned this pull request Oct 19, 2015

Installation: mark cuDNN v3 compatible #3218

Merged

ronghanghu mentioned this pull request Oct 28, 2015

Deal with Non-Deterministic Behavior (Ensure Determinism?) #3168

Open

ronghanghu reviewed Oct 28, 2015
View reviewed changes

ronghanghu mentioned this pull request Oct 28, 2015

CuDNNConvolutionLayer accumulate gradients #3254

Merged

lukeyeager mentioned this pull request Nov 12, 2015

Display and store cuDNN version numbers during cmake. #3267

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Basic cuDNN v3 support (update) #3160

Basic cuDNN v3 support (update) #3160

shelhamer commented Oct 6, 2015

slayton58 commented Oct 7, 2015

ronghanghu commented Oct 16, 2015

shelhamer commented Oct 16, 2015

ronghanghu Oct 28, 2015

shelhamer Oct 28, 2015

Basic cuDNN v3 support (update) #3160

Basic cuDNN v3 support (update) #3160

Conversation

shelhamer commented Oct 6, 2015

slayton58 commented Oct 7, 2015

ronghanghu commented Oct 16, 2015

shelhamer commented Oct 16, 2015

ronghanghu Oct 28, 2015

Choose a reason for hiding this comment

shelhamer Oct 28, 2015

Choose a reason for hiding this comment