Enabling a debug dll build under Windows (without CUDA, at least) #42307

MikhailStartsev · 2020-08-13T08:37:50Z

Modified SpaceToDepthOp and DepthToSpaceOp templated classes not to use a SpaceToDepthOpFunctor/DepthToSpaceOpFunctor structs with a template parameter Device=_GPU_Device in case the class itself is instantiated with Device=_CPU_Device. Added a partial template specialization for Device=GPUDevice to preserve the existing behaviour in all cases.

This at least partially (i.e. when bazel is configured not to use CUDA) fixes issue #41118.

…se a SpaceToDepthOpFunctor/DepthToSpaceOpFunctor struct with a template parameter Device=GPUDevice in case the class itself is instantiated with Device=CPUDevice. Added a partial template specialization for Device=GPUDevice to preserve the behaviour in all cases.

MikhailStartsev · 2020-08-13T08:39:43Z

tensorflow/core/kernels/depthtospace_op.cc

@@ -34,12 +33,14 @@ limitations under the License.
 #include "tensorflow/core/platform/logging.h"
 #include "tensorflow/core/platform/types.h"
 #include "tensorflow/core/util/tensor_format.h"
+#include "third_party/eigen3/unsupported/Eigen/CXX11/Tensor"


The order of includes changed according to clang-formatting tool with -style=google

MikhailStartsev · 2020-08-13T08:45:41Z

@rthadur Re-opening PR #42268 with master branch this time.

sanjoy · 2020-08-14T05:00:15Z

tensorflow/core/kernels/depthtospace_op.cc

@@ -143,6 +127,92 @@ class DepthToSpaceOp : public OpKernel {
  TensorFormat data_format_;
 };

+// Template specialization for GPUDevice, explicit referncing GPUDevice in code


End comment with period, also typo in referncing

sanjoy · 2020-08-14T05:09:49Z

tensorflow/core/kernels/depthtospace_op.cc

-        return;
-      }
-    }
-
    // NOTE: Assumes data_format_ == FORMAT_NHWC here, since we have rejected
    // (CPU && data_format_ != FORMAT_NHWC) in the constructor.



Can we specialize just this part that's different between GPU and CPU? Same comment for SpaceToDepthOp.

That's what I tried to do initially - specialize only the Compute() function, but AFAIU there is no way to specialize part of the class definition only: The template signature of the function has to match that of the class, so to partially specialize the function you need to partially specialize the class.

Taking the GPU-specialized code into a separate function would result in the same situation - the class still needs to be specialized.

One could probably get around this with some inheritance trickery, but this IMO is making this messier than it should be. Not that I like this code duplication much more...

I mean something like:

template <class Device> struct DepthToSpaceFunctorWrapper { void operator(params) { // .. generic implementation } }; template <> struct DepthToSpaceFunctorWrapper<GPUDevice> { void operator(params) { // .. GPU implementation } }; void Compute(...) { // Common stuff .. auto Toutput = outputs_tensor->tensor<T, kDims>(); DepthToSpaceFunctorWrapper<Device>{}(params as needed); }

Wait, isn't there a much simpler solution? In the if clause here, if one is inside it, it is guaranteed that Device is GPUDevice. So why not replace functor::DepthToSpaceOpFunctor<GPUDevice, T, FORMAT_NCHW> functor; with functor::DepthToSpaceOpFunctor<Device, T, FORMAT_NCHW> functor; inside this clause? This should be identical, no? If the condition is false, we never get into the code with the "wrong" device, if it is true - the code is literally the same.

sanjoy · 2020-08-14T07:51:34Z

tensorflow/core/kernels/depthtospace_op.cc

-        return;
-      }
-    }
-
    // NOTE: Assumes data_format_ == FORMAT_NHWC here, since we have rejected
    // (CPU && data_format_ != FORMAT_NHWC) in the constructor.



I mean something like:

template <class Device> struct DepthToSpaceFunctorWrapper { void operator(params) { // .. generic implementation } }; template <> struct DepthToSpaceFunctorWrapper<GPUDevice> { void operator(params) { // .. GPU implementation } }; void Compute(...) { // Common stuff .. auto Toutput = outputs_tensor->tensor<T, kDims>(); DepthToSpaceFunctorWrapper<Device>{}(params as needed); }

This reverts commit 7bf4aa5.

…to not use a SpaceToDepthOpFunctor/DepthToSpaceOpFunctor struct with a template parameter Device=GPUDevice in case the class itself is instantiated with Device=CPUDevice. Added a partial template specialization for Device=GPUDevice to preserve the behaviour in all cases." This reverts commit 132e8af.

…unctor::SpaceToDepthOpFunctor<GPUDevice, ...> in case functor::SpaceToDepthOp<CPUDevice, ...> is compiled

MikhailStartsev · 2020-08-14T08:22:19Z

Please check out the much more concise change version :)

UPD: Hm. It's more concise alright, but the debug build now does not work again. It used to complain about missing symbols for the () operator of SpaceToDepthOpFunctor<GPUDevice, ...>, now I got same linker errors for SpaceToDepthOpFunctor<CPUDevice, ..., FORMAT_NCHW> because the CPUDevice-functor is only instantiated for FORMAT_NHWC as the last template argument...

google-ml-butler bot added the size:M CL Change Size: Medium label Aug 13, 2020

googlebot added the cla: yes label Aug 13, 2020

MikhailStartsev commented Aug 13, 2020

View reviewed changes

gbaned self-assigned this Aug 13, 2020

gbaned added comp:core issues related to core part of tensorflow type:build/install Build and install issues labels Aug 13, 2020

gbaned added this to Assigned Reviewer in PR Queue via automation Aug 13, 2020

gbaned requested a review from tatianashp August 13, 2020 11:13

tatianashp requested a review from sanjoy August 14, 2020 02:25

PR Queue automation moved this from Assigned Reviewer to Reviewer Requested Changes Aug 14, 2020

sanjoy suggested changes Aug 14, 2020

View reviewed changes

Typo fix in comments + ending those with a .

7bf4aa5

sanjoy suggested changes Aug 14, 2020

View reviewed changes

MikhailStartsev added 3 commits August 14, 2020 09:15

Revert "Typo fix in comments + ending those with a ."

287d074

This reverts commit 7bf4aa5.

Without changing the behaviour of the code remove the references to f…

aeb407f

…unctor::SpaceToDepthOpFunctor<GPUDevice, ...> in case functor::SpaceToDepthOp<CPUDevice, ...> is compiled

sanjoy approved these changes Aug 14, 2020

View reviewed changes

PR Queue automation moved this from Reviewer Requested Changes to Approved by Reviewer Aug 14, 2020

google-ml-butler bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Aug 14, 2020

kokoro-team removed the kokoro:force-run Tests on submitted change label Aug 14, 2020

tensorflow-copybara merged commit c33dc04 into tensorflow:master Aug 15, 2020

PR Queue automation moved this from Approved by Reviewer to Merged Aug 15, 2020

jgehw mentioned this pull request Aug 26, 2020

make debug build on Windows MSVC compile #42676

Merged

MikhailStartsev mentioned this pull request Oct 5, 2020

Debug .dll build with CUDA support on Windows fails to link #41118

Closed

tilakrayal mentioned this pull request Oct 11, 2022

Unable to build debug version #46296

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enabling a debug dll build under Windows (without CUDA, at least) #42307

Enabling a debug dll build under Windows (without CUDA, at least) #42307

MikhailStartsev commented Aug 13, 2020

MikhailStartsev Aug 13, 2020

MikhailStartsev commented Aug 13, 2020

sanjoy Aug 14, 2020

MikhailStartsev Aug 14, 2020

sanjoy Aug 14, 2020

MikhailStartsev Aug 14, 2020

sanjoy Aug 14, 2020

MikhailStartsev Aug 14, 2020 •

edited

sanjoy Aug 14, 2020

MikhailStartsev commented Aug 14, 2020 •

edited

Enabling a debug dll build under Windows (without CUDA, at least) #42307

Enabling a debug dll build under Windows (without CUDA, at least) #42307

Conversation

MikhailStartsev commented Aug 13, 2020

MikhailStartsev Aug 13, 2020

Choose a reason for hiding this comment

MikhailStartsev commented Aug 13, 2020

sanjoy Aug 14, 2020

Choose a reason for hiding this comment

MikhailStartsev Aug 14, 2020

Choose a reason for hiding this comment

sanjoy Aug 14, 2020

Choose a reason for hiding this comment

MikhailStartsev Aug 14, 2020

Choose a reason for hiding this comment

sanjoy Aug 14, 2020

Choose a reason for hiding this comment

MikhailStartsev Aug 14, 2020 • edited

Choose a reason for hiding this comment

sanjoy Aug 14, 2020

Choose a reason for hiding this comment

MikhailStartsev commented Aug 14, 2020 • edited

MikhailStartsev Aug 14, 2020 •

edited

MikhailStartsev commented Aug 14, 2020 •

edited