FusedBatchNorm & Conv2D backwards doesn't support zero batch size #14657

ppwwyyxx · 2017-11-17T16:25:50Z

Most ops in TF work well with tensors with zero elements. However, ~~convolution~~ fusedbatchnorm with cudnn gives the following error:

2017-11-17 08:00:20.835113: F tensorflow/stream_executor/cuda/cuda_dnn.cc:444] could not convert BatchDescriptor {count: 0 feature_map_count: 1024 spatial: 28 28  value_min: 0.000000 value_max: 0.000000 layout: BatchDepthYX} to cudnn tensor descriptor: CUDNN_STATUS_BAD_PARAM

I would expect it checks and returns a 4D tensor with zero batch-size. Currently I have to work around it by tf.cond.

The text was updated successfully, but these errors were encountered:

cy89 · 2017-11-17T19:06:07Z

@zheng-xq this seems like it falls between the boundaries of cuDNN and TF. Is this something we might fix, or should I set to "contributions welcome"?

facaiy · 2017-11-18T00:39:21Z

Perhaps doc of cudnnSetTensorNdDescriptor is helpful for discussion.

Return Value	Meaning
CUDNN_STATUS_BAD_PARAM	At least one of the elements of the array dimA was negative or zero, or dataType has an invalid enumerant value

hpnhxxwn · 2017-11-29T04:48:22Z

@ppwwyyxx
Hi, I'm facing the same issue using CUDA 8.0 with cuDNN 6. Could you show me how did you use tf.cond? Like which parameter did you put condition on?

Thank you!

ppwwyyxx · 2017-11-29T09:20:59Z

https://github.com/ppwwyyxx/tensorpack/blob/be50085f7e2d9ed2704a14cb4c2e6aaa20548972/examples/FasterRCNN/train.py#L123-L134

hpnhxxwn · 2017-11-30T04:59:12Z

@ppwwyyxx Could you point me how to use tf.cond on keras generator type of input?

ppwwyyxx · 2017-11-30T05:23:58Z

I'm not sure I know what you mean. And I think it's already unrelated to the issue anyway. You can search for documentation/examples or ask on stackoverflow for usage questions.

ppwwyyxx · 2017-12-11T06:05:30Z

Turned out that Conv2d actually does support empty tensor, but FusedBatchNorm does not. That is probably why I saw this error. To reproduce:

with tf.device('/gpu:0'):
    x = tf.random_normal([0, 16, 64, 64])
    scale = tf.random_normal([16])
    y = tf.nn.fused_batch_norm(x, scale, scale, data_format='NCHW')

with tf.Session() as sess:
    print(sess.run(y))

ppwwyyxx · 2017-12-11T06:11:37Z

There is a problem with conv2d backward as well, though forward works:

with tf.device('/gpu:0'):
    x = tf.random_normal([0, 16, 64, 64])
    W = tf.random_normal([3, 3, 16, 16])
    y = tf.nn.conv2d(x, W, [1, 1, 1, 1], padding='SAME', data_format='NCHW')
    grad = tf.gradients(y, W)


with tf.Session() as sess:
    print(sess.run(grad))

…2DBackpropFilter (fix tensorflow#14657)

ppwwyyxx · 2017-12-19T20:46:07Z

Anyone can take a look at my PR #15264?

yzhwang · 2017-12-19T20:56:02Z

Thanks for the PR @ppwwyyxx . I will take a look.

…2DBackpropFilter (fix tensorflow#14657)

* Support empty input tensor for FusedBatchNorm,FusedBatchNormGrad,Conv2DBackpropFilter (fix #14657) * Also fix pooling ops * Add some comments in ops * Add tests for conv/pooling/bn. * Return NaN mean/variance when input is empty * update comments * fix typo * Move fill_functor implementations to :fill_functor

cy89 added the stat:awaiting tensorflower Status - Awaiting response from tensorflower label Nov 17, 2017

ppwwyyxx mentioned this issue Nov 29, 2017

Tensorflow Conv model crashes on GPU with zero size batch #14962

Closed

ppwwyyxx changed the title ~~Convolution doesn't support zero batch size~~ FusedBatchNorm doesn't support zero batch size Dec 11, 2017

ppwwyyxx changed the title ~~FusedBatchNorm doesn't support zero batch size~~ FusedBatchNorm & Conv2D backwards doesn't support zero batch size Dec 11, 2017

ppwwyyxx added a commit to ppwwyyxx/tensorflow that referenced this issue Dec 11, 2017

Support empty input tensor for FusedBatchNorm,FusedBatchNormGrad,Conv…

46c4aad

…2DBackpropFilter (fix tensorflow#14657)

ppwwyyxx mentioned this issue Dec 11, 2017

Support empty input tensor for some ops (fix #14657) #15264

Merged

zheng-xq assigned yzhwang Dec 19, 2017

yzhwang mentioned this issue Dec 22, 2017

could not set cudnn filter descriptor: CUDNN_STATUS_BAD_PARAM #5772

Closed

ppwwyyxx added a commit to ppwwyyxx/tensorflow that referenced this issue Dec 28, 2017

Support empty input tensor for FusedBatchNorm,FusedBatchNormGrad,Conv…

261951e

…2DBackpropFilter (fix tensorflow#14657)

drpngx closed this as completed in #15264 Dec 29, 2017

ypflll mentioned this issue Jan 24, 2018

An error raise when positive_overlaps has shape (0,0), how to fix? matterport/Mask_RCNN#170

Open

twangnh mentioned this issue Mar 28, 2018

could not convert BatchDescriptor to cudnn tensor descriptor: CUDNN_STATUS_BAD_PARAM tensorpack/tensorpack#713

Closed

LandyGuo mentioned this issue Jul 24, 2018

RUN FasterRCNN+FPN with tensorflow 1.4 or 1.5 tensorpack/tensorpack#841

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FusedBatchNorm & Conv2D backwards doesn't support zero batch size #14657

FusedBatchNorm & Conv2D backwards doesn't support zero batch size #14657

ppwwyyxx commented Nov 17, 2017 •

edited

Loading

cy89 commented Nov 17, 2017

facaiy commented Nov 18, 2017

hpnhxxwn commented Nov 29, 2017

ppwwyyxx commented Nov 29, 2017

hpnhxxwn commented Nov 30, 2017

ppwwyyxx commented Nov 30, 2017

ppwwyyxx commented Dec 11, 2017

ppwwyyxx commented Dec 11, 2017

ppwwyyxx commented Dec 19, 2017

yzhwang commented Dec 19, 2017

FusedBatchNorm & Conv2D backwards doesn't support zero batch size #14657

FusedBatchNorm & Conv2D backwards doesn't support zero batch size #14657

Comments

ppwwyyxx commented Nov 17, 2017 • edited Loading

cy89 commented Nov 17, 2017

facaiy commented Nov 18, 2017

hpnhxxwn commented Nov 29, 2017

ppwwyyxx commented Nov 29, 2017

hpnhxxwn commented Nov 30, 2017

ppwwyyxx commented Nov 30, 2017

ppwwyyxx commented Dec 11, 2017

ppwwyyxx commented Dec 11, 2017

ppwwyyxx commented Dec 19, 2017

yzhwang commented Dec 19, 2017

ppwwyyxx commented Nov 17, 2017 •

edited

Loading