session crashes (CPU->GPU Memcpy failed) when running BatchNormalization

 
 ### Issue Type

Bug

### Source

binary

### Tensorflow Version

tfnightly

### Custom Code

Yes

### OS Platform and Distribution

Linux Ubuntu 20.04

### Mobile device

_No response_

### Python version

3.9

### Bazel version

_No response_

### GCC/Compiler version

_No response_

### CUDA/cuDNN version

_No response_

### GPU model and memory

_No response_

### Current Behaviour?

When I run the code snippet below (`relu` and `BatchNormalization`) for the first time, I encounter `InternalError`. Then I try to run it again and session crashes.


### Standalone code to reproduce the issue
1. Run for a single time: throws InternalError
```shell
import tensorflow as tf
import numpy as np
print(tf.__version__)
input_data = np.random.rand(1, 3, 3, 1).astype(np.float32)
output_data = tf.keras.activations.relu(
    tf.keras.layers.BatchNormalization(axis=-1)(input_data))
```
Log output:

```shell
2.11.0-dev20220919
InternalError: Exception encountered when calling layer 'batch_normalization' (type BatchNormalization).

{{function_node __wrapped__FusedBatchNormV3_device_/job:localhost/replica:0/task:0/device:GPU:0}} cuDNN launch failure : input shape ([1,3,3,1]) [Op:FusedBatchNormV3]

Call arguments received by layer 'batch_normalization' (type BatchNormalization):
  • inputs=tf.Tensor(shape=(1, 3, 3, 1), dtype=float32)
  • training=None
```

2. Run for a second time: crash
```
import tensorflow as tf
import numpy as np
try:
  input_data = np.random.rand(1, 3, 3, 1).astype(np.float32)
  output_data = tf.keras.activations.relu(
    tf.keras.layers.BatchNormalization(axis=-1)(input_data))
except:
  pass

input_data = np.random.rand(1, 3, 3, 1).astype(np.float32)
output_data = tf.keras.activations.relu(
    tf.keras.layers.BatchNormalization(axis=-1)(input_data)) # crash here
```
Relevant logs:
```
F tensorflow/core/common_runtime/gpu/gpu_util.cc:386] CPU->GPU Memcpy failed
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

session crashes (CPU->GPU Memcpy failed) when running BatchNormalization #57752

Issue Type

Source

Tensorflow Version

Custom Code

OS Platform and Distribution

Mobile device

Python version

Bazel version

GCC/Compiler version

CUDA/cuDNN version

GPU model and memory

Current Behaviour?

Standalone code to reproduce the issue

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

session crashes (CPU->GPU Memcpy failed) when running BatchNormalization #57752

Description

Issue Type

Source

Tensorflow Version

Custom Code

OS Platform and Distribution

Mobile device

Python version

Bazel version

GCC/Compiler version

CUDA/cuDNN version

GPU model and memory

Current Behaviour?

Standalone code to reproduce the issue

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions