Use static shape where available in TensorFlow backend #32

stefanwayon · 2019-05-07T15:14:40Z

When running in graph mode, TensorFlow tensors have two types of shapes:

a static shape, known when the graph is built
a dynamic shape, only known when the graph is run

The static shape is important especially for the channels dimension, since the number of variables needed for a convolution depends on this shape.

In the current implementation, einops uses only the dynamic shape in graph execution mode, which means that static shape information is lost after calling one of the einops functions. I have updated the TensorFlow backend to only use dynamic shape information for dimensions where the static shape is unknown.

When running in graph mode, TensorFlow tensors have two types of shapes: - a static shape, known when the graph is built - a dynamic shape, only known when the graph is run The static shape is important especially for the channels dimension, since the number of variables needed for a convolution depends on this shape. In the current implementation, `einops` uses only the dynamic shape in graph execution mode, which means that static shape information is lost after calling one of the einops functions. I have updated the TensorFlow backend to only use dynamic shape information for dimensions where the static shape is unknown.

arogozhnikov · 2019-05-10T23:58:26Z

LGTM, thanks! Does it only help you with interpreting shapes while constructing graph or there are other important cases?

stefanwayon · 2019-05-13T14:40:44Z

Not only. Having unknown shapes for the channel dimension breaks TF Keras layers, since the number of variables becomes unknown and cannot be allocated (e.g. the kernel size in a convolutional layer depends on the number of input channels).

arogozhnikov · 2019-05-13T16:30:43Z

@slimm some minimalistic example for tests would help

arogozhnikov merged commit f700314 into arogozhnikov:master May 10, 2019

arogozhnikov mentioned this pull request May 11, 2019

[Tensorflow] Simple rearrange does not work when dimensions are partially unknown #30

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use static shape where available in TensorFlow backend #32

Use static shape where available in TensorFlow backend #32

stefanwayon commented May 7, 2019

arogozhnikov commented May 10, 2019

stefanwayon commented May 13, 2019

arogozhnikov commented May 13, 2019

Use static shape where available in TensorFlow backend #32

Use static shape where available in TensorFlow backend #32

Conversation

stefanwayon commented May 7, 2019

arogozhnikov commented May 10, 2019

stefanwayon commented May 13, 2019

arogozhnikov commented May 13, 2019