Feature Request: Add separable_conv2d_transpose operation #12001

andreas-eberle · 2017-08-03T13:12:24Z

Some recent papers (e.g.) have shown that transposed separable convolutions can be a great choice for decoders in encoder decoder architectures.

Can you add a seperable_conv2d_transpose operation comparable to the conv2d_transpose operation?

tjingrant · 2017-08-04T22:05:59Z

Hi, I will try to work on this one.

@andreas-eberle do you have specific examples in which a seperable_conv2d_transpose operation is profitable? Do you have a link to your e.g.?

yeshwanthv5 · 2017-08-17T03:00:03Z

Please provide the links of the reference papers

andreas-eberle · 2017-08-17T07:50:52Z

Sorry, didn't notice that I forgot the link.

In The Devil is in the Decoder they compare several "deconvolution" strategies and show that separable transposed convolution has very good performance.

In section 2.1.1 they give a short explanation about separable transposed convolution.

Afaik, separable (not transposed) convolution was introduced in Xception: Deep Learning with Depthwise Separable Convolutions

ezfn · 2018-01-02T08:20:42Z

+1 on that.
Besides the examples given by andreas-eberle, It is generally helpful to have operations that are not allowed to mix the channels, so that channels from different layers can be trivially combined (i.e. summed without any learned parameters).

titusnicolae-intel · 2018-05-25T14:42:29Z

Hi, is anyone working on this anymore?
I'd like to start work on this, if anyone can provide some supervision that would be helpful.

estathop · 2018-07-04T11:46:36Z

useful feature to be implemented

dhaneshr · 2018-07-17T15:42:50Z

any updates on this ?

notnot · 2018-08-04T18:37:11Z

I'd like to try a GAN with separable_conv2d and separable_conv2d_transpose, and i was surprised to see that separable_conv2d_transpose isn't available yet. Some have stated they started working on an implementation, how is that work going?

chris-boson · 2018-08-09T22:28:57Z

I would also be able to help with an implementation, @tjingrant have you started on it?
Also seems like the authors of The Devil is in the Decoder must have access to it.

HouseOfFinwe · 2018-10-16T01:55:14Z

Any idea when there will be an implementation for separable_conv2d_transpose?

brucechou1983 · 2018-10-16T03:38:18Z

Any update to this feature?

chris-boson · 2018-10-16T16:37:19Z

Easiest to just usetf.keras.layers.Conv2DTranspose followed by tf.keras.layers.DepthwiseConv2D

HouseOfFinwe · 2018-10-16T17:46:38Z

@chris-boson. This is not equivalent. One of the major selling points of DepthwiseConv2DTranspose (if it existed) is a reduction of parameters, which would not be achieved by a transpose followed by a depthwise conv.

chris-boson · 2018-10-22T22:32:18Z

@HouseOfFinwe It does in fact reduce the parameter count considerably, especially in the case of many output channels. Use filters of shape [stride, stride] instead of [1, 1] for the pointwise conv in separable convolution to avoid checkerboarding.

netanel-s · 2018-10-29T09:54:00Z

+1 on this, would be highly appreciated.

veqtor · 2018-10-30T09:52:38Z

@chris-boson could you give a clearer example of using Conv2DTranspose followed by DepthwiseConv2D? Regardless, I still think a pure depthwise transpose would be even more efficient

zhcm · 2018-11-29T09:19:07Z

Any update to this feature?

shenyi0220 · 2018-12-10T05:01:19Z

Any updates or progress on this?

mbuckler · 2019-01-11T00:34:07Z

I would be interested in this

ltrottier · 2019-03-06T19:21:37Z

That would be a nice addition indeed.

ygoncharov · 2019-03-09T16:55:34Z

Seems like a feature that makes a lot of sense

veqtor · 2019-03-09T18:57:18Z

Would this require a new op? Is it difficult because of lack of hw support?

mjmjmtl-pony · 2019-06-19T18:10:31Z

Any update on this?

voletiv · 2019-10-10T01:23:29Z

Any update on this?

CoachRDeveloper · 2019-10-23T09:27:59Z

Would like to see this feature implemented

edmondja · 2020-06-08T14:54:20Z

+1

gurpreet-singh135 · 2020-06-19T10:13:14Z

Hi, is anyone working on this currently?
I'd like to work on this feature. Also can somebody please provide some resources to start from.

edmondja · 2020-06-19T10:15:22Z

Is it very different to using upsampling + separableconv ?

gurpreet-singh135 · 2020-06-19T10:33:30Z

@edmondja the problem with upsampling + separableconv is that it increases the number of computations compared to sep-conv2d-transpose

yaoshiang · 2020-06-24T06:24:31Z

Anyone find a workaround? I attempted to do a workaround with a stack of Conv2DTranspose, each with filters=1... but it was not very efficient. No promise this works but this was my attempt fwiw.

class DepthwiseConv2DTranspose(layers.Layer):
def init(self, filters, **kwargs):
super(DepthwiseConv2DTranspose, self).init(**kwargs)
self._filters = filters
self._t = []
for _ in range(filters):
self._t.append(layers.Conv2DTranspose(filters=1, kernel_size=5, strides=2, output_padding=1))

def __call__(self, img):
    upsample = []
    for i in range(self._filters):
        t = self._t[i](img[:,:,:,i:i+1])
        t = t[:,:,:,0]
        upsample.append(t)
    upsample = tf.stack(upsample, axis=-1)
    return upsample

junhyukso · 2020-08-22T17:13:22Z

Any update?

Orpheus23 · 2021-04-19T20:01:03Z

Here are two sample ones that do some of it, the first doesn't compile on TPU it does work on CPU

class Depthwise_Conv2D_Transpose(tf.keras.layers.Layer):
     def __init__(self, filters,kernel_size,strides,padding='same',use_bias=False,kernel_initializer=None,name="",**kwargs):
         super(Depthwise_Conv2D_Transpose, self).__init__(**kwargs)
         self.kernel_size = kernel_size
         self.strides = strides[0]
         self.padding = padding
         self.use_bias = use_bias
         self.kernel_init = kernel_initializer
         self.lambdas =[]
         for i in tf.range(filters):  
            self.lambdas.append(layers.Conv2DTranspose(filters=1, kernel_size=kernel_size,strides=self.strides,padding=self.padding))
         self.filters = filters
         self.input_image_shape = 0
         self.nm = name
     def call(self, inputs):
         #tf.print(inputs.shape,[-1]+[self.deconv_length(self.input_image_shape,self.strides,self.kernel_size,self.padding)]*2+[1])
         inputs_channel_wise =   tf.split(inputs,self.filters, -1)#
         x_outputs = [c(x) for x, c in zip(inputs_channel_wise, self.lambdas)]
         
         channel_wise_conv =  tf.concat(x_outputs, -1)#tf.transpose(tf.squeeze(channel_wise_conv,axis = -1),[0,2,3,1])
         return channel_wise_conv

The second compiles also on TPU and works well on CPU but doesn't train on TPU

class Depthwise_Conv2D_Transpose(tf.keras.layers.Layer):
    def __init__(self, filters,kernel_size,strides,padding='same',use_bias=False,kernel_initializer=None,name="",**kwargs):
        super(Depthwise_Conv2D_Transpose, self).__init__(**kwargs)
        self.kernel_size = kernel_size
        self.strides = strides[0]
        self.padding = padding
        self.use_bias = use_bias
        self.kernel_init = kernel_initializer
        self.lambdas =[]
        self.filters = filters
        self.input_image_shape = 0
        self.nm = name
        
    def deconv_length(self,dim_size, stride_size, kernel_size, padding, output_padding=None, dilation=1):

        assert padding in {'same', 'valid', 'full'}
        if dim_size is None:
            return None

        # Get the dilated kernel size
        kernel_size = kernel_size + (kernel_size - 1) * (dilation - 1)

        # Infer length if output padding is None, else compute the exact length
        if output_padding is None:
            if padding == 'valid':
                dim_size = dim_size * stride_size + max(kernel_size - stride_size, 0)
            elif padding == 'full':
                dim_size = dim_size * stride_size - (stride_size + kernel_size - 2)
            elif padding == 'same':
                dim_size = dim_size * stride_size
        else:
            if padding == 'same':
                pad = kernel_size // 2
            elif padding == 'valid':
                pad = 0
            elif padding == 'full':
                pad = kernel_size - 1

            dim_size = ((dim_size - 1) * stride_size + kernel_size - 2 * pad + output_padding)

        return dim_size

    def build(self, input_shape):
        for i in range(input_shape[-1]):  
           self.lambdas.append(self.add_weight(name = self.nm +"weights"+ str(i),initializer=tf.keras.initializers.deserialize(self.kernel_init),shape=(self.kernel_size,self.kernel_size,1,1), trainable=True))
        self.input_image_shape = input_shape[1]
        #self.lambdas = tf.stack(self.lambdas,axis = 0)
        self.image_shape = input_shape[-1]
        super(Depthwise_Conv2D_Transpose, self).build(input_shape)
    @tf.function
    def call(self, inputs):
        
        inputs_channel_wise =   tf.split(inputs,self.image_shape, -1)

        channel_wise_conv = tf.map_fn(lambda x:tf.nn.conv2d_transpose(x[0], filters=x[1], 
                                                                              output_shape=[tf.shape(inputs)[0]]+[self.deconv_length(self.input_image_shape,self.strides,self.kernel_size,self.padding)]*2+[1], 
                                                                              strides=self.strides, 
                                                                              padding=self.padding.upper()),(inputs_channel_wise,self.lambdas), fn_output_signature=tf.float32)
        
        channel_wise_conv = tf.transpose(tf.squeeze(channel_wise_conv,axis = -1),[0,2,3,1])
        return channel_wise_conv

If any solutions found do update thanks!

Ram-WD · 2021-11-14T13:20:25Z

any update ?

glenn-jocher · 2022-05-07T03:30:50Z

@Orpheus23 is your TF Depthwise Conv2dTranspose implementation in #12001 (comment) still the best available today?

Orpheus23 · 2022-05-07T04:58:56Z

Idk about any recent changes, but until early 2021(When I was checking) it was. If a better solution is needed, then it is best to write the same in c++ and export it. The layers that I had written didn't train properly on TPU(Idk about GPU, if it worked for someone then do update) as they required a lot of space in memory. Perhaps it is because of the mapping of conv2d transpose to every layer and the space required by tf.transpose + tf.squeeze.

Either way it would best to define on c++, the functions that tensorflow provides on python are very unideal for defining a new type of layer. With c++ there could be a better alternative to mapping conv2d_transpose to every layer and then reshaping everything.

glenn-jocher · 2022-05-07T07:05:14Z

@Orpheus23 hmm, yes that's what I was worried about. We've been running YOLOv5 experiments with these layers in PyTorch, and they seem to export well everywhere except TF. We build the TF models natively rather than go through ONNX, and currently there seems to be no efficient solution to build TF models with these layers. In PyTorch it's pretty simple as you can just set groups to equal the input/output channel counts to create depthwise conv2dtranspose layers.

Orpheus23 · 2022-05-07T09:01:32Z

That's there, Pytorch works very well in these matters. The group feature is not here, maybe with tf.gather and conv2d transpose you might be able to do it, it will not be very clean. If you really want to go ahead with TF here, then C++ is the best bet. Integrating the C++ part with python was not that much of a headache. Otherwise the other alternatives are, to do bi-linear interpolation and then depth-wise conv2d or to hack it out with TF in python. In any case, Best of luck with the implementation.

glenn-jocher · 2022-05-07T18:46:35Z

@Orpheus23 got it, thanks! I'm linking @zldrobit here, who's been a lot of the brains behind the YOLOv5 TF models.

glenn-jocher · 2022-06-08T09:30:26Z

@AyushExel @sergiossm this is the main issue I found regarding DW Conv2d Transpose layers in TF (C++ conversion proposed by @Orpheus23)

saikrn112 · 2023-01-24T03:00:50Z

Hey guys, any update on this feature?

bayesian-mind · 2023-05-17T21:38:33Z

Any update on the feature request for depthwise conv2d transpose?

github-actions · 2023-11-14T01:49:56Z

This issue is stale because it has been open for 180 days with no activity. It will be closed if no further activity occurs. Thank you.

poxvoculi added stat:contribution welcome Status - Contributions welcome type:feature Feature requests labels Aug 3, 2017

glenn-jocher mentioned this issue May 7, 2022

TF conversion of future candidate YOLOv5 layers ultralytics/yolov5#7689

Closed

1 task

github-actions bot added the stale This label marks the issue/pr stale - to be closed automatically if no activity label Nov 14, 2023

sushreebarsa assigned sushreebarsa and unassigned sushreebarsa Mar 12, 2024

github-actions bot removed stale This label marks the issue/pr stale - to be closed automatically if no activity stat:contribution welcome Status - Contributions welcome labels Mar 13, 2024

Feature Request: Add separable_conv2d_transpose operation #12001

Feature Request: Add separable_conv2d_transpose operation #12001

Comments

andreas-eberle commented Aug 3, 2017

tjingrant commented Aug 4, 2017 • edited

yeshwanthv5 commented Aug 17, 2017

andreas-eberle commented Aug 17, 2017

ezfn commented Jan 2, 2018

titusnicolae-intel commented May 25, 2018

estathop commented Jul 4, 2018

dhaneshr commented Jul 17, 2018

notnot commented Aug 4, 2018

chris-boson commented Aug 9, 2018

HouseOfFinwe commented Oct 16, 2018 • edited

brucechou1983 commented Oct 16, 2018

chris-boson commented Oct 16, 2018

HouseOfFinwe commented Oct 16, 2018

chris-boson commented Oct 22, 2018 • edited

netanel-s commented Oct 29, 2018

veqtor commented Oct 30, 2018 • edited

zhcm commented Nov 29, 2018

shenyi0220 commented Dec 10, 2018

mbuckler commented Jan 11, 2019

ltrottier commented Mar 6, 2019

ygoncharov commented Mar 9, 2019

veqtor commented Mar 9, 2019

mjmjmtl-pony commented Jun 19, 2019

voletiv commented Oct 10, 2019

CoachRDeveloper commented Oct 23, 2019

edmondja commented Jun 8, 2020

gurpreet-singh135 commented Jun 19, 2020 • edited

edmondja commented Jun 19, 2020 • edited

gurpreet-singh135 commented Jun 19, 2020

yaoshiang commented Jun 24, 2020

junhyukso commented Aug 22, 2020

Orpheus23 commented Apr 19, 2021

Ram-WD commented Nov 14, 2021

glenn-jocher commented May 7, 2022

Orpheus23 commented May 7, 2022

glenn-jocher commented May 7, 2022

Orpheus23 commented May 7, 2022

glenn-jocher commented May 7, 2022

glenn-jocher commented Jun 8, 2022

saikrn112 commented Jan 24, 2023

bayesian-mind commented May 17, 2023

github-actions bot commented Nov 14, 2023

tjingrant commented Aug 4, 2017 •

edited

HouseOfFinwe commented Oct 16, 2018 •

edited

chris-boson commented Oct 22, 2018 •

edited

veqtor commented Oct 30, 2018 •

edited

gurpreet-singh135 commented Jun 19, 2020 •

edited

edmondja commented Jun 19, 2020 •

edited