Do you have any plans to export a pytorch version? #17

MultiPath · 2021-05-03T06:10:42Z

Hi, I am not too familiar with tensorflow...
If there are no such plans currently, do you have quick pointers to:

the GANsformer model, especially where and how you deal with the latents (based on your paper, you split the latents?)
what kind of optimizers are you using? and how do you implemented it? Is it similar to what we did in NLP (warmup, etc);
did you ever tried using the standard feedforward after your duplex attention layer instead of 3x3? Did it still work?

Thanks again for your kind attention!
Best,

ygjwd12345 · 2021-05-04T15:31:13Z

Pytorch version, please.

dorarad · 2021-05-04T16:48:55Z

Hi, thank you for the interest!
Yea I think it will be great idea to have a pytorch version and hope to work on it after the neurips deadline in a few weeks.

For the specific questions:

All the GANsformer model details can be found in the network.py file. The parts the relate to initializing the latents the attention and transformer layer here and the latent initialization happens at the loss file here based on the model inputs format that is specified here.
The optimization is specified in the network.py file and follows exactly StyleGAN2 code exactly.
You mean avoiding convolution completely? Yes I did do such experiments. Got a bit lower quality but generally works ok-ish. I think the advantage of the bipartite transformer structure that we use here vs. standard transformers is that we have linear attention from a few global latent variables to the image, while a simple transformer has (quadratic) self-attention between all pixels to all pixels (or all spatial features to themselves) (which also prevents standard transformers from scaling for high-res images). As a result, our transformer can model regional or global aspects of the region, but it is more limited in its ability to modulate direct interactions between specific pairs of pixels/features, which is why retaining the convolution here is useful to complement this aspect.

MultiPath · 2021-05-04T17:05:03Z

Thanks very much for your kind attention and looking forward to the pytorch version!

Will check the codes you referred!

Thanks for the explanation of the convolution parts! Good to know it also works.

The last question i forgot to ask: how is the inference speed of your model compared to StyleGAN2 in a similar size?
Will transformer affect the speed? In my view, duplex attention still has a lot of computation due to the image size.

Thanks again

dorarad · 2021-05-12T02:13:50Z

Sure thing happy to help!

StyleGAN2 takes 0.45s to produces a 4-images batch, while Simplex and Duplex Transformers’ speeds are 0.48s and 0.53s per batch respectively, so not a significant overhead!

The EM process in the Duplex attention is amortized over the multiple layers of the generator as it is gradually increasing the features resolution, such that each layer adds one iteration only, allowing for efficient computation that avoids a large overhead compared to Simplex attention.

yzcv · 2021-09-06T12:59:49Z

Hi, @dorarad

May I ask have you developed the PyTorch version of gansformer? I look forward to your PyTorch version and the checkpoint.

dorarad · 2021-09-06T13:37:25Z

Hi, yea it's in progress and I hope to release it soon (over the next couple of weeks)!

yzcv · 2021-09-06T15:15:01Z

Hi, yea it's in progress and I hope to release it soon (over the next couple of weeks)!

Thanks so much for your prompt reply. I will keep an eye on this repo and thanks for your work again. @dorarad

MultiPath · 2021-10-09T22:00:40Z

Hi any news on the Pytorch version? Thanks

dorarad · 2021-10-09T23:41:49Z

Apologies for the delay, was in a summer internship! It's top on my todo list and will update as soon as it's added!

sb-nw · 2021-10-15T04:24:49Z

Hi! Is there an estimated release date for the PyTorch version? I know you mentioned that you are working on it recently, but it would be great to have an estimate! Thanks! - Stunning work btw.

dorarad · 2021-10-15T14:16:28Z

Thanks so much for the kind words! :-) The reason I can't give yet an exact time estimation is that there are now the deadlines for the talk and camera ready for NeurIPS and I got a GANformer's follow-up work accepted that I'm keen to get public too so I plan to finish that asap and right afterwards get back to complete the PyTorch version!

dorarad · 2022-02-02T20:34:14Z

Hi Guys! Without further ado, happy to introduce the new pytorch implementation of the model!
It has matching interface to the original TF version. Will add readme shortly, please do let me know for any questions or suggestions!

MultiPath closed this as completed May 12, 2021

MultiPath reopened this Oct 9, 2021

dorarad closed this as completed Feb 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do you have any plans to export a pytorch version? #17

Do you have any plans to export a pytorch version? #17

MultiPath commented May 3, 2021

ygjwd12345 commented May 4, 2021

dorarad commented May 4, 2021 •

edited

MultiPath commented May 4, 2021

dorarad commented May 12, 2021

yzcv commented Sep 6, 2021

dorarad commented Sep 6, 2021

yzcv commented Sep 6, 2021 •

edited

MultiPath commented Oct 9, 2021

dorarad commented Oct 9, 2021 via email •

edited

sb-nw commented Oct 15, 2021

dorarad commented Oct 15, 2021 via email •

edited

dorarad commented Feb 2, 2022

Do you have any plans to export a pytorch version? #17

Do you have any plans to export a pytorch version? #17

Comments

MultiPath commented May 3, 2021

ygjwd12345 commented May 4, 2021

dorarad commented May 4, 2021 • edited

MultiPath commented May 4, 2021

dorarad commented May 12, 2021

yzcv commented Sep 6, 2021

dorarad commented Sep 6, 2021

yzcv commented Sep 6, 2021 • edited

MultiPath commented Oct 9, 2021

dorarad commented Oct 9, 2021 via email • edited

sb-nw commented Oct 15, 2021

dorarad commented Oct 15, 2021 via email • edited

dorarad commented Feb 2, 2022

dorarad commented May 4, 2021 •

edited

yzcv commented Sep 6, 2021 •

edited

dorarad commented Oct 9, 2021 via email •

edited

dorarad commented Oct 15, 2021 via email •

edited