what's the difference between pulse and stylegan encoder? #29

danielkaifeng · 2020-06-23T01:39:55Z

Stylegan encoder predicts face similar to input face, while pulse predicts face whose LR image close to input face.
Just like stylegan encoder, pulse can only predict face based on pretrained GAN like stylegan. If you want to predict something else, you must train a GAN first. In this aspect, it is not general fit SR model.

Thanks for correcting me if I miss anything.

woctezuma · 2020-06-23T15:34:05Z

I assume that what you call a "StyleGAN encoder" is code like this one. Similarly to what was then suggested in the StyleGAN2 article, the aim is to project a real image in the latent space of the generative model. So what you get is one of the closest generated images to your high-resolution input. Images are close in high-resolution.

The original paper is about training the StyleGAN2 generative model, and the ability to project real images is a nice property. People have then built tools to do so because they wanted to have fun editing real images. Indeed, this allows to edit images by moving from the projection along specified latent directions. You will get results like this one, where Mona Lisa's pose is changed.

In contrast, PULSE looks for plausible images in the latent space which would downscale correctly with respect to the low-resolution image input, e.g. PULSE produces a plausible 1024x1024 image which would downscale close to a 16x16 image input. There are many potential 1024x1024 images, so what you get is one among very many suitable candidates. Images are close in low-resolution.

You can play with Colab.

Overall, I agree that one could find that there are broad similarities between the projects, as in one looks for a plausible image in the latent space in both cases. However, as you pointed out, there are differences in terms of input (high-resolution vs. low-resolution) and objective (find the projection vs. find one of very many equally probable candidates).

One cool experiment could be to play with these two Colab notebooks (projector vs. upsampling) and compare the kind of results that you get by feeding a low-resolution image to the StyleGAN2 projector. I would not expect the projector to perform well at all, because it will try to produce a blurry high-resolution image which would look like the low-resolution image but at a high-resolution. It is quite easy to produce unrealistic images with the projector, while PULSE sticks to producing plausible high-resolution images.

Anyway, let me know if I am mistaken, or if I rehashed information which you already knew. ;)

danielkaifeng · 2020-06-29T02:04:24Z

With pretrained projector's weight and tricky finetune skill, you can get very similar HR image, like this one:

For very blur face, you can use a very small learning rate and increase GAN loss to ensure it doesn't generate blur HR image.

I believe stylegan-encoder's projection method can do a good job in face SR, and PULSE improve it's generalization by changing the optimization target from (input vs predict) to (input vs predict's downsampling low-resolution image).

danielkaifeng · 2020-06-29T03:19:35Z

To make a step forward, I think both stylegan-encoder and PULSE couldn't make a general SR application. You can't get a full body person SR unless you train it. So the SR effect rely a lot on your pretrained GAN. You trained a good GAN, you get nice result and vice verse.

Even in face generation, we must align the face first, which means the eyes, mouth etc should in the absolute exact position. It would be much better to generalize it to less preprocess image input.

yuqiu1233 · 2021-01-20T08:09:11Z

With pretrained projector's weight and tricky finetune skill, you can get very similar HR image, like this one:

For very blur face, you can use a very small learning rate and increase GAN loss to ensure it doesn't generate blur HR image.

I believe stylegan-encoder's projection method can do a good job in face SR, and PULSE improve it's generalization by changing the optimization target from (input vs predict) to (input vs predict's downsampling low-resolution image).

Can you tell me how to finetune this project? thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

what's the difference between pulse and stylegan encoder? #29

what's the difference between pulse and stylegan encoder? #29

danielkaifeng commented Jun 23, 2020

woctezuma commented Jun 23, 2020 •

edited

Loading

danielkaifeng commented Jun 29, 2020

danielkaifeng commented Jun 29, 2020

yuqiu1233 commented Jan 20, 2021

what's the difference between pulse and stylegan encoder? #29

what's the difference between pulse and stylegan encoder? #29

Comments

danielkaifeng commented Jun 23, 2020

woctezuma commented Jun 23, 2020 • edited Loading

danielkaifeng commented Jun 29, 2020

danielkaifeng commented Jun 29, 2020

yuqiu1233 commented Jan 20, 2021

woctezuma commented Jun 23, 2020 •

edited

Loading