Output size discriminator #49

noob16 · 2017-02-10T10:30:16Z

Hi,

why are you using an output size of 1x30x30 for the discriminator? and not just 1x1x1?

Thanks!

phillipi · 2017-02-11T04:10:45Z

This is because the (default) discriminator is a "PatchGAN" (Section 2.2.2 in the paper). This discriminator slides across the generated image, convolutionally, trying to classify if each overlapping 70x70 patch is real or fake. This results in a 30x30 grid of classifier outputs, each corresponding to a different patch in the generated image.

kenshinzh · 2017-06-24T03:08:56Z

@phillipi If I want to classify each overlapping a minor patch eg. like 16 x 16 patch is real or fake, should this results in a which size grid of classifier outputs? any formula or hints can provide?

phillipi · 2017-06-24T05:25:52Z

You can use this script to determine the receptive field (e.g., 16 x 16) of a given architecture:
https://github.com/phillipi/pix2pix/blob/master/scripts/receptive_field_sizes.m

I'm not sure what the formula would be for calculating output grid size given desired receptive field. For a given architecture, you can always check the output size by running it on an image and calling out:size() on the output out.

kenshinzh · 2017-06-24T06:14:39Z

@phillipi Thank you for you quick response. I am not familiar with the Matlab. Would you please be more specify how can I get the output with the script you provided. Thank you so much.

phillipi · 2017-06-24T06:25:09Z

That script gives you the receptive field of a neuron. The equation to compute the input receptive field size from a given output receptive field size is (for a single convolutional layer):
input_size = (output_size - 1) * stride + kernel_size

You can call this recursively to compute the receptive field sizes across multiple layers.

kenshinzh · 2017-06-25T05:43:50Z

@phillipi Thank you for you kind explanation. I am sorry that I didn't define my question well. The problem is when I use 70*70 patch GAN as discriminator in code, the output is not as sharp as what described in the paper in some occasion.

This looks good, but my results is not sharp enough as sample

So I wonder maybe the patch GAN size as default code is 70*70 is too large for the discriminator to classify the fake or real. If I want to enhance the sharpness should I enlarge the patch size or decrease the patch size? or any other suggestions ?

junyanz · 2017-06-25T05:56:49Z

@kenshinzh what's your input and output? To produce good colorization results, one needs to map L to ab. Look at Notes on Colorization for more details.

kenshinzh · 2017-06-25T06:00:41Z

Actually, my experments is from a B2A direction which is not the color corrections. I just take this B/W for illustration. all the input image is as the demo 256*256

phillipi · 2017-06-25T06:22:28Z

In the colorizations in the paper, we concatenate the predicted ab map with the ground truth L. This has the effect of making the results look nice and crisp since most of the high frequencies are in the L channel. If you just look at the raw predicted ab it tends to look less sharp.

I don't think there is a simple relationship between discriminator patch size and and sharpness. In practice 70x70 usually works pretty well for me, but you could try a few variants to see what works in your application.

phillipi · 2017-12-25T20:46:55Z

Yep that's right!

…

On Sun, Dec 24, 2017 at 5:47 PM, Kv Manohar ***@***.***> wrote: @phillipi <https://github.com/phillipi> Just wanted to confirm my understanding of PatchGAN. Please point out if there is a mistake. Say the generated image has dimensions of HxWx3, we have a 70x70 patchGAN discriminator. In order to determine if the image is real or fake, 70x70 patch from the generated image is taken and passed through a discriminator to produce a single scalar. On extracting each such 70x70 patch convolutionally from the generated image and averaging the obtained scalars for each patch, I get the final probability of whether the image is real or fake ? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#49 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAxoVh_WboAdoCFewRF1-jx4R1nQmtuKks5tDv7EgaJpZM4L9OgB> .

phillipi closed this as completed Feb 11, 2017

SanderGielisse mentioned this issue Jul 8, 2019

D_NLayersMulti implementation correctness junyanz/BicycleGAN#71

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output size discriminator #49

Output size discriminator #49

noob16 commented Feb 10, 2017

phillipi commented Feb 11, 2017

kenshinzh commented Jun 24, 2017

phillipi commented Jun 24, 2017

kenshinzh commented Jun 24, 2017

phillipi commented Jun 24, 2017 •

edited

kenshinzh commented Jun 25, 2017 •

edited

junyanz commented Jun 25, 2017

kenshinzh commented Jun 25, 2017

phillipi commented Jun 25, 2017

phillipi commented Dec 25, 2017 via email

Output size discriminator #49

Output size discriminator #49

Comments

noob16 commented Feb 10, 2017

phillipi commented Feb 11, 2017

kenshinzh commented Jun 24, 2017

phillipi commented Jun 24, 2017

kenshinzh commented Jun 24, 2017

phillipi commented Jun 24, 2017 • edited

kenshinzh commented Jun 25, 2017 • edited

junyanz commented Jun 25, 2017

kenshinzh commented Jun 25, 2017

phillipi commented Jun 25, 2017

phillipi commented Dec 25, 2017 via email

phillipi commented Jun 24, 2017 •

edited

kenshinzh commented Jun 25, 2017 •

edited