why you use layer_norm ? #10

taki0112 · 2018-04-20T02:48:16Z

Hi
code
In Decoder, you used the layer_norm

have you tried another normalization ? like instance, batch, group

The text was updated successfully, but these errors were encountered:

xunhuang1995 · 2018-04-21T00:12:53Z

Instance norm does not work well, since it removes the global feature mean and variance that capture the style information.

Batch norm is the same since we use batch size = 1.

The paper had been submitted before group normalization came out. So we haven't try it.

taki0112 · 2018-04-21T08:26:01Z

I have some question...

If you look at configs, there is no information about animal translation.
When will you upload it?
Is it ok to just run as hyper-parameter like edge2shoes?

xunhuang1995 closed this as completed Apr 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why you use layer_norm ? #10

why you use layer_norm ? #10

taki0112 commented Apr 20, 2018

xunhuang1995 commented Apr 21, 2018

taki0112 commented Apr 21, 2018

why you use layer_norm ? #10

why you use layer_norm ? #10

Comments

taki0112 commented Apr 20, 2018

xunhuang1995 commented Apr 21, 2018

taki0112 commented Apr 21, 2018