StyleLoss dividing by n twice #90

rrshaban · 2015-12-07T21:07:54Z

In StyleLoss:updateOutput neural_style.lua:398:

function StyleLoss:updateOutput(input)
  self.G = self.gram:forward(input)
  self.G:div(input:nElement())
  self.loss = self.crit:forward(self.G, self.target)
  ...
end

Our criterion function (self.crit = nn.MSECriterion()) already divides by n after calculating MSE: nn.MSECriterion. So it seems to me that we are dividing by n twice; once before passing it to MSE and then once again within MSE.

Is this intentional, or am I missing something?

The text was updated successfully, but these errors were encountered:

jcjohnson · 2015-12-07T23:01:34Z

You're right - we do divide by the same thing twice. Per-layer, this is basically a no-op since it just scales the style loss for that layer by a constant.

However since we use the same style loss weight for all layers, any normalization has the effect of scaling the contributions of the style losses from different layers, since n is different for each layer. I didn't really have a good theoretical justification for this particular type of normalization, but empirically it tends to give good results.

rrshaban · 2015-12-11T18:54:58Z

Thanks for the detailed reply, as well as the great code!

rrshaban closed this as completed Dec 11, 2015

htoyryla mentioned this issue Feb 13, 2017

Implementing features from the "Controlling Perceptual Factors in Neural Style Transfer" research paper #376

Open

ProGamerGov mentioned this issue Mar 17, 2018

Any ports to Python? #450

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

StyleLoss dividing by n twice #90

StyleLoss dividing by n twice #90

rrshaban commented Dec 7, 2015

jcjohnson commented Dec 7, 2015

rrshaban commented Dec 11, 2015

StyleLoss dividing by n twice #90

StyleLoss dividing by n twice #90

Comments

rrshaban commented Dec 7, 2015

jcjohnson commented Dec 7, 2015

rrshaban commented Dec 11, 2015