Questions about GaussianCriterion.lua #4

xcyan · 2015-08-29T04:36:04Z

self.gradInput[2] = torch.exp(-input[2]):cmul(torch.add(target,-1,input[1]):pow(2)):add(-0.5)

It seems to me that gradient updating step should be:

self.gradInput[2] = torch.exp(-input[2]):cmul(torch.add(target,-1,input[1]):pow(2)):mul(0.5):add(-0.5)

y0ast · 2015-08-30T02:12:18Z

Hmm, there is definitely something wrong but your solution is not correct.

input[1] = mu
input[2] = log(sigma^2)

The forward is:
-0.5 * (log(sigma) + log(2pi)) - 0.5 * (x - mu)^2/sigma^2

The backward for log(sigma^2) is:

derivative of 1/sigma^2 with respect to log(sigma^2) is:
-exp(-log(sigma^2)) (using: 1/sigma^2 = exp(-log(sigma^2)))

The minus cancels the minus of - 0.5 * (x - mu)^2 and you reach the above mentioned backward.

See the commit for further details :). Thanks for pointing it out!

y0ast closed this as completed in c074ffa Aug 30, 2015

y0ast pushed a commit that referenced this issue Dec 19, 2016

fixes #4

6eae75c

Provide feedback