Fix Weibull loglikelihood issues #59

michael-tsel · 2019-05-09T07:57:01Z

This is a fix for both continuous and discrete Weibull log-likelihood issues #58 and #56

ragulpr

ragulpr · 2019-10-13T16:14:27Z

python/wtte/wtte.py

    loglikelihoods = u * \
-        K.log(K.exp(hazard1 - hazard0) - (1.0 - epsilon)) - hazard1
+        K.log((1.0 + epsilon) - K.exp(hazard0 - hazard1)) - hazard0
    return loglikelihoods


In the discrete case it’s equivalent. It’s just a matter of where we put epsilon:

K.log((1.0 + epsilon) - K.exp(hazard0 - hazard1)) - hazard0 =K.log((1.0 + epsilon) - K.exp(hazard0)/K.exp(hazard1)) - hazard0 =K.log(K.exp(hazard0)[(1.0 + epsilon)/K.exp(hazard0) - 1/K.exp(hazard1)]) - hazard0 =K.log([(1.0 + epsilon)*K.exp(-hazard0) - K.exp(-hazard1)]) =K.log(K.exp(-hazard1)[(1.0 + epsilon)*K.exp(hazard1-hazard0) - 1]) =K.log([(1.0 + epsilon)*K.exp(hazard1-hazard0) - 1])-hazard1 =K.log([(1.0)*K.exp(hazard1-hazard0) - 1])-hazard1 ~K.log([K.exp(hazard1-hazard0) - 1])-hazard1

Which form is right is just a matter of style. My reasons is that I like to emphasis that for positive distributions discretized loglikelihood can always be written on the form u*log[exp(Λ(y+1)-Λ(y))-1]-Λ(y+1), which is a form I chose because empirically it seemed most numerically stable/efficient and that automatic differentiation seemed to unfold it efficiently. I may be wrong about this however so please provide me with counterexamples if you have them.
See proposition 2.26.

Yeah, the only true difference was in case of u=0.

ragulpr · 2019-10-13T16:32:30Z

python/wtte/wtte.py

-    loglikelihoods = u * (K.log(b) + b * K.log(ya)) - K.pow(ya, b)
+    loglikelihoods = u * (K.log(b) - K.log(a) + (b-1) * K.log(ya)) - K.pow(ya, b)


Considering the only terms that differ:

b * K.log(ya) vs - K.log(a) + (b-1) * K.log(ya)

Where the latter can be written

- K.log(a)+b* K.log(ya)-1* K.log(ya) =[b* K.log(ya)]- K.log(a)-1* K.log(ya) =[b* K.log(ya)]- K.log(a)-K.log(y+eps)+K.log(a) =[b* K.log(ya)]- K.log(y+eps)

With regards to the parameters, this is proportional to

∝b * K.log(ya)

Since K.log(y+eps) has zero gradient, so it's unnecessary to compute it. The standard in almost every statistical/ml package I've seen is - for computational reasons- to implement loss functions using the only the terms that are proportional- rather than equal to the log-likelihood.

The upsides are very marginal computational benefit. The downsides are that it can be confusing if one expects that exp(-loss)="probability of seeing this data with these parameters". Log-likelihood are hence rarely interpretable or directly comparable across distributions and implementations except in proportionality.

I'm leaning towards that the upsides is not worth the downsides and would be open for a loss function that is equal, rather than proportional. I'm a bit worried about touching these equations too much, they are battle tested in numerous fights against NaN and have proven themselves very stable and performant. A form like

loglikelihoods = u * (K.log(b) + b* K.log(ya)- K.log(y+eps)) - K.pow(ya, b)

Seems like a pretty safe and cheap alternative however.

What do you think?

Now when I come think of it once again, you're right! There is no need to add redundant constant term to the log-likelihood in continuous case.

Fix Weibull loglikelihood issues

bf15bbc

ragulpr reviewed Oct 13, 2019

View reviewed changes

This was referenced Oct 13, 2019

Loss Function - Not the PCF? #56

Open

Log-likelihood for discrete Weibull distribution #58

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Weibull loglikelihood issues #59

Fix Weibull loglikelihood issues #59

michael-tsel commented May 9, 2019

ragulpr left a comment

ragulpr Oct 13, 2019

michael-tsel Oct 20, 2019

ragulpr Oct 13, 2019 •

edited

michael-tsel Oct 20, 2019

		loglikelihoods = u * (K.log(b) + b * K.log(ya)) - K.pow(ya, b)
		loglikelihoods = u * (K.log(b) - K.log(a) + (b-1) * K.log(ya)) - K.pow(ya, b)

Fix Weibull loglikelihood issues #59

Are you sure you want to change the base?

Fix Weibull loglikelihood issues #59

Conversation

michael-tsel commented May 9, 2019

ragulpr left a comment

Choose a reason for hiding this comment

ragulpr Oct 13, 2019

Choose a reason for hiding this comment

michael-tsel Oct 20, 2019

Choose a reason for hiding this comment

ragulpr Oct 13, 2019 • edited

Choose a reason for hiding this comment

michael-tsel Oct 20, 2019

Choose a reason for hiding this comment

ragulpr Oct 13, 2019 •

edited