ENH: stats: Use explicit MLE formulas in uniform.fit() #8554

WarrenWeckesser · 2018-03-14T15:36:27Z

uniform.fit() will now give the "exact" maximum likelihood parameter estimates.

In previous cases where I implemented the exact MLE formulas (#2519, #8358), I used decorators to tweak the fit docstring. I now think that approach is more trouble than it is worth. In this PR, I give uniform.fit its own docstring. This allows the docstring to be much simpler and clearer, by providing only the information relevant to uniform.fit. And the examples use uniform.fit instead of beta.fit and norm.fit.

(In another pull request--not yet created--I'll give the docstrings for norm.fit and expon.fit the same treatment.)

chrisb83 · 2018-03-15T19:53:49Z

Question: Does fit need to rely on MLE in that case? It is not the best estimator: if one uses the max of the observations to find the upper bound, one always understimates,it is better to use (1 + 1/n) * max, especially for small samples. (it is the minimum-variance unbiased estimator, see https://en.wikipedia.org/wiki/Uniform_distribution_(continuous))

WarrenWeckesser · 2018-03-15T20:09:46Z

@chrisb83 wrote:

Question: Does fit need to rely on MLE in that case?

Yes. Currently fit computes the MLE. That's what it is documented to do; it must not do something else.

Enhancing the fit function with other fitting methods would be great (and it is something I have experimented with), but it is not something we should do ad hoc to individual distributions. To implement a new fitting technique, we'd need something like a method argument in the fit function, or even a new function if it turns out that changing the API of the current fit function is too awkward to do in a backwards-compatible way.

(The to-do/wish list is long. The other big enhancement to fit that is needed is to return information about the goodness of the MLE fit.)

mirca · 2018-03-17T03:35:38Z

@WarrenWeckesser by goodness of the MLE fit, you mean the Cramér-Rao Lower Bound? (Asking as someone who is willing to contribute some code if that's the case)

WarrenWeckesser · 2018-03-22T02:45:09Z

@mirca: Yes. A standard calculation in MLE is to use the inverse of the Hessian of the log-likelihood function to estimate the asymptotic covariance matrix. If you are interested in this, it would be best to open up a new issue to discuss the appropriate calculations and the API for an implementation.

josef-pkt · 2018-03-22T03:23:36Z

I don't think Hessian will work for uniform and other distributions where the support bounds are a function of the parameters.
AFAIR, the problem is because the likelihood function is not continuous or not differentiable.

WarrenWeckesser · 2018-03-26T00:26:13Z

I added comments in uniform.fit() to explain how the maximum likelihood estimate is calculated for the uniform distribution.

ev-br · 2018-04-01T18:10:32Z

I cannot help noticing that a significant part of this PR might be in rv_continuous.fit. Ditto for some other distributions which define special-cased fit method (gamma, beta). Maybe it would be good to have a pair of a public fit and private _fit which does all actual heavy lifting. However, this is clearly separate from this PR, and this PR looks good. Thanks Warren

WarrenWeckesser added scipy.stats enhancement A new feature or improvement labels Mar 14, 2018

WarrenWeckesser force-pushed the uniform-fit branch 2 times, most recently from 13a73dd to c2cd5d1 Compare March 14, 2018 16:32

ENH: stats: Use explicit MLE formulas in uniform.fit()

dec2a82

WarrenWeckesser force-pushed the uniform-fit branch from c2cd5d1 to dec2a82 Compare March 25, 2018 22:56

ev-br merged commit d468c4e into scipy:master Apr 1, 2018

ev-br added this to the 1.1.0 milestone Apr 1, 2018

WarrenWeckesser deleted the uniform-fit branch April 1, 2018 20:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: stats: Use explicit MLE formulas in uniform.fit() #8554

ENH: stats: Use explicit MLE formulas in uniform.fit() #8554

WarrenWeckesser commented Mar 14, 2018

chrisb83 commented Mar 15, 2018

WarrenWeckesser commented Mar 15, 2018

mirca commented Mar 17, 2018

WarrenWeckesser commented Mar 22, 2018

josef-pkt commented Mar 22, 2018

WarrenWeckesser commented Mar 26, 2018

ev-br commented Apr 1, 2018

ENH: stats: Use explicit MLE formulas in uniform.fit() #8554

ENH: stats: Use explicit MLE formulas in uniform.fit() #8554

Conversation

WarrenWeckesser commented Mar 14, 2018

chrisb83 commented Mar 15, 2018

WarrenWeckesser commented Mar 15, 2018

mirca commented Mar 17, 2018

WarrenWeckesser commented Mar 22, 2018

josef-pkt commented Mar 22, 2018

WarrenWeckesser commented Mar 26, 2018

ev-br commented Apr 1, 2018