False CDF values for skew normal distribution #7746

pphilippos · 2017-08-18T10:04:01Z

When x is sufficiently large, skewnorm.cdf outputs 0 instead of 1.

Reproducing code example:

A short code example that reproduces the problem:

from scipy.stats import skewnorm as sk2
import numpy as np
for x in np.linspace(-10,50,1000):
	print str(x) + " " + str(sk2.cdf(x,-1))

Error message:

No error messages. See the following graph for the issue:

Scipy/Numpy/Python version information:

Python 2.7.13
Scipy 0.19.1
Numpy 1.12.1

The text was updated successfully, but these errors were encountered:

pv · 2017-08-18T15:46:37Z

Correct. The technical problem is that numerical integration of the pdf starts to fail once the support becomes too narrow compared to the interval. Can probably be fixed by just returning 1.0 when deep in the tail of the cdf.

ev-br · 2017-08-18T22:13:53Z

ISTM the best fix would be to finish off gh-7120, which implements the Owen's T function, and add the explicit form of the _cdf. This seems to fix the loss of precision, cf https://github.com/ev-br/scipy/tree/pr/7120. The only relevant commit on top of gh-7120 is 86b43ba

The CDF is computed by integrating the PDF using scipy.integrate.quad. To ensure that quad "sees" the peak of the PDF, the integral is split at x=0. The calculation of the survival function is improved by using the symmetry it has with the CDF: sf(x, a) = cdf(-x, -a). Closes scipygh-7746.

ev-br · 2018-03-03T20:50:14Z

Per the discussion in gh-8473: the naive formula contains the difference 2\Phi(x) - T(x,a), which suffers from the loss of precision for some parameters. A proper fix likely involves coming up with the strategy of computing the full expression, instead of subtracting two terms. Meanwhile, gh-8501 has a workaround.

pphilippos changed the title ~~False CDF values of the skewnorm distribution~~ False CDF values for skew normal distribution Aug 18, 2017

ev-br added defect A clear bug or issue that prevents SciPy from being installed or used as expected scipy.stats labels Aug 18, 2017

ev-br mentioned this issue Feb 24, 2018

BUG: stats: fix skewnorm.cdf losing precision at large x #8473

Closed

WarrenWeckesser mentioned this issue Feb 28, 2018

BUG: stats: Split the integral used to compute skewnorm.cdf. #8501

Merged

rgommers closed this as completed in #8501 Mar 31, 2018

rgommers added this to the 1.1.0 milestone Mar 31, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

False CDF values for skew normal distribution #7746

False CDF values for skew normal distribution #7746

pphilippos commented Aug 18, 2017

pv commented Aug 18, 2017

ev-br commented Aug 18, 2017

ev-br commented Mar 3, 2018

False CDF values for skew normal distribution #7746

False CDF values for skew normal distribution #7746

Comments

pphilippos commented Aug 18, 2017

Reproducing code example:

Error message:

Scipy/Numpy/Python version information:

pv commented Aug 18, 2017

ev-br commented Aug 18, 2017

ev-br commented Mar 3, 2018