ENH: stats.trap - adding trapezoidal distribution closes #6028 #6030

andyfaff · 2016-04-04T03:54:10Z

Adds a trapezoidal distribution to scipy.stats. For those au-fait with the stats module please inform me of the relevant tests I need to add. The testing framework seems to be automatic, but the test_stats.py is mammoth, so I don't know where to start.
Possible work to be done (to satisfaction of stats guru):

figure out the relevant statistics for the _stats method.
check that the trapezoidal distribution reduces to the uniform and triangular distributions.
add other tests
add further documentation.

andyfaff · 2016-04-04T03:57:11Z

I noticed that _distr_params.py has all the continuous distributions listed. I can add trap there, but it's not clear what one is supposed to add. The file states "Sane parameters for stats.distributions.", but that comment is fairly cryptic.

ev-br · 2016-04-04T08:38:57Z

Parameters: adding an entry to _distr_params.py runs this battery of tests: https://github.com/scipy/scipy/blob/master/scipy/stats/tests/test_continuous_basic.py#L72
Re what to add: ['trap', (1, 2)] would mean c=1, d=2.
These values will also be used for constructing the docstring example (this is what fails on Travis now).
(If you have an idea of how to document it better, please go ahead)

Additional tests (+1 for limiting cases) go to test_distributions.py, which has roughly a test case class per distribution.

ev-br · 2016-04-04T08:44:19Z

scipy/stats/_continuous_distns.py

+    """
+    def _rvs(self, c, d):
+        rands = self._random_state.rand(self._size)
+        return self._ppf(rands, c, d)


This is inherited from superclass, not needed here https://github.com/scipy/scipy/blob/master/scipy/stats/_distn_infrastructure.py#L801

ev-br · 2016-04-04T09:02:32Z

figure out the relevant statistics for the _stats method.

You can just define _munp(self, n, c, d) to compute n-th central moment. Or leave it be --- this would be nice to have, but not absolutely necessary IMO.

ev-br · 2016-04-04T09:13:37Z

scipy/stats/_continuous_distns.py

+                      x**2 / c / (d - c + 1),
+                      (c + 2 * (x - c)) / (d - c + 1),
+                      1 - ((1 - x)**2 / (d - c + 1) / (1 - d))]
+        return np.select(condlist, choicelist)


np.select is cute --- does it handle x, c or d being arrays?

Best solution found so far. It handles x being array or scalar, but c and d have to be scalar.
I had a few mental gymnastics figuring out how to do this without examining if x was array or scalar.

One simple option is of course just two calls to np.where.

andyfaff · 2016-04-05T03:10:13Z

@ev-br I added sane values to the trap entry in _distr_params.py (so the doctests would pass) but now I'm getting a weird error, see below. Why on earth is it generating a shape tuple of (1.8589710689636982, 1.461789732784883) instead of (0.2, 0.8) like I specified in _distr_params.py?

ERROR: test_distributions.test_all_distributions('trap', (1.8589710689636982, 1.461789732784883), 0.01)

Traceback (most recent call last):
File "/Users/anz/miniconda3/envs/dev3/lib/python3.4/site-packages/nose/case.py", line 198, in runTest
self.test(_self.arg)
File "/Users/anz/Documents/Andy/programming/scipy/build/testenv/lib/python3.4/site-packages/scipy/stats/tests/test_distributions.py", line 61, in check_distribution
D, pval = stats.kstest(dist, '', args=args, N=1000)
File "/Users/anz/Documents/Andy/programming/scipy/build/testenv/lib/python3.4/site-packages/scipy/stats/stats.py", line 3904, in kstest
vals = np.sort(rvs(_args, **kwds))
File "/Users/anz/Documents/Andy/programming/scipy/build/testenv/lib/python3.4/site-packages/scipy/stats/_distn_infrastructure.py", line 855, in rvs
raise ValueError("Domain error in arguments.")
ValueError: Domain error in arguments.

ev-br · 2016-04-05T04:03:15Z

(As well hidden in the traceback), this is a different set of tests, which generate random sets of parameters:
https://github.com/scipy/scipy/blob/master/scipy/stats/tests/test_distributions.py#L70

also filter out a RuntimeWarning for division by zero in a neighbor test

Out-of-range input is handled automagically by the distributions framework.

ev-br · 2016-04-06T11:05:14Z

You seem to be hitting quite a few ugly corners of the distributions framework...

The last remaining failure can be taken care of by just listing trap as a failing fit, see the list in stats/tests/test_fit.py.

I've played with it a bit to check how it fares for array-valued shapes, and it seems OK -- there's an additional simple test in https://github.com/ev-br/scipy/tree/pr/6030, can you take it over?
In that branch, I also removed extraneous cases from the select: the out-of-bounds values are handled automatically by the framework, and _pdf does not even see x < 0.

From my POV, this is ready modulo blacklisting the test and renaming it to trapz.

Conflicts: scipy/stats/tests/test_distributions.py

codecov-io · 2016-04-07T06:11:57Z

@@            master   #6030   diff @@
======================================
  Files          238     238       
  Stmts        43831   43849    +18
  Branches      8215    8215       
  Methods          0       0       
======================================
+ Hit          34262   34280    +18
  Partial       2602    2602       
  Missed        6967    6967

Review entire Coverage Diff as of 0605410

Powered by Codecov. Updated on successful CI builds.

ev-br · 2016-04-09T11:05:13Z

Ok, I' m going to merge it. Edge cases, c=0 and d=1, might warrant additional care, but that can be taken care of in a sequel when this sees some usage. Thanks @andyfaff.

andyfaff added the scipy.stats label Apr 4, 2016

andyfaff changed the title ~~ENH: stats.trap - adding trapezoidal distribution~~ ENH: stats.trap - adding trapezoidal distribution closes #6028 Apr 4, 2016

ENH: stats.trap - adding trapezoidal distribution

dd61b6d

andyfaff force-pushed the trap branch from b1767ca to dd61b6d Compare April 4, 2016 03:59

MAINT: stats.trap.pdf, remove division by zero

72908f2

ev-br reviewed Apr 4, 2016
View reviewed changes

ev-br added the enhancement A new feature or improvement label Apr 4, 2016

ev-br reviewed Apr 4, 2016
View reviewed changes

TST: added tests for stats.trap

20cf66a

andyfaff and others added 6 commits April 5, 2016 14:09

TST: add trap to test_all_distributions

8620cee

PEP8: stats.trap space around operator

295c1a0

DOC: adding continuous_trap to stats tutorial

fa837e6

DOC/TST: fixed refguide tst stats.trap

04f0059

TST: stats: test vectorized computations in stats.trap.pdf

b867b63

also filter out a RuntimeWarning for division by zero in a neighbor test

MAINT: stats: remove extraneous cases from stats.trap

7440fff

Out-of-range input is handled automagically by the distributions framework.

andyfaff added 2 commits April 7, 2016 14:09

MAINT: rename stat.trapz

bf45af8

Merge branch 'pr/6030' of https://github.com/ev-br/scipy into trap

bc4bf6b

Conflicts: scipy/stats/tests/test_distributions.py

andyfaff force-pushed the trap branch from 57faddb to d5911e7 Compare April 7, 2016 05:36

TST: adding a few more stats.trapz tests

d5911e7

ev-br merged commit 543aca9 into scipy:master Apr 9, 2016

ev-br added this to the 0.18.0 milestone Apr 9, 2016

andyfaff deleted the trap branch April 11, 2016 02:34

ev-br mentioned this pull request Jul 27, 2016

Added Breit-Wigner distribution #6419

Closed

bashtage mentioned this pull request Aug 10, 2016

random.trapezoidal numpy/numpy#3770

Closed

andyfaff mentioned this pull request Dec 4, 2017

ENH added normal inverse gaussian distribution to scipy.stats #8171

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: stats.trap - adding trapezoidal distribution closes #6028 #6030

ENH: stats.trap - adding trapezoidal distribution closes #6028 #6030

andyfaff commented Apr 4, 2016

andyfaff commented Apr 4, 2016

ev-br commented Apr 4, 2016

ev-br Apr 4, 2016

ev-br commented Apr 4, 2016

ev-br Apr 4, 2016

andyfaff Apr 4, 2016

ev-br Apr 4, 2016

andyfaff commented Apr 5, 2016

ERROR: test_distributions.test_all_distributions('trap', (1.8589710689636982, 1.461789732784883), 0.01)

ev-br commented Apr 5, 2016

ev-br commented Apr 6, 2016

codecov-io commented Apr 7, 2016

ev-br commented Apr 9, 2016

ENH: stats.trap - adding trapezoidal distribution closes #6028 #6030

ENH: stats.trap - adding trapezoidal distribution closes #6028 #6030

Conversation

andyfaff commented Apr 4, 2016

andyfaff commented Apr 4, 2016

ev-br commented Apr 4, 2016

ev-br Apr 4, 2016

Choose a reason for hiding this comment

ev-br commented Apr 4, 2016

ev-br Apr 4, 2016

Choose a reason for hiding this comment

andyfaff Apr 4, 2016

Choose a reason for hiding this comment

ev-br Apr 4, 2016

Choose a reason for hiding this comment

andyfaff commented Apr 5, 2016

ERROR: test_distributions.test_all_distributions('trap', (1.8589710689636982, 1.461789732784883), 0.01)

ev-br commented Apr 5, 2016

ev-br commented Apr 6, 2016

codecov-io commented Apr 7, 2016

ev-br commented Apr 9, 2016