ENH: stats: add nonparametric one-sample quantile test and CI #12680

romain-jacob · 2020-08-07T11:46:28Z

Reference issue

Relate to statsmodels/statsmodels#6562
Loosely relate to #10577
(additional statistical methods, including confidence intervals)

What does this implement?

This PR implements a non-parametric approach to compute confidence intervals for quantiles for a given confidence level and input x which is either a set of samples (one-dimensional array_like) or the number of samples available. The confidence intervals are valid if and only if the samples are i.i.d.

Both one-sided and two-sided confidence intervals can be obtained (default is one-sided). The function returns two values: either the bounds for the two one-sided confidence intervals, or the lower and upper bounds of a two-sided confidence interval.
The return values are either the indexes of the bounds (if x is an integer) or sample values (if x is the set of samples). None is returned when there are not enough samples to compute the desired confidence interval.

This approach is well-known, present is statistics textbooks but got somehow overlooked despite its simplicity and usefulness. An implementation in SciPy would help changing that.

Additional information

The current PR does not include any benchmark for that function, which seemed a bit of an overkill. It can be added it necessary.
[FIXED] The Sphinx documentation renders fine locally, although I do get some error messages I did not manage to eradicate (Inline interpreted text or phrase reference start-string without end-string)...
From the code in this PR, it's easy to write a function that returns the minimal number of samples to compute any confidence interval of interest. I have this code lying around but was not sure whether it makes sense to integrate in SciPy as well. Opinions?

…py.stats

tylerjereddy

I just added a few drive-by comments--not a substitute for a stats guru review.

scipy/stats/stats.py

scipy/stats/tests/test_stats.py

scipy/stats/stats.py

chrisb83 · 2020-08-09T21:54:28Z

Thanks for the contribution. At the moment, there are no functions in SciPy that return a CI, #12609 adds a CI to a test. In the same spirit, one could implement a quantile test (see e.g. Conover: Practical nonparam. statistics) and in addition to the pvalue, return the CI. (we already have binom_test to test the probability). It is just a suggestion that came to my mind.

fixed minor documentation things

scipy/stats/stats.py

romain-jacob · 2020-08-21T09:04:26Z

The only CI failure is in sparse.linalg and unrelated to this PR.

Unless there are further comments, I believe this PR is ready to be merged.

chrisb83 · 2020-08-21T14:47:02Z

@romain-jacob have you thought about the following suggestion?

At the moment, there are no functions in SciPy that return a CI, #12609 adds a CI to a test. In the same spirit, one could implement a quantile test (see e.g. Conover: Practical nonparam. statistics) and in addition to the pvalue, return the CI. (we already have binom_test to test the probability). It is just a suggestion that came to my mind.

romain-jacob · 2020-08-21T14:57:00Z

Hi @chrisb83, yes, sorry I forgot to follow-up on that one.

The method I am proposing here is not related to hypothesis testing, it's an estimation method for quantiles. I do not know the quantile test you are referring to (I did not look it up), but if it is a test of practical relevance, then I guess that could be certainly added too. But I think that should be in a distinct PR, no?

chrisb83 · 2020-08-24T21:27:47Z

The method I am proposing here is not related to hypothesis testing, it's an estimation method for quantiles.

In general, there is a very close relationship between CIs and hypothesis tests. E.g. if you estimate a mean and you can construct confidence intervals, you can define a test and vice versa. I am just wondering about the best way to integrate the functionality into SciPy. At the moment, just returning a CI without an estimate of the quantile somehow looks odd. are there other implementations, e.g. in R, that we can use as a benchmark?

romain-jacob · 2020-08-25T08:48:46Z

I see your point, and indeed I am aware of the close connection between CI and NHST, but I am not aware of any "test" that explicitly focus on percentiles (except the median). That being said, I know very little about statistics in general, so such tests may well exist, but I could not find any.

I just browsed through the R packages and found QuantileNPCI which seems to be doing about the same thing, although it seems it computes only two-sided CI (not 100% sure yet, I just had a quick look).

Unfortunately I am not proficient in R, so it's not easy for me to quickly compare the two implementations. But I can try computing the CI presented in the QuantileNPCI example; that will already do for a quick check.

THANKS.txt

mdhaber · 2023-07-13T23:26:41Z

Yes

scipy/stats/_stats_py.py

My concerns were resolved.

mdhaber · 2023-07-16T02:18:14Z

scipy/stats/_stats_py.py

+        calculate the p-value respectively. `1` corresponds to the "greater"
+        alternative hypothesis and `-1` to the "less".  For the "two-sided"
+        alternative, a value of `+1` means, that given the data, there is
+        greater than equal likelihood that the population quantile is larger


Suggested change

greater than equal likelihood that the population quantile is larger

greater than or equal likelihood that the population quantile is larger

?

mdhaber · 2023-07-16T02:19:20Z

scipy/stats/_stats_py.py

+        or `T2`, the observed number of observations strictly less than the
+        hypothesized quantile. Two test statistics are required to handle the
+        possibility the data was generated from a discrete or mixed
+        distribution. `T1` is sensitive whether the population quantile is


Suggested change

distribution. `T1` is sensitive whether the population quantile is

distribution. `T1` is sensitive to whether the population quantile is

mdhaber · 2023-07-16T02:26:11Z

scipy/stats/_stats_py.py

+        greater than equal likelihood that the population quantile is larger
+        than the hypothesized value, while a value of `-1` means there is
+        strictly greater likelihood that it is less than the hypothesized
+        value.


I'm not familiar with this use of "likelihood". Also, what is the likelihood greater than?

Sorry. It’s nonsense. I wrote it on the plane earlier today while dozing. I’ll fix it up.

steppi · 2023-07-16T02:36:26Z

Naming things is so hard. I'm not happy with the following repr for QuantileTestResult but just wanted to throw something out there..

    >>> rvs = stats.uniform.rvs(size=100, random_state=rng)
    >>> stats.quantile_test(rvs, q=0.6, p=0.75, alternative='greater')
    QuantileTestResult(statistic=64, statistic_sign=1, pvalue=0.00940696592998271)

My thought is that if we're encoding whether the statistic T1 or T2 is used to calculate the p-value with a sign $\pm1$, then ideally the name statistic_sign should make sense for this attribute. If I try to take a step back and forget how wrapped up I am in this though, I don't think it would be immediately clear to me what statistic_sign should mean in this context. Some other names I thought of are statistic_direction and statistic_side.

To me, I think the important thing that should be communicated is that +1 means the test is reporting a p-value quantifying the likelihood of a sample quantile being greater than or equal to the observed value under the null distribution, and a value of -1 means the test is reporting a p-value quantifying the likelihood of a sample quantile being less than or equal to the observed value under the null distribution.

mdhaber · 2023-07-16T02:52:25Z

Part of the difficulty is that the statistic itself is always a count of values to the same side of the quantile, regardless of the alternative. Could this be refactored such that one is the statistics is changed to its complement? Then statistic_direction could make a lot of sense.

steppi · 2023-07-16T03:09:42Z

Part of the difficulty is that the statistic itself is always a count of values to the same side of the quantile, regardless of the alternative. Could this be refactored such that one is the statistics is changed to its complement? Then statistic_direction could make a lot of sense.

I’ll think about it too. Sounds very promising.

romain-jacob · 2023-07-17T07:54:15Z

Hello there!

Thanks a lot for your help @steppi! I just wanted to say that it's been strange to see the work on this PR unfolding without me raising a finger, but I'm really grateful to see more experienced contributors wrapping things up.

So, many thanks :-)

steppi · 2023-07-17T20:17:01Z

Hello there!

Thanks a lot for your help @steppi! I just wanted to say that it's been strange to see the work on this PR unfolding without me raising a finger, but I'm really grateful to see more experienced contributors wrapping things up.

So, many thanks :-)

Happy to help @romain-jacob! A group of us stats maintainers got together at SciPy 2023 and started working on wrapping up / reviewing outstanding PRs. Thanks for this valuable contribution!

steppi · 2023-07-17T20:24:13Z

@mdhaber I decided to go with statistic_type equal to 1 when test statistic T1 is used and 2 when test statistic T2 is used. I started playing with taking the "complement" of one of the statistics so that statistic_direction would become meaningful but it seemed to complicate things.

I think T1 and T2 are nice because for the continuous case each approximates $n$ times CDF evaluated at x=q, the hypothesized value for the $p_{th}$ quantile. Under the null this should just equal $np$. This seems like a simpler way to organize things. Then I think it's intuitive that for the discrete or mixed case you need both statistics since there might be multiple observations with x = q.

mdhaber · 2023-07-17T21:40:36Z

scipy/stats/_stats_py.py

@@ -10063,7 +10060,7 @@ def quantile_test(x, *, q=0, p=0.5, alternative='two-sided'):
    expect the null hypothesis to be rejected.

    >>> stats.quantile_test(rvs, q=0.5, p=0.5, alternative='greater')
-    QuantileTestResult(statistic=67, statistic_sign=1, pvalue=0.9997956114162866)
+    QuantileTestResult(statistic1=67, statistic_type=1, pvalue=0.9997956114162866)


@steppi isn't this supposed to be statistic, not statistic1? Here and elsewhere.

mdhaber

@romain-jacob Could you check the recent changes for accuracy? When you approve, these corrections have been made, and @steppi takes a final pass through the rendered documentation to look for inconsistencies like the ones I've highlighted, I think it's fine for @steppi to merge.

scipy/stats/_stats_py.py

romain-jacob · 2023-07-19T14:37:05Z

I just reviewed the current state and double-checked the code and the examples. The change to the statistic_type trick makes sense to me, but I must say I think this makes the function more difficult to understand. I don't think it matters too much though, and I'm confident it is correct.

So all good from my side :-)

mdhaber

Looks like most of the stylization issues are fixed. I'm re-reviewing so it's easier to keep track of what remains.

scipy/stats/_stats_py.py

mdhaber · 2023-07-20T00:16:27Z

Looking pretty good @steppi. I think we should rebase, squashing consecutive commits by the same author. You want to do that, or want me to?

steppi · 2023-07-20T00:18:01Z

Looking pretty good @steppi. I think we should rebase, squashing all my commits and squashing all your commits. You want to do that, or want me to?

You can do it if you have time. Eating dinner now and won’t have a chance for a while.

mdhaber · 2023-07-20T04:48:58Z

I've rebased some pretty messy histories before, but this one seems really tough. Would you take a look? If it's too messy, LMK.

steppi · 2023-07-20T12:47:55Z

I've rebased some pretty messy histories before, but this one seems really tough. Would you take a look? If it's too messy, LMK.

Sure. Will do.

DOC: Sync docstring for test with actual implementation DOC: Indent hypotheses [skip actions] DOC: Fix docstring formatting [skip actions] DOC: Try to fix another sphinx issue [skip actions] TST: Fix hardcoded value in doctest [skip actions] TST: More fix for doctest [skip actions] DOC: More thrashing [skip actions] DOC: sphinx was confused by `p`th [skip actions] DOC: try to deal with more sphinx weirdness [skip actions] DOC: Change pth to latex [skip actions] [skip cirrus] DOC: Make p-value consistent in docs [skip actions] [skip cirrus] DOC: More docstring cleanup [skip actions] [skip cirrus] Update scipy/stats/_stats_py.py Co-authored-by: Matt Haberland <mhaberla@calpoly.edu> Update scipy/stats/_stats_py.py [skip actions] [skip cirrus] Co-authored-by: Matt Haberland <mhaberla@calpoly.edu> Remove blank line (squash) MAINT: Make result contain only one statistic + info MAINT: Bring some lines to 79 chars or less MAINT: Fix statistic not set for alternative two-side MAINT: Update statistic_type -> statistic_sign TST: Fix doctests [skip actions] [skip cirrus] MAINT: Change to statistic_type 1, 2 MAINT: Fix formatting in tests Apply suggestions from code review [skip cirrus] [skip actions] Co-authored-by: Matt Haberland <mhaberla@calpoly.edu> DOC: Documentation fixes [skip actions] [skip cirrus] DOC: documentation fixes [skip actions] [skip cirrus] DOC: More doc fixes [skip actions] [skip cirrus] DOC: Adjust formatting for bullet list [skip actions] [skip cirrus] DOC: Adjust subscript to superscript in pth [skip cirrus] [skip actions] DOC: Remove extraneous blank line [skip actions] [skip cirrus] DOC: Unbold hypotheses [skip cirrus] [skip actions]

romain-jacob · 2023-08-14T08:03:54Z

I almost can't believe this finally got merged! Thank you so much @mdhaber for your patience and dedication throughout the process. I've been a messy contributor to work with -__-'

Many thanks also @steppi for helping closing this! I really appreciate it.

mdhaber · 2023-08-14T15:27:47Z

Congratulations and thanks for seeing it through!

romain-jacob added 5 commits August 4, 2020 17:45

mwe of the function, tests added but does not work yet

ad36fc4

added self to contrib list"

896e431

added function name to __init__ to appear in the online doc

70304eb

function is ready, docstring and tests to finish

e3571b6

ENH: added non-parametric confidence intervals for quantiles into sci…

2c3a3d7

…py.stats

romain-jacob changed the title ~~@romain-jacob ENH: added non-parametric confidence intervals for quantiles into scipy.stats~~ ENH: added non-parametric confidence intervals for quantiles into scipy.stats Aug 7, 2020

romain-jacob and others added 5 commits August 7, 2020 13:51

Merge branch 'master' into confint

fb5f8a8

fixed docstring

9e33d1e

merge new commits

e179d68

Merge branch 'confint' of github.com:romain-jacob/scipy into confint

8483fc4

fixed pep8 typos and doctest typos

07abe2f

tylerjereddy added enhancement A new feature or improvement scipy.stats labels Aug 7, 2020

tylerjereddy reviewed Aug 8, 2020

View reviewed changes

mdhaber reviewed Aug 9, 2020

View reviewed changes

scipy/stats/stats.py Outdated Show resolved Hide resolved

improved efficiency by using binom.isf

59bb0ee

fixed minor documentation things

sethtroisi reviewed Aug 10, 2020

View reviewed changes

scipy/stats/stats.py Outdated Show resolved Hide resolved

scipy/stats/stats.py Outdated Show resolved Hide resolved

scipy/stats/stats.py Outdated Show resolved Hide resolved

romain-jacob added 2 commits August 11, 2020 08:50

added None return to the docstring

2a5c286

fixed docstring bug

ecff946

sethtroisi reviewed Aug 11, 2020

View reviewed changes

scipy/stats/stats.py Outdated Show resolved Hide resolved

scipy/stats/stats.py Outdated Show resolved Hide resolved

scipy/stats/stats.py Outdated Show resolved Hide resolved

romain-jacob added 2 commits August 17, 2020 08:07

docstring updates

1b3542b

Merge branch 'master' of https://github.com/scipy/scipy into confint

3332939

rlucas7 reviewed Aug 29, 2020

View reviewed changes

THANKS.txt Outdated Show resolved Hide resolved

Merge branch 'master' of https://github.com/scipy/scipy into confint

5c164ac

mdhaber reviewed Jul 14, 2023

View reviewed changes

scipy/stats/_stats_py.py Outdated Show resolved Hide resolved

mdhaber reviewed Jul 14, 2023

View reviewed changes

scipy/stats/_stats_py.py Outdated Show resolved Hide resolved

steppi reviewed Jul 15, 2023

View reviewed changes

scipy/stats/_stats_py.py Show resolved Hide resolved

mdhaber reviewed Jul 16, 2023

View reviewed changes

mdhaber reviewed Jul 17, 2023

View reviewed changes

mdhaber reviewed Jul 18, 2023

View reviewed changes

mdhaber reviewed Jul 19, 2023

View reviewed changes

scipy/stats/_stats_py.py Outdated Show resolved Hide resolved

scipy/stats/_stats_py.py Outdated Show resolved Hide resolved

scipy/stats/_stats_py.py Outdated Show resolved Hide resolved

steppi reviewed Jul 19, 2023

View reviewed changes

scipy/stats/_stats_py.py Outdated Show resolved Hide resolved

steppi and others added 2 commits July 20, 2023 11:37

DOC: More doc fixes [skip actions] [skip cirrus]

5bcd5e1

steppi force-pushed the confint branch from 0dc8bd6 to 5bcd5e1 Compare July 20, 2023 15:55

steppi merged commit 64be173 into scipy:main Aug 4, 2023
6 checks passed

mdhaber added this to the 1.12.0 milestone Aug 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: stats: add nonparametric one-sample quantile test and CI #12680

ENH: stats: add nonparametric one-sample quantile test and CI #12680

romain-jacob commented Aug 7, 2020 •

edited

tylerjereddy left a comment

chrisb83 commented Aug 9, 2020

romain-jacob commented Aug 21, 2020

chrisb83 commented Aug 21, 2020

romain-jacob commented Aug 21, 2020

chrisb83 commented Aug 24, 2020

romain-jacob commented Aug 25, 2020

mdhaber commented Jul 13, 2023

mdhaber Jul 16, 2023

mdhaber Jul 16, 2023

mdhaber Jul 16, 2023

steppi Jul 16, 2023

steppi commented Jul 16, 2023

mdhaber commented Jul 16, 2023

steppi commented Jul 16, 2023

romain-jacob commented Jul 17, 2023

steppi commented Jul 17, 2023

steppi commented Jul 17, 2023

mdhaber Jul 17, 2023 •

edited

steppi Jul 17, 2023

mdhaber left a comment •

edited

romain-jacob commented Jul 19, 2023

mdhaber left a comment

mdhaber commented Jul 20, 2023 •

edited

steppi commented Jul 20, 2023

mdhaber commented Jul 20, 2023

steppi commented Jul 20, 2023

romain-jacob commented Aug 14, 2023

mdhaber commented Aug 14, 2023

	greater than equal likelihood that the population quantile is larger
	greater than or equal likelihood that the population quantile is larger

	distribution. `T1` is sensitive whether the population quantile is
	distribution. `T1` is sensitive to whether the population quantile is

ENH: stats: add nonparametric one-sample quantile test and CI #12680

ENH: stats: add nonparametric one-sample quantile test and CI #12680

Conversation

romain-jacob commented Aug 7, 2020 • edited

Reference issue

What does this implement?

Additional information

tylerjereddy left a comment

Choose a reason for hiding this comment

chrisb83 commented Aug 9, 2020

romain-jacob commented Aug 21, 2020

chrisb83 commented Aug 21, 2020

romain-jacob commented Aug 21, 2020

chrisb83 commented Aug 24, 2020

romain-jacob commented Aug 25, 2020

mdhaber commented Jul 13, 2023

mdhaber Jul 16, 2023

Choose a reason for hiding this comment

mdhaber Jul 16, 2023

Choose a reason for hiding this comment

mdhaber Jul 16, 2023

Choose a reason for hiding this comment

steppi Jul 16, 2023

Choose a reason for hiding this comment

steppi commented Jul 16, 2023

mdhaber commented Jul 16, 2023

steppi commented Jul 16, 2023

romain-jacob commented Jul 17, 2023

steppi commented Jul 17, 2023

steppi commented Jul 17, 2023

mdhaber Jul 17, 2023 • edited

Choose a reason for hiding this comment

steppi Jul 17, 2023

Choose a reason for hiding this comment

mdhaber left a comment • edited

Choose a reason for hiding this comment

romain-jacob commented Jul 19, 2023

mdhaber left a comment

Choose a reason for hiding this comment

mdhaber commented Jul 20, 2023 • edited

steppi commented Jul 20, 2023

mdhaber commented Jul 20, 2023

steppi commented Jul 20, 2023

romain-jacob commented Aug 14, 2023

mdhaber commented Aug 14, 2023

romain-jacob commented Aug 7, 2020 •

edited

mdhaber Jul 17, 2023 •

edited

mdhaber left a comment •

edited

mdhaber commented Jul 20, 2023 •

edited