MAINT: stats.wilcoxon: improve documentation and tests #21867

mdhaber · 2024-11-11T19:07:20Z

Reference issue

What does this implement/fix?

gh-21567 made the following improvements to scipy.stats.wilcoxon:

Respect the method parameter if it is explicitly passed by the user. In particular, if method='exact' is specified, use it regardless of whether there are ties/zeros.
Improve 'auto' method by performing a permutation test if the sample size is small and there are ties or zeros.
Rename method='approx' to method='asymptotic' for consistency with closely related tests mannwhitneyu, kendalltau, and page_trend_test.

During the review process, I noticed some other items deserving of maintenance. This PR makes those adjustments; rationale is noted inline.

mdhaber

@chrisb83 Could you take a look at these follow-up suggestions?

mdhaber · 2024-11-11T19:08:36Z

scipy/stats/_morestats.py

-    - The default, ``method='auto'``, selects between the two: when
-      ``len(d) <= 50`` and there are no zeros and no ties, the exact method
-      is used; if the sample size is small and there are zeros or ties, the
-      p-value is computed using `permutation_test`;
-      otherwise, the approximate method is used. The p-value computed by
-      the permutation test is deterministic since it is only used if the
-      sample size is small enough to iterate over all possible outcomes.
+    - The default, ``method='auto'``, selects between the two:
+    ``method='exact'`` is used when ``len(d) <= 50``, and
+    ``method='asymptotic'`` is used otherwise.


We have not yet defined "ties" or "zeros". Above, it specifies

Assume that all elements of d are independent and identically distributed observations, and all are distinct and nonzero.

So let's document what happens under this assumption, then define ties and zeros and document how those affect the behavior.

mdhaber · 2024-11-11T19:18:59Z

scipy/stats/_wilcoxon.py

-    if n_zero > 0 and method == "exact":
-        warnings.warn("Exact p-value calculation does not work if there are "
-                      "zeros. Consider using method `asymptotic` or "
-                      "`stats.PermutationMethod`",
-                      stacklevel=2)


As noted in #21567 (comment):

'exact' is also not exact if there are ties, but there is no similar warning in that case.

Users have expressed confusion about what a "zero" is in the past.

The recommended normal approximation / method='asymptotic' is not exact, either, so this doesn't motivate why one would be better than the other. (A permuation test would also not be exact unless the sample size is small.)

It's tough to address all the subtleties of accurate hypothesis testing in a warning; that's why we've elaborated on this stuff in the documentation. And I'm concerned that incomplete warnings like this can do more harm than good: it may give users the impression that the code will help them avoid shooting themselves in the foot, but scipy.stats doesn't do that consistently. If we think a more consistent policy across the board is important, that is something we can consider, but for now, let's not give a false impression.

mdhaber · 2024-11-11T19:20:10Z

scipy/stats/_wilcoxon.py

-    if 0 < d.shape[-1] < 10 and method == "asymptotic":
-        warnings.warn("Sample size too small for normal approximation.", stacklevel=2)


Similarly here: does this imply that a sample size of 10 is large enough for a normal approximation?

Note that there wasn't actually a test for this warning before. It was always just filtered out, so presumably it was not deemed to be critical.

mdhaber · 2024-11-11T19:20:54Z