DOC: SciPy extensions for code style and docstring guidelines. #13955

WarrenWeckesser · 2021-04-28T21:49:42Z

I've been keeping notes on various requests for changes that come up now and then in PRs in NumPy and SciPy. In this PR, I've documented some of the common requests that I've seen in a new section of the docs called "Code and Documentation Style Guide - The Missing Bits". These are issues that are not covered in the usual coding and documentation PEPs and guides, and that can be expressed as a fairly simple rule.

The motivation for these guidelines is this: if a core developer makes a request to change a PR that is a blocking request, and if the change can be expressed as a fairly simple rule, and if there is consensus among the core devs that the requested change should block the PR, then the rule should be documented. That way it is clear that the requested change is not just the reviewer's personal preference, but in fact is the preferred style approved by the SciPy devs.

It is unlikely that everyone will agree that we need all these new guidelines. In fact, I expect that for some of these guidelines, a reaction will be "it is not important, we don't need it". That's fine! But then we have to be sure that in the future, if a reviewer requests a such a change in a PR, the request is just a suggestion (i.e. an expression of personal preference), and not a request that will block the PR.

For example, I personally don't care about the relatively trivial guidelines about where to put the space when a long string is broken at a word boundary, or about the suggestion to always include a blank line as the last line of a multiline docstring. If everyone else feels the same, then I'll take those out. But that means reviewers who do happen to have a preference should not block a PR that doesn't follow the guidelines. (There is nothing wrong with stating the preference in a review comment, but it shouldn't block a PR.)

doc/source/dev/missing-bits.rst

rgommers

Thanks for writing this up Warren. Most/all of this looks like good guidance, and useful to document. My one caution is that this should be documentation primarily for us (maintainers), you cannot expect new contributors to read and adhere to all that. And if they don't, keep in mind https://numpy.org/devdocs/dev/reviewer_guidelines.html#communication-guidelines. These are important to not make contributing to SciPy feel painful.

Related: pre-commit is now much more user-friendly than manual pre-commit hooks were in the past, so integrating that and making it run tools/lint_diff.py (same as in CI) would also be useful I think. I haven't used it much, but quite a few people have told me they like it.

doc/source/dev/missing-bits.rst

tupui · 2021-04-29T13:00:42Z

Related: pre-commit is now much more user-friendly than manual pre-commit hooks were in the past, so integrating that and making it run tools/lint_diff.py (same as in CI) would also be useful I think. I haven't used it much, but quite a few people have told me they like it.

Also these tools now offer a way to not kill git blame and such (see here). So having something like Black would remove lots of discussions. I set these up (black, pre-commit, isort) on a few projects at work and it's really convenient.

tupui

Nice, thanks for proposing that! (Do you have the feedback on skipping the CI?)

doc/source/dev/missing-bits.rst

tupui · 2021-04-29T15:00:08Z

doc/source/dev/missing-bits.rst

+* For a new argument added to an existing function,  two locations have been
+  used for the the 'versionadded' markup, [TBD: which is preferred?]:
+
+  * At the end of the description of the argument in the "Parameters" section


+1 on my side for this option.

It has been roughly 11 months and no one has indicated a different preference, so let's go with the first option.

tupui · 2021-04-29T16:11:35Z

doc/source/dev/missing-bits.rst

+statement makes the import *optional*; this guideline says explicitly
+that the import statement must not be included.
+
+


Suggested change

How to use Random Number Generator (RNG) and seed

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

In case random numbers are needed, `np.random.Generator` API must be used.

Use the helper function to validate the parameter `seed`, `random_state` [TBD: which is preferred?]::

from scipy._lib._util import check_random_state

def foo(seed=None):

rng = check_random_state(seed)

sample = rng.uniform(...) # for instance

If you want to write an example using random numbers in the documentation::

>>> rng = np.random.default_rng()

>>> sample = rng.uniform(...) # for instance

.. warning:: Do **not** specify the seed. This line will get overwritten and the same seed

will be used for conveniency for all examples. The value that you can use locally is:

`1638083107694713882823079058616272161`

@tupui, do you think you could split this into two parts, one for generating random data in the "Examples" section of a docstring, and one for using random data in unit tests? I hadn't followed very closely the discussions in #13863, but in #14182, @rkern pointed me to the important conclusions, including the part about a predefined seed being used when Sphinx builds the docs. This means a developer can test the code that will be used in an example, and be sure that the seed will match the one used in Sphinx. We need to get some version of this comment into these docs.

Back end of June. I will get to this as soon as I'm back if this is still a thing.

This has been updated and ready to include on my side.

Other than

In case random numbers are needed, np.random.Generator API must be used.

seeming to apply to test code and not just docstrings, this looks good to me. Pick any one of seed/random_state/rng for now. We can use the new decorator for backward compatible keyword renaming to change the name once we settle on something. We can use the new decorator for backward compatible keyword renaming to change the name once we settle on something.

That comment linked above looks like a good idea to me. It is good to include the seed here in the meantime, but it would also be nice to have something closer to English that we can import rather than having to refer to the docs every time. It would also be good if that thing could be automatically replaced in the user-facing docstring so that we don't need to remember to remove the seed before pushing (and inevitably add it back in and remove it again as a PR develops).

I opened #15852 for that.

ev-br · 2021-05-01T09:15:30Z

One other thing to add here might be not using asserts outside of test code. Errors need to be raised explicitly.

tupui · 2021-05-02T14:32:06Z

I am also seeing this in some PR as we add seeding, we should also decide about how to name this: seed, random_sate, something else?

tupui · 2021-05-10T12:37:09Z

Another point, when raising an exception: how to document parameters? There are many format used here. Sometimes we do ValueError("`param` is not good"), ValueError("``param`` is not good") or ValueError("param is not good").

tupui

I think we should prioritize this PR. Possibly backport it, but not necessary as we push people to look at the devdoc anyway. All these are quite important things that we always have to talk about during reviews (and we could add a link to this in the PR template).

tupui · 2021-05-27T11:05:46Z

doc/source/dev/missing-bits.rst

+            Parameters
+            ----------
+            x : float
+                x must be nonnegative.


Might be work talking about how to format the params in the description vs others things with double backticks.

Suggested change

x must be nonnegative.

`x` must be nonnegative.

[skip travis] [skip actions] [skip azp]

[skip actions] [skip travis] [skip azp]

WarrenWeckesser · 2021-06-06T22:20:36Z

Hmmm... why are the Azure tests running? I thought [skip azp] was working these days.

WarrenWeckesser · 2021-06-07T02:09:30Z

Hmmm... why are the Azure tests running? I thought [skip azp] was working these days.

Proposed fix for the handling of [skip azp] is in #14197.

Co-authored-by: Melissa Weber Mendonça <melissawm@gmail.com>

If necessary, this can be hashed out in a follow-up PR.

…section.

[skip azp] [skip actions]

WarrenWeckesser · 2022-03-22T09:38:43Z

I have updated the PR and moved it out of "draft" status.

These are unresolved items where conversations have been started here but would benefit from being split out as separate follow-up issues or PRs:

Use of backticks around variable names: reconciling the NumPy docstring guidelines with (1) desired
appearance of the rendered web pages, and (2) occasional incorrect links when single backticks are used.
Use of delimiters around parameter names in error and warning messages.
Using (and seeding) random numbers in docstrings and in unit tests. (@tupui has provided a starting point for these guidelines in the comments.)

I don't know what is happening with CircleCI; that's the one test that I wanted to run!

ilayn · 2022-03-22T09:52:19Z

We are using git+ssh instead of https and things changed on git side. https://stackoverflow.com/questions/70663523/the-unauthenticated-git-protocol-on-port-9418-is-no-longer-supported

tupui · 2022-03-22T09:54:15Z

I don't know what is happening with CircleCI; that's the one test that I wanted to run!

@WarrenWeckesser You need to merge main to this PR. Since you branched there has been a few changes which require that.

[skip azp] [skip actions]

tupui · 2022-03-22T09:57:38Z

These are unresolved items where conversations have been started here but would benefit from being split out as separate follow-up issues or PRs:

Use of backticks around variable names: reconciling the NumPy docstring guidelines with (1) desired
appearance of the rendered web pages, and (2) occasional incorrect links when single backticks are used.

We already are enforcing something. We should describe the current state at least. It does not prevent us for discussing future changes.

Using (and seeding) random numbers in docstrings and in unit tests. (@tupui has provided a starting point for these

I would not delay this. A lot of PR are wrong with respect to this and I always struggle to point out official doc for it.

rgommers

Everything that's here LGTM. +1 for merging as is, and leaving other topics for follow-ups.

[skip azp] [skip actions]

mdhaber

The rest LGTM, if we're holding other discussions for followups.

doc/source/dev/missing-bits.rst

There is ongoing discussion in scipygh-13049. We can add the appropriate guideline once that issue is officially resolved. Co-authored-by: Matt Haberland <mhaberla@calpoly.edu>

WarrenWeckesser · 2022-03-22T23:58:29Z

I forgot to add the skip notation in the last commit. The failed test is the Azure timeout that has been happening lately, and is not related to this PR.

ilayn · 2022-03-24T15:57:16Z

This is already a very nice set of guidelines and we can fix the rough edges later. Already approved by maintainers and +1 from me too. In it goes thanks @WarrenWeckesser and all reviewers

ilayn · 2022-03-24T15:59:31Z

@melissawm @tylerjereddy there is already a review request for you but I think working on separate PRs would be better to avoid the review bloat on this, hence the merge.

WarrenWeckesser requested a review from larsoner as a code owner April 28, 2021 21:57

WarrenWeckesser marked this pull request as draft April 28, 2021 22:10

tylerjereddy added the Documentation Issues related to the SciPy documentation. Also check https://github.com/scipy/scipy.org label Apr 29, 2021

tylerjereddy reviewed Apr 29, 2021

View reviewed changes

doc/source/dev/missing-bits.rst Show resolved Hide resolved

doc/source/dev/missing-bits.rst Outdated Show resolved Hide resolved

doc/source/dev/missing-bits.rst Outdated Show resolved Hide resolved

doc/source/dev/missing-bits.rst Outdated Show resolved Hide resolved

rgommers reviewed Apr 29, 2021

View reviewed changes

doc/source/dev/missing-bits.rst Outdated Show resolved Hide resolved

ev-br reviewed Apr 29, 2021

View reviewed changes

doc/source/dev/missing-bits.rst Show resolved Hide resolved

ev-br reviewed Apr 29, 2021

View reviewed changes

doc/source/dev/missing-bits.rst Outdated Show resolved Hide resolved

ev-br reviewed Apr 29, 2021

View reviewed changes

doc/source/dev/missing-bits.rst Outdated Show resolved Hide resolved

tupui reviewed Apr 29, 2021

View reviewed changes

doc/source/dev/missing-bits.rst Show resolved Hide resolved

doc/source/dev/missing-bits.rst Show resolved Hide resolved

doc/source/dev/missing-bits.rst Show resolved Hide resolved

doc/source/dev/missing-bits.rst Show resolved Hide resolved

tupui reviewed Apr 29, 2021

View reviewed changes

tupui mentioned this pull request Apr 30, 2021

ENH: Add Boschloo exact test to stats #13951

Merged

9 tasks

WarrenWeckesser mentioned this pull request May 1, 2021

ENH: stats: add bootstrap for estimating confidence interval and standard error of an n-sample statistic #13371

Merged

11 tasks

tupui mentioned this pull request May 2, 2021

ENH: cluster: add an optional argument seed for kmeans and kmeans2 to set random generator and random state. #13972

Merged

mdhaber mentioned this pull request May 3, 2021

ENH: stats: Studentized Range Distribution #13732

Merged

treverhines mentioned this pull request May 23, 2021

ENH: interpolate: add RBFInterpolator #13595

Merged

tupui mentioned this pull request May 27, 2021

DOC: Add better error message for unpacking issue #14142

Merged

tupui reviewed May 27, 2021

View reviewed changes

WarrenWeckesser added 4 commits June 6, 2021 17:14

DOC: SciPy extensions for code style and docstring guidelines.

98a4613

[skip travis] [skip actions] [skip azp]

Remove rule about where the space goes when a long string is split.

3ff0804

Remove rule about last line of docstring being a blank line.

8780082

Add links to the NumPy Testing Guidelines.

f0be9b3

[skip actions] [skip travis] [skip azp]

WarrenWeckesser force-pushed the guide-missing-bits branch from cd0a6fc to f0be9b3 Compare June 6, 2021 22:01

tupui mentioned this pull request Sep 22, 2021

API: keywords only arguments #14714

Closed

WarrenWeckesser mentioned this pull request Nov 30, 2021

ENH: Add Chirp Z-transform, zoom FFT #4607

Merged

59 tasks

tupui added this to the 1.9.0 milestone Mar 17, 2022

mdhaber mentioned this pull request Mar 20, 2022

DEP: add actual DeprecationWarning for sym_pos-keyword of scipy.linalg.solve #15821

Merged

WarrenWeckesser and others added 7 commits March 22, 2022 02:29

DOC: Update link to numpy's testing guidelines.

4eac26b

Co-authored-by: Melissa Weber Mendonça <melissawm@gmail.com>

DOC: Update link to numpy's testing guidelines.

a674d86

Co-authored-by: Melissa Weber Mendonça <melissawm@gmail.com>

Remove section about the use of LaTeX.

9b9b3b0

If necessary, this can be hashed out in a follow-up PR.

Remove section about the recommended Python name to use for 'lambda'.

134232f

If necessary, this can be hashed out in a follow-up PR.

In the 'must, not should' examples, use backticks around x.

22ddb43

Add recommendation to use DOIs in references.

43c5d6c

Decide that 'versionadded' for new parameters goes in the Parameters …

518be54

…section.

WarrenWeckesser marked this pull request as ready for review March 22, 2022 09:22

Trivial whitespace fix.

eefe045

[skip azp] [skip actions]

Merge branch 'main' into guide-missing-bits

a484c69

[skip azp] [skip actions]

rgommers approved these changes Mar 22, 2022

View reviewed changes

Fix Sphinx markup so the sample reference is a literal block.

1794b92

[skip azp] [skip actions]

mdhaber requested changes Mar 22, 2022

View reviewed changes

doc/source/dev/missing-bits.rst Outdated Show resolved Hide resolved

Remove the guidelines about the use of 'import numpy as np' in examples.

a2a2aeb

There is ongoing discussion in scipygh-13049. We can add the appropriate guideline once that issue is officially resolved. Co-authored-by: Matt Haberland <mhaberla@calpoly.edu>

mdhaber approved these changes Mar 22, 2022

View reviewed changes

mdhaber requested review from melissawm and tylerjereddy March 22, 2022 23:53

ilayn merged commit 1045161 into scipy:main Mar 24, 2022

WarrenWeckesser deleted the guide-missing-bits branch March 25, 2022 02:09

WarrenWeckesser mentioned this pull request Jun 4, 2022

DOC: add examples to signal.medfilt2d #16356

Merged

WarrenWeckesser mentioned this pull request Sep 23, 2022

DOC: Fix formatting in svds docstrings #17081

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: SciPy extensions for code style and docstring guidelines. #13955

DOC: SciPy extensions for code style and docstring guidelines. #13955

WarrenWeckesser commented Apr 28, 2021 •

edited

rgommers left a comment

tupui commented Apr 29, 2021 •

edited

tupui left a comment

tupui Apr 29, 2021

WarrenWeckesser Mar 22, 2022

tupui Apr 29, 2021 •

edited

WarrenWeckesser Jun 6, 2021

tupui Jun 7, 2021

tupui Mar 22, 2022

mdhaber Mar 22, 2022 •

edited

tupui Mar 23, 2022

ev-br commented May 1, 2021 •

edited by WarrenWeckesser

tupui commented May 2, 2021

tupui commented May 10, 2021

tupui left a comment

tupui May 27, 2021

WarrenWeckesser commented Jun 6, 2021 •

edited

WarrenWeckesser commented Jun 7, 2021 •

edited

WarrenWeckesser commented Mar 22, 2022 •

edited

ilayn commented Mar 22, 2022 •

edited

tupui commented Mar 22, 2022

tupui commented Mar 22, 2022

rgommers left a comment

mdhaber left a comment

WarrenWeckesser commented Mar 22, 2022

ilayn commented Mar 24, 2022

ilayn commented Mar 24, 2022

		statement makes the import optional; this guideline says explicitly
		that the import statement must not be included.

+How to use Random Number Generator (RNG) and seed
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+In case random numbers are needed, `np.random.Generator` API must be used.
+Use the helper function to validate the parameter `seed`, `random_state` [TBD: which is preferred?]::
+    from scipy._lib._util import check_random_state
+    def foo(seed=None):
+        rng = check_random_state(seed)
+        sample = rng.uniform(...)  # for instance
+If you want to write an example using random numbers in the documentation::
+    >>> rng = np.random.default_rng()
+    >>> sample = rng.uniform(...)  # for instance
+.. warning:: Do **not** specify the seed. This line will get overwritten and the same seed
+   will be used for conveniency for all examples. The value that you can use locally is:
+   `1638083107694713882823079058616272161`

DOC: SciPy extensions for code style and docstring guidelines. #13955

DOC: SciPy extensions for code style and docstring guidelines. #13955

Conversation

WarrenWeckesser commented Apr 28, 2021 • edited

rgommers left a comment

Choose a reason for hiding this comment

tupui commented Apr 29, 2021 • edited

tupui left a comment

Choose a reason for hiding this comment

tupui Apr 29, 2021

Choose a reason for hiding this comment

WarrenWeckesser Mar 22, 2022

Choose a reason for hiding this comment

tupui Apr 29, 2021 • edited

Choose a reason for hiding this comment

WarrenWeckesser Jun 6, 2021

Choose a reason for hiding this comment

tupui Jun 7, 2021

Choose a reason for hiding this comment

tupui Mar 22, 2022

Choose a reason for hiding this comment

mdhaber Mar 22, 2022 • edited

Choose a reason for hiding this comment

tupui Mar 23, 2022

Choose a reason for hiding this comment

ev-br commented May 1, 2021 • edited by WarrenWeckesser

tupui commented May 2, 2021

tupui commented May 10, 2021

tupui left a comment

Choose a reason for hiding this comment

tupui May 27, 2021

Choose a reason for hiding this comment

WarrenWeckesser commented Jun 6, 2021 • edited

WarrenWeckesser commented Jun 7, 2021 • edited

WarrenWeckesser commented Mar 22, 2022 • edited

ilayn commented Mar 22, 2022 • edited

tupui commented Mar 22, 2022

tupui commented Mar 22, 2022

rgommers left a comment

Choose a reason for hiding this comment

mdhaber left a comment

Choose a reason for hiding this comment

WarrenWeckesser commented Mar 22, 2022

ilayn commented Mar 24, 2022

ilayn commented Mar 24, 2022

WarrenWeckesser commented Apr 28, 2021 •

edited

tupui commented Apr 29, 2021 •

edited

tupui Apr 29, 2021 •

edited

mdhaber Mar 22, 2022 •

edited

ev-br commented May 1, 2021 •

edited by WarrenWeckesser

WarrenWeckesser commented Jun 6, 2021 •

edited

WarrenWeckesser commented Jun 7, 2021 •

edited

WarrenWeckesser commented Mar 22, 2022 •

edited

ilayn commented Mar 22, 2022 •

edited