DOC: Updating capitalization of doc/source/development #33121

cleconte987 · 2020-03-29T15:50:25Z

Regarding issue #32550. Changes to documentation folder doc/source/development to capitalise title strings and keep keyword exceptions as is

…es_to_pandas_remote

….sh L 330

…nt words with multiple capital letters

…t_modification Merging new changes from pandas-dev/pandas repository into script_modification to be up-to-date

Merge remote-tracking branch 'upstream_main_pandas/master' into script_modification

… add elements to exceptions's set in scripts/validate_rst_title_capitalization.py

Merge branch 'script_modification' into changes_to_pandas_remote

…t_modification

…' into changes_to_pandas_remote

Merge remote-tracking branch 'upstream_main_pandas/master' into changes_to_pandas_remote

…es_to_pandas_remote

…es_to_documentation

ShaharNaveh · 2020-03-29T16:39:29Z

doc/source/development/code_style.rst

@@ -18,7 +18,7 @@ consistent code format throughout the project. For details see the
 Patterns
 ========

-foo.__class__
+Foo.class


Actually this is on purpose, does validate_rst_title_capitalization.py showing this as an error?

It is showing this as an error, because foo is not inside exceptions.
But moreover, it is removing underscores from the title here:
with open(rst_file, "r") as fd: previous_line = "" for i, line in enumerate(fd): line = line[:-1] line_chars = set(line) if ( len(line_chars) == 1 and line_chars.pop() in symbols and len(line) == len(previous_line) ): **yield re.sub(r"[{backtick}\*_]", "", previous_line)**, i previous_line = line

@cleconte987 can you change the title to Using foo.__class__ instead? And below same for Using f-strings.

This will make the validation pass, and things look probably even better, without adding extra complexity.

Thanks!

Roger that!

ShaharNaveh · 2020-03-29T16:40:47Z

doc/source/development/extending.rst

@@ -95,7 +95,7 @@ on :ref:`ecosystem.extensions`.

 The interface consists of two classes.

-:class:`~pandas.api.extensions.ExtensionDtype`
+Class:~pandas.API.extensions.ExtensionDtype


:class: is something that we use for sphinx, I don't think that this is something we want

datapythonista · 2020-03-29T19:28:13Z

Thanks @cleconte987. Most of this is duplicated of #32944, we're going to merge that one, so you may want to revert those files. Also, try to coordinate with @themien who is also working on this.

For the titles that are like :class:... we can't apply your changes, that is a special encoding to automatically create links. What we'll have to do is to update the script to skip titles that start with :.

…es_to_documentation

cleconte987 · 2020-04-01T12:36:06Z

Thanks @cleconte987. Most of this is duplicated of #32944, we're going to merge that one, so you may want to revert those files. Also, try to coordinate with @themien who is also working on this.

For the titles that are like :class:... we can't apply your changes, that is a special encoding to automatically create links. What we'll have to do is to update the script to skip titles that start with :.

Ok I will follow what @themien is doing.
I modify the script as you said
And I update PR soon

…r doc/source/development Updating script validate_rst_title_capitalization.py to avoid taking into account titles beginning with ':'

…ITALIZATION_EXCEPTIONS' that are present in title with special encoding, as it is not necessary anymore

ShaharNaveh · 2020-04-01T14:24:53Z

scripts/validate_rst_title_capitalization.py

+            if ":class" in title:
+                print(title)


I think this should be:

if ":class:" in title: continue

Horrible... I forgot to remove it. Correcting it right away!

Hmm Im not so much used to git yet.. Apart from that mistake, I didn't roll back previous changes that I made, I thought it will be ok. I don't know if the changes i didn't remove will fail

Or if it conflicts with previous PR or something

…apitalization.py

datapythonista · 2020-04-01T15:42:20Z

scripts/validate_rst_title_capitalization.py

@@ -121,6 +175,7 @@ def find_titles(rst_file: str) -> Generator[Tuple[str, int], None, None]:
                len(line_chars) == 1
                and line_chars.pop() in symbols
                and len(line) == len(previous_line)
+                and previous_line[0] != ":"


Sorry, I think I suggested this approach myself, but I see a problem now. This will skip the whole line from validation, and I see we use a mix of these :class: directives and regular text in some titles.

Instead of skipping the line here, I think we should update the condition in line 139. So, we check that word is not one of the exceptions, and also, it doesn't start with :. We could use a regex probably, but let's start simple, and see if it's enough.

What do you think?

The problem is that:

First of all this part of code line 180

yield re.sub(r"[`*_]", "", previous_line), i

removes "`" from the title. So when function checks if title is correct, it will already be modified. So this is the main problem.

Secondly, line 129:

correct_title: str = re.sub(r"^\W*", "", title).capitalize()

removes all non word characters at the beginning from the desired correct title that we want to keep, so ":" is not kept as being part of the correct title

Thirdly, this line, number 136

word_list = re.split(r"\W", removed_https_title)

removes all non word characters from the list to analyse. So ":" don't appear anymore.

By the way, code in line 180 causes problems because it also removes "_" which is present in documentation like > code_style.rst line 21

Using foo.__class__

So what I suggest is to modify in script "validate_rst_title_capitalization.py" from line 180 on:

NB: I replace "`" by {backtick} as it causes trouble to display it as code

yield re.sub(r"[{backtick}\*_]", "", previous_line), i

to:

if previous_line[0] != ":": yield re.sub(r"[{backtick}\*_]", "", previous_line), i else: yield re.sub(r"[\*_]", "", previous_line), i

in order to keep the "`" if title begins by a ":"
And to add in "validate_rst_title_capitalization.py" line 126:

if title[0] == ":": return title

So that if title begins by ":" it considers title to be valid no matter what.

Or to simply add in "validate_rst_title_capitalization.py" line 126:

if title[0] == ":": return title

for the same reason.

Note that there is still a problem with:

yield re.sub(r"[`*_]", "", previous_line), i

That doesn't keep "_" in the titles

datapythonista

Ok, let's fix it this way then, and we can find a better solution in the future.

Thanks for the work on this @cleconte987

simonjayhawkins · 2020-04-02T12:00:01Z

@cleconte987 can you merge upstream master into your branch to resolve merge conflicts. see https://pandas.pydata.org/docs/development/contributing.html#updating-your-pull-request

…t tracked anymore? Add it to repository

…nge the way code excludes specific title syntax from being seen as an error

pep8speaks · 2020-04-02T15:47:11Z

Hello @cleconte987! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2020-04-02 21:13:35 UTC

…es_to_documentation

datapythonista · 2020-04-03T00:12:05Z

ci/code_checks.sh

@@ -327,7 +327,7 @@ if [[ -z "$CHECK" || "$CHECK" == "docstrings" ]]; then
    RET=$(($RET + $?)) ; echo $MSG "DONE"

    MSG='Validate correct capitalization among titles in documentation' ; echo $MSG
-    $BASE_DIR/scripts/validate_rst_title_capitalization.py $BASE_DIR/doc/source/development/contributing.rst
+    $BASE_DIR/scripts/validate_rst_title_capitalization.py $BASE_DIR/doc/source/development/contributing.rst $BASE_DIR/doc/source/reference


If everything in development/ is fixed, can you change this in a new PR, so the whole directory is validated in the CI? Thanks!

Suggested change

$BASE_DIR/scripts/validate_rst_title_capitalization.py $BASE_DIR/doc/source/development/contributing.rst $BASE_DIR/doc/source/reference

$BASE_DIR/scripts/validate_rst_title_capitalization.py $BASE_DIR/doc/source/development $BASE_DIR/doc/source/reference

datapythonista · 2020-04-03T00:12:50Z

Thanks for working on this @cleconte987

cleconte987 · 2020-04-15T16:23:11Z

You're welcome!

cleconte987 added 26 commits March 19, 2020 10:51

Updating capitalization in folder doc/source/reference

194d5a7

Merge remote-tracking branch 'upstream_main_pandas/master' into chang…

3d1960c

…es_to_pandas_remote

Add folder doc/source/reference in ci code checking in ci/code_checks…

9da1e8a

….sh L 330

Modification of validate_rst_title_capitalization.py script

55587f7

Modify validate_rst_title_capitalization.py script to take into accou…

048fb78

…nt words with multiple capital letters

remove stylistic errors in comment section L:77,78,79

d1ae9ea

Modify validate_rst_title_capitalization.py script

83280b7

Merge remote-tracking branch 'upstream_main_pandas/master' into scrip…

4314d3c

…t_modification Merging new changes from pandas-dev/pandas repository into script_modification to be up-to-date

modify script following tests

937b5df

Modify second time following style checking

589b0fe

Update branch with pandas repository commits

f09689c

Merge remote-tracking branch 'upstream_main_pandas/master' into script_modification

Commit to update capitalization in folder doc/source/reference and to…

e71cffd

… add elements to exceptions's set in scripts/validate_rst_title_capitalization.py

Merge branches to update PR

dc839c5

Merge branch 'script_modification' into changes_to_pandas_remote

Merge remote-tracking branch 'upstream_main_pandas/master' into scrip…

f354797

…t_modification

Merge remote-tracking branch 'temporary_repo/changes_to_pandas_remote…

62d5164

…' into changes_to_pandas_remote

Remove wrong lines of code

f8b496c

Formatting

5d01189

Update branch

1685076

Merge remote-tracking branch 'upstream_main_pandas/master' into changes_to_pandas_remote

Remove exception "GroupBy" and lower "GroupBy" in doc

06dc1cd

modifying for PR

a11dc2b

Merge remote-tracking branch 'upstream_main_pandas/master' into chang…

e55064d

…es_to_pandas_remote

Update "groupby" to "GroupBy"

fc1009d

Merge remote-tracking branch 'upstream_main_pandas/master' into chang…

eba0d13

…es_to_pandas_remote

Merge remote-tracking branch 'upstream_main_pandas/master' into chang…

726bd4c

…es_to_documentation

Commit changes to folder doc/source/development

0beeda1

Add keywords to CAPITALIZATION_EXCEPTIONS

f48b2dd

ShaharNaveh reviewed Mar 29, 2020

View reviewed changes

ShaharNaveh added the Docs label Mar 29, 2020

datapythonista changed the title ~~Changes to documentation folder doc/source/development || issue #32550~~ DOC: Updating capitalization of doc/source/development Mar 29, 2020

Merge remote-tracking branch 'upstream_main_pandas/master' into chang…

5ba464c

…es_to_documentation

cleconte987 added 3 commits April 1, 2020 15:08

Commit and updating documentation syntax and capitalization for folde…

7c09610

…r doc/source/development Updating script validate_rst_title_capitalization.py to avoid taking into account titles beginning with ':'

Remove the 'ExtensionDtype' and 'ExtensionArray' exceptions from 'CAP…

940109d

…ITALIZATION_EXCEPTIONS' that are present in title with special encoding, as it is not necessary anymore

Style formatting

35b1a37

ShaharNaveh reviewed Apr 1, 2020

View reviewed changes

Correct error by removing wrong lines of code in validate_rst_title_c…

5f0512e

…apitalization.py

datapythonista reviewed Apr 1, 2020

View reviewed changes

datapythonista approved these changes Apr 1, 2020

View reviewed changes

cleconte987 added 3 commits April 2, 2020 13:26

Weird behaviour file 'pandas/tests/frame/indexing/test_setitem.py' no…

7767d3c

…t tracked anymore? Add it to repository

merging remote pandas-dev/pandas/master

888aa10

Commit modifying script 'validate_rst_title_capitalization.py' to cha…

5fc4dcc

…nge the way code excludes specific title syntax from being seen as an error

cleconte987 added 2 commits April 2, 2020 16:52

Correct style errors

0ad8143

Merge remote-tracking branch 'upstream_main_pandas/master' into chang…

3ea43a5

…es_to_documentation

datapythonista reviewed Apr 3, 2020

View reviewed changes

datapythonista merged commit e17467e into pandas-dev:master Apr 3, 2020

simonjayhawkins added this to the 1.1 milestone Apr 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: Updating capitalization of doc/source/development #33121

DOC: Updating capitalization of doc/source/development #33121

cleconte987 commented Mar 29, 2020

ShaharNaveh Mar 29, 2020

cleconte987 Mar 30, 2020

datapythonista Mar 30, 2020

cleconte987 Apr 1, 2020

ShaharNaveh Mar 29, 2020

datapythonista commented Mar 29, 2020

cleconte987 commented Apr 1, 2020 •

edited

ShaharNaveh Apr 1, 2020 •

edited

cleconte987 Apr 1, 2020

cleconte987 Apr 1, 2020 •

edited

cleconte987 Apr 1, 2020

datapythonista Apr 1, 2020

cleconte987 Apr 1, 2020

datapythonista left a comment

simonjayhawkins commented Apr 2, 2020

pep8speaks commented Apr 2, 2020 •

edited

datapythonista Apr 3, 2020

cleconte987 Apr 3, 2020

datapythonista commented Apr 3, 2020

cleconte987 commented Apr 15, 2020

	$BASE_DIR/scripts/validate_rst_title_capitalization.py $BASE_DIR/doc/source/development/contributing.rst $BASE_DIR/doc/source/reference
	$BASE_DIR/scripts/validate_rst_title_capitalization.py $BASE_DIR/doc/source/development $BASE_DIR/doc/source/reference

DOC: Updating capitalization of doc/source/development #33121

DOC: Updating capitalization of doc/source/development #33121

Conversation

cleconte987 commented Mar 29, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

datapythonista commented Mar 29, 2020

cleconte987 commented Apr 1, 2020 • edited

ShaharNaveh Apr 1, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cleconte987 Apr 1, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

datapythonista left a comment

Choose a reason for hiding this comment

simonjayhawkins commented Apr 2, 2020

pep8speaks commented Apr 2, 2020 • edited

Comment last updated at 2020-04-02 21:13:35 UTC

Choose a reason for hiding this comment

Choose a reason for hiding this comment

datapythonista commented Apr 3, 2020

cleconte987 commented Apr 15, 2020

cleconte987 commented Apr 1, 2020 •

edited

ShaharNaveh Apr 1, 2020 •

edited

cleconte987 Apr 1, 2020 •

edited

pep8speaks commented Apr 2, 2020 •

edited