Skip to content

DOC: prepare 2.3.1 whatsnew notes for release #61794

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

Conversation

jorisvandenbossche
Copy link
Member

Prepping for doing a 2.3.1 release today, xref #61590

@simonjayhawkins
Copy link
Member

@jorisvandenbossche for the "Comparisons between different string dtypes" is there an issue ref?

I don't get why:

When pd.StringDtype("pyarrow", na_value=pd.NA) is compared against any other string dtype, the result will always be boolean[pyarrow].

since when did we start mixing the pandas nullable dtypes with the ArrowDtypes?

If this is now policy, when do the ArrowDtypes return the ArrowDtype version of the string array and not the new default string array (pd.NA variant)?

@simonjayhawkins
Copy link
Member

in "Index set operations ignore empty RangeIndex and object dtype Index"

the code block uses

pd.options.mode.infer_string = True

this should be

pd.options.future.infer_string = True

@jorisvandenbossche
Copy link
Member Author

for the "Comparisons between different string dtypes" is there an issue ref?

#61138 is the PR, #60639 the issue. Will add a link

since when did we start mixing the pandas nullable dtypes with the ArrowDtypes?

Unfortunately for some time .. (and it is also a change that I don't really agree with). I was also again confused about it when finalizing that PR (see #61138 (comment)). But, this has been like this now for some releases, so not something to change here in pandas 2.3 (if we want to change it, it's something for 3.0 I think).
I know there was some discussion about this in the past, looking it up.

@simonjayhawkins
Copy link
Member

Well I need to do some more research to be sure, but I'm not happy on two fronts: the change itself and the fact that you were "required" to do PDEP-14 and maintain backwards compat with the "experimental" StringDtype because it had been available for so long. So it appears others seem to have changed the API without any deprecation or warning. Hopefully this will be clarified in the roadmap discussion.

@simonjayhawkins
Copy link
Member

for the "Comparisons between different string dtypes" is there an issue ref?

#61138 is the PR, #60639 the issue. Will add a link

great.

Let's just do that for now. No need to block on the rest of my comment.

@jorisvandenbossche
Copy link
Member Author

and the fact that you were "required" to do PDEP-14

Just to clarify here: this behaviour stems from before PDEP-14, and it is only for the NA-variant of the dtype, not for the future-default NaN-variant (so that's another reason that resolving this specific item is not a priority for 2.3)

@jorisvandenbossche
Copy link
Member Author

Putting it here now just because I looked it up (but further not related to the content of this PR): the change for returning bool[pyarrow] instead of BooleanDtype was done in 2.0 in #51643, triggered by doing something similar for value_counts returning int64[pyarrow] instead of Int64 (#51542). This came up again at #59330 (comment) and then Will created #59346 to discuss (but we haven't actually further discussed it)

@jorisvandenbossche
Copy link
Member Author

/preview

Copy link
Contributor

github-actions bot commented Jul 7, 2025

Website preview of this PR available at: https://pandas.pydata.org/preview/pandas-dev/pandas/61794/

@jorisvandenbossche jorisvandenbossche added Build Library building on various platforms Docs labels Jul 7, 2025
@jorisvandenbossche jorisvandenbossche added this to the 2.3.1 milestone Jul 7, 2025
@jorisvandenbossche jorisvandenbossche removed the Build Library building on various platforms label Jul 7, 2025
@jorisvandenbossche jorisvandenbossche merged commit ff8a607 into pandas-dev:main Jul 7, 2025
8 checks passed
meeseeksmachine pushed a commit to meeseeksmachine/pandas that referenced this pull request Jul 7, 2025
jorisvandenbossche added a commit that referenced this pull request Jul 7, 2025
… for release) (#61803)

Backport PR #61794: DOC: prepare 2.3.1 whatsnew notes for release

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants