Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ENH: add additional extras_require sections for optional dependencies #39164

Closed
person142 opened this issue Jan 14, 2021 · 20 comments · Fixed by #47336
Closed

ENH: add additional extras_require sections for optional dependencies #39164

person142 opened this issue Jan 14, 2021 · 20 comments · Fixed by #47336
Labels
Build Library building on various platforms Enhancement

Comments

@person142
Copy link

Is your feature request related to a problem?

It would be nice to have more extras_require sections in the setup.py for optional dependencies.

Describe the solution you'd like

I'd like to see more sections added here:

https://github.com/pandas-dev/pandas/blob/master/setup.py#L743

for optional dependencies; e.g. pandas[s3] for Pandas + everything you need to write to S3 (just for the sake of example; this is probably not the right division). I found a similar request here: #35206. (Sorry if a previous issue exists and I missed it.)

API breaking implications

None

Describe alternatives you've considered

Pin all desired optional dependencies.

Additional context

The idea is that if you're using a tool like pip-compile, then currently you have to pin all of these requirements, even if they are only used transitively by Pandas. It would be nice to do say pandas[s3] in your requirements.in and have s3fs get added to the compiled requirements.

Happy to make a PR if folks don't object to the idea.

@person142 person142 added Enhancement Needs Triage Issue that has not been reviewed by a pandas team member labels Jan 14, 2021
@simonjayhawkins simonjayhawkins added Build Library building on various platforms and removed Needs Triage Issue that has not been reviewed by a pandas team member labels Jan 17, 2021
@JMBurley
Copy link
Contributor

Idea is still relevant, but since initial request pandas has refactored to more modern dependency mgmt and the necessary extras_require is held in setup.cfg: https://github.com/pandas-dev/pandas/blob/master/setup.cfg#L48.

I'll fix to insert current recommended dependencies

@JMBurley
Copy link
Contributor

@mroeschke PR #47336 resolves some of the issues discussed here.

It would also be trivial to repeat my code to provide functional-level extras like pandas s3, gcp (etc...) if package ranges are agreed upon. I can do that over the next few weeks.

(Well, pandas s3 uses s3fs which means a hideous set of dependency problems derived from their use in turn of aiobotocore (in s3fs>=0.5.0) which is typically incompatible with any other package using boto3 but that's not for pandas to resolve. I'd want to check with pandas leadership if s3fs>=0.5 is needed for any reason and strongly suggest pinning s3fs ~=0.4 until that dependency mess is resolved. Summary here)

@mroeschke
Copy link
Member

Thanks for initiating this! I think there may need to be a broader discussion with the core team before we make any PR on organizing the optional dependencies. I think the broad steps will be.

  1. Decide on the correct logical grouping of optional dependencies
  2. Establish the groupings in 1 PR.

Part of 1) is also revisiting pandas optional dependencies. pandas has like 25+ optional dependencies and there has been discussion if we could also spin off some dependencies elsewhere (#45433).

A good first step as you mention would to be propose dependency groupings and gain some feedback.

@JMBurley
Copy link
Contributor

JMBurley commented Jun 14, 2022

I agree with the overall path to solving all extras_require options, and that that would be a nice improvement to pandas.

However I do want to highlight that that will be a quite large discussion that will inevitably reveal a series of packaging problems and lead to this issue dragging on and taking a while to resolve.


s3fs example

s3fs is required for s3 interaction but (because s3fs>=0.5.0 requires aiobotocore) that is mutually incompatible with moto[boto3] (some unresolved issues explaining the aiobotocore botocore problem).

It isn't actively documented in pandas, but right now the entire stability of pandas is pinned to two s3fs releases from March 2020 because pandas needs moto for tests, and moto cannot coexist with s3fs>0.5.0 and pandas requires s3fs>0.4.0.

But pandas has non-professional users who don't care about writing tests (ie. don't care about moto) and therefore might prefer pandas to allow a more recent version of s3fs.

So what versions of s3fs would we pin pandas to?


I'm diving deep into that one s3fs issue to flag that solving packaging in full is not going to be quick, and the fundamental work to understand packaging and versioning is not complete (eg. environment.yml has required indirectly may be? as a comment, and many dependencies have no stated versions although other requirements / setup files and documentation may suggest versions). There will be multiple issues like s3fs.

I propose incremental improvement. Allowing the recommended installs to be managed dependencies is an easy improvement to pandas packaging + dependencies that we can make right now that will save developers some serious headaches. How we solve that will not change in future and it is independent of the optional dependencies that extend functionality. Perhaps we can bundle that under an optional extra performance or recommended?

We can then work towards the larger, laudable goal of allowing functionality groupings (eg. s3, gcp, excel, parquet, sql, html, plots, dask) as extras.

Thoughts?

Groupings

(Based off current functionality enhancements in setup & environment.yml)

extras_require name Enforces
recommended: numexper>=2.7.1, bottleneck>=1.3.1
s3: TBC
gcp: TBC
excel: TBC
parquet: TBC
sql: TBC
html: TBC
plots: TBC
dask: TBC

@bluss
Copy link

bluss commented Jun 17, 2022

I would wish for that pandas not recommend bottleneck until there is a mitigation for #42878

@JMBurley
Copy link
Contributor

JMBurley commented Jul 7, 2022

@mroeschke pinging on the above. I would still advocate that the best path is to package what can be trivialise packaged right now in order to improve pandas as-is, and then finalise more difficult packaging decisions later.

Otherwise, I think the packaging below would be in line with current understanding.

Groupings

(Based off current functionality enhancements in setup & environment.yml, install_guide where available, or open-ended >=current_stable_version when no information available regarding pandas compatibility)

extras_require name Enforces
recommended: numexper>=2.7.1; bottleneck>=1.3.1
computation: SciPy >=1.4.1; numba>=0.50.1 ; xarray>=0.15.1
s3: s3fs>=0.4.0 ; boto3>=1.22.7
gcp: gcsfs>=0.6.0 ; pandas-gbq>=0.14.0 ;
excel: xlrd >=2.0.1 ; xlwt >=1.3.0 ; xlsxwriter v1.2.2 ; openpyxl >=3.0.3 ; pyxlsb >=1.0.6
parquet: fastparquet>=0.4.0; pyarrow>=1.0.1
feather: pyarrow>=1.0.1
hdf5: PyTables>=3.6.1 ; blosc>=1.20.1 ;
sql: SQLAlchemy>=1.4.0 ; psycopg2>=2.8.4 ; pymysql>=0.10.1
html: BeautifulSoup4>=4.8.2 ; html5lib>=1.1 ; lxml>=4.5.0
viz: matplotlib>=3.3.2 ; Jinja2>=2.11 ; tabulate>=0.8.7

Notes:

  • recommended: It seems that Numba should be in this list for performance boost on rolling operations?
  • s3: fsspec is explicit dependency of s3fs, pandas can ignore.
  • hdf5: zlib not pip package, so ignore in setup. nb./ pandas install_guide should recommend >=1.1.4 due to security flaws in prior zlib
  • clipboard: I am ignoring because OS-dependent packaging is outside of current scope, and such functionality very rarely relevant to dockerising production code (thus, low priority IMO)

I can incorporate all of the above in my PR to manage optional dependencies via [options.extras_require] if there is consensus that we should follow this path. Thoughts?

Note that there would be an open question on how to ensure that https://pandas.pydata.org/docs/getting_started/install.html is kept current with the actual packaging.

@mroeschke
Copy link
Member

mroeschke commented Jul 7, 2022

Sorry for the delay @JMBurley.

On the surface these groupings seem reasonable, I would however be weary to make a final call here without more buy in from the core team @pandas-dev/pandas-core

This type of enhancement may warrant going through our developing enhancement proposal process #47444 as this would be a larger change for the library, so let's wait for those developments before moving forward here.

@bashtage
Copy link
Contributor

bashtage commented Jul 8, 2022

Would it make sense to have a kitchen sink option, e.g., pip install pandas[all]?

@bashtage
Copy link
Contributor

bashtage commented Jul 8, 2022

  • recommended: It seems that Numba should be in this list for performance boost on rolling operations?

I think numba can be a bit too fiddly on some OS/arch types to be "recommended".

@jreback
Copy link
Contributor

jreback commented Jul 8, 2022

well numba is supported in the 3 main architectures so IMHO it's enough (it's still optional in any event)

@TomAugspurger
Copy link
Contributor

A few notes:

  • I don't think parquet should include both fastparquet and pyarrow, users aren't likely to use both simultaneously. It should either be just arrow since it's the default or split into two extras
  • Likewise with sql: people probably aren't using both MySQL and PostgreSQL. Split these into a bunch of sql-* extras?
  • If you're adding an s3 and gcp you should also add an azure that depends on adlfs (and why stop there? fsspec has lots of other implementations)
  • I'm a bit surprised to see Jinaj2 and tabulate under "viz". Viz makes me think .plot. I wouldn't necessarily consider .to_markdown() or .table "visualizations". So perhaps split those yet again into plot or plotting and something else for .table?

@JMBurley
Copy link
Contributor

JMBurley commented Jul 8, 2022

Thanks everyone for comments.

Actions / discussion

[I think numba can be a bit too fiddly on some OS/arch types to be "recommended".] [Numba can be fiddly but supported on "3 main architectures"]

I'll leave Numba in Computation only for now. Also saves on rewriting the install.rst file with a detailed reason behind why it is in recommended.

  • I don't think parquet should include both fastparquet and pyarrow, users aren't likely to use both simultaneously. It should either be just arrow since it's the default or split into two extras

Good catch thanks. Happy to take just pyarrow as the default. This is probably also more convenient to most users as other major libraries are more likely to request pyarrow than fastparquet.

  • Likewise with sql: people probably aren't using both MySQL and PostgreSQL. Split these into a bunch of sql-* extras?

I'll keep a general sql that will ensure user can interact with any (common) SQL implementation, and also split out specific sql-* to facilitate tighter packaging.

  • If you're adding an s3 and gcp you should also add an azure that depends on adlfs

Good catch, currently there are no install notes for azure requirements in the install_guide so that might need an update as well. I'll add azure to [options.extras_require]. @TomAugspurger are there any other dependencies needed to make adlfs work (that it won't handle itself)? PyPi documentation looks very dask-focused.

Although adlfs is not imported anywhere in pandas so I'm not sure why/how the azure interaction works? Perhaps it just needs fsspec?

(and why stop there? fsspec has lots of other implementations)

The way I think about packaging is that pandas should be able to manage its dependencies for any meaningful function that an end-user might need. Therefore s3, gcp are options. Other capabilities of fsspec that are meaningful to the end-user can be added. That said, if fsspec underlies an overwhelming fraction of pandas file I/O then perhaps it should be a core dependency?

  • I'm a bit surprised to see Jinaj2 and tabulate under "viz". Viz makes me think .plot. I wouldn't necessarily consider .to_markdown() or .table "visualizations". So perhaps split those yet again into plot or plotting and something else for .table?

Agreed. I followed the current install guide structure, will split into plot and table. I'm honestly not sure if Jinja2 is needed for any pandas functionality? (Jinja2 not imported anywhere in pandas).

Updated Groupings for [options.extras_require]

extras_require name Enforces
recommended: numexper>=2.7.1; bottleneck>=1.3.1 ; numba>=0.50.1
computation: SciPy >=1.4.1; numba>=0.50.1 ; xarray>=0.15.1
s3: s3fs>=0.4.0 ; boto3>=1.22.7
gcp: gcsfs>=0.6.0 ; pandas-gbq>=0.14.0 ;
azure: adlfs >=0.6.0 (triage needed?)
excel: xlrd >=2.0.1 ; xlwt >=1.3.0 ; xlsxwriter>=1.2.2 ; openpyxl >=3.0.3 ; pyxlsb >=1.0.6
parquet: pyarrow>=1.0.1
feather: pyarrow>=1.0.1
hdf5: PyTables>=3.6.1 ; blosc>=1.20.1 ;
sql-postgresql: SQLAlchemy>=1.4.0 ; psycopg2>=2.8.4
sql-mysql: SQLAlchemy>=1.4.0 ; pymysql>=0.10.1
sql-other: SQLAlchemy>=1.4.0
html: BeautifulSoup4>=4.8.2 ; html5lib>=1.1 ; lxml>=4.5.0
plot: matplotlib>=3.3.2
table: Jinja2>=2.11 ; tabulate>=0.8.7

@TomAugspurger
Copy link
Contributor

are there any other dependencies needed to make adlfs work (that it won't handle itself)? PyPi documentation looks very dask-focused.

Just adlfs.

Although adlfs is not imported anywhere in pandas so I'm not sure why/how the azure interaction works? Perhaps it just needs fsspec?

It's the same as s3fs / gcsfs. I don't think pandas imports those anywhere outside of tests.

It wouldn't be appropriate to add fsspec as a required dependency.

@fangchenli
Copy link
Member

Sorry, I'm late for the discussion. Before doing this, we better update all the minimum support versions (if needed).

@JMBurley
Copy link
Contributor

JMBurley commented Jul 8, 2022

Before doing this, we better update all the minimum support versions (if needed).

Found the updated list in the 1.5.0 rst and applied to the PR

JMBurley added a commit to JMBurley/pandas that referenced this issue Jul 12, 2022
JMBurley added a commit to JMBurley/pandas that referenced this issue Jul 12, 2022
see pandas-dev#39164 for discussion.  0.6.0 might be an overly restrictive version, but it is compatible
@JMBurley
Copy link
Contributor

JMBurley commented Jul 12, 2022

@mroeschke @bashtage @TomAugspurger @jreback @fangchenli :

Thanks, comments & feedback incorporated. I've updated the PR #47336.

Let me know if further changes needed and/or who to tag for the PR review.

@JMBurley
Copy link
Contributor

@mroeschke @TomAugspurger @jreback

Sorry to ping again, want to try and get this into the 1.5 milestone if possible.

Issue-closing PR #47336 for this optional dependencies managed via extras_require argument is up and passing tests. Are you guys happy to review as-is or is further discussion needed?

@JMBurley
Copy link
Contributor

@mroeschke @TomAugspurger @jreback

Earlier discussion here suggested that this PR adding proper optional packaging to Pandas might be suited for PDEP when PDEP was finalised (#47444).

As PDEP is now done, I could make this this the first PDEP?

Although:

  1. It is somewhat of a fait accompli as the solution is already built has complete PR & documentation waiting to go in BLD: add optional dependencies as extras_require in setup.py #47336
  2. I'm not sure if the process for the core team reviewing PDEP is actually ready to go?

Let me know your thoughts.

@JMBurley
Copy link
Contributor

JMBurley commented Sep 7, 2022

Bumping this issue. Problem is solved and we are just waiting for approval on the solution #47366, hopefully in version 1.5.0.

Remaining issues & debates are best resolved by having the dependency options available for open-source users to change in minor version updates. I think it would be a significant loss for this to be delayed until pandas v1.6.0, although I'm not sure what else is competing for attention prior to that release.

mroeschke added a commit that referenced this issue Oct 21, 2022
* add-recommended-dependencies-as-extras_require-in-setup.cfg

See issue #47335.

#47335

recommended dependencies should have package mgmt facilitated through pandas.  This will make mgmt of pandas in production docker environments a lot simpler.

* Update v1.4.3.rst

* double backtick code in rst

* rebundle under extras_require `recommended`

* [options.extras_require] bundled dependencies by featureset

see #39164 for discussion.

* note: [options.extras_require] bundled dependencies by featureset

* Update setup.cfg

rollback numba from recommended.  This would necessitate and update to documentation that requires broad agreement from pd-dev-core team that would slow down the overall PR

* add adlfs for azure to `access data in cloud`

see #39164 for discussion.  0.6.0 might be an overly restrictive version, but it is compatible

* fix extras_require: PyTables is actually `tables` on PyPi

* Update setup.cfg

* add `all` option to [options.extras_require]

* moved changelog to 1.4.4 as 1.4.3 released while this PR was stalled

* Updated to 1.5.0 compliance

* simplify sql option names

* extras rename: recommended -> performance

* remove azure

support is currently unofficial as of 1.5.0

* align with actions-38-minimum_versions.yaml

add  specific installs and, where required, missing install documentation for
 - odfpy
 - pyreadstat
 - compression options

* Pandas -> pandas in doc

Co-authored-by: Matthew Roeschke <emailformattr@gmail.com>

* extras rename: s3 -> aws

see #47336 (comment)

* extras rename: table -> output_formatting

to be more general in case of future changes

* bug: `>=` not `=`

* Apply suggestions from code review

Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>

* align 1.5.0.rst to latest extras_require updates

* 1.5.0.rst example updated to use valid extras

* add optional dep mgmt instructions to install.rst

* lint scipy optional import

Co-authored-by: Matthew Roeschke <emailformattr@gmail.com>

* Apply suggestions from code review

* detailed extras guidance in install.rst

 - updated numbas to a full recommended dependency with a promotional bullet point like bottleneck and numexpr
- clarified the extra to use for each set of optional dependencies
- made xml an optional extra, because is does have usage outside of read_html.

* _optional.py note to keep track of setup.cfg

* bug: indent after bullet in install.rst

* remove numba from computation extra.

* Backport PR #48197 on branch 1.5.x (DOC: Cleanup 1.5 whatsnew) (#48228)

Backport PR #48197: DOC: Cleanup 1.5 whatsnew

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48232 on branch 1.5.x (CI: Ensure jobs run on 1.5.x branch) (#48235)

Backport PR #48232: CI: Ensure jobs run on 1.5.x branch

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48180 on branch 1.5.x (CI: Switch to large for circleci) (#48251)

Backport PR #48180: CI: Switch to large for circleci

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48245 on branch 1.5.x (CI: Skip test_round_sanity tests due to failures) (#48257)

Backport PR #48245: CI: Skip test_round_sanity tests due to failures

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48240 on branch 1.5.x (Fix mypy erroring on backport branches) (#48259)

Backport PR #48240: Fix mypy erroring on backport branches

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48215 on branch 1.5.x (REGR: properly update DataFrame cache in Series.__setitem__) (#48268)

Backport PR #48215: REGR: properly update DataFrame cache in Series.__setitem__

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR #48272 on branch 1.5.x (CI: Require s3fs greater than minumum version in builds) (#48276)

Backport PR #48272: CI: Require s3fs greater than minumum version in builds

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48299 on branch 1.5.x (Bump s3fs to 2021.08.00) (#48305)

Backport PR #48299: Bump s3fs to 2021.08.00

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48027 on branch 1.5.x (ENH: Support masks in groupby prod) (#48302)

Backport PR #48027: ENH: Support masks in groupby prod

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #47762 on branch 1.5.x (REGR: preserve reindexed array object (instead of creating new array) for concat with all-NA array) (#48309)

Backport PR #47762: REGR: preserve reindexed array object (instead of creating new array) for concat with all-NA array

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR #48246 on branch 1.5.x (REGR: iloc not possible for sparse DataFrame) (#48311)

Backport PR #48246: REGR: iloc not possible for sparse DataFrame

Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>

* Backport PR #48314 on branch 1.5.x (DOC: v1.4.4 release date and tidy up release notes) (#48320)

Backport PR #48314: DOC: v1.4.4 release date and tidy up release notes

Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>

* Backport PR #48301 on branch 1.5.x (DEPR: Deprecate positional arguments in pivot) (#48326)

Backport PR #48301: DEPR: Deprecate positional arguments in pivot

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48214 on branch 1.5.x (WEB: Removing links to pdf version of the docs from web and docs) (#48242)

Backport PR #48214: WEB: Removing links to pdf version of the docs from web and docs

* Backport PR #48159 on branch 1.5.x (TST: Fix interchange/plotting/groupby test warnings) (#48279)

Backport PR #48159: TST: Fix interchange/plotting/groupby test warnings

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48324 on branch 1.5.x (BUG: Add note in whatsnew for DataFrame.at behavior change) (#48345)

Backport PR #48324: BUG: Add note in whatsnew for DataFrame.at behavior change

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR #48336 on branch 1.5.x (DOC: Add whatsnew note for #45404) (#48341)

Backport PR #48336: DOC: Add whatsnew note for #45404

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48254 on branch 1.5.x (REF: avoid FutureWarning about using deprecates loc.__setitem__ non-inplace usage) (#48353)

Backport PR #48254: REF: avoid FutureWarning about using deprecates loc.__setitem__ non-inplace usage

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR #48334 on branch 1.5.x (BUG: read_html(extract_links=all) with no header) (#48350)

Backport PR #48334: BUG: read_html(extract_links=all) with no header

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48265 on branch 1.5.x (CI: Setting up ssh key to upload prod docs) (#48370)

Backport PR #48265: CI: Setting up ssh key to upload prod docs

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR #48381 on branch 1.5.x (CI: Pin mambaforge image) (#48401)

Backport PR #48381: CI: Pin mambaforge image

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48229 on branch 1.5.x (TST: Test Nullable int floordiv by 0) (#48413)

Backport PR #48229: TST: Test Nullable int floordiv by 0

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48414 on branch 1.5.x (DOC: Add deprecation to is_categorical) (#48418)

Backport PR #48414: DOC: Add deprecation to is_categorical

Co-authored-by: Kevin Sheppard <bashtage@users.noreply.github.com>

* Backport PR #48264 on branch 1.5.x (BUG: ArrowExtensionArray._from_* accepts pyarrow arrays) (#48422)

* Backport PR #48264: BUG: ArrowExtensionArray._from_* accepts pyarrow arrays

* Add missing import

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48411 on branch 1.5.x (REGR: get_loc for ExtensionEngine not returning bool indexer for na) (#48430)

Backport PR #48411: REGR: get_loc for ExtensionEngine not returning bool indexer for na

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48419 on branch 1.5.x (BUG: ensure to return writable buffer in __dataframe__ interchange for categorical column) (#48441)

Backport PR #48419: BUG: ensure to return writable buffer in __dataframe__ interchange for categorical column

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR #48444 on branch 1.5.x (CI: Pin ipython version) (#48449)

Backport PR #48444: CI: Pin ipython version

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48380 on branch 1.5.x (DOC: Clarify that objects dtype takes precedence in where) (#48445)

* Backport PR #48380: DOC: Clarify that objects dtype takes precedence in where

* Update generic.py

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Manual Backport PR #48427 on branch 1.5.x (BLD: Refactor Dockerfile to not install dev enviornment on base) (#48450)

Backport PR #48427: BLD: Refactor Dockerfile to not install dev enviornment on base

* Backport PR #48426 on branch 1.5.x (BUG: Column.size should be a method) (#48465)

Backport PR #48426: BUG: Column.size should be a method

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48398 on branch 1.5.x (WARN: Avoid FutureWarnings in tests) (#48420)

* Backport PR #48398: WARN: Avoid FutureWarnings in tests

* Update

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>
Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR #48416 on branch 1.5.x (REF: ensure to apply suffixes before concat step in merge code) (#48470)

Backport PR #48416: REF: ensure to apply suffixes before concat step in merge code

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR #48354 on branch 1.5.x (CI: Bump timeout to 180 minutes) (#48474)

Backport PR #48354: CI: Bump timeout to 180 minutes

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48472 on branch 1.5.x (PERF: keep using ObjectEngine for ExtensionArrays for 1.5) (#48486)

Backport PR #48472: PERF: keep using ObjectEngine for ExtensionArrays for 1.5

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR #48473 on branch 1.5.x (REGR: .describe on unsigned dtypes results in object) (#48501)

Backport PR #48473: REGR: .describe on unsigned dtypes results in object

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR #48443 on branch 1.5.x (BUG: Fix pyarrow groupby tests)  (#48494)

* BUG: Fix pyarrow groupby tests (#48443)

# Conflicts:
#	pandas/tests/extension/test_arrow.py

* CI: Fix failing tests (#48493)

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR #48490 on branch 1.5.x (CI: Use -j1 for python-dev build to avoid flaky build error) (#48517)

Backport PR #48490: CI: Use -j1 for python-dev build to avoid flaky build error

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Manual Backport PR #48428 on branch 1.5.x (BUG/TST: fix a bunch of arraymanager+pyarrow tests) (#48518)

Backport PR #48428: BUG/TST: fix a bunch of arraymanager+pyarrow tests

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR #48525 on branch 1.5.x (CI: Fix py311 builds different exception message) (#48529)

Backport PR #48525: CI: Fix py311 builds different exception message

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48489 on branch 1.5.x (BUG: fix test_arrow.py tests) (#48532)

Backport PR #48489: BUG: fix test_arrow.py tests

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR #48543 on branch 1.5.x (DOC: Update footer and include OVH) (#48548)

Backport PR #48543: DOC: Update footer and include OVH

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Manual Backport PR #48417 on branch 1.5.x (Revert set_index inplace and copy keyword changes) (#48552)

Backport PR #48417: Revert set_index inplace and copy keyword changes

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR #48550 on branch 1.5.x (TST: remove 2D tests irrelevant for pyarrow) (#48554)

Backport PR #48550: TST: remove 2D tests irrelevant for pyarrow

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR #48556 on branch 1.5.x (DOC: Fix docs footer) (#48558)

Backport PR #48556: DOC: Fix docs footer

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR #48562 on branch 1.5.x (TST: Testing that no warnings are emitted and that inplace fillna produces the correct result (GH48480)) (#48564)

Backport PR #48562: TST: Testing that no warnings are emitted and that inplace fillna produces the correct result (GH48480)

Co-authored-by: RaphSku <45042665+RaphSku@users.noreply.github.com>

* Backport PR #48563 on branch 1.5.x (DOC: Fix read_sas 1.5 release notes) (#48565)

Backport PR #48563: DOC: Fix read_sas 1.5 release notes

Co-authored-by: Jonas Haag <jonas@lophus.org>

* Backport PR #48539 on branch 1.5.x (REGR: groupby doesn't identify null values when sort=False) (#48568)

Backport PR #48539: REGR: groupby doesn't identify null values when sort=False

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR #48555 on branch 1.5.x (DEPR: Series.astype(np.datetime64)) (#48569)

Backport PR #48555: DEPR: Series.astype(np.datetime64)

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR #48557 on branch 1.5.x (WEB: Add new footer to web) (#48571)

Backport PR #48557: WEB: Add new footer to web

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR #48285 on branch 1.5.x (WEB: Unpin pydata sphinx theme) (#48585)

Backport PR #48285: WEB: Unpin pydata sphinx theme

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48572 on branch 1.5.x (DOC: Fixing styles for the dark theme) (#48584)

Backport PR #48572: DOC: Fixing styles for the dark theme

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR #48397 on branch 1.5.x (WARN: Remove false positive warning for iloc inplaceness) (#48583)

Backport PR #48397: WARN: Remove false positive warning for iloc inplaceness

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48587 on branch 1.5.x (Fix `series.str.startswith(tuple)`) (#48593)

Backport PR #48587: Fix `series.str.startswith(tuple)`

Co-authored-by: Janosh Riebesell <janosh.riebesell@gmail.com>

* Backport PR #48601 on branch 1.5.x (CI: Fix matplolib release issues) (#48617)

Backport PR #48601: CI: Fix matplolib release issues

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48623 on branch 1.5.x (REGR/DOC: Docs left navbar broke) (#48625)

Backport PR #48623: REGR/DOC: Docs left navbar broke

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR #48619 on branch 1.5.x (REGR: Loc.setitem with enlargement raises for nested data) (#48629)

Backport PR #48619: REGR: Loc.setitem with enlargement raises for nested data

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48627 on branch 1.5.x (DOC: Last changes to release notes for 1.5.0 release) (#48630)

Backport PR #48627: DOC: Last changes to release notes for 1.5.0 release

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* RLS: 1.5.0

* Backport PR #48642 on branch 1.5.x (DOC: Add release notes for 1.5.1) (#48647)

Backport PR #48642: DOC: Add release notes for 1.5.1

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48639 on branch 1.5.x (CI: Fix directory name for published prod docs) (#48648)

Backport PR #48639: CI: Fix directory name for published prod docs

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR #48651 on branch 1.5.x (REGR: TextIOWrapper raising an error in read_csv) (#48666)

Backport PR #48651: REGR: TextIOWrapper raising an error in read_csv

Co-authored-by: Torsten Wörtwein <twoertwein@users.noreply.github.com>

* Backport PR #48599 on branch 1.5.x (DOC: Add deprecation infos to deprecated functions) (#48690)

Backport PR #48599: DOC: Add deprecation infos to deprecated functions

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48620 on branch 1.5.x (REGR: Performance decrease in factorize) (#48710)

Backport PR #48620: REGR: Performance decrease in factorize

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR #48711 on branch 1.5.x (REGR: Regression in DataFrame.loc when setting df with all True indexer) (#48717)

Backport PR #48711: REGR: Regression in DataFrame.loc when setting df with all True indexer

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48696 on branch 1.5.x (REGR: to_hdf raising AssertionError with boolean index) (#48716)

Backport PR #48696: REGR: to_hdf raising AssertionError with boolean index

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48734 on branch 1.5.x (REGR: Raise on invalid colormap for scatter plot) (#48744)

Backport PR #48734: REGR: Raise on invalid colormap for scatter plot

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48713 on branch 1.5.x (BUG: pivot_table raising Future Warning with datetime column as index) (#48742)

Backport PR #48713: BUG: pivot_table raising Future Warning with datetime column as index

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48703 on branch 1.5.x (Added theme_switcher ) (#48741)

Backport PR #48703: Added theme_switcher

Co-authored-by: Deepak Sirohiwal <38135521+deepaksirohiwal@users.noreply.github.com>

* Backport PR #48697 on branch 1.5.x (REGR: None converted to NaN when enlarging Series) (#48745)

Backport PR #48697: REGR: None converted to NaN when enlarging Series

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48702 on branch 1.5.x (REGR: dropna affects observed in groupby) (#48750)

Backport PR #48702: REGR: dropna affects observed in groupby

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR #48782 on branch 1.5.x (REGR: describe raising when result contains NA) (#48793)

Backport PR #48782: REGR: describe raising when result contains NA

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48662 on branch 1.5.x (BUG: Series.getitem not falling back to positional for bool index) (#48799)

Backport PR #48662: BUG: Series.getitem not falling back to positional for bool index

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48751 on branch 1.5.x (WEB: Update link to datapythonista blog url) (#48798)

Backport PR #48751: WEB: Update link to datapythonista blog url

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR #48608 on branch 1.5.x (REGR: assert_index_equal raising with non matching pd.NA) (#48800)

* Backport PR #48608: REGR: assert_index_equal raising with non matching pd.NA

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48785 on branch 1.5.x (BUG: still emitting unnecessary FutureWarning in DataFrame.sort_values with sparse columns) (#48807)

Backport PR #48785: BUG: still emitting unnecessary FutureWarning in DataFrame.sort_values with sparse columns

Co-authored-by: Marco Edward Gorelli <33491632+MarcoGorelli@users.noreply.github.com>

* Backport PR #48693 on branch 1.5.x (ENH: Make deprecate_nonkeyword_arguments alter function signature) (#48795)

Backport PR #48693: ENH: Make deprecate_nonkeyword_arguments alter function signature

Co-authored-by: Shantanu <12621235+hauntsaninja@users.noreply.github.com>

* Backport PR #48579 on branch 1.5.x (BUG: Fix calling groupBy(...).apply(func) on an empty dataframe invokes func) (#48817)

BUG: Fix calling groupBy(...).apply(func) on an empty dataframe invokes func  (#48579)

(cherry picked from commit 8b0ad71)

Co-authored-by: Dennis Chukwunta <chuksmcdennis@yahoo.com>

* Backport PR #48760 on branch 1.5.x (REGR: groupby.size with axis=1 doesn't return a Series) (#48825)

* Backport PR #48820 on branch 1.5.x (BUG: to_datetime(format='...%f') parses nanoseconds) (#48860)

BUG: to_datetime(format='...%f') parses nanoseconds (#48820)

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48850 on branch 1.5.x (TYP: Fix typing errors caused by new numpy) (#48859)

* Backport PR #48790 on branch 1.5.x (Created icons for dark theme) (#48875)

Backport PR #48790: Created icons for dark theme

Co-authored-by: Lorenzo Vainigli <lorenzovngl@gmail.com>

* Backport PR #48805 on branch 1.5.x (Added padding and fixed columns for sponsor logos in mobile view) (#48874)

Backport PR #48805: Added padding and fixed columns for sponsor logos in mobile view

Co-authored-by: Amay Patel <92037532+amay-patel@users.noreply.github.com>

* Backport PR #48866 on branch 1.5.x (REGR: replace replacing wrong values with inplace and datetime) (#48872)

Backport PR #48866: REGR: replace replacing wrong values with inplace and datetime

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48853 on branch 1.5.x (REGR: Avoid unnecessary warning when setting empty dataframe) (#48873)

Backport PR #48853: REGR: Avoid unnecessary warning when setting empty dataframe

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR #48833 on branch 1.5.x (BUG: ArrowExtensionArray compared to invalid object not raising) (#48878)

Backport PR #48833: BUG: ArrowExtensionArray compared to invalid object not raising

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48686 on branch 1.5.x (BUG: to_datetime(tz_mix, utc=True) converts to UTC) (#48882)

Backport PR #48686: BUG: to_datetime(tz_mix, utc=True) converts to UTC

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR #48736 on branch 1.5.x ( BUG: AttributeError: 'function' object has no attribute 'currentframe') (#48887)

* Backport PR #48797 on branch 1.5.x (REGR: fix df.apply with keyword non-zero axis) (#48886)

REGR: fix df.apply with keyword non-zero axis (#48797)

Co-authored-by: Shantanu <12621235+hauntsaninja@users.noreply.github.com>

* add pandas[foo] throughout to explain the optional extras process

Also `Recommended dependencies` to `Performance dependencies (recommended)`

* extend optional_extra`all` to cover tests and redundant packages

* add tzdata from pandas 1.5.0

* move summary to latest whatsnew doc

Code is now stable and updated.

Attempt a port from doc/source/whatsnew/v1.5.0.rst over to latest document  doc/source/whatsnew/v1.5.1.rst.

* explicitly pair packages to optional_extra in installs

* fix sphinx errors in install.rst

extra padding on column + empty return before table

* add: pytest-asyncio>=0.19.0

pytest-asyncio>=0.19.0

Closes #48361.

No version guidance available so went for latest.  No-one should be messing around with tests without modern installs.

* fsspec note

* repin pytest-asyncio>=0.17.0

matches pytest-asyncio>=0.17.0 in ci/deps/actions-38-minimum_versions.yaml

* move summary from whatsnew/v1.5.1 to v1.6.0

plus add (:issue:`48361`) resolution note

* linting

double-backtick ``test``
underline tilde match title length

* Add `clipboard` as optional extra

* Review comments

 - fss optional extra
 - better numba description

* remove unneeded comment

* fix: leave 1.5.1.rst unchanged by this PR

* Update doc/source/whatsnew/v1.5.1.rst

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* pyarrow bump 1.0.1 -> 6.0.0

Co-authored-by: Matthew Roeschke <emailformattr@gmail.com>
Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>
Co-authored-by: MeeseeksMachine <39504233+meeseeksmachine@users.noreply.github.com>
Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>
Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>
Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>
Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>
Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>
Co-authored-by: Marc Garcia <garcia.marc@gmail.com>
Co-authored-by: Kevin Sheppard <bashtage@users.noreply.github.com>
Co-authored-by: RaphSku <45042665+RaphSku@users.noreply.github.com>
Co-authored-by: Jonas Haag <jonas@lophus.org>
Co-authored-by: Janosh Riebesell <janosh.riebesell@gmail.com>
Co-authored-by: Pandas Development Team <pandas-dev@python.org>
Co-authored-by: Torsten Wörtwein <twoertwein@users.noreply.github.com>
Co-authored-by: Deepak Sirohiwal <38135521+deepaksirohiwal@users.noreply.github.com>
Co-authored-by: Marco Edward Gorelli <33491632+MarcoGorelli@users.noreply.github.com>
Co-authored-by: Shantanu <12621235+hauntsaninja@users.noreply.github.com>
Co-authored-by: Dennis Chukwunta <chuksmcdennis@yahoo.com>
Co-authored-by: Lorenzo Vainigli <lorenzovngl@gmail.com>
Co-authored-by: Amay Patel <92037532+amay-patel@users.noreply.github.com>
phofl added a commit to phofl/pandas that referenced this issue Oct 21, 2022
…dev#47336)

* add-recommended-dependencies-as-extras_require-in-setup.cfg

See issue pandas-dev#47335.

pandas-dev#47335

recommended dependencies should have package mgmt facilitated through pandas.  This will make mgmt of pandas in production docker environments a lot simpler.

* Update v1.4.3.rst

* double backtick code in rst

* rebundle under extras_require `recommended`

* [options.extras_require] bundled dependencies by featureset

see pandas-dev#39164 for discussion.

* note: [options.extras_require] bundled dependencies by featureset

* Update setup.cfg

rollback numba from recommended.  This would necessitate and update to documentation that requires broad agreement from pd-dev-core team that would slow down the overall PR

* add adlfs for azure to `access data in cloud`

see pandas-dev#39164 for discussion.  0.6.0 might be an overly restrictive version, but it is compatible

* fix extras_require: PyTables is actually `tables` on PyPi

* Update setup.cfg

* add `all` option to [options.extras_require]

* moved changelog to 1.4.4 as 1.4.3 released while this PR was stalled

* Updated to 1.5.0 compliance

* simplify sql option names

* extras rename: recommended -> performance

* remove azure

support is currently unofficial as of 1.5.0

* align with actions-38-minimum_versions.yaml

add  specific installs and, where required, missing install documentation for
 - odfpy
 - pyreadstat
 - compression options

* Pandas -> pandas in doc

Co-authored-by: Matthew Roeschke <emailformattr@gmail.com>

* extras rename: s3 -> aws

see pandas-dev#47336 (comment)

* extras rename: table -> output_formatting

to be more general in case of future changes

* bug: `>=` not `=`

* Apply suggestions from code review

Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>

* align 1.5.0.rst to latest extras_require updates

* 1.5.0.rst example updated to use valid extras

* add optional dep mgmt instructions to install.rst

* lint scipy optional import

Co-authored-by: Matthew Roeschke <emailformattr@gmail.com>

* Apply suggestions from code review

* detailed extras guidance in install.rst

 - updated numbas to a full recommended dependency with a promotional bullet point like bottleneck and numexpr
- clarified the extra to use for each set of optional dependencies
- made xml an optional extra, because is does have usage outside of read_html.

* _optional.py note to keep track of setup.cfg

* bug: indent after bullet in install.rst

* remove numba from computation extra.

* Backport PR pandas-dev#48197 on branch 1.5.x (DOC: Cleanup 1.5 whatsnew) (pandas-dev#48228)

Backport PR pandas-dev#48197: DOC: Cleanup 1.5 whatsnew

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48232 on branch 1.5.x (CI: Ensure jobs run on 1.5.x branch) (pandas-dev#48235)

Backport PR pandas-dev#48232: CI: Ensure jobs run on 1.5.x branch

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48180 on branch 1.5.x (CI: Switch to large for circleci) (pandas-dev#48251)

Backport PR pandas-dev#48180: CI: Switch to large for circleci

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48245 on branch 1.5.x (CI: Skip test_round_sanity tests due to failures) (pandas-dev#48257)

Backport PR pandas-dev#48245: CI: Skip test_round_sanity tests due to failures

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48240 on branch 1.5.x (Fix mypy erroring on backport branches) (pandas-dev#48259)

Backport PR pandas-dev#48240: Fix mypy erroring on backport branches

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48215 on branch 1.5.x (REGR: properly update DataFrame cache in Series.__setitem__) (pandas-dev#48268)

Backport PR pandas-dev#48215: REGR: properly update DataFrame cache in Series.__setitem__

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48272 on branch 1.5.x (CI: Require s3fs greater than minumum version in builds) (pandas-dev#48276)

Backport PR pandas-dev#48272: CI: Require s3fs greater than minumum version in builds

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48299 on branch 1.5.x (Bump s3fs to 2021.08.00) (pandas-dev#48305)

Backport PR pandas-dev#48299: Bump s3fs to 2021.08.00

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48027 on branch 1.5.x (ENH: Support masks in groupby prod) (pandas-dev#48302)

Backport PR pandas-dev#48027: ENH: Support masks in groupby prod

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#47762 on branch 1.5.x (REGR: preserve reindexed array object (instead of creating new array) for concat with all-NA array) (pandas-dev#48309)

Backport PR pandas-dev#47762: REGR: preserve reindexed array object (instead of creating new array) for concat with all-NA array

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48246 on branch 1.5.x (REGR: iloc not possible for sparse DataFrame) (pandas-dev#48311)

Backport PR pandas-dev#48246: REGR: iloc not possible for sparse DataFrame

Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>

* Backport PR pandas-dev#48314 on branch 1.5.x (DOC: v1.4.4 release date and tidy up release notes) (pandas-dev#48320)

Backport PR pandas-dev#48314: DOC: v1.4.4 release date and tidy up release notes

Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>

* Backport PR pandas-dev#48301 on branch 1.5.x (DEPR: Deprecate positional arguments in pivot) (pandas-dev#48326)

Backport PR pandas-dev#48301: DEPR: Deprecate positional arguments in pivot

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48214 on branch 1.5.x (WEB: Removing links to pdf version of the docs from web and docs) (pandas-dev#48242)

Backport PR pandas-dev#48214: WEB: Removing links to pdf version of the docs from web and docs

* Backport PR pandas-dev#48159 on branch 1.5.x (TST: Fix interchange/plotting/groupby test warnings) (pandas-dev#48279)

Backport PR pandas-dev#48159: TST: Fix interchange/plotting/groupby test warnings

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48324 on branch 1.5.x (BUG: Add note in whatsnew for DataFrame.at behavior change) (pandas-dev#48345)

Backport PR pandas-dev#48324: BUG: Add note in whatsnew for DataFrame.at behavior change

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48336 on branch 1.5.x (DOC: Add whatsnew note for pandas-dev#45404) (pandas-dev#48341)

Backport PR pandas-dev#48336: DOC: Add whatsnew note for pandas-dev#45404

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48254 on branch 1.5.x (REF: avoid FutureWarning about using deprecates loc.__setitem__ non-inplace usage) (pandas-dev#48353)

Backport PR pandas-dev#48254: REF: avoid FutureWarning about using deprecates loc.__setitem__ non-inplace usage

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48334 on branch 1.5.x (BUG: read_html(extract_links=all) with no header) (pandas-dev#48350)

Backport PR pandas-dev#48334: BUG: read_html(extract_links=all) with no header

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48265 on branch 1.5.x (CI: Setting up ssh key to upload prod docs) (pandas-dev#48370)

Backport PR pandas-dev#48265: CI: Setting up ssh key to upload prod docs

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48381 on branch 1.5.x (CI: Pin mambaforge image) (pandas-dev#48401)

Backport PR pandas-dev#48381: CI: Pin mambaforge image

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48229 on branch 1.5.x (TST: Test Nullable int floordiv by 0) (pandas-dev#48413)

Backport PR pandas-dev#48229: TST: Test Nullable int floordiv by 0

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48414 on branch 1.5.x (DOC: Add deprecation to is_categorical) (pandas-dev#48418)

Backport PR pandas-dev#48414: DOC: Add deprecation to is_categorical

Co-authored-by: Kevin Sheppard <bashtage@users.noreply.github.com>

* Backport PR pandas-dev#48264 on branch 1.5.x (BUG: ArrowExtensionArray._from_* accepts pyarrow arrays) (pandas-dev#48422)

* Backport PR pandas-dev#48264: BUG: ArrowExtensionArray._from_* accepts pyarrow arrays

* Add missing import

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48411 on branch 1.5.x (REGR: get_loc for ExtensionEngine not returning bool indexer for na) (pandas-dev#48430)

Backport PR pandas-dev#48411: REGR: get_loc for ExtensionEngine not returning bool indexer for na

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48419 on branch 1.5.x (BUG: ensure to return writable buffer in __dataframe__ interchange for categorical column) (pandas-dev#48441)

Backport PR pandas-dev#48419: BUG: ensure to return writable buffer in __dataframe__ interchange for categorical column

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48444 on branch 1.5.x (CI: Pin ipython version) (pandas-dev#48449)

Backport PR pandas-dev#48444: CI: Pin ipython version

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48380 on branch 1.5.x (DOC: Clarify that objects dtype takes precedence in where) (pandas-dev#48445)

* Backport PR pandas-dev#48380: DOC: Clarify that objects dtype takes precedence in where

* Update generic.py

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Manual Backport PR pandas-dev#48427 on branch 1.5.x (BLD: Refactor Dockerfile to not install dev enviornment on base) (pandas-dev#48450)

Backport PR pandas-dev#48427: BLD: Refactor Dockerfile to not install dev enviornment on base

* Backport PR pandas-dev#48426 on branch 1.5.x (BUG: Column.size should be a method) (pandas-dev#48465)

Backport PR pandas-dev#48426: BUG: Column.size should be a method

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48398 on branch 1.5.x (WARN: Avoid FutureWarnings in tests) (pandas-dev#48420)

* Backport PR pandas-dev#48398: WARN: Avoid FutureWarnings in tests

* Update

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>
Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48416 on branch 1.5.x (REF: ensure to apply suffixes before concat step in merge code) (pandas-dev#48470)

Backport PR pandas-dev#48416: REF: ensure to apply suffixes before concat step in merge code

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48354 on branch 1.5.x (CI: Bump timeout to 180 minutes) (pandas-dev#48474)

Backport PR pandas-dev#48354: CI: Bump timeout to 180 minutes

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48472 on branch 1.5.x (PERF: keep using ObjectEngine for ExtensionArrays for 1.5) (pandas-dev#48486)

Backport PR pandas-dev#48472: PERF: keep using ObjectEngine for ExtensionArrays for 1.5

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48473 on branch 1.5.x (REGR: .describe on unsigned dtypes results in object) (pandas-dev#48501)

Backport PR pandas-dev#48473: REGR: .describe on unsigned dtypes results in object

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48443 on branch 1.5.x (BUG: Fix pyarrow groupby tests)  (pandas-dev#48494)

* BUG: Fix pyarrow groupby tests (pandas-dev#48443)

# Conflicts:
#	pandas/tests/extension/test_arrow.py

* CI: Fix failing tests (pandas-dev#48493)

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48490 on branch 1.5.x (CI: Use -j1 for python-dev build to avoid flaky build error) (pandas-dev#48517)

Backport PR pandas-dev#48490: CI: Use -j1 for python-dev build to avoid flaky build error

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Manual Backport PR pandas-dev#48428 on branch 1.5.x (BUG/TST: fix a bunch of arraymanager+pyarrow tests) (pandas-dev#48518)

Backport PR pandas-dev#48428: BUG/TST: fix a bunch of arraymanager+pyarrow tests

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48525 on branch 1.5.x (CI: Fix py311 builds different exception message) (pandas-dev#48529)

Backport PR pandas-dev#48525: CI: Fix py311 builds different exception message

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48489 on branch 1.5.x (BUG: fix test_arrow.py tests) (pandas-dev#48532)

Backport PR pandas-dev#48489: BUG: fix test_arrow.py tests

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48543 on branch 1.5.x (DOC: Update footer and include OVH) (pandas-dev#48548)

Backport PR pandas-dev#48543: DOC: Update footer and include OVH

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Manual Backport PR pandas-dev#48417 on branch 1.5.x (Revert set_index inplace and copy keyword changes) (pandas-dev#48552)

Backport PR pandas-dev#48417: Revert set_index inplace and copy keyword changes

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48550 on branch 1.5.x (TST: remove 2D tests irrelevant for pyarrow) (pandas-dev#48554)

Backport PR pandas-dev#48550: TST: remove 2D tests irrelevant for pyarrow

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48556 on branch 1.5.x (DOC: Fix docs footer) (pandas-dev#48558)

Backport PR pandas-dev#48556: DOC: Fix docs footer

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48562 on branch 1.5.x (TST: Testing that no warnings are emitted and that inplace fillna produces the correct result (GH48480)) (pandas-dev#48564)

Backport PR pandas-dev#48562: TST: Testing that no warnings are emitted and that inplace fillna produces the correct result (GH48480)

Co-authored-by: RaphSku <45042665+RaphSku@users.noreply.github.com>

* Backport PR pandas-dev#48563 on branch 1.5.x (DOC: Fix read_sas 1.5 release notes) (pandas-dev#48565)

Backport PR pandas-dev#48563: DOC: Fix read_sas 1.5 release notes

Co-authored-by: Jonas Haag <jonas@lophus.org>

* Backport PR pandas-dev#48539 on branch 1.5.x (REGR: groupby doesn't identify null values when sort=False) (pandas-dev#48568)

Backport PR pandas-dev#48539: REGR: groupby doesn't identify null values when sort=False

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48555 on branch 1.5.x (DEPR: Series.astype(np.datetime64)) (pandas-dev#48569)

Backport PR pandas-dev#48555: DEPR: Series.astype(np.datetime64)

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48557 on branch 1.5.x (WEB: Add new footer to web) (pandas-dev#48571)

Backport PR pandas-dev#48557: WEB: Add new footer to web

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48285 on branch 1.5.x (WEB: Unpin pydata sphinx theme) (pandas-dev#48585)

Backport PR pandas-dev#48285: WEB: Unpin pydata sphinx theme

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48572 on branch 1.5.x (DOC: Fixing styles for the dark theme) (pandas-dev#48584)

Backport PR pandas-dev#48572: DOC: Fixing styles for the dark theme

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48397 on branch 1.5.x (WARN: Remove false positive warning for iloc inplaceness) (pandas-dev#48583)

Backport PR pandas-dev#48397: WARN: Remove false positive warning for iloc inplaceness

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48587 on branch 1.5.x (Fix `series.str.startswith(tuple)`) (pandas-dev#48593)

Backport PR pandas-dev#48587: Fix `series.str.startswith(tuple)`

Co-authored-by: Janosh Riebesell <janosh.riebesell@gmail.com>

* Backport PR pandas-dev#48601 on branch 1.5.x (CI: Fix matplolib release issues) (pandas-dev#48617)

Backport PR pandas-dev#48601: CI: Fix matplolib release issues

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48623 on branch 1.5.x (REGR/DOC: Docs left navbar broke) (pandas-dev#48625)

Backport PR pandas-dev#48623: REGR/DOC: Docs left navbar broke

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48619 on branch 1.5.x (REGR: Loc.setitem with enlargement raises for nested data) (pandas-dev#48629)

Backport PR pandas-dev#48619: REGR: Loc.setitem with enlargement raises for nested data

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48627 on branch 1.5.x (DOC: Last changes to release notes for 1.5.0 release) (pandas-dev#48630)

Backport PR pandas-dev#48627: DOC: Last changes to release notes for 1.5.0 release

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* RLS: 1.5.0

* Backport PR pandas-dev#48642 on branch 1.5.x (DOC: Add release notes for 1.5.1) (pandas-dev#48647)

Backport PR pandas-dev#48642: DOC: Add release notes for 1.5.1

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48639 on branch 1.5.x (CI: Fix directory name for published prod docs) (pandas-dev#48648)

Backport PR pandas-dev#48639: CI: Fix directory name for published prod docs

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48651 on branch 1.5.x (REGR: TextIOWrapper raising an error in read_csv) (pandas-dev#48666)

Backport PR pandas-dev#48651: REGR: TextIOWrapper raising an error in read_csv

Co-authored-by: Torsten Wörtwein <twoertwein@users.noreply.github.com>

* Backport PR pandas-dev#48599 on branch 1.5.x (DOC: Add deprecation infos to deprecated functions) (pandas-dev#48690)

Backport PR pandas-dev#48599: DOC: Add deprecation infos to deprecated functions

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48620 on branch 1.5.x (REGR: Performance decrease in factorize) (pandas-dev#48710)

Backport PR pandas-dev#48620: REGR: Performance decrease in factorize

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48711 on branch 1.5.x (REGR: Regression in DataFrame.loc when setting df with all True indexer) (pandas-dev#48717)

Backport PR pandas-dev#48711: REGR: Regression in DataFrame.loc when setting df with all True indexer

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48696 on branch 1.5.x (REGR: to_hdf raising AssertionError with boolean index) (pandas-dev#48716)

Backport PR pandas-dev#48696: REGR: to_hdf raising AssertionError with boolean index

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48734 on branch 1.5.x (REGR: Raise on invalid colormap for scatter plot) (pandas-dev#48744)

Backport PR pandas-dev#48734: REGR: Raise on invalid colormap for scatter plot

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48713 on branch 1.5.x (BUG: pivot_table raising Future Warning with datetime column as index) (pandas-dev#48742)

Backport PR pandas-dev#48713: BUG: pivot_table raising Future Warning with datetime column as index

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48703 on branch 1.5.x (Added theme_switcher ) (pandas-dev#48741)

Backport PR pandas-dev#48703: Added theme_switcher

Co-authored-by: Deepak Sirohiwal <38135521+deepaksirohiwal@users.noreply.github.com>

* Backport PR pandas-dev#48697 on branch 1.5.x (REGR: None converted to NaN when enlarging Series) (pandas-dev#48745)

Backport PR pandas-dev#48697: REGR: None converted to NaN when enlarging Series

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48702 on branch 1.5.x (REGR: dropna affects observed in groupby) (pandas-dev#48750)

Backport PR pandas-dev#48702: REGR: dropna affects observed in groupby

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48782 on branch 1.5.x (REGR: describe raising when result contains NA) (pandas-dev#48793)

Backport PR pandas-dev#48782: REGR: describe raising when result contains NA

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48662 on branch 1.5.x (BUG: Series.getitem not falling back to positional for bool index) (pandas-dev#48799)

Backport PR pandas-dev#48662: BUG: Series.getitem not falling back to positional for bool index

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48751 on branch 1.5.x (WEB: Update link to datapythonista blog url) (pandas-dev#48798)

Backport PR pandas-dev#48751: WEB: Update link to datapythonista blog url

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48608 on branch 1.5.x (REGR: assert_index_equal raising with non matching pd.NA) (pandas-dev#48800)

* Backport PR pandas-dev#48608: REGR: assert_index_equal raising with non matching pd.NA

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48785 on branch 1.5.x (BUG: still emitting unnecessary FutureWarning in DataFrame.sort_values with sparse columns) (pandas-dev#48807)

Backport PR pandas-dev#48785: BUG: still emitting unnecessary FutureWarning in DataFrame.sort_values with sparse columns

Co-authored-by: Marco Edward Gorelli <33491632+MarcoGorelli@users.noreply.github.com>

* Backport PR pandas-dev#48693 on branch 1.5.x (ENH: Make deprecate_nonkeyword_arguments alter function signature) (pandas-dev#48795)

Backport PR pandas-dev#48693: ENH: Make deprecate_nonkeyword_arguments alter function signature

Co-authored-by: Shantanu <12621235+hauntsaninja@users.noreply.github.com>

* Backport PR pandas-dev#48579 on branch 1.5.x (BUG: Fix calling groupBy(...).apply(func) on an empty dataframe invokes func) (pandas-dev#48817)

BUG: Fix calling groupBy(...).apply(func) on an empty dataframe invokes func  (pandas-dev#48579)

(cherry picked from commit 8b0ad71)

Co-authored-by: Dennis Chukwunta <chuksmcdennis@yahoo.com>

* Backport PR pandas-dev#48760 on branch 1.5.x (REGR: groupby.size with axis=1 doesn't return a Series) (pandas-dev#48825)

* Backport PR pandas-dev#48820 on branch 1.5.x (BUG: to_datetime(format='...%f') parses nanoseconds) (pandas-dev#48860)

BUG: to_datetime(format='...%f') parses nanoseconds (pandas-dev#48820)

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48850 on branch 1.5.x (TYP: Fix typing errors caused by new numpy) (pandas-dev#48859)

* Backport PR pandas-dev#48790 on branch 1.5.x (Created icons for dark theme) (pandas-dev#48875)

Backport PR pandas-dev#48790: Created icons for dark theme

Co-authored-by: Lorenzo Vainigli <lorenzovngl@gmail.com>

* Backport PR pandas-dev#48805 on branch 1.5.x (Added padding and fixed columns for sponsor logos in mobile view) (pandas-dev#48874)

Backport PR pandas-dev#48805: Added padding and fixed columns for sponsor logos in mobile view

Co-authored-by: Amay Patel <92037532+amay-patel@users.noreply.github.com>

* Backport PR pandas-dev#48866 on branch 1.5.x (REGR: replace replacing wrong values with inplace and datetime) (pandas-dev#48872)

Backport PR pandas-dev#48866: REGR: replace replacing wrong values with inplace and datetime

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48853 on branch 1.5.x (REGR: Avoid unnecessary warning when setting empty dataframe) (pandas-dev#48873)

Backport PR pandas-dev#48853: REGR: Avoid unnecessary warning when setting empty dataframe

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48833 on branch 1.5.x (BUG: ArrowExtensionArray compared to invalid object not raising) (pandas-dev#48878)

Backport PR pandas-dev#48833: BUG: ArrowExtensionArray compared to invalid object not raising

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48686 on branch 1.5.x (BUG: to_datetime(tz_mix, utc=True) converts to UTC) (pandas-dev#48882)

Backport PR pandas-dev#48686: BUG: to_datetime(tz_mix, utc=True) converts to UTC

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48736 on branch 1.5.x ( BUG: AttributeError: 'function' object has no attribute 'currentframe') (pandas-dev#48887)

* Backport PR pandas-dev#48797 on branch 1.5.x (REGR: fix df.apply with keyword non-zero axis) (pandas-dev#48886)

REGR: fix df.apply with keyword non-zero axis (pandas-dev#48797)

Co-authored-by: Shantanu <12621235+hauntsaninja@users.noreply.github.com>

* add pandas[foo] throughout to explain the optional extras process

Also `Recommended dependencies` to `Performance dependencies (recommended)`

* extend optional_extra`all` to cover tests and redundant packages

* add tzdata from pandas 1.5.0

* move summary to latest whatsnew doc

Code is now stable and updated.

Attempt a port from doc/source/whatsnew/v1.5.0.rst over to latest document  doc/source/whatsnew/v1.5.1.rst.

* explicitly pair packages to optional_extra in installs

* fix sphinx errors in install.rst

extra padding on column + empty return before table

* add: pytest-asyncio>=0.19.0

pytest-asyncio>=0.19.0

Closes pandas-dev#48361.

No version guidance available so went for latest.  No-one should be messing around with tests without modern installs.

* fsspec note

* repin pytest-asyncio>=0.17.0

matches pytest-asyncio>=0.17.0 in ci/deps/actions-38-minimum_versions.yaml

* move summary from whatsnew/v1.5.1 to v1.6.0

plus add (:issue:`48361`) resolution note

* linting

double-backtick ``test``
underline tilde match title length

* Add `clipboard` as optional extra

* Review comments

 - fss optional extra
 - better numba description

* remove unneeded comment

* fix: leave 1.5.1.rst unchanged by this PR

* Update doc/source/whatsnew/v1.5.1.rst

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* pyarrow bump 1.0.1 -> 6.0.0

Co-authored-by: Matthew Roeschke <emailformattr@gmail.com>
Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>
Co-authored-by: MeeseeksMachine <39504233+meeseeksmachine@users.noreply.github.com>
Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>
Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>
Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>
Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>
Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>
Co-authored-by: Marc Garcia <garcia.marc@gmail.com>
Co-authored-by: Kevin Sheppard <bashtage@users.noreply.github.com>
Co-authored-by: RaphSku <45042665+RaphSku@users.noreply.github.com>
Co-authored-by: Jonas Haag <jonas@lophus.org>
Co-authored-by: Janosh Riebesell <janosh.riebesell@gmail.com>
Co-authored-by: Pandas Development Team <pandas-dev@python.org>
Co-authored-by: Torsten Wörtwein <twoertwein@users.noreply.github.com>
Co-authored-by: Deepak Sirohiwal <38135521+deepaksirohiwal@users.noreply.github.com>
Co-authored-by: Marco Edward Gorelli <33491632+MarcoGorelli@users.noreply.github.com>
Co-authored-by: Shantanu <12621235+hauntsaninja@users.noreply.github.com>
Co-authored-by: Dennis Chukwunta <chuksmcdennis@yahoo.com>
Co-authored-by: Lorenzo Vainigli <lorenzovngl@gmail.com>
Co-authored-by: Amay Patel <92037532+amay-patel@users.noreply.github.com>
noatamir pushed a commit to noatamir/pandas that referenced this issue Nov 9, 2022
…dev#47336)

* add-recommended-dependencies-as-extras_require-in-setup.cfg

See issue pandas-dev#47335.

pandas-dev#47335

recommended dependencies should have package mgmt facilitated through pandas.  This will make mgmt of pandas in production docker environments a lot simpler.

* Update v1.4.3.rst

* double backtick code in rst

* rebundle under extras_require `recommended`

* [options.extras_require] bundled dependencies by featureset

see pandas-dev#39164 for discussion.

* note: [options.extras_require] bundled dependencies by featureset

* Update setup.cfg

rollback numba from recommended.  This would necessitate and update to documentation that requires broad agreement from pd-dev-core team that would slow down the overall PR

* add adlfs for azure to `access data in cloud`

see pandas-dev#39164 for discussion.  0.6.0 might be an overly restrictive version, but it is compatible

* fix extras_require: PyTables is actually `tables` on PyPi

* Update setup.cfg

* add `all` option to [options.extras_require]

* moved changelog to 1.4.4 as 1.4.3 released while this PR was stalled

* Updated to 1.5.0 compliance

* simplify sql option names

* extras rename: recommended -> performance

* remove azure

support is currently unofficial as of 1.5.0

* align with actions-38-minimum_versions.yaml

add  specific installs and, where required, missing install documentation for
 - odfpy
 - pyreadstat
 - compression options

* Pandas -> pandas in doc

Co-authored-by: Matthew Roeschke <emailformattr@gmail.com>

* extras rename: s3 -> aws

see pandas-dev#47336 (comment)

* extras rename: table -> output_formatting

to be more general in case of future changes

* bug: `>=` not `=`

* Apply suggestions from code review

Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>

* align 1.5.0.rst to latest extras_require updates

* 1.5.0.rst example updated to use valid extras

* add optional dep mgmt instructions to install.rst

* lint scipy optional import

Co-authored-by: Matthew Roeschke <emailformattr@gmail.com>

* Apply suggestions from code review

* detailed extras guidance in install.rst

 - updated numbas to a full recommended dependency with a promotional bullet point like bottleneck and numexpr
- clarified the extra to use for each set of optional dependencies
- made xml an optional extra, because is does have usage outside of read_html.

* _optional.py note to keep track of setup.cfg

* bug: indent after bullet in install.rst

* remove numba from computation extra.

* Backport PR pandas-dev#48197 on branch 1.5.x (DOC: Cleanup 1.5 whatsnew) (pandas-dev#48228)

Backport PR pandas-dev#48197: DOC: Cleanup 1.5 whatsnew

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48232 on branch 1.5.x (CI: Ensure jobs run on 1.5.x branch) (pandas-dev#48235)

Backport PR pandas-dev#48232: CI: Ensure jobs run on 1.5.x branch

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48180 on branch 1.5.x (CI: Switch to large for circleci) (pandas-dev#48251)

Backport PR pandas-dev#48180: CI: Switch to large for circleci

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48245 on branch 1.5.x (CI: Skip test_round_sanity tests due to failures) (pandas-dev#48257)

Backport PR pandas-dev#48245: CI: Skip test_round_sanity tests due to failures

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48240 on branch 1.5.x (Fix mypy erroring on backport branches) (pandas-dev#48259)

Backport PR pandas-dev#48240: Fix mypy erroring on backport branches

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48215 on branch 1.5.x (REGR: properly update DataFrame cache in Series.__setitem__) (pandas-dev#48268)

Backport PR pandas-dev#48215: REGR: properly update DataFrame cache in Series.__setitem__

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48272 on branch 1.5.x (CI: Require s3fs greater than minumum version in builds) (pandas-dev#48276)

Backport PR pandas-dev#48272: CI: Require s3fs greater than minumum version in builds

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48299 on branch 1.5.x (Bump s3fs to 2021.08.00) (pandas-dev#48305)

Backport PR pandas-dev#48299: Bump s3fs to 2021.08.00

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48027 on branch 1.5.x (ENH: Support masks in groupby prod) (pandas-dev#48302)

Backport PR pandas-dev#48027: ENH: Support masks in groupby prod

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#47762 on branch 1.5.x (REGR: preserve reindexed array object (instead of creating new array) for concat with all-NA array) (pandas-dev#48309)

Backport PR pandas-dev#47762: REGR: preserve reindexed array object (instead of creating new array) for concat with all-NA array

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48246 on branch 1.5.x (REGR: iloc not possible for sparse DataFrame) (pandas-dev#48311)

Backport PR pandas-dev#48246: REGR: iloc not possible for sparse DataFrame

Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>

* Backport PR pandas-dev#48314 on branch 1.5.x (DOC: v1.4.4 release date and tidy up release notes) (pandas-dev#48320)

Backport PR pandas-dev#48314: DOC: v1.4.4 release date and tidy up release notes

Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>

* Backport PR pandas-dev#48301 on branch 1.5.x (DEPR: Deprecate positional arguments in pivot) (pandas-dev#48326)

Backport PR pandas-dev#48301: DEPR: Deprecate positional arguments in pivot

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48214 on branch 1.5.x (WEB: Removing links to pdf version of the docs from web and docs) (pandas-dev#48242)

Backport PR pandas-dev#48214: WEB: Removing links to pdf version of the docs from web and docs

* Backport PR pandas-dev#48159 on branch 1.5.x (TST: Fix interchange/plotting/groupby test warnings) (pandas-dev#48279)

Backport PR pandas-dev#48159: TST: Fix interchange/plotting/groupby test warnings

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48324 on branch 1.5.x (BUG: Add note in whatsnew for DataFrame.at behavior change) (pandas-dev#48345)

Backport PR pandas-dev#48324: BUG: Add note in whatsnew for DataFrame.at behavior change

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48336 on branch 1.5.x (DOC: Add whatsnew note for pandas-dev#45404) (pandas-dev#48341)

Backport PR pandas-dev#48336: DOC: Add whatsnew note for pandas-dev#45404

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48254 on branch 1.5.x (REF: avoid FutureWarning about using deprecates loc.__setitem__ non-inplace usage) (pandas-dev#48353)

Backport PR pandas-dev#48254: REF: avoid FutureWarning about using deprecates loc.__setitem__ non-inplace usage

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48334 on branch 1.5.x (BUG: read_html(extract_links=all) with no header) (pandas-dev#48350)

Backport PR pandas-dev#48334: BUG: read_html(extract_links=all) with no header

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48265 on branch 1.5.x (CI: Setting up ssh key to upload prod docs) (pandas-dev#48370)

Backport PR pandas-dev#48265: CI: Setting up ssh key to upload prod docs

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48381 on branch 1.5.x (CI: Pin mambaforge image) (pandas-dev#48401)

Backport PR pandas-dev#48381: CI: Pin mambaforge image

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48229 on branch 1.5.x (TST: Test Nullable int floordiv by 0) (pandas-dev#48413)

Backport PR pandas-dev#48229: TST: Test Nullable int floordiv by 0

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48414 on branch 1.5.x (DOC: Add deprecation to is_categorical) (pandas-dev#48418)

Backport PR pandas-dev#48414: DOC: Add deprecation to is_categorical

Co-authored-by: Kevin Sheppard <bashtage@users.noreply.github.com>

* Backport PR pandas-dev#48264 on branch 1.5.x (BUG: ArrowExtensionArray._from_* accepts pyarrow arrays) (pandas-dev#48422)

* Backport PR pandas-dev#48264: BUG: ArrowExtensionArray._from_* accepts pyarrow arrays

* Add missing import

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48411 on branch 1.5.x (REGR: get_loc for ExtensionEngine not returning bool indexer for na) (pandas-dev#48430)

Backport PR pandas-dev#48411: REGR: get_loc for ExtensionEngine not returning bool indexer for na

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48419 on branch 1.5.x (BUG: ensure to return writable buffer in __dataframe__ interchange for categorical column) (pandas-dev#48441)

Backport PR pandas-dev#48419: BUG: ensure to return writable buffer in __dataframe__ interchange for categorical column

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48444 on branch 1.5.x (CI: Pin ipython version) (pandas-dev#48449)

Backport PR pandas-dev#48444: CI: Pin ipython version

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48380 on branch 1.5.x (DOC: Clarify that objects dtype takes precedence in where) (pandas-dev#48445)

* Backport PR pandas-dev#48380: DOC: Clarify that objects dtype takes precedence in where

* Update generic.py

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Manual Backport PR pandas-dev#48427 on branch 1.5.x (BLD: Refactor Dockerfile to not install dev enviornment on base) (pandas-dev#48450)

Backport PR pandas-dev#48427: BLD: Refactor Dockerfile to not install dev enviornment on base

* Backport PR pandas-dev#48426 on branch 1.5.x (BUG: Column.size should be a method) (pandas-dev#48465)

Backport PR pandas-dev#48426: BUG: Column.size should be a method

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48398 on branch 1.5.x (WARN: Avoid FutureWarnings in tests) (pandas-dev#48420)

* Backport PR pandas-dev#48398: WARN: Avoid FutureWarnings in tests

* Update

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>
Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48416 on branch 1.5.x (REF: ensure to apply suffixes before concat step in merge code) (pandas-dev#48470)

Backport PR pandas-dev#48416: REF: ensure to apply suffixes before concat step in merge code

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48354 on branch 1.5.x (CI: Bump timeout to 180 minutes) (pandas-dev#48474)

Backport PR pandas-dev#48354: CI: Bump timeout to 180 minutes

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48472 on branch 1.5.x (PERF: keep using ObjectEngine for ExtensionArrays for 1.5) (pandas-dev#48486)

Backport PR pandas-dev#48472: PERF: keep using ObjectEngine for ExtensionArrays for 1.5

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48473 on branch 1.5.x (REGR: .describe on unsigned dtypes results in object) (pandas-dev#48501)

Backport PR pandas-dev#48473: REGR: .describe on unsigned dtypes results in object

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48443 on branch 1.5.x (BUG: Fix pyarrow groupby tests)  (pandas-dev#48494)

* BUG: Fix pyarrow groupby tests (pandas-dev#48443)

# Conflicts:
#	pandas/tests/extension/test_arrow.py

* CI: Fix failing tests (pandas-dev#48493)

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48490 on branch 1.5.x (CI: Use -j1 for python-dev build to avoid flaky build error) (pandas-dev#48517)

Backport PR pandas-dev#48490: CI: Use -j1 for python-dev build to avoid flaky build error

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Manual Backport PR pandas-dev#48428 on branch 1.5.x (BUG/TST: fix a bunch of arraymanager+pyarrow tests) (pandas-dev#48518)

Backport PR pandas-dev#48428: BUG/TST: fix a bunch of arraymanager+pyarrow tests

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48525 on branch 1.5.x (CI: Fix py311 builds different exception message) (pandas-dev#48529)

Backport PR pandas-dev#48525: CI: Fix py311 builds different exception message

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48489 on branch 1.5.x (BUG: fix test_arrow.py tests) (pandas-dev#48532)

Backport PR pandas-dev#48489: BUG: fix test_arrow.py tests

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48543 on branch 1.5.x (DOC: Update footer and include OVH) (pandas-dev#48548)

Backport PR pandas-dev#48543: DOC: Update footer and include OVH

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Manual Backport PR pandas-dev#48417 on branch 1.5.x (Revert set_index inplace and copy keyword changes) (pandas-dev#48552)

Backport PR pandas-dev#48417: Revert set_index inplace and copy keyword changes

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

* Backport PR pandas-dev#48550 on branch 1.5.x (TST: remove 2D tests irrelevant for pyarrow) (pandas-dev#48554)

Backport PR pandas-dev#48550: TST: remove 2D tests irrelevant for pyarrow

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48556 on branch 1.5.x (DOC: Fix docs footer) (pandas-dev#48558)

Backport PR pandas-dev#48556: DOC: Fix docs footer

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48562 on branch 1.5.x (TST: Testing that no warnings are emitted and that inplace fillna produces the correct result (GH48480)) (pandas-dev#48564)

Backport PR pandas-dev#48562: TST: Testing that no warnings are emitted and that inplace fillna produces the correct result (GH48480)

Co-authored-by: RaphSku <45042665+RaphSku@users.noreply.github.com>

* Backport PR pandas-dev#48563 on branch 1.5.x (DOC: Fix read_sas 1.5 release notes) (pandas-dev#48565)

Backport PR pandas-dev#48563: DOC: Fix read_sas 1.5 release notes

Co-authored-by: Jonas Haag <jonas@lophus.org>

* Backport PR pandas-dev#48539 on branch 1.5.x (REGR: groupby doesn't identify null values when sort=False) (pandas-dev#48568)

Backport PR pandas-dev#48539: REGR: groupby doesn't identify null values when sort=False

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48555 on branch 1.5.x (DEPR: Series.astype(np.datetime64)) (pandas-dev#48569)

Backport PR pandas-dev#48555: DEPR: Series.astype(np.datetime64)

Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

* Backport PR pandas-dev#48557 on branch 1.5.x (WEB: Add new footer to web) (pandas-dev#48571)

Backport PR pandas-dev#48557: WEB: Add new footer to web

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48285 on branch 1.5.x (WEB: Unpin pydata sphinx theme) (pandas-dev#48585)

Backport PR pandas-dev#48285: WEB: Unpin pydata sphinx theme

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48572 on branch 1.5.x (DOC: Fixing styles for the dark theme) (pandas-dev#48584)

Backport PR pandas-dev#48572: DOC: Fixing styles for the dark theme

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48397 on branch 1.5.x (WARN: Remove false positive warning for iloc inplaceness) (pandas-dev#48583)

Backport PR pandas-dev#48397: WARN: Remove false positive warning for iloc inplaceness

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48587 on branch 1.5.x (Fix `series.str.startswith(tuple)`) (pandas-dev#48593)

Backport PR pandas-dev#48587: Fix `series.str.startswith(tuple)`

Co-authored-by: Janosh Riebesell <janosh.riebesell@gmail.com>

* Backport PR pandas-dev#48601 on branch 1.5.x (CI: Fix matplolib release issues) (pandas-dev#48617)

Backport PR pandas-dev#48601: CI: Fix matplolib release issues

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48623 on branch 1.5.x (REGR/DOC: Docs left navbar broke) (pandas-dev#48625)

Backport PR pandas-dev#48623: REGR/DOC: Docs left navbar broke

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48619 on branch 1.5.x (REGR: Loc.setitem with enlargement raises for nested data) (pandas-dev#48629)

Backport PR pandas-dev#48619: REGR: Loc.setitem with enlargement raises for nested data

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48627 on branch 1.5.x (DOC: Last changes to release notes for 1.5.0 release) (pandas-dev#48630)

Backport PR pandas-dev#48627: DOC: Last changes to release notes for 1.5.0 release

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* RLS: 1.5.0

* Backport PR pandas-dev#48642 on branch 1.5.x (DOC: Add release notes for 1.5.1) (pandas-dev#48647)

Backport PR pandas-dev#48642: DOC: Add release notes for 1.5.1

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48639 on branch 1.5.x (CI: Fix directory name for published prod docs) (pandas-dev#48648)

Backport PR pandas-dev#48639: CI: Fix directory name for published prod docs

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48651 on branch 1.5.x (REGR: TextIOWrapper raising an error in read_csv) (pandas-dev#48666)

Backport PR pandas-dev#48651: REGR: TextIOWrapper raising an error in read_csv

Co-authored-by: Torsten Wörtwein <twoertwein@users.noreply.github.com>

* Backport PR pandas-dev#48599 on branch 1.5.x (DOC: Add deprecation infos to deprecated functions) (pandas-dev#48690)

Backport PR pandas-dev#48599: DOC: Add deprecation infos to deprecated functions

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48620 on branch 1.5.x (REGR: Performance decrease in factorize) (pandas-dev#48710)

Backport PR pandas-dev#48620: REGR: Performance decrease in factorize

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48711 on branch 1.5.x (REGR: Regression in DataFrame.loc when setting df with all True indexer) (pandas-dev#48717)

Backport PR pandas-dev#48711: REGR: Regression in DataFrame.loc when setting df with all True indexer

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48696 on branch 1.5.x (REGR: to_hdf raising AssertionError with boolean index) (pandas-dev#48716)

Backport PR pandas-dev#48696: REGR: to_hdf raising AssertionError with boolean index

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48734 on branch 1.5.x (REGR: Raise on invalid colormap for scatter plot) (pandas-dev#48744)

Backport PR pandas-dev#48734: REGR: Raise on invalid colormap for scatter plot

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48713 on branch 1.5.x (BUG: pivot_table raising Future Warning with datetime column as index) (pandas-dev#48742)

Backport PR pandas-dev#48713: BUG: pivot_table raising Future Warning with datetime column as index

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48703 on branch 1.5.x (Added theme_switcher ) (pandas-dev#48741)

Backport PR pandas-dev#48703: Added theme_switcher

Co-authored-by: Deepak Sirohiwal <38135521+deepaksirohiwal@users.noreply.github.com>

* Backport PR pandas-dev#48697 on branch 1.5.x (REGR: None converted to NaN when enlarging Series) (pandas-dev#48745)

Backport PR pandas-dev#48697: REGR: None converted to NaN when enlarging Series

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48702 on branch 1.5.x (REGR: dropna affects observed in groupby) (pandas-dev#48750)

Backport PR pandas-dev#48702: REGR: dropna affects observed in groupby

Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>

* Backport PR pandas-dev#48782 on branch 1.5.x (REGR: describe raising when result contains NA) (pandas-dev#48793)

Backport PR pandas-dev#48782: REGR: describe raising when result contains NA

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48662 on branch 1.5.x (BUG: Series.getitem not falling back to positional for bool index) (pandas-dev#48799)

Backport PR pandas-dev#48662: BUG: Series.getitem not falling back to positional for bool index

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48751 on branch 1.5.x (WEB: Update link to datapythonista blog url) (pandas-dev#48798)

Backport PR pandas-dev#48751: WEB: Update link to datapythonista blog url

Co-authored-by: Marc Garcia <garcia.marc@gmail.com>

* Backport PR pandas-dev#48608 on branch 1.5.x (REGR: assert_index_equal raising with non matching pd.NA) (pandas-dev#48800)

* Backport PR pandas-dev#48608: REGR: assert_index_equal raising with non matching pd.NA

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48785 on branch 1.5.x (BUG: still emitting unnecessary FutureWarning in DataFrame.sort_values with sparse columns) (pandas-dev#48807)

Backport PR pandas-dev#48785: BUG: still emitting unnecessary FutureWarning in DataFrame.sort_values with sparse columns

Co-authored-by: Marco Edward Gorelli <33491632+MarcoGorelli@users.noreply.github.com>

* Backport PR pandas-dev#48693 on branch 1.5.x (ENH: Make deprecate_nonkeyword_arguments alter function signature) (pandas-dev#48795)

Backport PR pandas-dev#48693: ENH: Make deprecate_nonkeyword_arguments alter function signature

Co-authored-by: Shantanu <12621235+hauntsaninja@users.noreply.github.com>

* Backport PR pandas-dev#48579 on branch 1.5.x (BUG: Fix calling groupBy(...).apply(func) on an empty dataframe invokes func) (pandas-dev#48817)

BUG: Fix calling groupBy(...).apply(func) on an empty dataframe invokes func  (pandas-dev#48579)

(cherry picked from commit 8b0ad71)

Co-authored-by: Dennis Chukwunta <chuksmcdennis@yahoo.com>

* Backport PR pandas-dev#48760 on branch 1.5.x (REGR: groupby.size with axis=1 doesn't return a Series) (pandas-dev#48825)

* Backport PR pandas-dev#48820 on branch 1.5.x (BUG: to_datetime(format='...%f') parses nanoseconds) (pandas-dev#48860)

BUG: to_datetime(format='...%f') parses nanoseconds (pandas-dev#48820)

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48850 on branch 1.5.x (TYP: Fix typing errors caused by new numpy) (pandas-dev#48859)

* Backport PR pandas-dev#48790 on branch 1.5.x (Created icons for dark theme) (pandas-dev#48875)

Backport PR pandas-dev#48790: Created icons for dark theme

Co-authored-by: Lorenzo Vainigli <lorenzovngl@gmail.com>

* Backport PR pandas-dev#48805 on branch 1.5.x (Added padding and fixed columns for sponsor logos in mobile view) (pandas-dev#48874)

Backport PR pandas-dev#48805: Added padding and fixed columns for sponsor logos in mobile view

Co-authored-by: Amay Patel <92037532+amay-patel@users.noreply.github.com>

* Backport PR pandas-dev#48866 on branch 1.5.x (REGR: replace replacing wrong values with inplace and datetime) (pandas-dev#48872)

Backport PR pandas-dev#48866: REGR: replace replacing wrong values with inplace and datetime

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48853 on branch 1.5.x (REGR: Avoid unnecessary warning when setting empty dataframe) (pandas-dev#48873)

Backport PR pandas-dev#48853: REGR: Avoid unnecessary warning when setting empty dataframe

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

* Backport PR pandas-dev#48833 on branch 1.5.x (BUG: ArrowExtensionArray compared to invalid object not raising) (pandas-dev#48878)

Backport PR pandas-dev#48833: BUG: ArrowExtensionArray compared to invalid object not raising

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48686 on branch 1.5.x (BUG: to_datetime(tz_mix, utc=True) converts to UTC) (pandas-dev#48882)

Backport PR pandas-dev#48686: BUG: to_datetime(tz_mix, utc=True) converts to UTC

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* Backport PR pandas-dev#48736 on branch 1.5.x ( BUG: AttributeError: 'function' object has no attribute 'currentframe') (pandas-dev#48887)

* Backport PR pandas-dev#48797 on branch 1.5.x (REGR: fix df.apply with keyword non-zero axis) (pandas-dev#48886)

REGR: fix df.apply with keyword non-zero axis (pandas-dev#48797)

Co-authored-by: Shantanu <12621235+hauntsaninja@users.noreply.github.com>

* add pandas[foo] throughout to explain the optional extras process

Also `Recommended dependencies` to `Performance dependencies (recommended)`

* extend optional_extra`all` to cover tests and redundant packages

* add tzdata from pandas 1.5.0

* move summary to latest whatsnew doc

Code is now stable and updated.

Attempt a port from doc/source/whatsnew/v1.5.0.rst over to latest document  doc/source/whatsnew/v1.5.1.rst.

* explicitly pair packages to optional_extra in installs

* fix sphinx errors in install.rst

extra padding on column + empty return before table

* add: pytest-asyncio>=0.19.0

pytest-asyncio>=0.19.0

Closes pandas-dev#48361.

No version guidance available so went for latest.  No-one should be messing around with tests without modern installs.

* fsspec note

* repin pytest-asyncio>=0.17.0

matches pytest-asyncio>=0.17.0 in ci/deps/actions-38-minimum_versions.yaml

* move summary from whatsnew/v1.5.1 to v1.6.0

plus add (:issue:`48361`) resolution note

* linting

double-backtick ``test``
underline tilde match title length

* Add `clipboard` as optional extra

* Review comments

 - fss optional extra
 - better numba description

* remove unneeded comment

* fix: leave 1.5.1.rst unchanged by this PR

* Update doc/source/whatsnew/v1.5.1.rst

Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>

* pyarrow bump 1.0.1 -> 6.0.0

Co-authored-by: Matthew Roeschke <emailformattr@gmail.com>
Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>
Co-authored-by: MeeseeksMachine <39504233+meeseeksmachine@users.noreply.github.com>
Co-authored-by: Matthew Roeschke <10647082+mroeschke@users.noreply.github.com>
Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>
Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>
Co-authored-by: Richard Shadrach <45562402+rhshadrach@users.noreply.github.com>
Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>
Co-authored-by: Marc Garcia <garcia.marc@gmail.com>
Co-authored-by: Kevin Sheppard <bashtage@users.noreply.github.com>
Co-authored-by: RaphSku <45042665+RaphSku@users.noreply.github.com>
Co-authored-by: Jonas Haag <jonas@lophus.org>
Co-authored-by: Janosh Riebesell <janosh.riebesell@gmail.com>
Co-authored-by: Pandas Development Team <pandas-dev@python.org>
Co-authored-by: Torsten Wörtwein <twoertwein@users.noreply.github.com>
Co-authored-by: Deepak Sirohiwal <38135521+deepaksirohiwal@users.noreply.github.com>
Co-authored-by: Marco Edward Gorelli <33491632+MarcoGorelli@users.noreply.github.com>
Co-authored-by: Shantanu <12621235+hauntsaninja@users.noreply.github.com>
Co-authored-by: Dennis Chukwunta <chuksmcdennis@yahoo.com>
Co-authored-by: Lorenzo Vainigli <lorenzovngl@gmail.com>
Co-authored-by: Amay Patel <92037532+amay-patel@users.noreply.github.com>
@rendner
Copy link
Contributor

rendner commented Nov 30, 2022

@JMBurley

I'm honestly not sure if Jinja2 is needed for any pandas functionality? (Jinja2 not imported anywhere in pandas).

jinja2 is required if you use pandas Styler. As you can see here: import_optional_dependency("jinja2", extra="DataFrame.style requires jinja2.")

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Build Library building on various platforms Enhancement
Projects
None yet
Development

Successfully merging a pull request may close this issue.

10 participants