ENH: Adding table name parameter to pandas.read_excel #58464

iangainey · 2024-04-28T17:02:20Z

closes ENH: pd.read_excel with table parameter #38937 (Replace xxxx with the GitHub issue number)
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

…local pre-commit

iangainey · 2024-04-28T19:26:53Z

As far as I can tell this should be passing all of the tests. Some pre-commit hooks seem to not be passing in this PR but some make no sense, saying there is no attribute, yet there is. Tons of "not indexable errors", that are indexable, and were existing functionality prior to opening this so I'm not certain how it's failing.
Any guidance on getting these to pass would be great.

rhshadrach

Thanks for the PR. This issue is tagged as Needs Discussion, and I am hesitant to add this functionality to pandas' already complicated Excel reading code. This is especially true because, as the implementation currently stands, it's only supported by openpyxl.

That said, it seems to me the move of code in parse to it's own method (currently - your parse_mutliindex) is a good cleanup anyways (without the reading tables feature), and it would simplify the diff here. Would you consider doing that refactor as a pre-cursor?

I'm also curious as to why parse_multiindex since (I think) there isn't necessarily a MultiIndex. I would suggest parse_sheet instead.

mroeschke · 2024-04-30T16:52:04Z

Agreed with the hesitancy to implement this feature as of now. Going to close since I think more buy-in needs to happen in the issue first, but happy to have a separate PR with the cleanups

iangainey · 2024-04-30T17:08:05Z

I guess I'm a bit confused, as I had tried to get more of a discussion on the issue but could not get any response from you (the development team), only from other contributors. The functionality only adds complexity when the user wants to read in tables, and there are plenty of other functionality in the excel io area of pandas that only works with one of the engines. How does getting more buy-in happen? @mroeschke @rhshadrach

rhshadrach · 2024-04-30T21:00:18Z

The functionality only adds complexity when the user wants to read in tables

Each feature adds a certain amount of technical debt - it increases the surface area of pandas requiring tests, documentation, dealing with new bug reports. It would be unmaintainable to accept all feature requests, therefore we must carefully evaluate what features we do accept.

there are plenty of other functionality in the excel io area of pandas that only works with one of the engines.

This is something we would like to reduce.

That said - I would take a look at the diff here after #58497 is merged.

iangainey · 2024-05-01T01:13:07Z

Thank you for the explanations, those certainly make sense.
In regards to looking at the diff here after #58497 is merged, since I can't update this PR and wanted to improve it/resolve any errors I opened at draft PR #58500 that will have a more accurate diff if you would be so kind to look at that.
Apologies for the closed out #58499-had set up working on this project incorrectly initially and was working on the main branch of my fork, finally got these split out into branches properly and linked with the right PR's.

Please let me know any thoughts, issues, or improvements. I would like to understand the discussion as to if or how this functionality should look in pandas.

rhshadrach · 2024-05-01T21:02:58Z

For the future - we can reopen the PR. But let's continue this in #58500.

iangainey added 2 commits April 28, 2024 10:41

Initial commit for PR

c14b536

Added whats new enhancement

396d0a6

iangainey requested a review from rhshadrach as a code owner April 28, 2024 17:02

iangainey and others added 8 commits April 28, 2024 13:05

Merge branch 'main' into main

d8f8bdf

Resolving commit checks

ff5b722

Merge branch 'main' of https://github.com/iangainey/pandas

6c948cb

Removing local testing comments

182627e

Attempting to resolve ruff-format failure that is not occuring on my …

6c17196

…local pre-commit

Resolving typing and docstring manual pre-commit errors

d036bb5

Removing type hints considering typing checks doesn't like it

ea15154

These errors make no sense and contradict what I am seeing

748d90f

rhshadrach requested changes Apr 29, 2024

View reviewed changes

mroeschke closed this Apr 30, 2024

This was referenced May 1, 2024

ENH: Adding table name parameter to pandas.read_excel #58499

Closed

ENH: Add table name parameter to pandas.read_excel #58500

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Adding table name parameter to pandas.read_excel #58464

ENH: Adding table name parameter to pandas.read_excel #58464

iangainey commented Apr 28, 2024

iangainey commented Apr 28, 2024

rhshadrach left a comment

mroeschke commented Apr 30, 2024

iangainey commented Apr 30, 2024 •

edited

Loading

rhshadrach commented Apr 30, 2024 •

edited

Loading

iangainey commented May 1, 2024

rhshadrach commented May 1, 2024

ENH: Adding table name parameter to pandas.read_excel #58464

ENH: Adding table name parameter to pandas.read_excel #58464

Conversation

iangainey commented Apr 28, 2024

iangainey commented Apr 28, 2024

rhshadrach left a comment

Choose a reason for hiding this comment

mroeschke commented Apr 30, 2024

iangainey commented Apr 30, 2024 • edited Loading

rhshadrach commented Apr 30, 2024 • edited Loading

iangainey commented May 1, 2024

rhshadrach commented May 1, 2024

iangainey commented Apr 30, 2024 •

edited

Loading

rhshadrach commented Apr 30, 2024 •

edited

Loading