BUG: Dataframe arithmatic operators don't work with Series using fill_value #61828

eicchen · 2025-07-10T22:54:01Z

closes BUG: pd.DataFrame.mul has not support fill_value? #61581
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

Removed a test which checked for expected error to be raised and a corner case. Added a test case to test multiple operators with Dataframe x Series operations while using fill_value

…uts are NaN

Updated docs

jbrockmendel · 2025-07-11T14:54:03Z

pandas/tests/frame/test_arithmetic.py

+
+
+@pytest.mark.parametrize("op", ["add", "sub", "mul", "div", "mod", "truediv", "pow"])
+def test_df_series_fill_value(op):


ill need to take a closer look at this, just because im really skeptical that the fix is this easy.

OK, i think the trouble is that in _maybe_align_series_as_frame we will broadcast the 1D object to 2D for numpy dtypes, but not EA dtypes. so can you add a test for non-numpy dtypes and see how it goes

You're correct about the EAs, Ill update the testcases and then change the function to work with other EA types (mainly the ones that work with operators, int and float I believe)

pandas/tests/frame/test_arithmetic.py

eicchen · 2025-07-15T18:49:30Z

Im closing the PR for now until the additional fixes for EA are deployed

eicchen · 2025-08-19T21:49:44Z

Reopened to talk about fixes for this specific issue before I get sidetracked by 1D operations again (ignore all the failed checks for now)

test.py

test2.py

jbrockmendel · 2025-08-20T16:59:35Z

The appropriate fix is going to be in _maybe_align_series_as_frame

eicchen · 2025-08-20T21:16:09Z

The appropriate fix is going to be in _maybe_align_series_as_frame

So this was what I was working on locally, and had questions about. I was able to reshape EAs in _maybe_align_series_as_frame and am still working on various places to get the operation smoothed out. But I feel like this issue deviates from the original issue, which is only related to fill_value. As far as I can tell this is not related to that issue so we should probably file it under another and mark the original closed for bookkeeping.

I can add another test case which wouldn't require 2D EA operations for the dtype test.

(There was original a bunch of brain spew about issues I was currently having, but I'll organize it before reposting if needed)

pandas/tests/frame/test_arithmetic.py

eicchen · 2025-08-20T23:40:58Z

Just making sure, do you agree with splitting the 1D part off?

…on-1D EAs

pandas/core/frame.py

eicchen · 2025-08-21T20:45:36Z

It looks like the change might have inadvertently changed some behavior that I don't know if I should keep or not.

It reverts the error message that is expected in the test_period_add_timestamp_raises test back to what it was pre-resolution-inference according to your comment from a year ago.

And it makes the test_add_strings in test_string.py return a success, rather than the xfail that it was supposed to be. test_add_frame unfortunately still fails though so I don't know if I should purposefully break it to keep the actions in line with each other. I read the linked issue but don't think there was a consensus (#28527 )

jbrockmendel · 2025-08-21T21:07:56Z

whats the updated exception messsage for the period one?

Fixing xfailed tests is a good thing.

eicchen · 2025-08-21T21:41:57Z

it is now "cannot add PeriodArray and DatetimeArray", which is inline with what it is for everything else.

here's the code snippet. I modified.

However, it looks like contrary to my earlier statement, add_to_frame doesn’t consistently pass as xfail on the pipeline, some jobs fail while others don’t. It works as expected locally, so I’m not sure how best to debug this properly. Do you have any advice?

jbrockmendel · 2025-08-22T03:39:02Z

Can you remove the xfail and let’s see how the CI does

eicchen · 2025-08-23T17:29:39Z

Can you remove the xfail and let’s see how the CI does

So interestingly, it seems to pass the tests it failed previously while failing the ones it previously succeeded. Do you know if there is a significant difference between the subset of unit tests that are different than the others? (Freethreading, Numpy Dev, Linux-32-bit. Linux-Musl, Pyodide, and Without PyArrow). Alternatively, I can carve out StringArray for now and investigate it as a separate issue

…t catch for invalid messages

jbrockmendel · 2025-08-25T18:28:56Z

pandas/core/arrays/arrow/array.py

-        other = self._box_pa(other)
+        other_NA = self._box_pa(other)
+        # pyarrow gets upset if you try to join a NullArray
+        other = other_NA.cast(pa_type)


is it obvious this is always right? e.g. what if self is pa.timestamp("us") and other is pa.int64()?

That's fair, I did try to only check for NullArrays, but that returned the error about how it couldn't concatenate the frame in the original add_to_frame testcase.

We could circumvent that by casting the initial df as an object but I didn't want to mess with the test case because I didn't know if that was something it was testing for.

Alternatively, I can just reimplement a check and check for dtypes we'd want to let go through

looks like this is the cause of a bunch of test failures FAILED pandas/tests/extension/test_arrow.py::test_arithmetic_temporal[pa_type11] - pyarrow.lib.ArrowNotImplementedError: Unsupported cast from duration[us] to timestamp using function cast_timestamp .

are you running the tests locally before committing/pushing?

I only ran the array folder because the full suite takes a lot of time, Ill be sure to run the full thing going forward. That's on me.

Ill add official testcases once the build clears CI due to the weird tack-on nature of this bug fix. Just from some local testing, it looks like there is already a preexisting error message for trying to use the add operation on dtypes like Datetime and TimeDelta.

That being said, it looks like the CI is throwing errors on some of the builds but not others again, and what do you know, they're not replicated on my local machine. Would you know who I could talk to to figure out why that is?

if the exception message in a test needs to be updated thats fine as long as the new one makes sense.

Sounds good to me.

Any pointers for the CI or should I ask it during the meeting tmr?

i havent looked too closely, but the CI failiures ll look like cases of "the test needs to be updated to check for the new exception message".

none of the edits to the ArrowEA are necessary, nor is the special-casing for Period.

The issue is that 2/3 of the unit tests succeed as-is, so it doesn't make sense why only 7 are failing. Especially since the error is about a float being concatenated with a string, which all the other builds are able to do. My guess was that something was different about their set up process.

pandas/core/frame.py

…rray and timedelta catch

eicchen · 2025-08-27T17:43:51Z

I modified ArrowEA to address the xfail issue in add_to_frame across environments. With the changes, add_to_string passes, but add_to_frame behaves inconsistently on some CIs (occasionally passing despite xfail). If preferred, I can revert the edits, though that would leave the inconsistency still.

jbrockmendel · 2025-08-27T17:55:49Z

When i tried this locally i didn't need to modify ArrowEA at all. What breaks without that change?

eicchen · 2025-08-27T18:03:21Z

Before modifying ArrayEA, these were the failing tests in ArrowEA:

pandas/tests/arrays/string_/test_string.py::test_add_frame[string=string[python]]
pandas/tests/arrays/string_/test_string.py::test_add_frame[string=str[python]]
(Job link: https://github.com/pandas-dev/pandas/actions/runs/17139673229/job/48624140109)

These tests were expected to fail but did not. I was unable to replicate the failures locally, and most CI runs did not encounter the issue; it appeared only in a small subset. I modified ArrayEA to reconcile these differences, but the same CI runs are still encountering issues.

eicchen added 5 commits July 10, 2025 15:42

Initial test case

a9c8d85

Updated test case to account for results of mul being NaN if both inp…

f303a04

…uts are NaN

Removed test cases which expect an error from fill_value

5ac26a4

Updated test case to include other operators which included fill_value

a60fbb0

Removed restriction on using fill_value with series

87ecfc4

Updated docs

jbrockmendel reviewed Jul 11, 2025

View reviewed changes

pandas/tests/frame/test_arithmetic.py Outdated Show resolved Hide resolved

eicchen closed this Jul 15, 2025

Included PR suggestions, added seperate dtype test (WIP)

bc805fd

eicchen mentioned this pull request Jul 15, 2025

BUG: Operations not implemented for non-1D ExtensionArrays #61866

Closed

3 tasks

temp files

be09616

Nadav-Zilberberg mentioned this pull request Jul 16, 2025

BUG: Operations not implemented for non-1D ExtensionArrays Nadav-Zilberberg/pandas-test#20

Open

eicchen and others added 2 commits August 18, 2025 16:46

Added test case to test EA and NUMPY dtypes

1ebcf6e

Merge branch 'pandas-dev:main' into BUG-pandas-dev#61581-DataFrame.mul

98fb07f

eicchen reopened this Aug 19, 2025