BUG: fillna with DataFrame input should preserve dtype when possible #61742

iabhi4 · 2025-06-29T19:26:42Z

When filling a DataFrame with another DataFrame using fillna, columns with matching dtypes were being unnecessarily cast to object due to use of np.where.

This PR updates the logic to use pandas’ Series.where, which is dtype-safe and respects extension and datetime types.

Closes BUG: Inconsistent behavior surrounding pd.fillna #61568
Adds a regression test
Adds a whatsnew entry
pre-commit checks passed

…aFrame

iabhi4 · 2025-06-29T22:29:22Z

Since we now operate column-wise and use Series.where instead of np.where, so it keeps dtype safety as suggested by @jbrockmendel

This also preserves extension dtypes like string[pyarrow], which used to get cast to object. Because of that, test_fillna_dataframe_preserves_dtypes_mixed_columns is failing since it expects the downgraded dtype.

Let me know if this behavior change is fine, happy to update the test or tweak the logic based on what’s preferred!

jbrockmendel · 2025-06-30T14:46:57Z

pandas/core/generic.py

+                    # restore original dtype if fallback to object occurred
+                    if lhs.dtype == rhs.dtype and filled.dtype == object:
+                        try:
+                            filled = filled.astype(lhs.dtype)


id expect this to be handled by Series.where. is it not?

jbrockmendel · 2025-06-30T14:48:04Z

pandas/core/generic.py

@@ -7145,7 +7145,24 @@ def fillna(
            else:
                new_data = self._mgr.fillna(value=value, limit=limit, inplace=inplace)
        elif isinstance(value, ABCDataFrame) and self.ndim == 2:
-            new_data = self.where(self.notna(), value)._mgr
+            filled_columns = {}
+            for col in self.columns:


doing this column-by-column is going to mean a performance hit for non-object cases. i suspect we need to do this at the Block level in order to avoid that

BUG: Preserve dtype in DataFrame.fillna when filling from another Dat…

9c5e29a

…aFrame

simonjayhawkins added Bug Dtype Conversions Unexpected or buggy dtype conversions labels Jun 30, 2025

jbrockmendel reviewed Jun 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: fillna with DataFrame input should preserve dtype when possible #61742

BUG: fillna with DataFrame input should preserve dtype when possible #61742

Uh oh!

iabhi4 commented Jun 29, 2025

Uh oh!

iabhi4 commented Jun 29, 2025

Uh oh!

jbrockmendel Jun 30, 2025

Uh oh!

jbrockmendel Jun 30, 2025

Uh oh!

Uh oh!

Uh oh!

BUG: fillna with DataFrame input should preserve dtype when possible #61742

Are you sure you want to change the base?

BUG: fillna with DataFrame input should preserve dtype when possible #61742

Uh oh!

Conversation

iabhi4 commented Jun 29, 2025

Uh oh!

iabhi4 commented Jun 29, 2025

Uh oh!

jbrockmendel Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!