BUG: Inconsistent conversion of missing column names #44878

johnzangwill · 2021-12-14T12:17:33Z

closes BUG: Inconsistent conversion of missing column names #44818
tests changed / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

Factored out multiple occurances of missing name logic into common method.
DataFrame.to_records() changed to use common method rather than missing value count when filling missing names.
Index.to_frame() left alone for the time being.

import pandas as pd
f = pd.DataFrame([1], index=pd.MultiIndex.from_arrays([[2],[3],[4]], names=[None, "a", None]))

Old behavior:

>>> pd.DataFrame(f.to_records())
   level_0  a  level_1  0
0        2  3        4  1

New behavior:

>>> pd.DataFrame(f.to_records())
   level_0  a  level_2  0
0        2  3        4  1

phofl

small comments, otherwise lgtm

phofl · 2021-12-14T15:57:43Z

pandas/tests/frame/methods/test_to_records.py

        df.index.names = ["A", None]
        rs = df.to_records()
-        assert "level_0" in rs.dtype.fields
+        assert "level_1" in rs.dtype.fields


Could you maybe add a test checking the whole result?

Cruel! But done.

phofl · 2021-12-14T15:57:58Z

doc/source/whatsnew/v1.4.0.rst

 - Bug in :meth:`DataFrame.shift` with ``axis=1`` and ``ExtensionDtype`` columns incorrectly raising when an incompatible ``fill_value`` is passed (:issue:`44564`)
 - Bug in :meth:`DataFrame.diff` when passing a NumPy integer object instead of an ``int`` object (:issue:`44572`)
 - Bug in :meth:`Series.replace` raising ``ValueError`` when using ``regex=True`` with a :class:`Series` containing ``np.nan`` values (:issue:`43344`)
+- Bug in :meth:`DataFrame.to_records` missing names filled incorrectly (:issue:`44818`)


Could you specify a bit more?

jreback

small comments, not 100% sold on the location (maybe @jbrockmendel can think of a better one), but ok. ping on green.

jreback · 2021-12-15T22:58:38Z

doc/source/whatsnew/v1.4.0.rst

 - Bug in :meth:`DataFrame.shift` with ``axis=1`` and ``ExtensionDtype`` columns incorrectly raising when an incompatible ``fill_value`` is passed (:issue:`44564`)
 - Bug in :meth:`DataFrame.diff` when passing a NumPy integer object instead of an ``int`` object (:issue:`44572`)
 - Bug in :meth:`Series.replace` raising ``ValueError`` when using ``regex=True`` with a :class:`Series` containing ``np.nan`` values (:issue:`43344`)
+- Bug in :meth:`DataFrame.to_records` where an incorrect n was used when missing names were replaced by level_n (:issue:`44818`)


can you add backticks around the n and level_n

can you add backticks around the n and level_n

Done

jreback · 2021-12-15T22:58:48Z

pandas/core/common.py

    return _builtin_table.get(arg, arg)
+
+
+def fill_missing_names(names):


can you type in and out here

Done, and added version and more docstring.

jbrockmendel · 2021-12-16T00:11:53Z

test location makes sense to me

pep8speaks · 2021-12-16T13:04:56Z

Hello @johnzangwill! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2021-12-16 13:28:03 UTC

johnzangwill · 2021-12-16T13:29:06Z

not 100% sold on the location (maybe @jbrockmendel can think of a better one)

I suspect that @jreback was referring to the fact that I factored the logic into core/common.py.
It seemed a good place to me. My method is ref'ed in:

core/frame.py
core/indexes/multi.py
io/pytables.py
io/sql.py
io/json/_table_schema.py

and core/common is already imported as com into every one.
But if you have a better suggestion...

jreback · 2021-12-16T20:33:43Z

thanks @johnzangwill very nice!

johnzangwill added 5 commits December 12, 2021 10:41

Factor out existing occurrences

0959372

Don't change index.to_frame

276bdff

Merge branch 'pandas-dev:master' into fill-missing-names

8a4879a

Update v1.4.0.rst

10bcdc4

Add docstring

e2a1247

johnzangwill marked this pull request as draft December 14, 2021 13:39

johnzangwill added 2 commits December 14, 2021 13:50

Update multi.py

aa6374a

Merge branch 'pandas-dev:master' into fill-missing-names

b2c4b92

johnzangwill marked this pull request as ready for review December 14, 2021 15:48

phofl reviewed Dec 14, 2021

View reviewed changes

johnzangwill added 2 commits December 14, 2021 18:29

Improve whatsnew and test

a6c2109

Merge branch 'pandas-dev:master' into fill-missing-names

314b777

johnzangwill requested a review from phofl December 14, 2021 18:59

phofl approved these changes Dec 14, 2021

View reviewed changes

johnzangwill added 3 commits December 14, 2021 23:33

Trigger CI

c2d9668

Trigger CI

94a86a0

Merge branch 'pandas-dev:master' into fill-missing-names

4f569ad

jreback requested changes Dec 15, 2021

View reviewed changes

jreback added the Reshaping Concat, Merge/Join, Stack/Unstack, Explode label Dec 15, 2021

jreback added this to the 1.4 milestone Dec 15, 2021

johnzangwill added 2 commits December 16, 2021 13:04

Add types and improve docstring

f311e49

Merge branch 'pandas-dev:master' into fill-missing-names

122eecd

johnzangwill added 2 commits December 16, 2021 13:11

Update common.py

ac573b8

Update common.py

76e864b

johnzangwill requested a review from jreback December 16, 2021 15:21

jreback approved these changes Dec 16, 2021

View reviewed changes

jreback merged commit a769e38 into pandas-dev:master Dec 16, 2021

johnzangwill deleted the fill-missing-names branch December 16, 2021 21:25

This was referenced Jan 16, 2022

BUG: DataFrameGroupBy.value_counts() fails if as_index=False and there are duplicate column labels #45160

Merged

ENH: Add allow_duplicates to MultiIndex.to_frame #45318

Merged

		return _builtin_table.get(arg, arg)


		def fill_missing_names(names):

Uh oh!

BUG: Inconsistent conversion of missing column names #44878

BUG: Inconsistent conversion of missing column names #44878

Uh oh!

Conversation

johnzangwill commented Dec 14, 2021

Uh oh!

phofl left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jreback left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jbrockmendel commented Dec 16, 2021

Uh oh!

pep8speaks commented Dec 16, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Comment last updated at 2021-12-16 13:28:03 UTC

Uh oh!

johnzangwill commented Dec 16, 2021

Uh oh!

jreback commented Dec 16, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pep8speaks commented Dec 16, 2021 •

edited

Loading