BUG: Fix MI repr with long names #21655

TomAugspurger · 2018-06-27T13:40:08Z

In [4]:         try:
   ...:             from unittest import mock
   ...:         except ImportError:
   ...:             mock = pytest.importorskip("mock")
   ...:
   ...:         terminal_size = os.terminal_size((118, 96))
   ...:         p1 = mock.patch('pandas.io.formats.console.get_terminal_size',
   ...:                         return_value=terminal_size)
   ...:         p2 = mock.patch('pandas.io.formats.format.get_terminal_size',
   ...:                         return_value=terminal_size)
   ...:
   ...:         index = range(5)
   ...:         columns = pd.MultiIndex.from_tuples([
   ...:             ('This is a long title with > 37 chars.', 'cat'),
   ...:             ('This is a loooooonger title with > 43 chars.', 'dog'),
   ...:         ])
   ...:         df = pd.DataFrame(1, index=index, columns=columns)
   ...:
   ...:

In [5]: with p1, p2:
   ...:     print('-' * 80)
   ...:     print(repr(df))
   ...:     print('-' * 80)
   ...:

output:

--------------------------------------------------------------------------------
  ...
  ...
0 ...
1 ...
2 ...
3 ...
4 ...

[5 rows x 2 columns]
--------------------------------------------------------------------------------

This matches the repr for non-hierarchical

In [6]: s = pd.DataFrame({"A" * 41: [1, 2], 'B' * 41: [1, 2]})

In [7]: with p1, p2:
   ...:     print('-' * 80)
   ...:     print(repr(s))
   ...:     print('-' * 80)
   ...:

output:

--------------------------------------------------------------------------------
  ...
0 ...
1 ...

[2 rows x 2 columns]
--------------------------------------------------------------------------------

These can certainly be improved, though I'm not sure we'll (I'll) get to it for 0.23.2.

TomAugspurger · 2018-06-27T13:42:13Z

from @jorisvandenbossche

Overflowing the line is what is happening in my console if I make it smaller

Yeah, agreed that would be best, but I can't tell whose doing the wrapping, if that makes sense.

TomAugspurger · 2018-06-27T13:42:53Z

Well, I guess it has to be the terminal... So we should just always print two columns? And let the terminal wrap if needed?

codecov · 2018-06-27T14:06:27Z

Codecov Report

❗ No coverage uploaded for pull request base (master@1cc5471). Click here to learn what that means.
The diff coverage is 100%.

@@            Coverage Diff            @@
##             master   #21655   +/-   ##
=========================================
  Coverage          ?    91.9%           
=========================================
  Files             ?      154           
  Lines             ?    49657           
  Branches          ?        0           
=========================================
  Hits              ?    45638           
  Misses            ?     4019           
  Partials          ?        0

Flag	Coverage Δ
#multiple	`90.28% <100%> (?)`
#single	`42.05% <75%> (?)`

Impacted Files	Coverage Δ
pandas/io/formats/format.py	`98.25% <100%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1cc5471...4391c24. Read the comment docs.

TomAugspurger · 2018-06-27T14:09:17Z

Running

import pandas as pd
import numpy as np

s = pd.DataFrame({"A" * 41: 1, "B" * 41: 1}, index=[0, 1, 2])
print("Regular Columns:")
print(repr(s))
print('\n\n')

index = range(5)
columns = pd.MultiIndex.from_tuples([
    ('This is a long title with > 37 chars.', 'cat'),
    ('This is a loooooonger title with > 43 chars.', 'dog'),
])
df = pd.DataFrame(1, index=index, columns=columns)

print("MultiIndex")
print(repr(df))

output

Regular Columns:
   AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA  BBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBB
0                                          1                                          1
1                                          1                                          1
2                                          1                                          1



MultiIndex
  This is a long title with > 37 chars. This is a loooooonger title with > 43 chars.
                                    cat                                          dog
0                                     1                                            1
1                                     1                                            1
2                                     1                                            1
3                                     1                                            1
4                                     1                                            1

jorisvandenbossche · 2018-06-27T16:17:43Z

The output in the top-post is not longer correct with the latest changes?

TomAugspurger · 2018-06-27T17:49:46Z

Correct, #21655 (comment) has the current output.

jreback · 2018-06-28T10:06:15Z

pandas/tests/io/formats/test_format.py

+        # TODO: use mock fixutre.
+        # This is being backported, so doing it directly here.
+        try:
+            from unittest import mock


should PR in 0.24.0 (to move this to test_decorators)

Forgot to lay out my plan.

I'd like to merge this with the try / except, backport to 0.23.2, and then make a PR removing this and using the mock fixture from #20729

#20729 isn't being backported, so that seems easiset.

jreback · 2018-06-28T10:10:45Z

pandas/io/formats/format.py

@@ -640,6 +640,8 @@ def to_string(self):
                    col_lens = col_lens.drop(mid_ix)
                    n_cols = len(col_lens)
                max_cols_adj = n_cols - self.index  # subtract index column


maybe make make the comments consistent here

# subtract index column max_cols_adj = ... # esnure we print at least two ....

jorisvandenbossche · 2018-06-29T09:49:26Z

I can confirm that this fixes #21327 (the one that I could reproduce in my local console without any hacks)

(cherry picked from commit ad76ffc)

BUG: quickfix MI repr

6f2aca6

TomAugspurger added this to the 0.23.2 milestone Jun 27, 2018

TomAugspurger added the Output-Formatting __repr__ of pandas objects, to_string label Jun 27, 2018

Better fallback

8151293

Py2 compat

4e11634

jreback approved these changes Jun 28, 2018

View reviewed changes

TomAugspurger and others added 3 commits June 29, 2018 05:55

Merge remote-tracking branch 'upstream/master' into repr-mi-tostring

cebb741

update

2878358

Merge branch 'master' into repr-mi-tostring

4391c24

jorisvandenbossche added the Needs Backport label Jul 2, 2018

jorisvandenbossche merged commit ad76ffc into pandas-dev:master Jul 2, 2018

jorisvandenbossche removed the Needs Backport label Jul 2, 2018

jorisvandenbossche pushed a commit to jorisvandenbossche/pandas that referenced this pull request Jul 2, 2018

BUG: Fix MI repr with long names (pandas-dev#21655)

cb94eb2

(cherry picked from commit ad76ffc)

jorisvandenbossche pushed a commit that referenced this pull request Jul 5, 2018

BUG: Fix MI repr with long names (#21655)

a74ee54

(cherry picked from commit ad76ffc)

Sup3rGeo pushed a commit to Sup3rGeo/pandas that referenced this pull request Oct 1, 2018

BUG: Fix MI repr with long names (pandas-dev#21655)

55463df

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Fix MI repr with long names #21655

BUG: Fix MI repr with long names #21655

TomAugspurger commented Jun 27, 2018

TomAugspurger commented Jun 27, 2018

TomAugspurger commented Jun 27, 2018

codecov bot commented Jun 27, 2018 •

edited

Loading

TomAugspurger commented Jun 27, 2018

jorisvandenbossche commented Jun 27, 2018

TomAugspurger commented Jun 27, 2018

jreback Jun 28, 2018

TomAugspurger Jun 28, 2018

jreback Jun 28, 2018

jorisvandenbossche commented Jun 29, 2018

BUG: Fix MI repr with long names #21655

BUG: Fix MI repr with long names #21655

Conversation

TomAugspurger commented Jun 27, 2018

TomAugspurger commented Jun 27, 2018

TomAugspurger commented Jun 27, 2018

codecov bot commented Jun 27, 2018 • edited Loading

Codecov Report

TomAugspurger commented Jun 27, 2018

jorisvandenbossche commented Jun 27, 2018

TomAugspurger commented Jun 27, 2018

jreback Jun 28, 2018

Choose a reason for hiding this comment

TomAugspurger Jun 28, 2018

Choose a reason for hiding this comment

jreback Jun 28, 2018

Choose a reason for hiding this comment

jorisvandenbossche commented Jun 29, 2018

codecov bot commented Jun 27, 2018 •

edited

Loading