BUG: describe started rounding reported percentile 99.999% to 100% #55765

AlbertoEAF · 2023-10-30T15:15:55Z

Pandas version checks

I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of pandas.
I have confirmed this bug exists on the main branch of pandas.

Reproducible Example

import pandas as pd

D = pd.DataFrame({"a":[23, 51]})
D.a.describe([0.9, 0.99, 0.999, 0.9999, 0.99999])

Issue Description

In the pandas 1.x version I was using (don't know exactly which after I bumped) describe() used to compute the requested percentiles. However, I just bumped to 2.1.2 and now my percentile 99.999% is reported as percentile 100%, even though it is not, by the following example:

Observed issues:

Row 99.999% is now reported as 100%; <-- This is the one that affects me the most
Clearly requesting percentiles 99.999% and 100% is still different, by looking at the outputs of the two 100% rows;
max doesn't match 100%.

Expected Behavior

The percentile with 5 9's should still report as percentile 99.999% on the output table.

That problem affects me the most, however, the other 2 could also hypothetically be considered unexpected.

Installed Versions

INSTALLED VERSIONS

commit : a60ad39
python : 3.11.5.final.0
python-bits : 64
OS : Linux
OS-release : 5.15.0-87-generic
Version : #97~20.04.1-Ubuntu SMP Thu Oct 5 08:25:28 UTC 2023
machine : x86_64
processor : x86_64
byteorder : little
LC_ALL : None
LANG : en_US.UTF-8
LOCALE : en_US.UTF-8

pandas : 2.1.2
numpy : 1.26.1
pytz : 2023.3.post1
dateutil : 2.8.2
setuptools : 68.0.0
pip : 23.3
Cython : None
pytest : None
hypothesis : None
sphinx : None
blosc : None
feather : None
xlsxwriter : None
lxml.etree : None
html5lib : None
pymysql : None
psycopg2 : None
jinja2 : 3.1.2
IPython : 8.16.1
pandas_datareader : None
bs4 : 4.12.2
bottleneck : None
dataframe-api-compat: None
fastparquet : None
fsspec : None
gcsfs : None
matplotlib : 3.8.0
numba : None
numexpr : None
odfpy : None
openpyxl : None
pandas_gbq : None
pyarrow : None
pyreadstat : None
pyxlsb : None
s3fs : None
scipy : None
sqlalchemy : None
tables : None
tabulate : None
xarray : None
xlrd : None
zstandard : None
tzdata : 2023.3
qtpy : None
pyqt5 : None

The text was updated successfully, but these errors were encountered:

paulreece · 2023-10-30T20:37:53Z

Confirmed on main

I think the change was implemented in #48298 where the rounding logic was updated.

It is because in this line it is giving back a precision of 4 and therefore rounding up 99.999:

pandas/pandas/io/formats/format.py

Lines 1607 to 1610 in 2d2d67d

    
           prec = -np.floor( 
        
               np.log10(np.min(np.ediff1d(unique_pcts, to_begin=to_begin, to_end=to_end))) 
        
           ).astype(int) 
        
           prec = max(1, prec)

parthi-siva · 2023-11-06T05:36:21Z

take

AlbertoEAF added Bug Needs Triage Issue that has not been reviewed by a pandas team member labels Oct 30, 2023

paulreece added IO Data IO issues that don't fit into a more specific label and removed Needs Triage Issue that has not been reviewed by a pandas team member labels Oct 30, 2023

github-actions bot assigned parthi-siva Nov 6, 2023

parthi-siva mentioned this issue Nov 6, 2023

BUG: Fix rounding of percentile 99.999% to 100% #55841

Merged

5 tasks

mroeschke closed this as completed in #55841 Nov 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: describe started rounding reported percentile 99.999% to 100% #55765

BUG: describe started rounding reported percentile 99.999% to 100% #55765

AlbertoEAF commented Oct 30, 2023 •

edited

INSTALLED VERSIONS

paulreece commented Oct 30, 2023 •

edited

parthi-siva commented Nov 6, 2023

BUG: describe started rounding reported percentile 99.999% to 100% #55765

BUG: describe started rounding reported percentile 99.999% to 100% #55765

Comments

AlbertoEAF commented Oct 30, 2023 • edited

Pandas version checks

Reproducible Example

Issue Description

Expected Behavior

Installed Versions

INSTALLED VERSIONS

paulreece commented Oct 30, 2023 • edited

parthi-siva commented Nov 6, 2023

AlbertoEAF commented Oct 30, 2023 •

edited

paulreece commented Oct 30, 2023 •

edited