Skip to content

Commit

Permalink
fix ordering of quantiles in describe.
Browse files Browse the repository at this point in the history
Previously dataframe.describe was using the built-in set to sort quantiles which sorts by hash rather than by value. Now the built-in sorted is used instead.
  • Loading branch information
gregrf committed Mar 29, 2019
1 parent 2391040 commit 95d62cc
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion dask/dataframe/core.py
Expand Up @@ -1543,7 +1543,7 @@ def describe(self, split_every=False, percentiles=None):
if percentiles is None:
percentiles = [0.25, 0.5, 0.75]
else:
percentiles = list(set(sorted(percentiles + [0.5])))
percentiles = sorted(set(list(percentiles) + [0.5]))
stats = [num.count(split_every=split_every),
num.mean(split_every=split_every),
num.std(split_every=split_every),
Expand Down

0 comments on commit 95d62cc

Please sign in to comment.