Fix issue17 #19

pjz · 2016-06-15T18:04:02Z

Potential fix for #17. Since there's only one value present, I assume a normal distribution with a standard deviation that gets lower the more samples there are of that value.

CamDavidsonPilon · 2016-06-16T00:05:10Z

tests/test_tdigest.py

+    def test_quantile_with_single_centroid_at_zero(self, empty_tdigest):
+        td = empty_tdigest
+        td.update(0)
+        assert td.quantile(0) == 0.5


should this be 0.5? The quantile is the value p s.t. F(q) = p. If F is equal to 0 before 0, and 1 at and after 0, then I would expect this to be 1. (That's kind of confusing because of all the 0 and 1...)

CamDavidsonPilon · 2016-06-16T00:06:33Z

I was thinking of a solution like this:

def ...

        if len(self) == 1:                                 
            return int(q >= self.C.min_key())

        for i, key in enumerate(self.C.keys()):

The CDF is a degenerate distribution, so the values should return 0 or 1, depending on if q is above what we have seen thus far. What do you think?

pjz · 2016-06-22T14:05:22Z

You're probably correct ; my thought process was more like "there's one centroid, they asked for the mean, so clearly they should get 0.5". Not only is your reasoning better, but also since quantile and percentile are inverses, it should be true that x == td.quantile(td.percentile(x)) and x = td.percentile(td.quantile(x)), which fails if it's 0.5 as I suggested, but works if it's 1, as you suggested.

pjz · 2016-07-21T17:35:11Z

The failure here is a bit inexplicable, not even in the code I changed, I think, and only in py3.3: 2.7 and 3.5 pass fine. Can you prod it to re-run the tests?

CamDavidsonPilon · 2016-07-21T17:39:06Z

Rerunning - apologies, I didn't see you update the code

CamDavidsonPilon reviewed Jun 16, 2016
View reviewed changes

Tests for CamDavidsonPilon#17

6fbb771

pjz force-pushed the fix_issue17 branch from b8e1641 to aa7c90d Compare July 20, 2016 18:03

Potential fix for CamDavidsonPilon#17

6f78656

pjz force-pushed the fix_issue17 branch from aa7c90d to 6f78656 Compare July 20, 2016 18:05

CamDavidsonPilon merged commit 7770eeb into CamDavidsonPilon:master Jul 21, 2016

This was referenced Jul 21, 2016

Tests for #17 #18

Closed

Quantile broken if only one centroid #17

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix issue17 #19

Fix issue17 #19

pjz commented Jun 15, 2016

CamDavidsonPilon Jun 16, 2016

CamDavidsonPilon commented Jun 16, 2016

pjz commented Jun 22, 2016

pjz commented Jul 21, 2016

CamDavidsonPilon commented Jul 21, 2016

Fix issue17 #19

Fix issue17 #19

Conversation

pjz commented Jun 15, 2016

CamDavidsonPilon Jun 16, 2016

Choose a reason for hiding this comment

CamDavidsonPilon commented Jun 16, 2016

pjz commented Jun 22, 2016

pjz commented Jul 21, 2016

CamDavidsonPilon commented Jul 21, 2016