PERF: Implement RangeIndex min/max using RangeIndex properties #17611

jschendel · 2017-09-21T05:42:19Z

closes PERF: Use RangeIndex properties to compute max/min #17607
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

Benchmarks are essentially the same as what's referenced in the issue. Maybe a few ns slower since I combined logic into a helper function vs having identical code (other than elif) for both min/max, which was the case I did the benchmarks posted in the issue. Updated benchmarks:

      before           after         ratio
     [fedf9228]       [4cb0de8d]
-        25.4±0ms      1.77±0.06μs     0.00  index_object.Range.time_min
-      26.0±0.9ms      1.71±0.06μs     0.00  index_object.Range.time_max
-        25.4±0ms         1.51±0μs     0.00  index_object.Range.time_max_trivial
-        25.4±0ms         1.30±0μs     0.00  index_object.Range.time_min_trivial

SOME BENCHMARKS HAVE CHANGED SIGNIFICANTLY.

codecov · 2017-09-21T06:15:29Z

Codecov Report

Merging #17611 into master will decrease coverage by 0.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master   #17611      +/-   ##
==========================================
- Coverage    91.2%   91.18%   -0.02%     
==========================================
  Files         163      163              
  Lines       49637    49648      +11     
==========================================
+ Hits        45269    45271       +2     
- Misses       4368     4377       +9

Flag	Coverage Δ
#multiple	`88.97% <100%> (ø)`	⬆️
#single	`40.18% <27.27%> (-0.07%)`	⬇️

Impacted Files	Coverage Δ
pandas/core/indexes/range.py	`92.83% <100%> (+0.24%)`	⬆️
pandas/io/gbq.py	`25% <0%> (-58.34%)`	⬇️
pandas/core/frame.py	`97.77% <0%> (-0.1%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8276a42...5379fc2. Read the comment docs.

topper-123 · 2017-09-21T10:00:23Z

I'm wondering it something similar, but more general, couldn't be done that works with both indices and series using is_monotonic_*.

E.g.:

def max(self):
    if self.is_monotonic_increasing:
        return self[-1]
    elif self.is_monotonic_decreasing:
        return self[0]
    else:
        return nanops.nanmax(self.values)

Seems to me this should work. What do you think, have I missed something nonobvious?

jreback

doc comment otherwise lgtm

jreback · 2017-09-21T13:17:07Z

doc/source/whatsnew/v0.21.0.txt

@@ -472,6 +472,7 @@ Performance Improvements
 - Improved performance of :meth:`Categorical.set_categories` by not materializing the values (:issue:`17508`)
 - :attr:`Timestamp.microsecond` no longer re-computes on attribute access (:issue:`17331`)
 - Improved performance of the :class:`CategoricalIndex` for data that is already categorical dtype (:issue:`17513`)
+- Improved performance of ``RangeIndex.min`` and ``RangeIndex.max`` by using ``RangeIndex`` properties to perform the computations (:issue:`17607`)


can u use :meth: here

Will using :meth: work in this case? It doesn't look like RangeIndex is in api.rst, and I can't find any RangeIndex related items in the existing HTML API reference. Not super well versed on how the docs work though, so could be mistaken.

so should prob add RangeIndex to api.rst :>

(and prob any other Indexes that are missing). We don't need to repeat the methods though (they are inherited).

gfyoung · 2017-09-21T18:16:37Z

pandas/tests/indexes/test_range.py

@@ -994,3 +994,22 @@ def test_append(self):
                # Append single item rather than list
                result2 = indices[0].append(indices[1])
                tm.assert_index_equal(result2, expected, exact=True)
+
+    def test_max_min(self):
+        params = [(0, 400, 3), (500, 0, -6), (-10**6, 10**6, 4),


pytest.mark.parametrize this

Reference issue number above this

jschendel · 2017-09-21T19:12:35Z

@topper-123 : I think that will work in general, but I wonder if there are cases where the additional overhead of computing is_monotonic_* could outweigh performance gains. Would be interesting to look into.

Implemented RangeIndex min/max in terms of RangeIndex properties.

jorisvandenbossche · 2017-09-22T07:28:53Z

doc/source/api.rst

+
+.. autosummary::
+   :toctree: generated/
+   :template: autosummary/class_without_autosummary.rst


Can you just list them all in a single autosummary directive?

jreback · 2017-09-22T13:15:18Z

thanks @jschendel

jorisvandenbossche · 2017-09-22T15:17:16Z

doc/source/whatsnew/v0.21.0.txt

@@ -473,6 +473,7 @@ Performance Improvements
 - Improved performance of :meth:`Categorical.set_categories` by not materializing the values (:issue:`17508`)
 - :attr:`Timestamp.microsecond` no longer re-computes on attribute access (:issue:`17331`)
 - Improved performance of the :class:`CategoricalIndex` for data that is already categorical dtype (:issue:`17513`)
+- Improved performance of :meth:`RangeIndex.min` and :meth:`RangeIndex.max` by using ``RangeIndex`` properties to perform the computations (:issue:`17607`)


Those links won't work because the pages are not generated (only the class docstring is included in the api).
To solve this, we would need to find a way to specify which of the methods to include of which not (or list them manually in the api.rst, that is probably easier)

…s-dev#17611)

jreback approved these changes Sep 21, 2017

View reviewed changes

gfyoung reviewed Sep 21, 2017

View reviewed changes

gfyoung added Indexing Related to indexing on series/frames, not to indexes themselves Performance Memory or execution speed performance and removed Indexing Related to indexing on series/frames, not to indexes themselves labels Sep 21, 2017

jreback added this to the 0.21.0 milestone Sep 21, 2017

PERF: RangeIndex min/max

6f27b9a

Implemented RangeIndex min/max in terms of RangeIndex properties.

jschendel force-pushed the range-min-max branch from 4cb0de8 to f7e0bfa Compare September 22, 2017 05:39

jorisvandenbossche reviewed Sep 22, 2017

View reviewed changes

parametrize test and update docs

8aa59c3

jschendel force-pushed the range-min-max branch from f7e0bfa to 8aa59c3 Compare September 22, 2017 08:20

consolidate docs

5379fc2

jreback merged commit 26681db into pandas-dev:master Sep 22, 2017

jorisvandenbossche reviewed Sep 22, 2017

View reviewed changes

jorisvandenbossche mentioned this pull request Sep 23, 2017

DOC: fix no autosummary for numerical index api pages #17642

Merged

jschendel deleted the range-min-max branch October 22, 2017 05:47

alanbato pushed a commit to alanbato/pandas that referenced this pull request Nov 10, 2017

PERF: Implement RangeIndex min/max using RangeIndex properties (panda…

442f5b9

…s-dev#17611)

No-Stream pushed a commit to No-Stream/pandas that referenced this pull request Nov 28, 2017

PERF: Implement RangeIndex min/max using RangeIndex properties (panda…

af3d4ab

…s-dev#17611)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PERF: Implement RangeIndex min/max using RangeIndex properties #17611

PERF: Implement RangeIndex min/max using RangeIndex properties #17611

jschendel commented Sep 21, 2017

codecov bot commented Sep 21, 2017 •

edited

Loading

topper-123 commented Sep 21, 2017

jreback left a comment

jreback Sep 21, 2017

jschendel Sep 21, 2017

jreback Sep 21, 2017

gfyoung Sep 21, 2017 •

edited

Loading

jschendel commented Sep 21, 2017

jorisvandenbossche Sep 22, 2017

jreback commented Sep 22, 2017

jorisvandenbossche Sep 22, 2017

PERF: Implement RangeIndex min/max using RangeIndex properties #17611

PERF: Implement RangeIndex min/max using RangeIndex properties #17611

Conversation

jschendel commented Sep 21, 2017

codecov bot commented Sep 21, 2017 • edited Loading

Codecov Report

topper-123 commented Sep 21, 2017

jreback left a comment

Choose a reason for hiding this comment

jreback Sep 21, 2017

Choose a reason for hiding this comment

jschendel Sep 21, 2017

Choose a reason for hiding this comment

jreback Sep 21, 2017

Choose a reason for hiding this comment

gfyoung Sep 21, 2017 • edited Loading

Choose a reason for hiding this comment

jschendel commented Sep 21, 2017

jorisvandenbossche Sep 22, 2017

Choose a reason for hiding this comment

jreback commented Sep 22, 2017

jorisvandenbossche Sep 22, 2017

Choose a reason for hiding this comment

codecov bot commented Sep 21, 2017 •

edited

Loading

gfyoung Sep 21, 2017 •

edited

Loading