PERF: Possible performance regression for indexing from 0.12 to 0.13.1 #6882

pmorissette · 2014-04-14T20:24:58Z

Hey all,

Just upgraded my pandas version from 0.12 to 0.13.1 and noticed a significant performance regression for indexing operations (get, set, and windowing).

Here is my test setup code:

import pandas as pd

ts1 = pd.TimeSeries(data=100.0, index=pd.date_range('2000-01-01', periods=1000))
ts2 = pd.TimeSeries(data=200.0, index=pd.date_range('2000-01-01', periods=1000))
ts3 = pd.TimeSeries(data=300.0, index=pd.date_range('2000-01-01', periods=1000))
df = pd.DataFrame({'ts1': ts1, 'ts2': ts2, 'ts3': ts3})

dt = ts1.index[500]

Here is a table showing the results of IPython's %timeit function.

test	0.12	0.13.1
ts1[dt]	3.78	8.5
ts1.ix[dt]	11.8	30.7
ts1.loc[dt]	12.7	37.7
ts1[dt]=1	1.86	4.32
ts1.ix[dt]=1	12.5	65.9
ts1.loc[df]=1	36.2	65.7
ts1[:dt]	78.2	101
ts1.ix [:dt]	53.1	106
ts1.loc [:dt]	59.5	101
df.ix[dt]	45.3	77.9
df.ix [:dt]	63.3	85.9

I did not see up-to-date data on http://pandas.pydata.org/pandas-docs/vbench/vb_indexing.html - am I looking at the right benchmark data? Most charts end in June 2012.

Can someone confirm this slowdown?

I am using numpy 1.8.1 by the way - let me know if you need any other version numbers.

Thanks in advance!

immerrr · 2014-04-15T05:57:40Z

Are those numbers microseconds? There was some microsecond-level overhead added in 0.13.1, which I've seen and tried to address, but it was agreed that shaving off several dozen (hundred?) of additional function calls might not be worth it, because that overhead didn't scale with container size. For example, on my 3.3GHz i3 one microsecond is about 6 function calls:

In [1]: def foo(x): return x

In [2]: timeit foo(1)
10000000 loops, best of 3: 149 ns per loop

FTR, there's a pull request with a lot of big container indexing benchmarks, most likely including ones shown here. I remember it showing some unexpected slowdowns for datetime indices, but I haven't yet looked at them.

pmorissette · 2014-04-15T13:18:08Z

Hey @immerrr thanks for the quick reply!

Yes these numbers are in microseconds. The reason I noticed is that I have a program that updates a large number of pre-allocated time series and data frames date after date so the increase in time was noticeable. I understand this is not a huge issue but I thought I'd bring it up since I saw no mention of it elsewhere. For my application it did lead to a significant time increase (~1.7 times slower).

jreback · 2014-04-15T13:23:51Z

@pmorissette you need to make sure that you are vectorizing
accessing/setting single elements via most indexers is not that fast as it handles lots of cases
u can try using iat/at in those cases but it behooves u to vectorize as much as possible

pmorissette · 2014-04-15T13:40:25Z

Hey @jreback yeah vectorizing would indeed be the way to go but for my application this is not possible. The values I am updating are only known at time t and I must loop through all the dates one at a time. It is convenient to have the data in a pandas TimeSeries for my application, but perhaps a quicker data storage solution could work and I could create a TimeSeries on demand when necessary. Some testing will be in order.

Also, I will look at iat/at to see if I can get a speed improvement. Thanks for the help!

pmorissette · 2014-04-15T13:55:37Z

@jreback

I just ran my benchmark using .iat and .at and they too are slower in 0.13.1 vs 0.12. These two methods are also slower than the basic bracket indexing. Again, these are microseconds. Not a big deal individually but it adds up in my use case.

test	0.12	0.13.1
ts1[dt]	3.78	8.5
ts1.iat[500]	15.5	26.8
ts1.at[dt]	7	15

jreback · 2014-04-15T14:11:13Z

you realize that substantial changes took place in 0.13
see the whatsnew

if these microseconds matter to you
then u need to do the indexing some other way

pmorissette · 2014-04-15T14:24:35Z

@jreback sounds good - just wanted to bring it up since I didn't see this issue mentioned elsewhere. Pandas is great and I appreciate all the hard work that goes into this library. Thanks again.

immerrr · 2014-04-15T14:26:10Z

@pmorissette , sometimes i = ts.index.get_loc(dt); ts1.iloc[i] was faster for me than simply ts1.loc[dt], maybe that could help...

pmorissette · 2014-04-15T14:26:52Z

@immerrr ok cool I'll take a look!

jreback · 2014-04-15T14:34:13Z

my point before is that iat/at are faster than iloc/loc

they are all prob slower than 0.12 a bit

we normally don't optimize to microseconds as if that actually matter you are generally going about the problem in the wrong way

pmorissette · 2014-04-15T14:56:27Z

@jreback ok understood. Thanks for the heads up.

jreback added the Performance label Apr 25, 2014

jreback closed this as completed Apr 25, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PERF: Possible performance regression for indexing from 0.12 to 0.13.1 #6882

PERF: Possible performance regression for indexing from 0.12 to 0.13.1 #6882

pmorissette commented Apr 14, 2014

immerrr commented Apr 15, 2014

pmorissette commented Apr 15, 2014

jreback commented Apr 15, 2014

pmorissette commented Apr 15, 2014

pmorissette commented Apr 15, 2014

jreback commented Apr 15, 2014

pmorissette commented Apr 15, 2014

immerrr commented Apr 15, 2014

pmorissette commented Apr 15, 2014

jreback commented Apr 15, 2014

pmorissette commented Apr 15, 2014

PERF: Possible performance regression for indexing from 0.12 to 0.13.1 #6882

PERF: Possible performance regression for indexing from 0.12 to 0.13.1 #6882

Comments

pmorissette commented Apr 14, 2014

immerrr commented Apr 15, 2014

pmorissette commented Apr 15, 2014

jreback commented Apr 15, 2014

pmorissette commented Apr 15, 2014

pmorissette commented Apr 15, 2014

jreback commented Apr 15, 2014

pmorissette commented Apr 15, 2014

immerrr commented Apr 15, 2014

pmorissette commented Apr 15, 2014

jreback commented Apr 15, 2014

pmorissette commented Apr 15, 2014