Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PERF: Index.get_loc #43705

Merged
merged 1 commit into from
Sep 23, 2021
Merged

PERF: Index.get_loc #43705

merged 1 commit into from
Sep 23, 2021

Conversation

jbrockmendel
Copy link
Member

Makes a big difference for Float64Index with integer key.

import numpy as np
import pandas as pd

idx = pd.Index(np.arange(10**7))
fidx = idx.astype('f8')


key = idx[len(idx) // 2]
key2 = idx[10]

%timeit idx.get_loc(key)
2.97 µs ± 36.2 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  # <- master
2.3 µs ± 36.3 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  # <- PR

%timeit idx.get_loc(key2)
3.2 µs ± 151 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  # <- master
2.1 µs ± 140 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)  # <- PR

%timeit fidx.get_loc(key)
56.4 µs ± 1.22 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)  # <- master
2.11 µs ± 11.1 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  # <- PR

%timeit fidx.get_loc(key2)
54.7 µs ± 1.12 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)  # <- master
2 µs ± 21.7 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)  # <- PR

@jbrockmendel jbrockmendel added Indexing Related to indexing on series/frames, not to indexes themselves Performance Memory or execution speed performance labels Sep 22, 2021
@jreback jreback added this to the 1.4 milestone Sep 22, 2021
@jreback jreback merged commit f3d4817 into pandas-dev:master Sep 23, 2021
@jbrockmendel jbrockmendel deleted the cln-bin_search branch September 23, 2021 16:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Indexing Related to indexing on series/frames, not to indexes themselves Performance Memory or execution speed performance
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants