Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PERF: improve to_timedelta perf for string-like #6755

Closed
jreback opened this issue Apr 1, 2014 · 0 comments · Fixed by #10396
Closed

PERF: improve to_timedelta perf for string-like #6755

jreback opened this issue Apr 1, 2014 · 0 comments · Fixed by #10396
Labels
Performance Memory or execution speed performance Timedelta Timedelta data type

Comments

@jreback
Copy link
Contributor

jreback commented Apr 1, 2014

see #6754
related #3009, #8190
from SO

convering string like timedeltas, e.g.

'1 days', '3 days, 00:00:03', '3 days 2 hours'

is slow because it requires a regex match for each element. Can put cythonized (like format parsing for datetimes), or since only very limited formats supported can simply test them (in cython) and convert.

@jreback jreback added this to the 0.15.0 milestone Apr 1, 2014
@jreback jreback modified the milestones: 0.16.0, Next Major Release Mar 6, 2015
jreback added a commit to jreback/pandas that referenced this issue Jun 20, 2015
jreback added a commit that referenced this issue Jun 22, 2015
PERF: parse and timedelta ops improvements, #6755
yarikoptic added a commit to neurodebian/pandas that referenced this issue Jul 2, 2015
* commit 'v0.16.2-42-g383865f': (72 commits)
  BUG: provide categorical concat always on axis 0, pandas-dev#10430     numpy 1.10 makes this an error for 1-d on axis != 0
  DOC: update missing.rst with ref to groupby.rst
  BUG: Timedeltas with no specified units (and frac) should raise, pandas-dev#10426
  BUG: using .loc[:,column] fails when the object is a multi-index, pandas-dev#10408
  Removed scikit-timeseries migration docs from FAQ
  BUG: GH10395 bug in DataFrame.interpolate with axis=1 and inplace=True
  BUG: GH10392 bug where Table.select_column does not preserve column name
  TST: Use unicode literals in string test
  PERF: fix _get_level_indexer to accept an intermediate indexer result
  PERF: bench for pandas-dev#10287
  BUG: drop_duplicates drops name(s).
  ENH: Enable ExcelWriter to construct in-memory sheets
  BLD: remove support for 3.2, pandas-dev#9118
  PERF: timedelta and datetime64 ops improvements
  PERF: parse timedelta strings in cython pandas-dev#6755
  closes bug in reset_index when index contains NaT
  Check for size=0 before setting item Fixes pandas-dev#10193
  closes bug in apply when function returns categorical
  BUG: frequencies.get_freq_code raises an error against offset with n != 1
  CI: run doc-tests always
  ...
gambogi pushed a commit to gambogi/pandas that referenced this issue Jul 13, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Performance Memory or execution speed performance Timedelta Timedelta data type
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant