Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PERF: infer_datetime_format fails without zero-padding #11142

Closed
chris-b1 opened this issue Sep 18, 2015 · 1 comment
Closed

PERF: infer_datetime_format fails without zero-padding #11142

chris-b1 opened this issue Sep 18, 2015 · 1 comment
Labels
Datetime Datetime data dtype Performance Memory or execution speed performance
Milestone

Comments

@chris-b1
Copy link
Contributor

I think this can be fixed by just padding tokens in the inference step. The dates do parse, just always fall back to the slow path, even with infer_datetime_format=True

In [22]: from pandas.tseries.tools import _guess_datetime_format

In [23]: _guess_datetime_format('02/02/2014')
Out[23]: '%m/%d/%Y'

In [24]: _guess_datetime_format('2/2/2014')
@jreback
Copy link
Contributor

jreback commented Sep 18, 2015

I think you can add additional formats here.

@jreback jreback added Datetime Datetime data dtype Performance Memory or execution speed performance Difficulty Novice labels Sep 18, 2015
@jreback jreback added this to the Next Major Release milestone Sep 18, 2015
chris-b1 added a commit to chris-b1/pandas that referenced this issue Sep 19, 2015
@jreback jreback modified the milestones: 0.17.0, Next Major Release Sep 19, 2015
jreback added a commit that referenced this issue Sep 20, 2015
PERF: infer_datetime_format without padding #11142
yarikoptic added a commit to neurodebian/pandas that referenced this issue Oct 11, 2015
* commit 'v0.17.0rc1-92-gc6bcc99': (29 commits)
  CI: tests latest versions of openpyxl
  COMPAT: openpyxl >= 2.2 support, pandas-dev#10125
  Tests demonstrating how to use sqlalchemy.text() objects in read_sql()
  TST: Capture warnings in _check_plot_works
  COMPAT/BUG: color handling in scatter
  COMPAT: Support for matplotlib 1.5
  ERR/API: Raise NotImplementedError when Panel operator function is not implemented, pandas-dev#7692
  DOC: minor doc formatting fixes
  PERF: nested dict DataFrame construction
  DEPR: deprecate SparsePanel
  BLD: dateutil->python-dateutil in conda recipe
  BUG/API: GH11086 where freq is not inferred if both freq is None
  ENH: add merge indicator to DataFrame.merge
  PERF: improves performance in groupby.size
  BUG: DatetimeTZBlock.fillna raises TypeError
  PERF: infer_datetime_format without padding pandas-dev#11142
  PERF: improves performance in SeriesGroupBy.transform
  TST: Verify fix for buffer overflow in read_csv with engine='c' (GH pandas-dev#9735)
  DEPR: Series.is_timeseries
  BUG: nested construction with timedelta pandas-dev#11129
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Datetime Datetime data dtype Performance Memory or execution speed performance
Projects
None yet
Development

No branches or pull requests

2 participants