Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add option to create indexes in HDFStore if user is using PyTables Pro / PyTables 2.3+ #698

Closed
wesm opened this issue Jan 26, 2012 · 3 comments
Milestone

Comments

@wesm
Copy link
Member

wesm commented Jan 26, 2012

Should just silently fail otherwise if the PyTables version in use does not have the indexing functionality. Should be fairly simple to add, only affects Table formatted objects

@jreback
Copy link
Contributor

jreback commented Jan 27, 2012

requires pytables 2.3.1 at a minimum (the 'pro' version which is now open source)
currently a bug in the indexing routines in pytables that prevents an index on a Time64Col from working - see PyTables/PyTables#119

pandas should try to index the 'index' column in (and only) a table, probably by default
how to deal with a pytables version that is not supported?

@jreback
Copy link
Contributor

jreback commented Nov 12, 2012

this is actually pretty easy since 0.8.1....as now the indicies are Int64 and thus indexing works great
I by default index the 'index' column in all of my panels, but this could be provided as an option....I will take a hack at this after 0.9.1 comes out....

will have to disable auto-indexing for any Time64Cols though....

@jreback
Copy link
Contributor

jreback commented Nov 15, 2012

pytables_docs closes this PR

jreback added a commit to jreback/pandas that referenced this issue Nov 24, 2012
      1. added __str__ (to do __repr__)
      2. added __delitem__ to support store deletion syntatic sugar
      3. row removal in tables is much faster if rows are consecutive
      4. added Term class, refactored Selection (this is backwards compatible)
         Term is a concise way of specifying conditions for queries, e.g.

            Term(dict(field = 'index', op = '>', value = '20121114'))
            Term('index', '20121114')
            Term('index', '>', '20121114')
            Term('index', ['20121114','20121114'])
            Term('index', datetime(2012,11,14))
            Term('index>20121114')

            added alias to the Term class; you can specify the nomial indexers (e.g. index in DataFrame, major_axis/minor_axis or alias in Panel)

	 this should close GH pandas-dev#1996

      5. added Col class to manage the column conversions
      6. added min_itemsize parameter and checks in pytables to allow setting of indexer columns minimum size
      7. added indexing support via method create_table_index (requires 2.3 in PyTables)
      	 btw now works quite well as Int64 indicies are used as opposed to the Time64Col which has a bug); includes a check on the pytables version requirement

	 this should close GH pandas-dev#698

      8. signficantlly updated docs for pytables to reflect all changes; added docs for Table sections
      9. BUG: a store would fail if appending but the a put had not been done before (see test_append)
         this the result of incompatibility testing on the index_kind
     10. BUG: minor change to select and remove: require a table ONLY if where is also provided (and not None)

      all tests pass; tests added for new features
@wesm wesm closed this as completed Dec 28, 2012
dan-nadler pushed a commit to dan-nadler/pandas that referenced this issue Sep 23, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants