BUG: workaround PyTables 319, but not setting expected rows (GH8265, GH9676) #9681

jreback · 2015-03-19T11:53:16Z

closes #8265
closes #9676

jreback · 2015-03-19T11:54:38Z

give this a try and see if it fixed the issues you guys reported. lmk and can put this in 0.16.0

…GH9676) seems that setting expected rows casuses odd indexing issues in some cases

rockg · 2015-03-19T14:37:46Z

That did not work for me and actually returned fewer records than previously. With this change, 2804 records were returned versus 2892 previously (the actual number should be 2972 records).

rockg · 2015-03-19T14:41:10Z

Maybe important to note that I tested this on 0.14.0 and putting your change in as that is what I have at work. I don't know if there have been other substantive changes that might impact my test.

alexfields · 2015-03-19T17:07:26Z

My test agrees with @rockg - resaving my "problem" hdf via this commit and then reading with where statements gives slightly fewer lines than before (192334 vs 193757 previously, full file is 202836). Sorry @jreback! BTW in my case I am not using chunks or start/stop, I am selecting on the full file, and it still fails.

In the meantime I've just been calling ptrepack every time I save an HDF. I don't know if this always solves the problem but it has solved it every time I've tested, and gives an added bonus of speeding up on-disk selects anyway. As long as the tables saved by ptrepack are OK, this doesn't seem like such a bad workaround for now.

jreback · 2015-03-19T18:39:22Z

hmm ok
can u guys test with master on your dataset when u have a chance and lmk?

alexfields · 2015-03-19T19:44:25Z

Master 026a122 is giving me the same 193757 number as in 0.15.2 (different from the 7480a4b commit you sent earlier).

jreback · 2015-03-19T20:39:15Z

ok, seems that even though MY test worked, something else is going on. Ok will close this and can bug the PyTables guys to see if they can fix.

jreback added Bug IO HDF5 read_hdf, HDFStore labels Mar 19, 2015

jreback added this to the 0.16.0 milestone Mar 19, 2015

BUG: workaround PyTables 319, but not setting expected rows (GH8265, …

7480a4b

…GH9676) seems that setting expected rows casuses odd indexing issues in some cases

jreback force-pushed the pt branch from f447874 to 7480a4b Compare March 19, 2015 12:53

jreback closed this Mar 19, 2015

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: workaround PyTables 319, but not setting expected rows (GH8265, GH9676) #9681

BUG: workaround PyTables 319, but not setting expected rows (GH8265, GH9676) #9681

jreback commented Mar 19, 2015

jreback commented Mar 19, 2015

rockg commented Mar 19, 2015

rockg commented Mar 19, 2015

alexfields commented Mar 19, 2015

jreback commented Mar 19, 2015

alexfields commented Mar 19, 2015

jreback commented Mar 19, 2015

BUG: workaround PyTables 319, but not setting expected rows (GH8265, GH9676) #9681

BUG: workaround PyTables 319, but not setting expected rows (GH8265, GH9676) #9681

Conversation

jreback commented Mar 19, 2015

jreback commented Mar 19, 2015

rockg commented Mar 19, 2015

rockg commented Mar 19, 2015

alexfields commented Mar 19, 2015

jreback commented Mar 19, 2015

alexfields commented Mar 19, 2015

jreback commented Mar 19, 2015