You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The following results in a KeyError (full stack trace on pastebin.com):
dask_df=dd.read_csv('test.csv', index_col='id1')
However, setting the index column using the set_index method works (e.g. as follows) for single-columned indexes (MultiIndexes are not yet supported in Dask).
Indexes have to be a bit more strange when working in parallel. I think that, for the moment we should just raise an informative error pointing people to use set_index explicitly.
Sample data:
The following results in a
KeyError
(full stack trace on pastebin.com):However, setting the index column using the
set_index
method works (e.g. as follows) for single-columned indexes (MultiIndexes are not yet supported in Dask).Issue detected when using Dask 0.7.1 and pandas 0.16.2.
The text was updated successfully, but these errors were encountered: