Join GitHub today
GitHub is home to over 40 million developers working together to host and review code, manage projects, and build software together.Sign up
Data('file.csv') no longer infers fields with string values #1254
On 0.8.0 this worked. But since I updated, instantiating Data from a .csv is broken in cases like the following:
Actual vs expected output:
If the values are numbers, then it is inferred correctly:
If the column name is 'y', then it is also inferred as expected:
So it's very sensitive.
I will solve it for now by passing an argument to force the first row to be interpreted as column names.
http://blaze.pydata.org/en/latest/csv.html#correcting-csv-dialects states: In the first case of incorrect guessing of CSV dialect (e.g. delimiter) Blaze respects and passes through all keyword arguments to pandas.read_csv.
Moreover, pandas infers the fields as expected:
So for now the workaround is