-
Notifications
You must be signed in to change notification settings - Fork 3.5k
-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
read_csv behavior with blank lines differs between CSV deliimters #2883
Comments
@ultrabug this part of the project is very Alpha so expect to find such issues. Thanks for reporting, can you open a JIRA issue? |
Hi Wes, thanks for your answer. I'll open a JIRA issue then! I will reference the JIRA issue here and close the issue right after |
JIRA ARROW-3700 created https://issues.apache.org/jira/browse/ARROW-3700 Cheers! |
Hi, has this issue really been fixed? I'm getting the same error:
I'm using version 0.11.1 |
Please update to 0.12.0 and let us know if it still doesn't work |
Hi,
I was playing with
pyarrow.csv
read_csv
and found a rather strange behavior that I'm not sure is normal.Parsing will fail if the delimiter of the CSV file is a comma and there's a blank line after the header (see
basic_with_blank.csv
example)Example output:
If I change the CSV delimiter to semicolon, the error disappears and everything is fine!
I'm providing python code and CSV samples which compares with pandas (which does not suffer from this).
Hope this helps, thanks
csv_parse_error.zip
The text was updated successfully, but these errors were encountered: