-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tablite throws IndexError when reading a complex CSV file #33
Comments
Hey @ypanagis The file you use has claims to have 31 headers, but there are only 30 columns. This will work for you:
PS> Note that your file is cut mid-row in your file |
Hi @root-11 and thank you for your reply. First of all, yes this CSV is rather ill-structured and is missing values at some columns. One of those is the column "court" as you very correctly noticed. I didn't know of the I played a bit in the example script that you gave, with setting |
That's python multiprocessing module crashing. |
I run the script on MacOS, can this also be an issue with multiprocessing? |
I'm not sure. Can you try to run the test multiprocessing test suite in this script: If that doesn't work I'll have to do a deeper dive to why MacOS behaves differently. |
I changed to Python 3.9 as you suggested but gives me now the error in the attached file. My PC has also mamba installed the environment is now a mamba one but I hope this is not a problem. Note that I saw the same error when I removed the last two columns that had some emtpy values, in case that caused issues.
|
Thanks for that Yannis! I'll look into that immediately. |
So the error says that Could you run this on your mac for me:
|
Thanks Bjorn, I just ran it and gives this
|
@ypanagis - you think we can close this ticket now? |
Yes @root-11 makes total sense to me. Will try the package some more, but this part is definitely over now. |
Neat. Just FYI: I've released a new version today with slightly better memory management. |
I am trying tablite with a CSV file with many fields and some of which are long, more specifically full texts. When reading with
Table.import_file
, I get the following exceptionWhen the same dataset is converted to xlsx and then opened, no errors occur. I am attaching an example file that causes the error.
test.csv
The text was updated successfully, but these errors were encountered: