New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
python hangs after write a few parquet tables #17324
Comments
Wes McKinney / @wesm: @xhochy I think we need to get patched builds out ASAP. Let me know how you want to proceed |
Wes McKinney / @wesm: |
Keith Curtis: |
Keith Curtis: def to_parquet(output_file, csv_file): When Python seemed hung (after 3 minutes with no progress), I captured a stack trace with gdb, and attached the file I'm running on Ubuntu 14.04.3. I installed into a conda virtual environment using pip. |
Wes McKinney / @wesm: If you are using pip, can you try |
Wes McKinney / @wesm: |
Wes McKinney / @wesm: |
Wes McKinney / @wesm: |
Keith Curtis: I re-ran my script, but python appeared to hang, and the stack trace looks similar: #0 je_spin_adaptive (spin=) at include/jemalloc/internal/spin.h:40 |
Keith Curtis: |
Keith Curtis: |
Wes McKinney / @wesm: |
Uwe Korn / @xhochy:
|
Keith Curtis: I see there's a lot of threads in there (64?), more than I expected. I ran it from the ipython qtconsole, maybe that has something to do with it. Hope that helps. |
Todd Farmer / @toddfarmer: |
I had a program to read some csv files (a few million rows each, 9 columns), and converted with:
The first csv file would always complete, but python would hang on the second or third file, and sometimes on a much later file.
Environment: Python 3.5.2, pyarrow 0.5.0
Reporter: Keith Curtis
Assignee: Wes McKinney / @wesm
Original Issue Attachments:
Note: This issue was originally created as ARROW-1311. Please see the migration documentation for further details.
The text was updated successfully, but these errors were encountered: