Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

read_csv crashes when run on multiple threads #2608

Closed
mkleehammer opened this issue Dec 28, 2012 · 0 comments
Closed

read_csv crashes when run on multiple threads #2608

mkleehammer opened this issue Dec 28, 2012 · 0 comments
Assignees
Labels
Bug IO Data IO issues that don't fit into a more specific label Multithreading Parallelism in pandas
Milestone

Comments

@mkleehammer
Copy link

Running multiple threads each calling read_csv crashes on OS/X. I've seen two traps, which I'll put into an attachment. Sometimes it says "Fatal Python error: GC object already tracked\nAbort trap: 6"" and sometimes "Segmentation fault: 11".

I've put together a small example to reproduce it: https://gist.github.com/4401461

I've also added some of the OS/X crash report in case it isn't crashing on your install.

I'm using the built-in Python 2.7.2 on OS/X. Pandas 0.10.0 was built locally using pip install -U pandas. I don't know if this is new to 0.10.0 since I wasn't threading 0.9 yet.

This might be relevant for the GC already tracked: http://pyrit.wordpress.com/2010/02/18/385/

@ghost ghost assigned wesm Jan 21, 2013
@wesm wesm closed this as completed in a05ee0e Jan 21, 2013
yarikoptic added a commit to neurodebian/pandas that referenced this issue Jan 23, 2013
Version 0.10.1

* tag 'v0.10.1': (195 commits)
  RLS: set released to true
  RLS: Version 0.10.1
  TST: skip problematic xlrd test
  Merging in MySQL support pandas-dev#2482
  Revert "Merging in MySQL support pandas-dev#2482"
  BUG: don't let np.prod overflow int64
  RLS: note changed return type in DatetimeIndex.unique
  RLS: more what's new for 0.10.1
  RLS: some what's new for 0.10.1
  API: restore inplace=TRue returns self, add FutureWarnings. re pandas-dev#1893
  Merging in MySQL support pandas-dev#2482
  BUG: fix python 3 dtype issue
  DOC: fix what's new 0.10 doc bug re pandas-dev#2651
  BUG: fix C parser thread safety. verify gil release close pandas-dev#2608
  BUG: usecols bug with implicit first index column. close pandas-dev#2654
  BUG: plotting bug when base is nonzero pandas-dev#2571
  BUG: period resampling bug when all values fall into a single bin. close pandas-dev#2070
  BUG: fix memory error in sortlevel when many multiindex levels. close pandas-dev#2684
  STY: CRLF
  BUG: perf_HEAD reports wrong vbench name when an exception is raised
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Bug IO Data IO issues that don't fit into a more specific label Multithreading Parallelism in pandas
Projects
None yet
Development

No branches or pull requests

2 participants