Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve memory usage of read_csv, Python objects take up too much #407

Closed
wesm opened this issue Nov 23, 2011 · 8 comments
Closed

Improve memory usage of read_csv, Python objects take up too much #407

wesm opened this issue Nov 23, 2011 · 8 comments
Milestone

Comments

@wesm
Copy link
Member

wesm commented Nov 23, 2011

No description provided.

@wesm wesm closed this as completed Nov 2, 2012
@sanand0
Copy link

sanand0 commented Nov 6, 2012

Wes, was this closed because it's fixed? Does read_csv now work with larger datasets?

@wesm
Copy link
Member Author

wesm commented Nov 6, 2012

It will in 0.10, re: http://wesmckinney.com/blog/?p=543

@fivejjs
Copy link

fivejjs commented Jan 22, 2015

Seem not.
When I do df = pd.read_csv('test.gz', compression='gzip'), it some times fails.

@TomAugspurger
Copy link
Contributor

Hi @fivejjs, this is an old issue.

If you have a specific example that we can reproduce, could you create a new issue with the steps the cause the failure?

@cpcloud
Copy link
Member

cpcloud commented Jan 22, 2015

Also @fivejjs make sure that your issue is related to memory usage. There are all sorts of reasons that your example above could fail. If it's failing at all, please create a new issue.

@snetts
Copy link

snetts commented Jul 25, 2016

I am also having the same problem. I am trying to read a CSV file which is approximately 1gb in size and it fails all the time

@kvallinatest
Copy link

Is it fixed ?

@jorisvandenbossche
Copy link
Member

There is nothing to be 'fixed' here, as this issue is not about a very specific problem. If you have a reproducible use case where you run into troubles with read_csv, please open a new issue.

dan-nadler pushed a commit to dan-nadler/pandas that referenced this issue Sep 23, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

8 participants