Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

paper.csv is too large to save in my computer #20

Open
rebecca312 opened this issue May 9, 2018 · 1 comment
Open

paper.csv is too large to save in my computer #20

rebecca312 opened this issue May 9, 2018 · 1 comment

Comments

@rebecca312
Copy link

When I tried to run the pipeline, paper.csv was generated from Miner-Papertxt (about 2.2G). And the paper.csv file was too large (exceeded 1.7T) but my computer has only about 2T storage space. So it failed each time I run the project. Do you know how to fix this?

@macks22
Copy link
Owner

macks22 commented Jan 31, 2023

I'm surprised it's so big. I didn't catalog file sizes, but I don't remember anything being even close to 1T in size. IIRC, I was able to store everything on a machine with only 500G. It's been a while though, so I may be misremembering.

You could try modifying the code that writes the file to compress it first. I think pandas supports writing in compressed formats via extra kwargs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants