Optimize memory usage. #4

yunhailuo · 2019-02-20T17:21:07Z

In general, I pass file path between functions rather and pandas DataFrame so that there won't be a lot of dfs stay in the memory for more time than needed.

yunhailuo · 2019-02-21T05:40:38Z

These are codes facilitating:

ETL large dataset
Compare current Xena and GDC datasets, find differences so that only extra datasets need to be loaded

I definitely used them before. But I never pushed them. I'll merge them after some code reviews. @maryjgoldman

yunhailuo · 2019-03-01T06:32:59Z

I think the code is fine. Will merge after #6 .

yunhailuo · 2019-03-12T04:44:45Z

Merge all these before doing structural changes

yunhailuo self-assigned this Feb 20, 2019

Optimize memory usage.

bc67c3c

yunhailuo force-pushed the develop branch from e7c4e26 to a0ae7c7 Compare February 21, 2019 05:29

Update and add scripts facilitating ETL

524bd3d

yunhailuo force-pushed the develop branch from a0ae7c7 to 524bd3d Compare March 1, 2019 06:31

yunhailuo merged commit 23c5dcb into master Mar 12, 2019

yunhailuo deleted the develop branch March 12, 2019 04:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize memory usage. #4

Optimize memory usage. #4

yunhailuo commented Feb 20, 2019

yunhailuo commented Feb 21, 2019

yunhailuo commented Mar 1, 2019

yunhailuo commented Mar 12, 2019

Optimize memory usage. #4

Optimize memory usage. #4

Conversation

yunhailuo commented Feb 20, 2019

yunhailuo commented Feb 21, 2019

yunhailuo commented Mar 1, 2019

yunhailuo commented Mar 12, 2019