Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize memory usage. #4

Merged
merged 2 commits into from
Mar 12, 2019
Merged

Optimize memory usage. #4

merged 2 commits into from
Mar 12, 2019

Conversation

yunhailuo
Copy link
Collaborator

In general, I pass file path between functions rather and pandas DataFrame so that there won't be a lot of dfs stay in the memory for more time than needed.

@yunhailuo yunhailuo self-assigned this Feb 20, 2019
@yunhailuo
Copy link
Collaborator Author

These are codes facilitating:

  1. ETL large dataset
  2. Compare current Xena and GDC datasets, find differences so that only extra datasets need to be loaded

I definitely used them before. But I never pushed them. I'll merge them after some code reviews. @maryjgoldman

@yunhailuo
Copy link
Collaborator Author

I think the code is fine. Will merge after #6 .

@yunhailuo
Copy link
Collaborator Author

Merge all these before doing structural changes

@yunhailuo yunhailuo merged commit 23c5dcb into master Mar 12, 2019
@yunhailuo yunhailuo deleted the develop branch March 12, 2019 04:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant