-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gb/misc features #72
Gb/misc features #72
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- create a log_memory function in loggers.py to replace the memory logging throughout reV:
log_memory(logger, level='DEBUG') - Update DatasetCollector to no longer use SmartParallelJob and instead transfer file by file, dataset by dataset with smart logic to check to see if the entire file/dataset can fit in memory, if not then transfer as many chunks as will fit in memory at a time until file/dataset has been transfered
…ods including smartparalleljob. Tests pass
…nks). Added test to make sure it works.
@MRossol, added those two features. Cleaned up collection a lot. Made a low memory test, although I realize now that this will be kind of hardware dependent. Still, works on my machine! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The architecture looks great, much cleaner.
A few questions comments on method names
""" | ||
Add results from SmartParallelJob to out | ||
@staticmethod | ||
def _get_site_mem_req(shape, dtype, n=100): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm confused is this the memory for one site, one chunk, or the whole dataset?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just for one site. I realize i forgot to update some of the docstrings. Fixed.
reV/handlers/collection.py
Outdated
.format(os.path.basename(fp_source), e)) | ||
raise e | ||
|
||
def _low_mem_collect(self): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if this is the standard collection method I think we should rename it to just _collect
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agreed!
Rebase and merge away |
Addition of four features:
Pytests added for all except 2 (which i tested manually on eagle).