-
Notifications
You must be signed in to change notification settings - Fork 27
Experimental/funcx #712
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Experimental/funcx #712
Conversation
…user funcs with funcx, pass in endpoint ids
Pull Request Test Coverage Report for Build 1491745447
💛 - Coveralls |
…, creates new remote-test user funcs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd remove the changes in the alloc_fs and make it so the manager doesn't error when receiving a None
persis_info
…e how worker.py detects of funcx enabled
Thoughts on loss of coverage in openmpi runners? |
This was never the problem in the first place since the simple routines were only running with local comms anyway! And all the runners are covered in the simple coverage jobs... https://coveralls.io/jobs/90108856/source_files/14509258687 . So still trying to figure out whats up |
…pps dont collide when helloworld imported
I've completely rewritten the funcX test to be a scaling test. Try this out on Theta:
Configure
Then finally on Theta:
On your local machine (or any machine with internet access where you want to run libEnsemble), replace Then on your machine where you'll run libEnsemble, run the calling script as usual. As long as the local Using |
libensemble/manager.py
Outdated
|
||
if 'persis_info' in D_recv and len(D_recv['persis_info']): | ||
persis_info[w].update(D_recv['persis_info']) | ||
if D_recv.get('persis_info') is not None: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remind me - why not just if D_recv.get('persis_info') instead of 436/437
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's probably valid. The original work was to address how persis_info is handled by the manager if the user didn't specify one but the alloc tried packing it up anyway.
Wondering if we can avoid some of the duplication of forces stuff. Also looks like there is a binary there (forces.x). |
…oves reundant forces material
addressed |
Addresses #713
Initial attempts to draft an optional funcX interface so workers can easily launch non-persistent user functions on remote resources:
https://funcx.org/
Some notes:
To try this out locally:
pip install funcx funcx-endpoint
funcx-endpoint configure my-endpoint
funcx-endpoint start my-endpoint
. A message will confirm startup and print the endpoint's uuid.sim_specs['funcx_endpoint']
to this uuid.python test_funcx.py --comms local --nworkers 4
Documentation and/or other improvements coming soon?
TODO:
- [ ] Test cancellationsComm is necessary for manager.poll - so this doesn't work. function cancellation is a work-in-progress with the funcX team: Support for cancelling tasks in HighThroughputExecutor globus/globus-compute#606