At this stage, Parman is an experimental project, so expect a rocky road ahead.
The goal of Parman is to extend concurrent.futures
(and compatible implementations)
with features that facilitate a transparent implementation of workflows.
WaitFuture
: a Future subclass that is "finished" after its dependencies have finished. (To be created withWaitGraph.submit
, which never blocks.)ScheduledFuture
: a Future subclass that submits a Future after its dependencies have finished. (To be created withScheduler.submit
, which never blocks.)- Various
Runner
classes, similar to Executors, which dispatch function calls elsewhere. The main differences with conventional executors being:- Closures are submitted for (remote) execution, which contain more metadata, e.g. about (keyword) arguments and return values, than ordinary functions The extra metadata offer several advantages...
- A dry run can be carried out to quickly validate the connectivity of steps in the workflow before launching a full scale calculation.
- Closure arguments may contain futures.
If
schedule=True
is set, closures are scheduled for later execution when not all dependency futures have finished yet. (Dependencies are inferred from the arguments and keyword arguments.) Otherwise, the runner will block until all required futures have completed. - Closure return values are instantiated as much as possible, instead of just returning a single future. They may contain futures more deeply nested for parts of the return value, This makes it easier to submit more closures further down the workflow.
As a result, workflows can be implemented efficiently with relatively simple Python scripts, mostly hiding the interaction with Future objects.
Other useful features:
- Compatible with Python's built-in Concurrent package and Parsl. (Parls is an optional dependency.)
- Simplicity:
- Template jobs, for a straightforward migration of existing job scripts.
- Minimal Python package dependencies.
- Minimal API.
python -m pip install parman
At this stage, there is no documentation as such. If you want to learn how to use Parman, check out the demos. If you want to understand the internals, read the source and the docstrings.
- Support for Dask, because:
- The Dask
Future
does not subclass fromconcurrent.futures.Future
. Supporting dask would imply a lot of extra boilerplate code in Parman. - The Dask
Future
implements only a subset ofconcurrent.futures.Future
. - Dask Distributed has a large memory and time overhead.
- The Dask
- Simplify usage.
- Add more examples.
- Tutorial.