Scheduling, Execution, Storage and Dependency Manager for Data Jobs
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
distiller
doc
docker
.gitignore
LICENSE
README.md
cli.py
docker-build.sh
docker-run.py
requirements.txt
setup.cfg

README.md

The Distiller

The distiller is a dependency management, data management, job scheduling and job distribution system for data scientists. Its aim is to automate and manage the execution of data-focused batch jobs and keep its data up-to-date while keeping the batch execution to a minimum with lazy execution.

Documentation

To see the documentation install Sphinx (pip install sphinx), run (cd doc && make html), then go to doc/build/html/index.html.