GitHub - fatelei/ploomber: The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

Ploomber is the fastest way to build data pipelines ⚡️. Use your favorite editor (Jupyter, VSCode, PyCharm) to develop interactively and deploy ☁️ without code changes (Kubernetes, Airflow, AWS Batch, and SLURM). Do you have legacy notebooks? Refactor them into modular pipelines with a single command.

Installation

Compatible with Python 3.6 and higher.

Install with pip:

pip install ploomber

Or with conda:

conda install ploomber -c conda-forge

Getting started

Open a hosted JupyterLab instance:

Run an example locally:

# ML pipeline example
ploomber examples -n templates/ml-basic -o ml-basic
cd ml-basic

# install dependencies
pip install -r requirements.txt

# run pipeline
ploomber build

You just ran a Ploomber pipeline! 🎉

Check out the output folder, you'll see an HTML report with model results!

The pipeline.yaml contains the pipeline declaration. Feel free to modify any of the tasks, then call ploomber build again to update the results (Note: if using VSCode or PyCharm, execute ploomber nb -i before editing the files).

What's next?

Ready to migrate your project? Click here.

Do you want to learn more? Check out the introductory tutorial.

Run more examples.

Community

Main Features

⚡️ Get started quickly

A simple YAML API to get started quickly, a powerful Python API for total flexibility.

get-started.mp4

⏱ Shorter development cycles

Automatically cache your pipeline’s previous results and only re-compute tasks that have changed since your last execution.

shorter-cycles.mp4

☁️ Deploy anywhere

Run as a shell script in a single machine or distributively in Kubernetes, Airflow, AWS Batch, or SLURM.

deploy.mp4

📙 Automated migration from legacy notebooks

Bring your old monolithic notebooks, and we’ll automatically convert them into maintainable, modular pipelines.

refactor.mp4

I want to migrate my notebook.

Show me a demo.

Name		Name	Last commit message	Last commit date
Latest commit History 2,442 Commits
.githooks		.githooks
.github/workflows		.github/workflows
_static		_static
doc		doc
jupyter-config		jupyter-config
src/ploomber		src/ploomber
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pyproject.toml		pyproject.toml
readthedocs.yaml		readthedocs.yaml
setup.cfg		setup.cfg
setup.py		setup.py
tasks.py		tasks.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Getting started

Community

Main Features

⚡️ Get started quickly

⏱ Shorter development cycles

☁️ Deploy anywhere

📙 Automated migration from legacy notebooks

Resources

About

Releases

Packages

Languages

License

fatelei/ploomber

Folders and files

Latest commit

History

Repository files navigation

Installation

Getting started

Community

Main Features

⚡️ Get started quickly

⏱ Shorter development cycles

☁️ Deploy anywhere

📙 Automated migration from legacy notebooks

Resources

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages