Skip to content

edublancas/ploomber-workshop

Repository files navigation

Show your support with a star ⭐️ on this repository!


Ploomber Workshop Material

Authors: Eduardo Blancas and Ido Michael

This workshop demonstrates how to develop reproducible pipelines using Ploomber.

To start, click here or on the button below:

Start Workshop

Note: It may take a few seconds for the notebook to load.

Scroll down to the Running it locally section if you prefer to run things locally.

Workshop level: intermediate

Background knowledge

Familiarity with JupyterLab, and a basic knowledge of pandas and scikit-learn.

Workshop content

  1. Introduction
  2. Refactoring a legacy notebook
  3. The pipeline.yaml file.
  4. Building the pipeline
  5. Declaring dependencies
  6. Adding a new task
  7. Incremental builds
  8. Execution in the cloud

Documentation

Running it locally (with conda)

You can also follow this workshop locally, but it requires a bit more setup:

Pre-requisites:

  1. miniconda
  2. git
# clone the repository
git clone https://github.com/edublancas/ploomber-workshop
cd ploomber-workshop

# install dependencies (requires conda)
pip install invoke
invoke setup --from-lock

# activate environment
conda activate ploomber-workshop

# start jupyter
jupyter lab

Then open index.ipynb.

Running it locally (with pip)

# install dependencies
pip install --upgrade pip
pip install -r requirements.dev.txt

# start jupyter
jupyter lab

Then open index.ipynb.

Support us

If you like our project, please give us a ⭐️ on GitHub.

Contact