Skip to content

Latest commit

 

History

History

dagster-integration

Integration of lakeFS with Dagster

Prerequisites

  • Docker installed on your local machine

Setup

  1. Start by cloning this repository:

    git clone https://github.com/treeverse/lakeFS-samples && cd lakeFS-samples/01_standalone_examples/dagster-integration
  2. Run the following to provision the stack which includes Python, Spark, Jupyter Notebook, JDK, Hadoop binaries and lakeFS Python client

    docker compose up 

    Alternatively, if you want to provision a lakeFS server as well as MinIO for your object store, plus Jupyter then bring up the full stack:

    docker compose --profile local-lakefs up
  3. Open JupyterLab UI http://127.0.0.1:28888/ in your web browser.

Demo Instructions

Once you have successfully completed setup then open one of the provided notebooks from JupyterLab UI:

  • Dagster Demo Existing DAG
  • Dagster Demo New DAG