Skip to content

datamindedbe/conveyor-samples

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

98 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Open in Gitpod

Conveyor Samples

This repository contains a number of sample projects for Conveyor

Getting started

  1. Clone this repository
  2. Verify you have the Conveyor CLI installed by executing conveyor doctor
  3. Authenticate the Conveyor CLI conveyor auth login
  4. Create a new conveyor environment conveyor environment create --name samples
  5. Execute the additional steps listed in the prerequisites folder

Samples

Basic

  • pi_spark: use Apache Spark to calculate pi.
  • first_project_dbt: use dbt and DuckDB for the first time. Using this project is described in the Conveyor getting started guide.
  • coffee_shop_dbt: use dbt and DuckDB for cleaning and transforming the coffee shop input data and writing the results to S3.

MLOps

  • titanic: use the dataset from Kaggle to build a ML pipeline that will produce a model to predict survival.
  • housing: use the dataset from Kaggle to build a ML pipeline that will produce a model to predict the housing prices.

Various

  • alerting: use native Apache Airflow features to send notifications to Slack.

Releases

No releases published

Packages

No packages published