Skip to content

sudohainguyen/mini-lakehouse

Repository files navigation

Mini Data Lakehouse

Build your data lakehouse at home, with open source components to power machine learning feature store and data visualization.

General Architecture

Architecture Diagram

Guides

For docker-based setup

Go to docker/README.md

For k8s-based setup

(TBD)

Roadmap

  • Deploy Hive metastore
  • Deploy Trino cluster with 2 workers
  • Connect Trino to BQ (wont do)
  • Replace GCS with MinIO
  • Connect Trino to Feast (demo)
  • Deploy Superset dashboard to serve data