lakeFS - Data version control for your data lake | Git for data
-
Updated
May 27, 2024 - Go
lakeFS - Data version control for your data lake | Git for data
OpenSource data platform to build event-driven systems. It's like Deebezium for golang :)
The open source high performance ELT framework powered by Apache Arrow
Fancy stream processing made operationally mundane
Compute over Data framework for public, transparent, and optionally verifiable computation
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
Memphis.dev is a highly scalable and effortless data streaming platform
Conduit streams data between data stores. Kafka Connect replacement. No JVM required.
Workflow Engine for Kubernetes
Substation is a toolkit for routing, normalizing, and enriching security event and audit logs.
Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)
A lightweight CLI tool for versioning data alongside source code and building data pipelines.
A Kubernetes Operator to orchestrate Benthos pipelines
Go library to create and manage data pipelines on your machine
Transform your pythonic research to an artifact that engineers can deploy easily.
exercises from scaler academy course on DSA | Data Engineering | System Design
CloudQuery Provider for Scaleway
Spotify-inspired Change Data Capture with Kafka, Cassandra and Kubernetes
Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.
To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."