This proposes a project structure to implement multiple-layers ETL in Spark context
-
Updated
Jul 26, 2024 - Scala
This proposes a project structure to implement multiple-layers ETL in Spark context
A simplified, lightweight ETL Framework based on Apache Spark
Write ETL using your favorite SQL dialects
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more.
spark library to construct ETL pipeline with monads
Yet Another SPark Framework
seatunnel plugin developing examples.
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Data Tweak is a simplified, lightweight ETL framework based on Apache Spark.
Repository for playing with spark
📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
Broadway is a distributed actor-based processing server optimized for high-speed data/file ingestion
Add a description, image, and links to the etl-framework topic page so that developers can more easily learn about it.
To associate your repository with the etl-framework topic, visit your repo's landing page and select "manage topics."