Repository for playing with spark
-
Updated
Oct 13, 2020 - Scala
Repository for playing with spark
Data Tweak is a simplified, lightweight ETL framework based on Apache Spark.
spark library to construct ETL pipeline with monads
Broadway is a distributed actor-based processing server optimized for high-speed data/file ingestion
Yet Another SPark Framework
Write ETL using your favorite SQL dialects
📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more.
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
seatunnel plugin developing examples.
A simplified, lightweight ETL Framework based on Apache Spark
Add a description, image, and links to the etl-framework topic page so that developers can more easily learn about it.
To associate your repository with the etl-framework topic, visit your repo's landing page and select "manage topics."