etl-pipeline
Here are 16 public repositories matching this topic...
Repository for playing with spark
-
Updated
Oct 13, 2020 - Scala
Data Tweak is a simplified, lightweight ETL framework based on Apache Spark.
-
Updated
Jan 26, 2021 - Scala
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
-
Updated
Jun 7, 2021 - Scala
STM data enrichment, Extract, Transform, Load (e.g., ETL)
-
Updated
Jun 11, 2021 - Scala
Scala data-pipeline for amazon moview reviews data processing using kafka & spark streaming
-
Updated
Jul 25, 2021 - Scala
Data monitoring tool, monitors the result, not the run
-
Updated
Dec 16, 2021 - Scala
Bigdata processing (Realtime ETL DataPipeline) using Avro Schema Registry, Spark, Kafka, HDFS, Hive, Scala, docker, spark-streaming
-
Updated
Dec 20, 2021 - Scala
Batch ETL data pipeline built on HDP 3.0 to process daily sales and business data to procedure power Bi reports. Automated the pipelines using Airflow.
-
Updated
Dec 29, 2021 - Scala
seatunnel plugin developing examples.
-
Updated
Jan 3, 2022 - Scala
This project is a tempale for performing etl using Kafka, Spark and hive.
-
Updated
Feb 20, 2022 - Scala
Arrival delay time prediction of commercial flights (UPM's Master in Data Science project for Big Data subject)
-
Updated
Dec 24, 2022 - Scala
Yet Another SPark Framework
-
Updated
Feb 5, 2023 - Scala
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more.
-
Updated
Jul 30, 2023 - Scala
A simple Spark-powered ETL framework that just works 🍺
-
Updated
Dec 7, 2023 - Scala
A simplified, lightweight ETL Framework based on Apache Spark
-
Updated
Jan 24, 2024 - Scala
Improve this page
Add a description, image, and links to the etl-pipeline topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the etl-pipeline topic, visit your repo's landing page and select "manage topics."