#

etl-framework

Here are 14 public repositories matching this topic...

YotpoLtd / metorikku

A simplified, lightweight ETL Framework based on Apache Spark

scala sql big-data spark etl distributed-computing etl-framework etl-pipeline

Updated Jan 24, 2024
Scala

InterestingLab / seatunnel-example

seatunnel plugin developing examples.

spark spark-streaming flink sql-engine etl-framework waterdrop etl-pipeline

Updated Jan 3, 2022
Scala

DaFlow

sparsecode / DaFlow

Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.

json scala csv apache-spark hive hadoop avro etl parquet transformation-rules etl-framework etl-pipeline join-data

Updated Jun 7, 2021
Scala

tharwaninitin / etlflow

EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more.

redis bigquery aws scala spark etl s3 gcp gcs zio etl-framework dataproc etl-pipeline

Updated Jul 30, 2023
Scala

Renien / ETL-Starter-Kit

📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.

scala hive gradle bigdata datascience pig scalding azkaban datamining starter-project etl-framework mapreduce-jobs

Updated Mar 20, 2017
Scala

SharpData / SharpETL

Write ETL using your favorite SQL dialects

scala sql spark hive etl bigdata data-warehouse flink datawarehouse spark-sql etl-framework flink-sql paimon

Updated Jan 7, 2024
Scala

Mambo

2298-Software / Mambo

A simple in-memory, configuration driven, data processing pipeline for Apache Spark.

spark stream hadoop pipeline turbine etl-framework etl-job

Updated Dec 20, 2022
Scala

ldaniels528 / broadway-v1

Broadway is a distributed actor-based processing server optimized for high-speed data/file ingestion

nodejs json node scala csv kafka broadway actors avro etl narrative flow-control ingestion etl-framework avro-format

Updated Mar 29, 2016
Scala

giucris / yasp

Yet Another SPark Framework

framework scala big-data spark etl sparksql elt etl-framework etl-pipeline big-data-processing

Updated Feb 5, 2023
Scala

kklimexk / spark-playground

Repository for playing with spark

cats scala big-data spark etl functional-programming etl-framework tagless-final higher-kinded-types etl-pipeline cats-free etl-jobs delta-io

Updated Oct 13, 2020
Scala

eSolutionsGrup / datatweak

Data Tweak is a simplified, lightweight ETL framework based on Apache Spark.

processing scala big-data spark etl distributed-computing etl-framework etl-pipeline data-tweak

Updated Jan 26, 2021
Scala

wjyheropk / DBFly

一款基于scala的轻量级、基于配置的Mysql数据库的数据同步、增量更新框架

Updated Sep 18, 2017
Scala

hotienvu / spipe

spark library to construct ETL pipeline with monads

scala spark etl-framework

Updated Apr 21, 2023
Scala

ldaniels528 / transgress

A distributed processing/orchestration server and ETL for NodeJS

nodejs tsv json node scala csv etl scalajs ingestion etl-framework scalajs-io

Updated Apr 22, 2017
Scala

Improve this page

Add a description, image, and links to the etl-framework topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-framework topic, visit your repo's landing page and select "manage topics."