data-pipeline

Star

Here are 24 public repositories matching this topic...

sushovankarmakar / kafka-spark-streaming

Star

An end to end data pipeline with Kafka Spark Streaming Integration

java kafka spark spark-streaming java-8 data-pipeline kafka-spark kafka-spark-streaming

Updated Jun 16, 2022
Java

Ashfaqbs / Microservices-Based-Wikimedia-Data-Processing-with-Kafka

Star

Efficiently captures real-time Wikimedia data, like a newsroom for Wikipedia changes. Uses microservices, Kafka, and Spring Boot for reliability and scalability. Ideal for research and analysis.

kafka spring-boot microservice jpa java-8 data-pipeline

Updated Oct 12, 2023
Java

rashmishrm / serverhealth

Star

This is Kafka-Elastic Search pipeline for storing and analyzing server health logs

java kafka data-analysis elastic-search data-pipeline

Updated Jul 18, 2017
Java

ghowkay / realtime-metrics-calculation

Star

Realtime metrics calculation pipeline using kafka, elasticsearch and kibana.

docker elasticsearch kibana docker-compose data-engineering data-pipeline kakfa

Updated Feb 16, 2024
Java

ProsperChuks / airbyte

Star

Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.

data-engineering data-pipeline

Updated Oct 28, 2021
Java

yosra270 / store-data-pipeline

Star

Data pipeline using Apache Kafka, Apache Spark and HDFS

kafka big-data spark hdfs data-pipeline

Updated May 13, 2022
Java

mbrtargeting / camus

Star

LinkedIn's previous generation Kafka to HDFS pipeline.

kafka hadoop data-engineering hdfs data-pipeline

Updated Mar 12, 2019
Java

kwangjong / coinbase-real-time-data-pipeline

Star

A real-time cryptocurrency data streaming pipeline.

java docker kubernetes scala apache-spark grafana hdfs k8s apache-kafka apache-cassandra data-pipeline

Updated Jun 25, 2024
Java

cjannun / kafka-based-data-pipeline

Star

Cloud server data pipeline built with Apache Kafka and Java

java kafka apache-kafka kafka-streams data-pipeline

Updated Nov 5, 2022
Java

iShiBin / CS502Capstone

Star

CS502Capstone

scala spark cassandra prediction recommender-systems data-pipeline kafak

Updated Feb 18, 2018
Java

illuin-tech / data-pipeline

Star

Toolkit for describing data transformation pipelines by compositing simple reusable components.

java etl data-pipeline

Updated Sep 2, 2024
Java

colechristini / dataset-lib

Star

Data-processing and common libraries used in main project, all available under Apache 2.0

java data big-data java-8 data-processing data-pipeline

Updated Feb 27, 2019
Java

BrahianVT / Data-Pipeline

Star

Data-pipeline

mysql database restapi data-pipeline

Updated Jun 21, 2022
Java

sanogotech / spring-boot-with-kafkalighttest

Star

KAFKA par la Pratique

kafka spring-boot data-pipeline

Updated May 18, 2022
Java

JinsYin / datalink

Star

⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink

data streaming framework big-data spark integration pipeline etl bigdata batch data-integration data-collection flink cdc data-exchange data-synchronization data-pipeline datalink flink-cdc

Updated Jun 19, 2024
Java

mujahidniaz / iot_device_streaming_pipeline_cloudera-kakfa-spark-hbase

Star

Real Time Data Streaming Pipeline

kafka spark impala cloudera hbase data-pipeline streaming-data data-ingestion streaming-pipeline iots

Updated Jan 9, 2020
Java

GetFeedback / kahpp-oss

Star

Kafka Streams made easy with a YAML file

yaml automation kafka pipeline tool stream-processing kafka-streams data-processing data-pipeline stream-processor stream-processing-software

Updated Aug 4, 2023
Java

apache / seatunnel-datasource-sdk

Star

SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

real-time offline high-performance apache data-integration sql-engine data-pipeline etl-framework seatunnel

Updated Jun 14, 2023
Java

cognitree / kronos

Star

cron replacement to schedule complex data workflows

scheduler task-scheduler cronjob-scheduler quartz-scheduler data-pipeline java-scheduler workflow-scheduler

Updated Nov 16, 2022
Java

DataSQRL / sqrl

Star

Flexible development framework for building streaming data applications in SQL with Kafka, Flink, Postgres, GraphQL, and more.

api streaming database event-driven-microservices event-driven data-pipeline

Updated Sep 21, 2024
Java

Improve this page

Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-pipeline

Here are 24 public repositories matching this topic...

sushovankarmakar / kafka-spark-streaming

Ashfaqbs / Microservices-Based-Wikimedia-Data-Processing-with-Kafka

rashmishrm / serverhealth

ghowkay / realtime-metrics-calculation

ProsperChuks / airbyte

yosra270 / store-data-pipeline

mbrtargeting / camus

kwangjong / coinbase-real-time-data-pipeline

cjannun / kafka-based-data-pipeline

iShiBin / CS502Capstone

illuin-tech / data-pipeline

colechristini / dataset-lib

BrahianVT / Data-Pipeline

sanogotech / spring-boot-with-kafkalighttest

JinsYin / datalink

mujahidniaz / iot_device_streaming_pipeline_cloudera-kakfa-spark-hbase

GetFeedback / kahpp-oss

apache / seatunnel-datasource-sdk

cognitree / kronos

DataSQRL / sqrl

Improve this page

Add this topic to your repo