This is Kafka-Elastic Search pipeline for storing and analyzing server health logs
-
Updated
Jul 18, 2017 - Java
This is Kafka-Elastic Search pipeline for storing and analyzing server health logs
CS502Capstone
Data-processing and common libraries used in main project, all available under Apache 2.0
LinkedIn's previous generation Kafka to HDFS pipeline.
Real Time Data Streaming Pipeline
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Data pipeline using Apache Kafka, Apache Spark and HDFS
An end to end data pipeline with Kafka Spark Streaming Integration
Cloud server data pipeline built with Apache Kafka and Java
cron replacement to schedule complex data workflows
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
Kafka Streams made easy with a YAML file
Efficiently captures real-time Wikimedia data, like a newsroom for Wikipedia changes. Uses microservices, Kafka, and Spring Boot for reliability and scalability. Ideal for research and analysis.
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
Realtime metrics calculation pipeline using kafka, elasticsearch and kibana.
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
Compiler for streaming data pipelines and data microservices with configurable engines.
⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink
Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."