Skip to content
#

apache-kafka

Here are 159 public repositories matching this topic...

This project focuses on building a real-time streaming pipeline using Apache Flink and Apache Kafka. The goal is to enrich checkout data with user information, identify the first click leading to a checkout, and log the attributed checkouts into a Postgres sink table. The project implements concepts like state management, time attributes, watermark

  • Updated Sep 9, 2023
  • Python

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.

  • Updated May 23, 2024
  • Python

Improve this page

Add a description, image, and links to the apache-kafka topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the apache-kafka topic, visit your repo's landing page and select "manage topics."

Learn more