iceberg

Star

Here are 11 public repositories matching this topic...

apache / iceberg-python

Star

Apache PyIceberg

apache hacktoberfest iceberg pyiceberg

Updated Jul 25, 2024
Python

dbt-athena / dbt-athena

Sponsor

Star

The athena adapter plugin for dbt (https://getdbt.com)

athena s3 dbt iceberg glue-catalog dbt-athena dbt-athena-community

Updated Jul 22, 2024
Python

sotiriskar / netflix-big-data-stack

Star

Implement a big data stack with Apache Kafka for data ingestion, Apache Flink for stream processing, Druid for OLAP querying, Iceberg for data versioning, and Apache Superset for visualization

streaming kafka pipeline s3 superset apache pipelines data-visualization minio netflix druid flink iceberg

Updated Jul 18, 2024
Python

Sanjay-dev-ds / aws-serverless-data-pipeline

Star

This project creates a serverless data pipeline to extract data from the Colombo Stock Market ASI Index API using AWS Lambda, Kinesis Firehose, and S3. An AWS Glue workflow processes and transforms the data, storing it in an Apache Iceberg table via Athena and Glue ETL jobs.

aws aws-lambda athena apache s3-bucket iceberg glue-job

Updated Jul 2, 2024
Python

waiyan1612 / postgres-kafka-iceberg-pipeline

Star

Example pipeline to stream the data changes from RDBMS to Apache Iceberg tables

spark-streaming kafka-connect change-data-capture iceberg debezium-connector

Updated Apr 16, 2024
Python

WesleyJw / modern-data-stack

Star

Creating a modern data stack in Kubernetes with open-source products, both on-premises and cloud-agnostic, is an increasingly popular approach. By leveraging Kubernetes for container orchestration, you can deploy and manage data processing, storage, and analysis tools more efficiently.