iceberg
Here are 11 public repositories matching this topic...
The athena adapter plugin for dbt (https://getdbt.com)
-
Updated
Jul 22, 2024 - Python
Implement a big data stack with Apache Kafka for data ingestion, Apache Flink for stream processing, Druid for OLAP querying, Iceberg for data versioning, and Apache Superset for visualization
-
Updated
Jul 18, 2024 - Python
This project creates a serverless data pipeline to extract data from the Colombo Stock Market ASI Index API using AWS Lambda, Kinesis Firehose, and S3. An AWS Glue workflow processes and transforms the data, storing it in an Apache Iceberg table via Athena and Glue ETL jobs.
-
Updated
Jul 2, 2024 - Python
Example pipeline to stream the data changes from RDBMS to Apache Iceberg tables
-
Updated
Apr 16, 2024 - Python
Creating a modern data stack in Kubernetes with open-source products, both on-premises and cloud-agnostic, is an increasingly popular approach. By leveraging Kubernetes for container orchestration, you can deploy and manage data processing, storage, and analysis tools more efficiently.
-
Updated
Mar 10, 2024 - Python
Example code for running Spark and Hive jobs on EMR Serverless.
-
Updated
Mar 9, 2024 - Python
Kafka streaming job from iomete. This streaming job copies data from Kafka to Iceberg.
-
Updated
Oct 27, 2023 - Python
Current Antarctic large iceberg positions derived from ASCAT and OSCAT-2
-
Updated
Sep 2, 2023 - Python
Dagster + DBT + Spark + Iceberg
-
Updated
Jul 13, 2023 - Python
Improve this page
Add a description, image, and links to the iceberg topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the iceberg topic, visit your repo's landing page and select "manage topics."