datalake
Here are 18 public repositories matching this topic...
-
Updated
Mar 6, 2018 - Java
Debezium server batch consumers
-
Updated
Jul 20, 2022 - Java
Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-and-drop development capability.
-
Updated
Apr 15, 2024 - Java
Open Control Plane for Tables in Data Lakehouse
-
Updated
May 22, 2024 - Java
World's most powerful data catalog service with providing a high-performance, geo-distributed and federated metadata lake.
-
Updated
May 24, 2024 - Java
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
-
Updated
May 23, 2024 - Java
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
-
Updated
May 23, 2024 - Java
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
-
Updated
May 24, 2024 - Java
Upserts, Deletes And Incremental Processing on Big Data.
-
Updated
May 24, 2024 - Java
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
-
Updated
May 24, 2024 - Java
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
-
Updated
May 24, 2024 - Java
Improve this page
Add a description, image, and links to the datalake topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the datalake topic, visit your repo's landing page and select "manage topics."