-
Updated
Jun 10, 2020 - Java
data-catalog
Here are 11 public repositories matching this topic...
Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabytes of data and make it accessible for data processing and analytical purposes by any cloud compute platform.
-
Updated
Jun 8, 2020 - Java
-
Updated
Jun 25, 2023 - Java
A system for managing files and file replicas across many diverse sites
-
Updated
Mar 23, 2023 - Java
Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quality and user behaviour. This solution creates Data Catalog Tags history in BigQuery since Data Catalog keeps only the latest version of metadata for fast searchability.
-
Updated
Jul 21, 2021 - Java
Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.
-
Updated
Jul 17, 2024 - Java
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
-
Updated
Jun 3, 2024 - Java
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
-
Updated
Jul 20, 2024 - Java
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
-
Updated
Jul 15, 2024 - Java
The Metadata Platform for your Data Stack
-
Updated
Jul 20, 2024 - Java
Improve this page
Add a description, image, and links to the data-catalog topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-catalog topic, visit your repo's landing page and select "manage topics."