A lightweight tool for indexing, cataloging, and browsing data.
-
Updated
Oct 30, 2020 - Python
A lightweight tool for indexing, cataloging, and browsing data.
This is the project for Karnataka Police Hackathon
LangSphere is an interactive AI playground specially developed to demonstrate the capabilities of large language models (LLMs) within the realm of Augmented Analytics.
Dataset Mention Extraction and Classification Baselines
Spider join dataset for our paper - WarpGate: A Semantic Join Discovery System for Cloud Data Warehouses (CIDR 2023).
Open-source GCP metadata collector based on ODD Specification
A fast and accurate index for distribution-aware dataset search.
An analytics engineering sandbox focusing on real estates prices in Cook County, IL
Valentine scalable deployment for VLDB demo
Scan directories, exports, and backups for sensitive data (like PII and API keys) with Nightfall's data loss prevention (DLP) APIs. Discover what lives at-rest in your data silos.
Data Catalogs Made Easy
Toolkit for discovering and aggregating data for whole-cell modeling
articat: data artifact catalog
A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable front end that's just HTML.
A data lineage tool detects table dependencies from rendered SQL statements.
WG3 Metadata Specification
Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard
Open-source metadata collector based on ODD Specification
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Add a description, image, and links to the data-discovery topic page so that developers can more easily learn about it.
To associate your repository with the data-discovery topic, visit your repo's landing page and select "manage topics."