😎 Cocoon uses LLMs to augment data engineering and data science.
-
👉 Python Package: Check out the notebook that cleans tables in Snowflake/DuckDB
-
👉 Check out the 1 min demo
-
Interactively cast the column type and normalize cases
- The output is DBT staging sql/yml
- 👉 Online Service: Drop your CSV, and the profile will be ready in <10 min
- 👉 Python Package: Check out the notebook to interactively profile your table in python
- (Both run the same code; Python package requires LLM API, but is interactive and no size/#col limit)
Check out more profiles
Dataset Title | Profile Link |
---|---|
AQI and Latitude/Longitude of Countries | View Profile |
2020 Property Sales Data | View Profile |
AAC Shelter Cat Outcome | View Profile |
Books | View Profile |
Cancer | View Profile |
Divorces 2000-2015 | View Profile |
German Credit Data | View Profile |
K-Drama | View Profile |
Patients | View Profile |
Used Car Data | View Profile |
Cite Cocoon Profile
@article{huang2024cocoon,
title={Cocoon: Semantic Table Profiling Using Large Language Models},
author={Huang, Zezhou and Wu, Eugene},
journal={arXiv preprint arXiv:2404.12552},
year={2024}
}
We plan to support Data Cleaning, Data Integration, and Data Modeling.