The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
-
Updated
May 27, 2024 - Python
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
the portable Python dataframe library
Compare tables within or across databases
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
🎣 List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
Snowflake Snowpark Python API
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
One framework to develop, deploy and operate data workflows with Python and SQL.
Snowflake CLI is an open-source command-line tool explicitly designed for developer-centric workloads in addition to SQL operations.
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Simple DDL Parser to parse SQL (HQL, TSQL, AWS Redshift, BigQuery, Snowflake and other dialects) ddl files to json/python dict with full information about columns: types, defaults, primary keys, etc. & table properties, types, domains, etc.
LiDAR snowfall simulation
Code and tutorial to develop a real time data pipeline and a Streamlit app that consumes this data.
Add a description, image, and links to the snowflake topic page so that developers can more easily learn about it.
To associate your repository with the snowflake topic, visit your repo's landing page and select "manage topics."