The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
-
Updated
Jun 13, 2024 - Python
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
the portable Python dataframe library
Compare tables within or across databases
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
🎣 List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
One framework to develop, deploy and operate data workflows with Python and SQL.
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake data warehouse.
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
Snowflake Snowpark Python API
Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases
A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
Simple DDL Parser to parse SQL (HQL, TSQL, AWS Redshift, BigQuery, Snowflake and other dialects) ddl files to json/python dict with full information about columns: types, defaults, primary keys, etc. & table properties, types, domains, etc.
LiDAR snowfall simulation
Add a description, image, and links to the snowflake topic page so that developers can more easily learn about it.
To associate your repository with the snowflake topic, visit your repo's landing page and select "manage topics."