dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
-
Updated
Aug 8, 2024 - Python
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
MetricFlow allows you to define, build, and maintain metrics in code.
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-augur.readthedocs.io/en/main/ and learn more about Augur at our website https://augurlabs.io
Linked Open Data Modeling Language
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Define, govern, and model event data for warehouse-first product analytics.
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and select statements with SQLAlchemy. Amora is able to transform Python code into SQL data transformation jobs that run inside the warehouse.
Link Modeling Language (LinkML) model
An end-to-end data pipeline which extracts divvy bikeshare data from web loads it into data lake and datawarehouse transforms it using dbt and finally , a dashboard to visualize the data using looker studio, the pipeline is orchestrated using prefect
Automated assistance for the schema development lifecycle
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
The dbt adapter for Firebolt
WG3 Metadata Specification
Development of the Gellish Communicator reference application and tools for universal data exchange and data integration supporting Formal English and other Gellish formalized natural languages.
This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source tools
This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. There are different tools that have been used in this project such as Astro, DBT, GCP, Airflow, Metabase.
Add a description, image, and links to the data-modeling topic page so that developers can more easily learn about it.
To associate your repository with the data-modeling topic, visit your repo's landing page and select "manage topics."