#

great-expectations

Here are 22 public repositories matching this topic...

iusztinpaul / energy-forecasting

🌀 𝗧𝗵𝗲 𝗙𝘂𝗹𝗹 𝗦𝘁𝗮𝗰𝗸 𝟳-𝗦𝘁𝗲𝗽𝘀 𝗠𝗟𝗢𝗽𝘀 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 | 𝗟𝗲𝗮𝗿𝗻 𝗠𝗟𝗘 & 𝗠𝗟𝗢𝗽𝘀 for free by designing, building and deploying an end-to-end ML batch system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 2.5 𝘩𝘰𝘶𝘳𝘴 𝘰𝘧 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 & 𝘷𝘪𝘥𝘦𝘰 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴

Updated Apr 3, 2024
Python

adidas / lakehouse-engine

The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.

framework big-data spark data-engineering databricks data-quality delta-lake great-expectations lakehouse configuration-driven

Updated May 20, 2024
Python

josephmachado / data_engineering_best_practices

Sample project to demonstrate data engineering best practices

spark etl pyspark data-engineering minio delta-lake great-expectations

Updated Feb 24, 2024
Python

provectus / data-quality-gate

Data Quality Gate based on AWS

aws aws-lambda athena terraform s3 redshift data-quality data-governance great-expectations

Updated Nov 21, 2023
Python

PrefectHQ / prefect-great-expectations

Prefect integrations for interacting with Great Expectations

great expectations prefect great-expectations

Updated Feb 8, 2024
Python

moritzkoerber / covid-19-data-engineering-pipeline

A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.

api docker aws spark apache-spark aws-lambda aws-s3 pyspark aws-ecr aws-cloudformation aws-redshift apache-airflow aws-glue aws-cdk great-expectations

Updated Nov 21, 2023
Python

ismaildawoodjee / GreatEx

A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in Airflow.

python docker postgres ecommerce airflow csv sql pipeline etl data-engineering parquet elt data-pipeline data-quality data-profiling great-expectations

Updated Aug 30, 2022
Python

luatnc87 / modern-data-warehouse-modeling-and-data-quality-with-dbt-openmetadata

This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source tools

slack airflow dbt data-modeling data-quality great-expectations duckdb openmetadata dimentional-data-model

Updated Sep 20, 2023
Python

josephmachado / data_engineering_best_practices_log

Code to demonstrate data engineering metadata & logging best practices

metadata spark grafana logging postgresql prometheus minio great-expectations

Updated Mar 12, 2024
Python

grillazz / fastapi-greatexpectations

Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool

python sqlalchemy sql python3 dataquality pydantic fastapi great-expectations dataqualitycheck

Updated Oct 2, 2023
Python

serialbandicoot / great-assertions

This library is inspired by the Great Expectations library. The library has made the various expectations found in Great Expectations available when using the inbuilt python unittest assertions.

python testing data-science jupyter-notebook python3 quality-assurance databricks data-testing great-expectations

Updated Feb 3, 2022
Python

datarootsio / notion-dbs-data-quality

Using Great Expectations and Notion's API, this repo aims to provide data quality for our databases in Notion.

notion data-quality data-engineering-pipeline notion-database notion-api great-expectations

Updated Nov 30, 2021
Python

PbVrCt / nft-arbitrage

An ML pipeline to flip nfts that makes use of the cloud and containers.

python docker golang flask aws-s3 scikit-learn aws-ecs great-expectations

Updated Dec 13, 2021
Python

great-expectations / cloud

Source code for the gx cloud agent

great-expectations

Updated May 24, 2024
Python

phatnguyen080401 / Real-Estate-Sale-Analytics

Create data pipeline using Lambda architecture with Spark, Kafka, Airflow and Snowflake

docker airflow kafka spark snowflake lambda-architecture great-expectations

Updated Jul 26, 2023
Python

PbVrCt / time-series-pipeline

A pipeline to forecast the direction stock prices from data from eodhistoricaldata.com

python scikit-learn keras pandas tensorboard financial-data time-series-analysis tensorflow2 eodhistoricaldata great-expectations

Updated Sep 26, 2021
Python

brendajanuario / pipeline-bigdata-pyspark

Personal Data Engineering project witch the objective is create the Data Lakehouse for a B2B e-commerce that must store the transactional and analytical data of the business. The final system delivers structured and clean data with the purpose of generate reports and find opportunities.

terraform bigdata pyspark databricks data-engineer great-expectations

Updated Nov 27, 2022
Python

vmtl-adsk / spark-learning

Kafka-Spark jobs orchestrated with Airflow

docker sqlalchemy airflow kafka spark cassandra postgresql poetry faust great fastapi great-expectations fastapi-framework

Updated Jun 14, 2022
Python

sheoran19 / yahoo-airflow-data-engineering-project

Yahoo Data Pipeline using Airflow

docker airflow pandas python3 cloudsql fastapi great-expectations

Updated Mar 18, 2024
Python

krishna-aditi / devops-tools-tutorial-labs

Tutorials for DevOps tools such as Google Codelabs, Apache Airflow, Streamlit, FastAPI, Great Expectations, etc.

heroku docker airflow google-codelabs streamlit great-expectations diagrams-as-code

Updated Dec 29, 2022
Python

Improve this page

Add a description, image, and links to the great-expectations topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the great-expectations topic, visit your repo's landing page and select "manage topics."