data-quality
Here are 11 public repositories matching this topic...
Simple Spark wrapper for validating data
-
Updated
Oct 17, 2020 - Scala
An extensible and configurable ETL tool built on top of Apache Spark
-
Updated
Aug 28, 2021 - Scala
Automated data quality suggestions and analysis with Deequ on AWS Glue
-
Updated
Dec 29, 2022 - Scala
A library for Spark that helps to stadardize any input data (DataFrame) to adhere to the provided schema.
-
Updated
Sep 6, 2023 - Scala
A Quality Spark DQ Library
-
Updated
May 27, 2024 - Scala
Data generation and validation tool for any data source
-
Updated
Feb 20, 2024 - Scala
Data quality control tool built on spark and deequ
-
Updated
Mar 30, 2024 - Scala
Feathr – A scalable, unified data and AI engineering platform for enterprise
-
Updated
Apr 4, 2024 - Scala
Example API implementation for Data Caterer
-
Updated
Jun 6, 2024 - Scala
Test data management tool for any data source, batch or real-time
-
Updated
Jun 10, 2024 - Scala
Improve this page
Add a description, image, and links to the data-quality topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-quality topic, visit your repo's landing page and select "manage topics."