A tool to validate data, built around Apache Spark.
-
Updated
May 29, 2024 - Scala
A tool to validate data, built around Apache Spark.
Test data management tool for any data source, batch or real-time
A micro data binding and validating framework, very easy to use and hack
Data generation and validation tool for any data source
Example API implementation for Data Caterer
Example API implementation for Data Caterer
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Scala library to validate tabular data loaded from CSV files
Add a description, image, and links to the data-validation topic page so that developers can more easily learn about it.
To associate your repository with the data-validation topic, visit your repo's landing page and select "manage topics."