Tutorial and examples of Data Quality in Big Data System
-
Updated
Apr 25, 2017
Tutorial and examples of Data Quality in Big Data System
This program is running daily to check the sensor and probe data quality.
CSV Data Validator is a tool to validate csv file. It parse csv and validate the data with .hdr(csv meta data) before ingestion to Data Lake. It checks data file availability for every day load and validate data with respective meta data like File Size, Checksum, Delimiter, Record count etc. It ensure landed data conformity before give go ahead …
NOW-QUAL: Vaccine coverage survey Near-time Data Monitoring and Cleaning standard development template
This repository provides our generic test protocol for the integration test of ASS.
A Practical Approach for Population Data Quality Assessment
Data Stream Quality Control with Apache Kafka
data as a service, data concerns and data contracts
Explored transactional data and customer demographics to determine customer trends and behavior in order to highlight new potential high-value customers.
Simple Spark wrapper for validating data
[R package] Tools for data quality testing
Udacity Data Engineering Capstone Project. The purpose of this project is to establish a single source of truth database around I94 US immigration data considering basic immigration profiles, purpose of travel, visa status and weather impacts.
Azure Data Lake Gen2 storage connectors for Data Culpa - monitor data quality automatically with Data Culpa Validator
MongoDB connector for Data Culpa - monitor data quality automatically with Data Culpa Validator
Snowflake connectors for Data Culpa - monitor data quality automatically with Data Culpa Validator
Add a description, image, and links to the dataquality topic page so that developers can more easily learn about it.
To associate your repository with the dataquality topic, visit your repo's landing page and select "manage topics."