Scalable identity resolution, entity resolution, data mastering and deduplication using ML
-
Updated
May 9, 2024 - Java
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
The premier open source Data Quality solution
Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
CSV Data Validator is a tool to validate csv file. It parse csv and validate the data with .hdr(csv meta data) before ingestion to Data Lake. It checks data file availability for every day load and validate data with respective meta data like File Size, Checksum, Delimiter, Record count etc. It ensure landed data conformity before give go ahead …
data as a service, data concerns and data contracts
Add a description, image, and links to the dataquality topic page so that developers can more easily learn about it.
To associate your repository with the dataquality topic, visit your repo's landing page and select "manage topics."