The Goal of this project is to provide documentation for the Lakehouse Engine framework.
-
Updated
Oct 28, 2024 - HTML
The Goal of this project is to provide documentation for the Lakehouse Engine framework.
Data file examples and user guides for VerityPy and VerityDotNet libraries
R package for delineating temporal dataset shifts in Eletronic Health Records
re_data - fix data issues before your users & CEO would discover them 😊
Collection of R scripts to test packages in conducting data quality assessments
Metrics Observability & Troubleshooting
A comprehensive repository housing a collection of insightful blog posts, in-depth documentation, and resources exploring various facets of data engineering. From ETL processes and database management to orchestration tools, data quality, monitoring, and deployment strategies
This GitHub repository provides a comprehensive set of tools and algorithms for detecting fraud anomalies in various data sources. Fraudulent activities can have severe consequences, impacting businesses and individuals alike. With this repository, we aim to empower researchers with effective techniques to identify and prevent fraudulent behavior.
FIMUS imputes numerical and categorical missing values by using a data set’s existing patterns including co-appearances of attribute values, correlations among the attributes and similarity of values belonging to an attribute.
LEILA - Librería de calidad de datos
To describe age-gender unbiased COVID-19 subphenotypes regarding severity patterns through a two-stage clustering approach using patient phenotypes and demographic features. Additional source and temporal variability assessments are included as part of data quality analyses.
R code for the discovery of COVID-19 subgroups by symptoms and comorbidities.
TellMeQuality is a tool for measuring Data Quality according to ISO/IEC 25024.
Ergebnisse der Datenanalyse vom Feinstaub Hackathon 2018 der Stuttgarter Zeitung
Add a description, image, and links to the data-quality topic page so that developers can more easily learn about it.
To associate your repository with the data-quality topic, visit your repo's landing page and select "manage topics."