GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
The premier open source Data Quality solution
Vagrantfiles for various development environments
Documentation for the DataCleaner project
Data Profiling for Pentaho Data Integration (PDI) with DataCleaner
Extra pluggable modules for Apache MetaModel (but licensed with LGPL)
DataCleaner extension for ElasticSearch
Extension to send JMS messages to an Active MQ queue.
A DataCleaner extension for XML, XPath, XSLT like transformations
An emailing extension for DataCleaner - use it to send newsletters or other batch emails
An extension for matching product data with the POD database (product-open-data.com)
Extension for working with vehicle/car data
An extensible and high-performance data processing engine
(New) Web Service Proxy project for EasyDQ.com, based on the Netty I/O library
Lucene search extension for DataCleaner
DataCleaner extension that provides a DC monitor repository based on Amazon S3