A Generalized Data Cleaning System
JavaScript Java CSS HTML Thrift Shell
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
.idea
console
core
examples
lab
ruleext
service Redirecting logs into files under logs directory Feb 14, 2016
test
tools
vendors - Performance improvement on violation export. Jun 12, 2014
web
.gitignore
.travis.yml
LICENSE
NOTICE
README.md
build.xml
log4j.properties
nadeef.bat
nadeef.conf
nadeef.sh

README.md

NADEEF Stories in Ready ![Gitter](https://badges.gitter.im/Join Chat.svg)

What is NADEEF?

NADEEF (or ''clean'' in Arabic, نظيف) is a generalized data cleaning system developed by the data analytic group at Qatar Computing Research Institute.

See it in Action

Launch NADEEF using command

    Usage: nadeef.sh [OPTIONS]
    Options are:
        console : start the NADEEF console.
        dashboard : start the NADEEF dashboard.

A demo page can be accessed via here.

More details on the instructions can be found in the User Guide.

Goals of NADEEF

Being a commodity data cleaning system, NADEEF aims to be extensible, generic and easy-to-deploy.

Most existing data cleaning methods and systems, either in industry or academia, employ different types of data quality rules in isolation, each time a new data cleaning problem arises or a new type of rules is considered, practitioners either build a new system from scratch or go through a painful process of customizing an existing tool. The NADEEF team designed a new data cleaning system that distinguishes between a programming interface and a core to achieve generality and extensibility.

Publications

http://da.qcri.org/

License

NADEEF is released under the terms of the MIT License.

Contact

For any issues or enhancement please use the issue pages in Github, or contact us. We will try our best to help you sort it out.

Acknowledgement

We would like to thank JetBrains' support for their wonderful IntelliJ IDEA product. We are using it through all the development of NADEEF.

IntelliJ