Project of Introduction to Data Mining course at International University - Vietnam National University Ho Chi Minh City introduced by Dr. NGUYEN Thi Thanh Sang.
This project was written in Java and used Weka API on Java.
This project performed some data mining techniques on the dataset Autism Screening Adult Data Set. The techniques applied are
- Pre-processing Data
- Handle missing data
- Dectect outliers and extreme values
- Analyze correlation
- Discretize continuous attributes
- Classify the dataset using
- Decision tree (C4.5)
- Naive Bayes
- ZeroR
- Evaluate the performance using 10-fold cross validation