Introduction to Data Mining project at HCMIU.
Hepatitis C Prediction Dataset
- Pre-processing Data
- Remove useless data
- Handle missing data
- Dectect outliers and extreme data
- Analyze correlation
- Discretize continuous attributes
- Classify the dataset using
- OneR
- Naive Bayes
- Decision tree (J48)
- Evaluate the performance using 10-fold cross validation