Skip to content

mp1610/Data_Mining

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data_Mining

This project uses WEKA (data mining software) to apply several machine learning algorithms onto a dataset containing indian liver patient data. The coursework brief is attached in this repository, as well as the final pdf file which presents the implementation of data cleaning (such as normalisation, discretisation, handling missing data and outliers) and machine learning algorithms such as Naive Bayes, Multilayer Perceptron, and Clustering.

This data set contains 416 liver patient records and 167 non liver patient records. The dataset was collected from northeast of Andhra Pradesh, India. Selector is a class label used to divide into groups (liver patient or not). This data set contains 441 male patient records and 142 female patient records. Any patient whose age exceeded 89 is listed as being of age "90".

Attribute Information:

  1. Age - Age of the patient
  2. Gender - Gender of the patient
  3. TB - Total Bilirubin
  4. DB - Direct Bilirubin
  5. Alkphos - Alkaline Phosphatase
  6. Sgpt - Alamine Aminotransferase
  7. Sgot - Aspartate Aminotransferase
  8. TP - Total Proteins
  9. ALB - Albumin
  10. A/G - Ratio Albumin and Globulin Ratio
  11. Class - Selector field used to split the data into two sets (labelled by the experts) liver disease and No liver disease

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors