Data mining project to detect if a person is diabetic using logistic regression in R
In particular, all patients here are females at least 21 years old of Pima Indian heritage.
Number of Instances: 768
Number of Attributes: 8 plus class
- Number of times pregnant
- Plasma glucose concentration a 2 hours in an oral glucose tolerance test
- Diastolic blood pressure (mm Hg)
- Triceps skin fold thickness (mm)
- 2-Hour serum insulin (mu U/ml)
- Body mass index (weight in kg/(height in m)^2)
- Diabetes pedigree function
- Age (years)
- Class variable (0 or 1)
Class Distribution: (class value 1 is interpreted as "tested positive for diabetes")
Class Value | Number of instances |
---|---|
0 | 500 |
1 | 268 |