pima-indians-classification

This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.

Tools

scipy
numpy
matplotlib
pandas
sklearn

and make sure you have Python 2.7 or 3.6+

Run

python3 main.py

Notes

The dataset was split into training set (80%) and test set (20%). The dataset is shuffled before splitting in order to eliminate the possibility of any initial structure in the dataset.
For every observation in the test set, its distance is calculated from all observations in the training set. For this priject, 3 different distances were used in this step, to account for and report on the sensitivity of each distance metric:
- Euclidean distance
- Manhattan distance
- Jaccard distance of sample sets
5-fold Cross Validation was implemented to obtain the percentage of correct classification as a function of the number of nearest neighbours, for the different k values and distance metrics. The method for cross validation is generalized so as to accept a user-defined parameter of the number of groups that a given data sample is to be split into; for this project, 5 was chosen.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
datasets		datasets
source		source
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pima-indians-classification

Tools

Run

Notes

About

Releases

Packages

Languages

License

GeoRouv/pima-indians-classification

Folders and files

Latest commit

History

Repository files navigation

pima-indians-classification

Tools

Run

Notes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages