Prediction-Model

Cervical cancer screening prediction using Python

Built a classifier for cervical cancer screening prediction using the Kaggle dataset (https://www.kaggle.com/datasets/loveall/cervical-cancer-risk-classification).

Performed BINARY CLASSIFICATION for each of the 4 target variables individually namely Hinselmann, Schiller, Cytology and Biopsy using two classifiers: SVM and KNN.

Steps Performed:

Data Preprocessing

Dealt with missing values
Identified and removed the outliers
Normalized the data

Data Balancing

Used SMOTE to balance the classes

Feature Extraction

Identified useful features and eliminated redundant features
Reduced the dimensionality of data
Used PCA to extract the principal components

Classifier

Used two classifiers for this task: SVM and KNN
Tested the results by tuning the hyperparameters for each classifier, e.g., regularizer weight for soft-margin in SVM and the value of k in KNN

Evaluation

Three evaluation metrics are used for each classifier: Accuracy, Precision and Recall
Confusion matrix for each target variable is plotted

Data Visualization

Visualized the normalized data distribution using boxplot
Identified correlated features using correlation heatmap
Plotted the confusion matrix
Used seaborn and matplotlib libraries for visualization.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
190117.ipynb		190117.ipynb
CS253 - Python Assignment.pdf		CS253 - Python Assignment.pdf
README.md		README.md
kag_risk_factors_cervical_cancer.csv		kag_risk_factors_cervical_cancer.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prediction-Model

Cervical cancer screening prediction using Python

Steps Performed:

Data Preprocessing

Data Balancing

Feature Extraction

Classifier

Evaluation

Data Visualization

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Prediction-Model

Cervical cancer screening prediction using Python

Steps Performed:

Data Preprocessing

Data Balancing

Feature Extraction

Classifier

Evaluation

Data Visualization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages