Breast Cancer Wisconsin Diagnosis

Overview

This project focuses on using logistic regression to predict breast cancer diagnosis based on various features extracted from digitized images of breast mass. The dataset used in this project is the Breast Cancer Wisconsin (Diagnostic) Data Set (https://www.kaggle.com/uciml/breast-cancer-wisconsin-data) from Kaggle.

Data Description

The dataset contains 569 instances and 30 features.
Features include mean, standard error, and worst values for various attributes such as radius, texture, perimeter, area, smoothness, compactness, concavity, concave points, symmetry, and fractal dimension.
The target variable is the diagnosis, which is classified as malignant (M) or benign (B).

Tools and Technologies

Python: Data preprocessing, model training, and evaluation
pandas, NumPy: Data manipulation and numerical operations
scikit-learn: Implementation of logistic regression model
Matplotlib, Seaborn: Data visualization

Workflow

Data Preprocessing: The dataset is loaded and preprocessed to handle missing values and encode categorical variables if any.
Data Exploration: Exploratory Data Analysis (EDA) is performed to understand the distribution of features and their relationships with the target variable.
Model Training: A logistic regression model is trained on the preprocessed data to predict the diagnosis.
Model Evaluation: The model's performance is evaluated using metrics such as accuracy, precision, recall, and F1-score.
Results: The results of the model are presented, including any insights gained from the analysis.

Usage

Clone the repository:

git clone https://github.com/abckhush/breast-cancer-diagnosis.git
cd breast-cancer-diagnosis

Install the required dependencies
```
pip install -r requirements.txt
```

Run the Jupyter notebook

jupyter notebook breast_cancer_diagnosis.ipynb

Conclusion

The logistic regression model achieved an accuracy of 95% on the test set, demonstrating its effectiveness in predicting breast cancer diagnosis based on the given features.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Breast_Cancer_Wisconsin_Diagnosis.ipynb		Breast_Cancer_Wisconsin_Diagnosis.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Breast Cancer Wisconsin Diagnosis

Overview

Data Description

Tools and Technologies

Workflow

Usage

Conclusion

About

Releases

Packages

Languages

abckhush/Breast-Cancer-Wisconsin-Diagnosis

Folders and files

Latest commit

History

Repository files navigation

Breast Cancer Wisconsin Diagnosis

Overview

Data Description

Tools and Technologies

Workflow

Usage

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages