Skip to content

#Project Overview #Objective: Develop a predictive model for diagnosing breast cancer based on provided features. #Data Source: Breast Cancer Wisconsin (Diagnostic) Data Set. #Key Features: Includes measurements like radius, texture, perimeter, area, smoothness, etc., of the cell nuclei.

Notifications You must be signed in to change notification settings

A1fred00-datascience/Breast_Cancer_ML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

#Breast Cancer Diagnostic Analysis Using Machine Learning

##Project Overview

This project focuses on developing a machine learning model to accurately diagnose breast cancer using a dataset of breast cancer patient records. The project involves data preprocessing, exploratory data analysis, feature engineering, model building, evaluation, and optimization.

##Dataset Description

The dataset used in this project consists of several features computed from digitized images of fine needle aspirate (FNA) of breast masses. Features include characteristics of cell nuclei present in the images. The objective is to classify the diagnosis as either benign or malignant.

##Technologies Used Python - Jupyter Notebook Libraries: Pandas, NumPy, Matplotlib, Seaborn, Scikit-Learn Key Steps in the Project

Data Preprocessing: Cleaning data, handling missing values, and standardizing dataset. Exploratory Data Analysis: Visualizing data distributions and understanding correlations using Matplotlib and Seaborn. Feature Engineering: Standardizing features for model training and isolating influential features. Model Development and Evaluation: Implementing various machine learning models and evaluating them based on performance metrics. Model Optimization: Utilizing GridSearchCV for hyperparameter tuning to enhance model performance. Documentation and Reporting: Comprehensive documentation of methodologies and insights in the Jupyter Notebook. Results and Conclusion

The optimized Random Forest model achieved an accuracy of 96%, indicating a high level of precision in breast cancer diagnosis. The project demonstrates the potential of machine learning in augmenting medical diagnostic processes. How to Run the Project

##Future Scope

Incorporating more diverse data to improve model robustness. Exploring advanced machine learning algorithms and deep learning for enhanced accuracy. Developing a web-based application for easy access and usage by healthcare professionals.

About

#Project Overview #Objective: Develop a predictive model for diagnosing breast cancer based on provided features. #Data Source: Breast Cancer Wisconsin (Diagnostic) Data Set. #Key Features: Includes measurements like radius, texture, perimeter, area, smoothness, etc., of the cell nuclei.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published