This project builds a machine learning model to classify breast tumors as malignant or benign based on medical features.
It demonstrates the complete ML pipeline from data preprocessing to model evaluation.
To develop a reliable classification model that can assist in early detection of breast cancer.
- Python
- NumPy, Pandas
- Scikit-learn
- Matplotlib / Seaborn
- Breast Cancer dataset (from sklearn / UCI)
- Features include tumor radius, texture, perimeter, area, etc.
- Checked for missing values
- Feature scaling / normalization
- Train-test split
- Logistic Regression
- Random Forest Classifier
- (Optional: Neural Network if used)
- Accuracy score
- Confusion Matrix
- Performance comparison
- Achieved accuracy: 85% (replace with your actual result)
- Model successfully classifies tumors with high reliability
The trained model is saved using pickle as:
breast_cancer_model.pkl
This allows the model to be reused without retraining.