Skip to content

dsckiet/breast-cancer-app-ml

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BREAST CANCER PREDICTION MODEL

This model is being trained to make predictions about cancer the patient is most likely suffering. This can be used both by medical practitioner and the patient provided data is provided from a proper medical report. The data project predicts whether the patient actually has breast cancer or not from the data provided by the User.


DOCS UI

Functionalities

  • Predict if patient's data predicts between 'Benign' and 'Maligant'.
    Benign
    'Benign' cancer is the cancer which doesn't spread and are non cancerous. In most cases, a doctor diagnosing a tumor as benign will most likely be left alone. Benign tumors are not generally aggressive around the surrounding tissue and in some cases, may continue to grow. If the tumor continues to grow and cause discomfort by pressing against surrounding organs and causing pain, the tumor would be removed.
    Malignant
    'Malignant' cancer cells spread across the body making it very dangerous. Malignant tumors are aggressive and cancerous because damage the surrounding tissue and may be removed depending on the cancerous and aggressive on the severity or aggressiveness of the tumor.
  • Helps in visualization of the dataset using Maps and graphs and provide a basic idea of the dataset. Eg: Heatmap


    PLOT3



DataSet Column/Input required for Predictions:

ID number(unique per record)Not needed for trained, just for reference Diagnosis (M = malignant, B = benign) Testing field
Ten real-valued features are computed for each cell nucleus:
  1. Radius(mean of distances from center to points on the perimeter)
  2. Texture (standard deviation of gray-scale values)
  3. Perimeter
  4. Area
  5. Smoothness (local variation in radius lengths)
  6. Compactness (perimeter² / area — 1.0)
  7. Concavity (severity of concave portions of the contour)
  8. Concave points (number of concave portions of the contour)
  9. Symmetry
  10. Fractal dimension
Each of the 10 values are further given as 3 values in a total of 30 columns, which include mean, standard error and worst values from images taken of the cell.

Link for Kaggle Database:

You may refer this link, this will also provide you with DataSet. 'breast-cancer-prediction' (You may have to clean dataset to get it in ready to use state as few records contain 0s which may effect the prediction)

Instructions to run

  • Pre-requisites:

    • Python Language and its libraries including Pandas, Numpy, Matplotlib etc.
    • Machine Learning Algorithms like....(still working on improving efficiency of model using different algorithms). Few under consideration are mentioned below:
      • Decision Trees Algorithm
      • Gradient Boosting Algorithm
      • k-Nearest Neighbors Algorithm
      • Logistic Regression Algorithm
      • Random Forests Algorithm
      • Support Vector Machine Algorithm
  • Directions to setup/install

< insert code >
  • Directions to execute
< insert code >

Contributors



Made during 🌙 by DSC KIET

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published