Skip to content

officialpm/Breast-Cancer-Wisconsin-Diagnostic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Prediction of Benign or Malignant Cancer Tumors visitor badge License

Breast-Cancer-Wisconsin-Diagnostic

Dataset Information

Link - http://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29

Attribute Information

1)ID number

2-31) Ten synthetic-valued features are computed for each cell nucleus:

  • radius (mean of distances from the center to points on the perimeter)
  • texture (standard deviation of gray-scale values)
  • perimeter
  • area
  • smoothness (local variation in radius lengths)
  • compactness (perimeter^2 / area - 1.0)
  • concavity (severity of concave portions of the contour)
  • concave points (number of concave portions of the contour)
  • symmetry
  • fractal dimension ("coastline approximation" - 1)

The mean, standard error and "worst" or largest (mean of the three largest values) of these features were computed for each image, resulting in 30 features.

  1. Diagnosis (M = malignant, B = benign)

Notebooks

PCA RandomForestClassifier gives good accuracy as compare to normal RandomForestClassifier model this is because Principal Component Analysis (PCA) takes only those features, which are explaining high varience, so because of that we are getting good accuracy for PCA as compare to normal models.


Linkedin Badge Gmail Badge GitHub Badge