Skip to content

VelenL/Anomaly_Detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

3 Commits
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿฉบ Breast Cancer Anomaly Detection

A lightweight unsupervised anomaly detection project using the Multivariate Gaussian Model to identify abnormal breast cancer cases. The model learns the normal data distribution and flags low-probability observations as potential anomalies.


๐Ÿ” Overview

  • Applies density estimation to detect rare malignant patterns
  • Uses multivariate Gaussian parameters (mean + covariance)
  • Threshold (ฮต) selected by maximizing F1-score
  • Visualizes outliers that deviate from the main cluster

This approach demonstrates how simple unsupervised methods can reveal meaningful medical anomalies without relying on labels.


๐Ÿ“Š Dataset

  • Wisconsin Breast Cancer Dataset (WBC)
  • 569 samples, 30 standardized numeric features
  • Labels used only for evaluation (not for training)

๐Ÿง  Method Highlights

  • Feature preprocessing & normalization
  • Gaussian probability density calculation
  • Automatic threshold tuning
  • Outlier detection & visualization

๐Ÿ› ๏ธ Technologies

Python, NumPy, Pandas, Matplotlib, Seaborn

About

In this study we will proceed to the elaboration of an anomaly detection. For this, we retrieved a database on breast cancer. This dataset consists of 32 columns. Our analysis will be based mainly on the "diagnosis" column allowing us to know the condition of the cancer.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors