Skip to content

dbojado/clustering-exercises

Repository files navigation

Clustering Exercises

Summary

  • Clustering is an unsupervised machine learning methodology
  • It is used to group and identify similar observations when we do not have labels that identify the groups
  • It is often a preprocessing or an exploratory step in the data science pipeline
  • What groupings exist in the data already? (Clustering)

Clustering Use Cases

  • Text: Document classification, summarization, topic modeling, recommendations
  • Geographic: Crime zones, housing prices
  • Marketing: Customer segmentation, market research
  • Anomaly Detection: Account takeover, security risk, fraud
  • Image Processing: Radiology, security

About

Clustering Machine Learning Exercises

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published