Skip to content

Latest commit

 

History

History
156 lines (92 loc) · 5.62 KB

2022-08-31-clustering.md

File metadata and controls

156 lines (92 loc) · 5.62 KB

DIY Clustering algorithms and models

For more information about the "Project of the Week" initiative at DataTalks.Club, see README.md.

If you want to receive reminders about this event, sign up here

Technologies

  • Scikit-Learn
  • Jupyter notebooks

Note: this is a suggested list of technologies, you can chose alternatives instead

Plan

This is a proposed plan only, you don’t have to follow it day-by-day

Day 1 (31 August, Wednesday)

  • Come up with a project idea
  • Select the dataset for your project
  • Create a github repository
  • Share your progress in Slack and in social media

Day 2 (1 September, Thursday)

  • Learn about K-means clustering
  • Push your changes to github
  • Share your progress in Slack and in social media

Suggested materials:

Found good materials? Create a PR with links!

Day 3 (2 September, Friday)

  • Learn about Mean-shift Clustering
  • Push your changes to github
  • Share your progress in Slack and in social media

Suggested materials:

Found good materials? Create a PR with links!

Day 4 (3 September, Saturday)

  • Learn about Agglomerative Hierarchical Clustering
  • Push your changes to github
  • Share your progress in Slack and in social media

Suggested materials:

Found good materials? Create a PR with links!

Day 5 (4 September, Sunday)

  • Learn about DBSCAN (Density-Based Spatial Clustering of Applications with Noise)
  • Push your changes to github
  • Share your progress in Slack and in social media

Suggested materials:

Found good materials? Create a PR with links!

Day 6 (5 September, Monday)

  • Learn about Expectation–Maximization (EM) Clustering using Gaussian Mixture Models (GMM)

Suggested materials:

Found good materials? Create a PR with links!

Day 7 (6 September, Tuesday)

  • Continue exploring more about this topic
  • Polish the documentation for your project
  • Push your changes to github
  • Share your progress in Slack and in social media
  • Give us feedback
  • Add the link to your project to this project of the week github page

Materials

Notebooks:

Datasets

Note: If you know other good resources about clustering, send a PR

Legend

  • 🏫 Course
  • 💾 Dataset
  • 🗒️ Article
  • 📺 Video tutorial
  • 💻 Code

Projects

List of projects from our participants: