Skip to content

Perform clustering techniques (K-Means and Hierarchical Clustering) on a food dataset to determine which types of food are more likely to be grouped together.

Notifications You must be signed in to change notification settings

phuongdtrn/Clustering-Text-With-Python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Text Clustering with NLP and clustering techniques

Perform clustering techniques (K-Means and Hierarchical Clustering) on a food dataset to determine which types of food are more likely to be grouped together.

Step 1: Preprocess the data

  • Remove punctuations, numbers, stop words
  • Word stemming: words are reduced to their root form (plural to single)

Step 2: Create bag of words: Create a word matrix

Step 3: Apply TF-IDF on bag of words to evaluate how important a word is to a document in a collection or corpus.

K-Means Clustering

Screen Shot 2022-04-03 at 9 56 36 AM

Hierarchical Clustering

image

About

Perform clustering techniques (K-Means and Hierarchical Clustering) on a food dataset to determine which types of food are more likely to be grouped together.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published