Skip to content

singh-l/Clustering_Repo

Repository files navigation

Clustering_Repo

Website

Clustering Text : a comparison between available text vectorization techniques*            
Author: Lovedeep Singh







Abstract. The concept of clustering is of primitive importance in the field of unsupervised learning. We have always required the need to categorize data with respect to some parameters. More or less, this can become quite challenging with the increasing amount of jargon, which requires expert domain knowledge, and with the increasing amount of data. Sometimes, we even do not possess enough knowledge about the data to divide it into categories. We simply do not possess past experiences to train a classification model for categorizing data. This paper present a comparative study on the techniques available for clustering text data using only text vectorization methods.
Proceedings published in AISC-Springer