Skip to content

tomasojea/LDA-Topic-Modeling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 

Repository files navigation

Topic Modeling

General info

On the basis of a medical news I did the text cleaning (clean the stop words, lematized). Then calculate the frequency of words and graph it with Matplotlib. Below I did Topic Modeling with Gensim. Afterwards the calculation of efficiency from the model with c_v measure. It is a simple code. I made it short so it wouldn't be so heavy to read. If anything let me know if you would like any modification or something to be added.

Technologies

Project is created with:

  • Gensim: 3.8.1
  • Spacy: 2.0.16
  • Matplotlib: 3.0.1
  • BeautifulSoup: 4.8.2

Releases

No releases published

Packages

No packages published