On the basis of a medical news I did the text cleaning (clean the stop words, lematized). Then calculate the frequency of words and graph it with Matplotlib. Below I did Topic Modeling with Gensim. Afterwards the calculation of efficiency from the model with c_v measure. It is a simple code. I made it short so it wouldn't be so heavy to read. If anything let me know if you would like any modification or something to be added.
Project is created with:
- Gensim: 3.8.1
- Spacy: 2.0.16
- Matplotlib: 3.0.1
- BeautifulSoup: 4.8.2