Skip to content

maximtrp/topic-modeling-pipelines

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Topic Modeling Pipelines

Two LDA pipelines for topic modeling in Python based on pymystem (Python wrapper for mystem stemmer), gensim, and tomotopy packages. Pipelines include the crucial steps in topic modeling: lemmatization, stop words filtering, model training, phi and theta matrices saving.