Topic Modeling on Reddit with spaCy, Latent Dirichlet Allocation and Gensim
The following project has been made with the intent of modeling and discovering the main topics within the data science community on Reddit (a social network similar to Twitter). Reddit offers free API to extract its subreddits. A subreddit is basically the name of a community, and a community contains many posts.