Topic-Modeling-in-R

Visualizing topic models with LDAvis and topicmodels library in R

This project builds a word cloud and visualizes the topics from abstracts of academic publication data. It uses the tm package in R to build a corpus and remove stopwords. A document-term matrix is created from the corpus. A wordcloud is generated with most frequent words. Latent Dirichlet allocation (LDA) is a generative statistical model that allows sets of observations to be explained by unobserved groups that explain why some parts of the data are similar. LDA package is used to create topic models with k=10 topics. LDAVis package reduces the multi-dimensional topics to 2 dimensions using principal component analysis (pca) with pc1 and pc2 axes.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
NAP_10_topics		NAP_10_topics
Abstract_topic_modeling.R		Abstract_topic_modeling.R
NAP_abstracts.csv		NAP_abstracts.csv
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Topic-Modeling-in-R

About

Uh oh!

Releases

Packages

Languages

anandg112/Visualizing-Topic-Models

Folders and files

Latest commit

History

Repository files navigation

Topic-Modeling-in-R

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages