Topic Modelling in python

This repository contains:

topic_model.py file which contains code for a topic modelling object.
The pickled files directory contains all the files needed by topic_model.py
Topic_modelling.ipynb is a notebook with the details behind the creation of the model in topic_model.py

Note: The user needs to have the spacy and gensim modules installed for this code to work.

Usage

Class: topic_model

Make a new topic model object by instatiating the topic_model

import topic_model as t
model = t.topicMod()

The 'model' object contains a trained LDA model that will find topics for a given document.

The object also has pre-loaded documents in the model.test_documents list. The list consists of 2042 documets from the 20newsgroups dataset.

The model.get_document() method gets a random document from the test documents.

The model.lda_description method takes a text document and finds the topics relevant to it.

Some examples of how to get topics for a document:

model.lda_description(model.get_document())

model.lda_description(model.test_documents[1230])

model.lda_description("Some text I want to get topics from. This text will be preferably at least 100-150 words long")

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
pickled_files		pickled_files
README.md		README.md
Topic_modelling.ipynb		Topic_modelling.ipynb
topic_model.py		topic_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Topic Modelling in python

Usage

Class: topic_model

About

Releases

Packages

Languages

venciso/topicModel

Folders and files

Latest commit

History

Repository files navigation

Topic Modelling in python

Usage

Class: topic_model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages