TopicBlob
is a Python 3 library for processing textual data. It provides a simple API for diving into common natural language processing (NLP) tasks around topic modeling such as finding similar documents and provide a list of topics given input text.
Here is a live demo of TopicBlob working on wikipedia pages
https://share.streamlit.io/banjtheman/topicblob/main/topicblob_st.py
TopicBlob leverages NLTK
, pandas
, and gensim
, for the heavy lifting
- Topic Extraction
- Similarity Search
- BM25 search ( word ranking search)
- Topic Search
$ pip install topicblob
- Python >= 3.5
- Ensure you have (docker)[https://www.docker.com/] installed locally.
- Build local Docker Image
docker build -t topicblob:local .
- Run App (Simply runs the
example.py
module)docker run topicblob:local