Does GPU help? #31

lppier · 2020-12-28T15:37:12Z

Hi, firstly thank you so much for this library. I've tried it and it does take some time to get the topics.
Just wondering, will having GPU help speed-wise? Is the speed bottle-necked at the sentence transformers embedding portion?

lppier · 2020-12-28T16:06:20Z

Apologies, managed to try it on GPU enabled cloud server and it was significantly faster.

MaartenGr · 2020-12-28T16:09:22Z

Yes! Using a GPU is highly recommended to speed-up the inference at the sentence-transformers stage.

However, if you do not have a GPU available to you, then you can actually use TF-IDF instead since BERTopic allows for custom embeddings to be passed:

from bertopic import BERTopic
from sklearn.datasets import fetch_20newsgroups
from sklearn.feature_extraction.text import TfidfVectorizer

# Create TF-IDF sparse matrix
docs = fetch_20newsgroups(subset='all',  remove=('headers', 'footers', 'quotes'))['data']
vectorizer = TfidfVectorizer(min_df=5)
embeddings = vectorizer.fit_transform(docs)

# Run BERTopic with embeddings
model = BERTopic(allow_st_model=True)
topics, probabilities = model.fit_transform(docs, embeddings)

Note that I used the parameter allow_st_model which basically uses a sentence-transformer model to fine-tune the topic representation. This should be very efficient regardless of using a GPU since you would only need to embed a few hundred words. However, you can set this to False if you do not want to be using a sentence-transformer model at all.

EDIT: Did not saw your response but I will leave this up here for those who are interested in other embedding methods.

lppier · 2020-12-29T02:29:24Z

Thanks @MaartenGr ! This was very useful.

lppier closed this as completed Dec 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does GPU help? #31

Does GPU help? #31

lppier commented Dec 28, 2020

lppier commented Dec 28, 2020

MaartenGr commented Dec 28, 2020 •

edited

lppier commented Dec 29, 2020

Does GPU help? #31

Does GPU help? #31

Comments

lppier commented Dec 28, 2020

lppier commented Dec 28, 2020

MaartenGr commented Dec 28, 2020 • edited

lppier commented Dec 29, 2020

MaartenGr commented Dec 28, 2020 •

edited