-
Notifications
You must be signed in to change notification settings - Fork 764
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Topic Coherence Measurements #1489
Comments
#90 is actually a very nice thread about this specific topic that is a worthwhile read. To save you from a long read, there are currently no plans to add these types of measurements to the library for the reason that measurement of coherency is exceedingly difficult and sometimes even flawed. Measuring coherence holds a degree of subjectivity which makes it difficult to use such an evaluation metric as a ground truth. As a result, whatever evaluation metric is implemented in BERTopic, some users are likely going to optimize for those metrics since they are implemented in BERTopic. The issue here is that the resulting performance cannot be guaranteed due to the somewhat subjective nature of these metrics. Instead, each and every use case requires a different set of evaluation metrics. Although we like to evaluate topic modeling techniques with coherence it only shows a very small piece of what a topic model actually is. What about the quality of the document-topic assignment? The diversity of topics? What if we have labels instead of keywords? Etc. There simply is no "best" evaluation metric. There is a nice package, called OCTIS, which has a number of interesting evaluation metrics implemented. You can use that package to evaluate BERTopic on your use case of interest. |
Thanks for the reply. Obviously adding them to BERTopic would be redundant. |
Evaluating the quality of the discovered topics is challenging. This paper or This one, introduce a set of nice measurements for Topics Coherence and Diversity, like
c_v
coherence:Given
topic1={car,driver,wheel,speed}
Do you have any plans to add these types of measurements to the library? If not, what do you suggest as the best measurement of the coherency in topics?
The text was updated successfully, but these errors were encountered: