Machine Reading Tea Leaves #10

amritbhanu · 2016-04-11T15:06:58Z

Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality

[bibtex](@inproceedings{lau2014machine,
title={Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality.},
author={Lau, Jey Han and Newman, David and Baldwin, Timothy},
booktitle={EACL},
pages={530--539},
year={2014}
})

General:

A good paper which gives rational about the topics instability

Measures:

notion of topic “coherence”, and proposed an automatic method for estimating topic coherence based on pairwise pointwise mutual information (PMI) between the topic words
direct appraoch, asking people about topics, indirect approach by evaluating PMI, CP.
To create gold-standard coherence judgements, they used Amazon Mechanical Turk

Problems:

perplexity correlates negatively with topic interpretability

Research Question:

word intrusion measures topic interpretability differently to observed coherence

Terminologies:

topic coherence, the semantic interpretability of the top terms usually used to describe discovered topics
“intruder word”, which has low probability in the topic of interest, but high probability in other topics

amritbhanu added the Papers label Apr 11, 2016

This was referenced Apr 11, 2016

Read this #4

Closed

Coherence of Descriptors #11

Closed

amritbhanu closed this as completed Sep 6, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Machine Reading Tea Leaves #10

Machine Reading Tea Leaves #10

amritbhanu commented Apr 11, 2016

Machine Reading Tea Leaves #10

Machine Reading Tea Leaves #10

Comments

amritbhanu commented Apr 11, 2016

Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality