How to infer the subject #1

lhy2749 · 2019-11-29T07:20:26Z

sorry,i just read your paper.But there are so many questions,for example,after Eq.2,3,we can get the log-likelihood of a document,however,how to get the topic-word distribution and the topic-document distribution?

yaof20 · 2020-07-23T06:03:14Z

Hi, I was confused by this issue as well. Here is my understanding:
The optimization object of the model is to maximize the log likelihood of the whole document P(v). Once you have finished the training process, you get the weight matrix W, which is a H by K matrix (where H and K represent the numbers of topic and vocabulary respectively).

Then you could apply this W matrix to equation (1) in the iDocNADE paper which is to compute the hidden state h. To be more specific, h is an H-dimensional vector which could be interpreted as the topic distribution over the topics.

To determine the topic of a new document, you have to input the collection of document words into the model and the final hidden state can be used as the representation of the whole document. Notice that this representation is actually the H-dimensional topic distribution as mentioned above. The topic of this new document can be obtained through this topic distribution.

I hope it helps you. If there is anything wrong, please point it out.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to infer the subject #1

How to infer the subject #1

lhy2749 commented Nov 29, 2019

yaof20 commented Jul 23, 2020

How to infer the subject #1

How to infer the subject #1

Comments

lhy2749 commented Nov 29, 2019

yaof20 commented Jul 23, 2020