Mr.Blei, pls tell me how to use it. #4

9crk · 2017-03-02T03:11:04Z

I'm a beginer of LDA. pls tell me how to use this lda command.
usage : lda est [initial alpha] [k] [settings] [data] [random/seeded/*] [directory]
lda inf [settings] [model] [data] [name]

in my mind. this is a tool to get the topic words of an article.
so if I have handreds of articles by hand(like 0000.txt-1000.txt). how can I use lda to get the topic words of an article?

chanansh · 2017-08-03T06:01:16Z

the data format is explained in the readme.txt file. Each line should have a count of number of tokens followed by a term index and it counts. The term index should correspond to a vocabulary file.

See https://github.com/blei-lab/lda-c/blob/master/readme.txt for more details:

Data format

Under LDA, the words of each document are assumed exchangeable. Thus,
each document is succinctly represented as a sparse vector of word
counts. The data is a file where each line is of the form:
 [M] [term_1]:[count] [term_2]:[count] ...  [term_N]:[count]
where [M] is the number of unique terms in the document, and the
[count] associated with each term is how many times that term appeared
in the document. Note that [term_1] is an integer which indexes the
term; it is not a string.

kitescat · 2020-04-25T09:11:22Z

So how can i transform my data into this format,is there any script useful？
pls tell me if anyone seeing this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mr.Blei, pls tell me how to use it. #4

Mr.Blei, pls tell me how to use it. #4

9crk commented Mar 2, 2017

chanansh commented Aug 3, 2017

kitescat commented Apr 25, 2020

Mr.Blei, pls tell me how to use it. #4

Mr.Blei, pls tell me how to use it. #4

Comments

9crk commented Mar 2, 2017

chanansh commented Aug 3, 2017

kitescat commented Apr 25, 2020