how to predict on new data? #9

erdnaxel · 2020-12-17T00:18:58Z

hello:

love the package!!

i’m wondering how to apply the model to new data?

koheiw · 2020-12-23T08:19:57Z

In the original GibbsLDA++, topics of unseed documents are inferred in another round of Gibbs sampling. I haven't implemented this function, because I didn't think many people separate fitting and prediction steps with LDA.

With the current version, you can still predict topics of unseen documents using the distribution of topic over words (phi). Here, x should be fitted LDA object, and newdata is a DFM.

predict <- function(x, newdata = NULL) {
    if (!is.null(x)) {
        data <- newdata
    } else {
        data <- x$data
    }
    data <- dfm_match(data, colnames(x$phi))
    temp <- data %*% t(x$phi)
    result <- factor(max.col(temp), labels = rownames(x$phi),
                     levels = seq_len(nrow(x$phi)))
    result[rowSums(data) == 0] <- NA
    return(result)
}

Please be aware that the result of predict() can be different from topics() due to the different nature of algorithm.

tomseinen · 2020-12-23T12:46:42Z

Came here for the same question as @erdnaxel.
I think implementing the predict function will be much appreciated.

Great work!

erdnaxel · 2020-12-23T23:52:23Z

thank you, i really appreciate the response! i will try it out as soon as i can.

koheiw · 2020-12-30T09:35:45Z

Guys, I created predict() in the issue-9 branch. Please give it a try.

koheiw · 2020-12-30T12:16:31Z

I close this as the branch is merged, so please open a new issue if there are problems.

koheiw mentioned this issue Dec 30, 2020

Add predict #12

Merged

koheiw closed this as completed Dec 30, 2020

koheiw mentioned this issue Dec 31, 2021

Add model argument to pass existing model #23

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to predict on new data? #9

how to predict on new data? #9

erdnaxel commented Dec 17, 2020

koheiw commented Dec 23, 2020

tomseinen commented Dec 23, 2020

erdnaxel commented Dec 23, 2020

koheiw commented Dec 30, 2020

koheiw commented Dec 30, 2020

how to predict on new data? #9

how to predict on new data? #9

Comments

erdnaxel commented Dec 17, 2020

koheiw commented Dec 23, 2020

tomseinen commented Dec 23, 2020

erdnaxel commented Dec 23, 2020

koheiw commented Dec 30, 2020

koheiw commented Dec 30, 2020