tidy.STM
with content variable in stm
#209
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
I realized that when then document-level
content
covariate is set instm
, the beta-list contains also the interaction coefficients besides the topic-term probabilities and provides a beta-matrix for each covariate separately, in which casetidy.STM
returns a tibble where some topics/terms are NAs, and multiple values for topic-term combinations (exactly the number of levels in the covariate) - reprex below.So I restricted it to only return the beta-values and added a column to the beta tibble which indicates the covariate.
Created on 2022-04-20 by the reprex package (v2.0.1)