perplexity()
computes the perplexity score to help users to chose the
optimal values of hyper-parameters for LDA.
Arguments
- x
a LDA model fitted by
textmodel_seededlda()
ortextmodel_lda()
.- newdata
if provided,
theta
andphi
are estimated through fresh Gibbs sampling.- ...
additional arguments passed to textmodel_lda.
Details
perplexity()
predicts the distribution of words in the dfm based
on x$alpha
and x$gamma
and then compute the sum of disparity between their
predicted and observed frequencies. The perplexity score minimizes when the
chosen values of hyper-parameters such as k
, alpha
and gamma
are
optimal.