Changelog • seededlda

Changes in v1.3.0

CRAN release: 2024-04-11

CRAN release: 2024-04-10

The RcppParallel package is no longer required as the TBB library in the operating system (Linux and MacOS) or Rtools (Windows) is used.
Linux and MacOS must have the TBB library to enable parallel computing before installing this package from the source.

CRAN release: 2023-07-01

CRAN release: 2023-06-12

CRAN release: 2023-05-31

Add auto_iter to textmodel_seededlda() and textmodel_lda() to stop Gibbs sampling automatically before max_iter is reached.
Add batch_size to textmodel_seededlda() and textmodel_lda() to enable the distributed LDA algorithm for parallel computing.

CRAN release: 2023-04-30

Add the gamma parameter to textmodel_seededlda() and textmodel_lda() for sequential classification.
Add textmodel_seqlda() as as short cut for textmodel_lda(gamma = 0.5).
Improve the calculation of weights for seed words.
Add the regularize argument to divergence() for the regularized topic divergence measure.

CRAN release: 2023-03-23

CRAN release: 2023-03-17

CRAN release: 2022-10-09

Add min_prob and select to topics() for greater flexibility
Change the divergence measure from Kullback-Leibler to Jensen-Shannon.
Add weighted, min_size, select to divergence() for regularized topic divergence scores.

CRAN release: 2022-03-28

Change textmodel_seededlda() to set positive integer values to residual.
Fix a bug in textmodel_seededlda() that ignores n-grams when concatenator is not “_“.
Change topics() to return document names.
Add divergence() to optimize the number of topics or the seed words (#26).

CRAN release: 2022-01-07

Change the textmodel_seededlda object to save dictionary and related settings (#18)

CRAN release: 2021-04-08

Add predict() to identify topics of unseen documents (#9)
Allow selecting seed words based on their frequencies using dfm_trim() in textmodel_seededlda() via ... (#8)

CRAN release: 2020-12-17

CRAN release: 2020-09-10