Tools for Statistical Content Analysis

A framework for statistical analysis in content analysis. In addition to a pipeline for preprocessing text corpora and linking to the latent Dirichlet allocation from the 'lda' package, plots are offered for the descriptive analysis of text corpora and topic models. In addition, an implementation of Chang's intruder words and intruder topics is provided. Sample data for the vignette is included in the toscaData package, which is available on gitHub: <>.


Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


0.3-2 by Lars Koppers, 3 months ago,

Browse source code at

Authors: Lars Koppers [aut, cre] , Jonas Rieger [aut] , Karin Boczek [ctb] , Gerret von Nordheim [ctb]

Documentation:   PDF Manual  

GPL (>= 2) license

Imports tm, lda, quanteda, lubridate, htmltools, RColorBrewer, stringr, WikipediR, data.table

Suggests toscaData, testthat, knitr, devtools, rmarkdown

Imported by rollinglda.

Suggested by ldaPrototype.

See at CRAN