Textual Statistics for the Quantitative Analysis of Textual Data

Textual statistics functions formerly in the 'quanteda' package. Textual statistics for characterizing and comparing textual data. Includes functions for measuring term and document frequency, the co-occurrence of words, similarity and distance between features and documents, feature entropy, keyword occurrence, readability, and lexical diversity. These functions extend the 'quanteda' package and are specially designed for sparse textual data.


Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


0.94.1 by Kenneth Benoit, 5 months ago


Report a bug at https://github.com/quanteda/quanteda.textstats/issues

Browse source code at https://github.com/cran/quanteda.textstats

Authors: Kenneth Benoit [cre, aut, cph] , Kohei Watanabe [aut] , Haiyan Wang [aut] , Jiong Wei Lua [aut] , Jouni Kuha [aut] , European Research Council [fnd] (ERC-2011-StG 283794-QUANTESS)

Documentation:   PDF Manual  

GPL-3 license

Imports quanteda, Matrix, methods, nsyllable, proxyC, Rcpp, RcppParallel, stringi

Suggests entropy, ExPosition, proxy, rmarkdown, spelling, svs, testthat, knitr

Linking to Rcpp, RcppParallel, RcppArmadillo, quanteda

System requirements: C++11

Imported by LSX, docreview, newsmap, rainette.

Suggested by quanteda, quanteda.textplots.

See at CRAN