Analyses of Text using Natural Language Processing and Machine Learning

Transforms text variables to word embeddings; where the word embeddings are used to statistically test the mean difference between set of texts, compute semantic similarity scores between texts, predict numerical variables, and visual statistically significant words according to various dimensions etc. For more information see <>.


Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


0.9.10 by Oscar Kjell, 10 months ago,

Report a bug at

Browse source code at

Authors: Oscar Kjell [aut, cre] , Salvatore Giorgi [aut] , Andrew Schwartz [aut]

Documentation:   PDF Manual  

GPL-3 license

Imports dplyr, tokenizers, tibble, stringr, tidyr, ggplot2, ggrepel, cowplot, rlang, purrr, magrittr, parsnip, recipes, rsample, reticulate, tune, workflows, yardstick, future, furrr

Suggests knitr, rmarkdown, testthat, rio, glmnet, randomForest, covr, xml2, ranger

System requirements: Python (>= 3.6.0)

See at CRAN