Distributed Representations of Sentences and Documents

Learn vector representations of sentences, paragraphs or documents by using the 'Paragraph Vector' algorithms, namely the distributed bag of words ('PV-DBOW') and the distributed memory ('PV-DM') model. The techniques in the package are detailed in the paper "Distributed Representations of Sentences and Documents" by Mikolov et al. (2014), available at .


Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


0.1.1 by Jan Wijffels, a month ago


Browse source code at https://github.com/cran/doc2vec

Authors: Jan Wijffels [aut, cre, cph] (R wrapper) , BNOSAC [cph] (R wrapper) , hiyijian [ctb, cph] (Code in src/doc2vec)

Documentation:   PDF Manual  

MIT + file LICENSE license

Imports Rcpp, stats

Suggests tokenizers.bpe

Linking to Rcpp

See at CRAN