Distributed Representations of Sentences and Documents

Learn vector representations of sentences, paragraphs or documents by using the 'Paragraph Vector' algorithms, namely the distributed bag of words ('PV-DBOW') and the distributed memory ('PV-DM') model. The techniques in the package are detailed in the paper "Distributed Representations of Sentences and Documents" by Mikolov et al. (2014), available at .


News

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.

install.packages("doc2vec")

0.1.1 by Jan Wijffels, a month ago


https://github.com/bnosac/doc2vec


Browse source code at https://github.com/cran/doc2vec


Authors: Jan Wijffels [aut, cre, cph] (R wrapper) , BNOSAC [cph] (R wrapper) , hiyijian [ctb, cph] (Code in src/doc2vec)


Documentation:   PDF Manual  


MIT + file LICENSE license


Imports Rcpp, stats

Suggests tokenizers.bpe

Linking to Rcpp


See at CRAN