Retrieve Structured, Textual Data from Various Web Sources

Facilitate text retrieval from feed formats like XML (RSS, ATOM) and JSON. Also direct retrieval from HTML is supported. As most (news) feeds only incorporate small fractions of the original text tm.plugin.webmining even retrieves and extracts the text of the original text source.


News

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.

install.packages("tm.plugin.webmining")

1.3 by Mario Annau, 4 years ago


https://github.com/mannau/tm.plugin.webmining


Report a bug at https://github.com/mannau/tm.plugin.webmining/issues


Browse source code at https://github.com/cran/tm.plugin.webmining


Authors: Mario Annau [aut, cre]


Documentation:   PDF Manual  


Task views: Natural Language Processing, Web Technologies and Services


GPL-3 license


Imports NLP, tm, boilerpipeR, RCurl, XML, RJSONIO

Suggests testthat


See at CRAN