Retrieve Structured, Textual Data from Various Web Sources

Facilitate text retrieval from feed formats like XML (RSS, ATOM) and JSON. Also direct retrieval from HTML is supported. As most (news) feeds only incorporate small fractions of the original text tm.plugin.webmining even retrieves and extracts the text of the original text source.


Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


1.3 by Mario Annau, 7 years ago

Report a bug at

Browse source code at

Authors: Mario Annau [aut, cre]

Documentation:   PDF Manual  

Task views: Natural Language Processing, Web Technologies and Services

GPL-3 license

Imports NLP, tm, boilerpipeR, RCurl, XML, RJSONIO

Suggests testthat

See at CRAN