Download Public Domain Works from Wikisource

Download public domain works from Wikisource < https://wikisource.org/>, a free library from the Wikimedia Foundation project.


The wikisourcer R package helps you download public domain works from the free library Wikisource.

It includes two functions for downloading books and pages by url.

  • wikisource_book() to download a book.
  • wikisource_page() to download a page.

Installation

install.packages("wikisourcer") # or devtools::install_github("lgnbhl/wikisourcer")

Minimal examples

Download Voltaire’s philosophical novel Candide.

library(wikisourcer)
 
wikisource_book(url = "https://en.wikisource.org/wiki/Candide")

Download Chapter 1 of Candide.

wikisource_page(wikiurl = "https://en.wikisource.org/wiki/Candide/Chapter_1", 
                page = "Chapter 1")

Download Candide in French, Spanish and Italian.

library(purrr)
 
fr <- "https://fr.wikisource.org/wiki/Candide,_ou_l%E2%80%99Optimisme/Garnier_1877"
es <- "https://es.wikisource.org/wiki/C%C3%A1ndido,_o_el_optimismo"
it <- "https://it.wikisource.org/wiki/Candido"
 
purrr::map_df(c(fr, es, it), wikisource_book)

For more information on how to use wikisourcer, please read the vignette.

News

wikisourcer 0.1.3

  • Added a NEWS.md file to track changes to the package.
  • Fixed and improved vignette.
  • Removed dplyr and stringrdependencies.
  • Added the boolean cleaned variables to give the possibility to download all the metadata of the wikipages.

wikisourcer 0.1.4

  • Fixed vignette links.

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.

install.packages("wikisourcer")

0.1.4 by Félix Luginbuhl, 3 months ago


https://github.com/lgnbhl/wikisourcer


Report a bug at https://github.com/lgnbhl/wikisourcer/issues


Browse source code at https://github.com/cran/wikisourcer


Authors: Félix Luginbuhl [aut, cre]


Documentation:   PDF Manual  


MIT + file LICENSE license


Imports tibble, magrittr, rvest, purrr, xml2, urltools

Suggests dplyr, stringr, knitr, rmarkdown, ggplot2, tidyr, tidytext, widyr, SnowballC, ggraph, igraph


See at CRAN