Tools to Work with the 'Splash' 'JavaScript' Rendering and Scraping Service

'Splash' < https://github.com/scrapinghub/splash> is a 'JavaScript' rendering service. It is a lightweight web browser with an 'HTTP' API, implemented in 'Python' using 'Twisted' and 'QT' and provides some of the core functionality of the 'RSelenium' or 'seleniumPipes' R packages in a lightweight footprint. Some of 'Splash' features include the ability to process multiple web pages in parallel; retrieving 'HTML' results and/or take screen shots; disabling images or use 'Adblock Plus' rules to make rendering faster; executing custom 'JavaScript' in page context; getting detailed rendering info in 'HAR' format.


News

0.6.0

  • Switch Docker orchestration to the stevedore package
  • Fixed minor check on as.data.frame functions
  • Cleaned up splashr object printing
  • Added Android, Kindle, Apple TV & Chromecast user agents and updated other user agents
  • Updated Travis config to not use old docker pkg components

0.5.0

  • support Splash API basic auth
  • as_data_frame/as.data.frame methods for HAR objects

0.4.1

  • removed clipr usage due to CRAN

0.4.0

  • moved to 'docker' pacakge since it's on CRAN
  • temporarily removed render_file() support
  • added code coverage
  • CRAN release

0.3.0

  • added basic pkg tests
  • added mini-DSL to avoid needing to write lua scripts for some common operations
  • added many tests for many types of objects
  • added HAR support
  • added as_req()
  • added as_request()
  • added wait value range check for render_ functions (min 0, max 10)

0.2.0

  • added execute()
  • modified splash_active()
  • added splash_local global variable to avoid typing splash("localhost")

0.1.0

  • Initial release

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.

install.packages("splashr")

0.6.0 by Bob Rudis, 4 months ago


http://gitlab.com/hrbrmstr/splashr


Report a bug at https://gitlab.com/hrbrmstr/splashr/issues


Browse source code at https://github.com/cran/splashr


Authors: Bob Rudis [aut, cre]


Documentation:   PDF Manual  


Task views: Web Technologies and Services


MIT + file LICENSE license


Imports xml2, curl, httr, dplyr, purrr, stats, utils, stevedore, magick, scales, formatR, openssl, stringi, HARtools, jsonlite, lubridate

Suggests testthat, tibble, jpeg, png, covr, knitr, rmarkdown


See at CRAN