Tools to Work with the 'Splash' 'JavaScript' Rendering and Scraping Service

'Splash' < https://github.com/scrapinghub/splash> is a 'JavaScript' rendering service. It is a lightweight web browser with an 'HTTP' API, implemented in 'Python' using 'Twisted' and 'QT' and provides some of the core functionality of the 'RSelenium' or 'seleniumPipes' R packages in a lightweight footprint. Some of 'Splash' features include the ability to process multiple web pages in parallel; retrieving 'HTML' results and/or take screen shots; disabling images or use 'Adblock Plus' rules to make rendering faster; executing custom 'JavaScript' in page context; getting detailed rendering info in 'HAR' format.


News

0.4.1

  • removed clipr usage due to CRAN

0.4.0

  • moved to 'docker' pacakge since it's on CRAN
  • temporarily removed render_file() support
  • added code coverage
  • CRAN release

0.3.0

  • added basic pkg tests
  • added mini-DSL to avoid needing to write lua scripts for some common operations
  • added many tests for many types of objects
  • added HAR support
  • added as_req()
  • added as_request()
  • added wait value range check for render_ functions (min 0, max 10)

0.2.0

  • added execute()
  • modified splash_active()
  • added splash_local global variable to avoid typing splash("localhost")

0.1.0

  • Initial release

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.