XML2R is a framework that reduces the effort required to transform XML content into number of tables while preserving parent to child relationships.
EasieR XML data collection. For an introduction to the package, see here
To install the CRAN version use:
To install the github version use:
library(devtools); install_github("XML2R", "cpsievert"); library(XML2R)
(1) Fix naming of observations when no children exist
(2) Add some tests using testthat
CHANGES IN XML2R VERSION 0.0.6
In version 0.5, the url column does not contain the file name (instead, it contains integers). This has consequences for pitchRx, but is now fixed.
CHANGES IN XML2R VERSION 0.0.5
A dependency to RCurl was added so asynchronous downloads can be done via getURL. This dependency also has the benefit of more robust HTTP downloads.
CHANGES IN XML2R VERSION 0.0.4
The collapse() function was changed to collapse_obs() to avoid namespace clashing with dplyr's collapse().
The child argument was removed from add_key(). The recycle argument was added to add_key() which allows one to use an existing value in the parent node as a key to connect the parent observation to its descendents (thanks Carlos Scheidegger)
Better garbage collection
CHANGES IN XML2R VERSION 0.0.3
url.map option was added to XML2Obs.
CHANGES IN XML2R VERSION 0.0.2
quiet option added to several functions
CHANGES IN XML2R VERSION 0.0.1
First version of XML2R. See the package tutorial
This package was born out of an effort to abstract the functionality of pitchRx::urlsToDataFrame so that other projects can be built on top of this framework.