A Fast, Easy-to-Use Tool for Manipulating Tables in Databases and a Wrapper of MADlib

Provides an R interface for the Pivotal Data stack running on 'PostgreSQL', 'Greenplum' or 'Apache HAWQ (incubating)' databases with parallel and distributed computation ability for big data processing. 'PivotalR' provides an R interface to various database operations on tables or views. These operations are almost the same as the corresponding native R operations. Thus users of R do not need to learn 'SQL' when they operate on objects in the database. It also provides a wrapper for 'Apache MADlib (incubating)', which is an open- source library for parallel and scalable in-database analytics.


PivotalR is a package that enables users of R, the most popular open source statistical programming language and environment to interact with (Greenplum) Database as well as Apache HAWQ (incubating) and the open-source database PostgreSQL for Big Data analytics. It does so by providing an interface to the operations on tables/views in the database. These operations are almost the same as those of data.frame. Minimal amount of data is transfered between R and the database system. Thus the users of R do not need to learn SQL when they operate on the objects in the database. PivotalR also lets the user to run the functions of the open-source big-data machine learning package Apache MADlib (incubating) directly from R.

  1. An Introduction to PivotalR

     vignette("pivotalr") # execute in R console to view the PDF file
    
  2. To install PivotalR:

    • Get the latest stable version from CRAN by running install.packages("PivotalR")

    • Or try out the latest development version from github by running the following code (Need R >= 3.0.2):

      devtools::install_github("PivotalR", "pivotalsoftware")
      
    • Or download the source tarball directly from here, and then install the tarball

      install.packages("pivotalsoftware-PivotalR-xxxx.tar.gz", repos = NULL, type = "source")
      

    where "pivotalsoftware-PivotalR-xxxx.tar.gz" is the name of the package that you have downloaded.

  3. To get started:

News

PivotalR 0.1.16.1

  • Many bug fixes
  • Support for date, time, time stamp and interval in database

PivotalR 0.1.15.1

  • Various bug fixes
  • Better error handling
  • Full support for HAWQ 1.2
  • A complete testing framework

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.

install.packages("PivotalR")

0.1.18.3.1 by Rahul Iyer, 7 months ago


Browse source code at https://github.com/cran/PivotalR


Authors: Predictive Analytics Team at Pivotal Inc. <[email protected]> , with contributions from Data Science Team at Pivotal Inc.


Documentation:   PDF Manual  


GPL (>= 2) license


Depends on methods, Matrix

Suggests DBI, RPostgreSQL, shiny, testthat, tools, rpart, randomForest, topicmodels


Suggested by vinereg.


See at CRAN