Open Machine Learning and Open Data Platform

We provide an R interface to '' which is an online machine learning platform where researchers can access open data, download and upload data sets, share their machine learning tasks and experiments and organize them online to work and collaborate with other researchers. The R interface allows to query for data sets with specific properties, and allows the downloading and uploading of data sets, tasks, flows and runs. See <> for more information.


OpenML v1.7 (Release date: 2017-XX-XX):

  • listOMLTask, listOMLFlow, listOMLRuns do not return the tag field anymore.
  • New function listOMLSetup added, which enables extracting hyperparameter configurations of specific setups/flows.
  • New function chunkOMLlist added in order to automatically do chunked listing requests

OpenML v1.6 (Release date: 2017-08-14):

  • Fixes some config issues

OpenML v1.5 (Release date: 2017-08-11):

  • New functions to list and get all information w.r.t. studies: getOMLStudy, listOMLStudies.
  • API key not needed anymore for getOML* and listOML* functions

OpenML v1.4 (Release date: 2017-06-20):

  • Bugfix: Error message "Start tag expected, '<' not found" for getOMLTask and getOMLDataSet was fixed.
  • methods for OMLRunParList, OMLDataSet and OMLTasks objects were added.
  • listOMLRunEvaluations now allows to get results w.r.t. a single evaluation measure ('evaluation.measure' arg) which speeds up the request.
  • listOMLRuns and listOMLRunEvaluations now also supports a vector of ids for, and
  • getOMLRun(, only.xml = TRUE) can now be used to get the run without the predictions arff file (which is faster, especially when you are only interested in, e.g. getting the hyperparameters of a run.)

OpenML v1.3 (Release date: 2017-04-01):

  • Bugfixes
  • Updated citation
  • Add html vignette and update its content.
  • listOMLTasks and listOMLDataSets now additionally show a message when the limit of results is reached.
  • listOML* functions return an empty data frame when no results are available.
  • listOMLRunEvaluations now returns additional columns for the flow (flow version, flow source and learner name).
  • runTask now allows to set the 'models' option to FALSE so that resulting objects will be smaller.

OpenML v1.2 (Release date: 2017-02-07):

  • Add support for multilabel datasets and tasks.
  • Replace download.file with httr::GET.
  • Add mlr 2.10 dependency (we internally use mlr::mergeBenchmarkResults and mlr::makePrediction now).

OpenML v1.1 (Release date: 2016-11-22):

  • Setting default cache directory on package loading (fixes winbuilder).
  • Replace internal regexps with stringi functions.

OpenML v1.0 (Release date: 2016-11-12):

  • First submission to CRAN.

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


1.8 by Giuseppe Casalicchio, 4 months ago

Report a bug at

Browse source code at

Authors: Giuseppe Casalicchio <[email protected]>, Bernd Bischl <[email protected]>, Dominik Kirchhoff <[email protected]>, Michel Lang <[email protected]>, Benjamin Hofner <[email protected]>, Jakob Bossek <[email protected]>, Pascal Kerschke <[email protected]>, Joaquin Vanschoren <[email protected]>

Documentation:   PDF Manual  

BSD_3_clause + file LICENSE license

Imports backports, BBmisc, checkmate, ParamHelpers, data.table, digest, httr, stringi, XML, jsonlite, memoise, stats, curl

Depends on mlr

Suggests testthat, xml2, randomForest, rpart, RWeka, farff, knitr, rmarkdown, R.rsp, lintr

Suggested by farff.

See at CRAN