A General Multivariate Imputation Framework

Multivariate Expectation-Maximization (EM) based imputation framework that offers several different algorithms. These include regularisation methods like Lasso and Ridge regression, tree-based models and dimensionality reduction methods like PCA and PLS.

Project Status: Active The project has reached a stable, usable state and is being actively developed. Build Status AppVeyor Build Status CRAN Version License CRAN Downloads

imputeR is an R package that provides a general framework for missing values imputation based on automated variable selection.

The main function impute inputs a matrix containing missing values and returns a complete data matrix using the variable selection functions provided as part of the package, or written by the user.

The package also offers many useful tools for imputation research based on impute. For example, the Detect function can be used to detect the variables' type in a given data matrix. guess can be used for naive imputation such as mean imputation, median imputation, majority imputation (for categorical variables only) and random imputation. SimIm function stands for "simulation for imputation". It accepts a complete matrix and randomly introduce some percentage of missing values into the matrix so imputation methods can be employed subsequently to impute this artificial missing data matrix. Because the true values are actually know so imputation accuracy can be easily calculated. This calls for the SimEval function that extends SimIm function, simulates a number of missing data matrices, applies a imputation method to these missing matrices and evaluate its performance. This enables the uncertainty of the imputation method to be obtained.


You can cite imputeR the following: General Multivariate Imputation Framework_. R package version 2.1, <URL: https://CRAN.R-project.org/package=imputeR>.






Changes in Version 2.1

  • Changes to remain on CRAN and comply with CRAN policy

  • Changed NEWS file to markup document

  • Bugfix for glmboostR

  • Improved Examples for most of the functions

  • Moved several packages from depends to suggests

  • Removed some suggests e.g. cutoffR

  • Changed NEWS file to markup document

  • Replaced dontrun with donttest for long running examples

  • Description file: Improved package name and description

  • Minor Readme updates

Changes in Version 2.0

  • Changed version numbering from 2.0.0 to the simpler 2.0

  • Added a NEWS file

  • Maintainer change

  • Added citation file - citation("imputeR") returns now information on how to cite imputeR

  • Activated LazyData in Description file

  • Added automated testing via testthat

  • License change form GPL-2 to GPL-3

  • Internal code base changes

  • Readme updates

  • Added first testthat testing case

Changes in Version 1.0.0

  • Initial CRAN version

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


2.2 by Steffen Moritz, 2 years ago


Report a bug at https://github.com/SteffenMoritz/imputeR/issues

Browse source code at https://github.com/cran/imputeR

Authors: Steffen Moritz [aut, cre] , Lingbing Feng [aut] , Gen Nowak [ctb] , Alan. H. Welsh [ctb] , Terry. J. O'Neill [ctb]

Documentation:   PDF Manual  

Task views: Missing Data

GPL-3 license

Imports stats, utils, graphics, reshape2

Suggests testthat, caret, glmnet, pls, Cubist, ridge, gbm, mboost, rpart, earth

Suggested by miceadds.

See at CRAN