Task view: Missing Data

Last updated on 2021-12-16 by Julie Josse, Nicholas Tierney, Nathalie Vialaneix (r-miss-tastic team)

Missing data are very frequently found in datasets. Base R provides a few options to handle them using computations that involve only observed data (na.rm = TRUE in functions mean, var, ... or use = complete.obs|na.or.complete|pairwise.complete.obs in functions cov, cor, ...). The base package stats also contains the generic function na.action that extracts information of the NA action used to create an object.

These basic options are complemented by many packages on CRAN, which we structure into main topics:

In addition to the present task view, this reference website on missing data might also be helpful.

If you think that we missed some important packages in this list, please contact the maintainer.

Exploration of missing data

  • Manipulation of missing data is implemented in the packages sjmisc and sjlabelled. memisc also provides defineable missing values, along with infrastruture for the management of survey data and variable labels.
  • Missing data patterns can be identified and explored using the packages mi, wrangle, DescTools, dlookr and naniar.
  • Graphics that describe distributions and patterns of missing data are implemented in VIM (which has a Graphical User Interface, VIMGUI, currently archived on CRAN) and naniar (which abides by tidyverse principles).
  • Tests of the MAR assumption (versus the MCAR assumption): RBtest proposes a regression based approach to test the missing data mechanism and samon performs sensitivity analysis in clinical trials to check the relevance of the MAR assumption.
  • Evaluation of the quality of imputation can be performed using the function ampute of mice through with simulations or with the Iscores with a KL-based scoring rule.

Likelihood based approaches

  • Methods based on the Expectation Maximization (EM) algorithm are implemented in norm (using the function em.norm for multivariate Gaussian data), norm2 (using the function emNorm), in cat (function em.cat for multivariate categorical data), in mix (function em.mix for multivariate mixed categorical and continuous data). These packages also implement Bayesian approaches (with Imputation and Posterior steps) for the same models (functions da.XXX for norm, cat and mix) and can be used to obtain imputed complete datasets or multiple imputations (functions imp.XXX for norm, cat and mix), once the model parameters have been estimated. imputeR is a Multivariate Expectation-Maximization (EM) based imputation framework that offers several different algorithms, including Lasso, tree-based models or PCA. In addition, TestDataImputation implements imputation based on EM estimation (and other simpler imputation methods) that are well suited for dichotomous and polytomous tests with item responses.
  • Full Information Maximum Likelihood (also known as "direct maximum likelihood" or "raw maximum likelihood") is available in lavaan (and in its extension semTools), OpenMx and rsem, for handling missing data in structural equation modeling.
  • Bayesian approaches for handling missing values in model based clustering with variable selection is available in VarSelLCM. The package also provides imputation using the posterior mean.
  • Missing values in mixed-effect models and generalized linear models are supported in the packages mdmb, icdGLM and JointAI, the last one being based on a Bayesian approach. brlrmr also handles MNAR values in response variable for logistic regression using an EM approach. ui implements uncertainty intervals for linear and probit regressions when the outcome is missing not at random.
  • Missing data in item response models is implemented in TAM, mirt and ltm.
  • Robust covariance estimation is implemented in the package GSE. Robust location and scatter estimation and robust multivariate analysis with missing data are implemented in rrcovNA.

Single imputation

  • The simplest method for missing data imputation is imputation by mean (or median, mode, ...). This approach is available in many packages among which ForImp and Hmisc that contain various proposals for imputing with the same value all missing instances of a variable.
  • k-nearest neighbors is a popular method for missing data imputation that is available in many packages including impute, VIM, GenForImp and yaImpute (with many different methods for kNN imputation, including a CCA based imputation). wNNSel implements a kNN based method for imputation in large dimensional datasets. isotree uses a similar approach that is based on similarities between samples to impute missing data with isolation forests.
  • hot-deck imputation is implemented in hot.deck, FHDI and VIM (function hotdeck). StatMatch uses hot-deck imputation to impute surveys from an external dataset. impimp also uses the notion of "donor" to impute a set of possible values, termed "imprecise imputation".
  • Other regression based imputations are implemented in VIM (linear regression based imputation in the function regressionImp). In addition, simputation is a general package for imputation by any prediction method that can be combined with various regression methods, and works well with the tidyverse. WaverR imputes data using a weighted average of several regressions. iai tunes optimal imputation based on knn, tree or SVM.
  • Based on random forest in missForest with a faster version in missRanger.
  • Based on copula in CoImp, in mdgc (mixed type data), and in sbgcop (semi-parametric Bayesian copula imputation). The latter supports multiple imputation.
  • PCA/Singular Value Decomposition/matrix completion is implemented in the package missMDA for numerical, categorical and mixed data. Heterogeneous missingness in a high-dimensional PCA is also addressed in primePCA. softImpute contains several methods for iterative matrix completion, as well as filling, rsparse and denoiseR for numerical variables, mimi that uses low rank assumptions to impute mixed datasets, and ECLRMC performs ensemble correlation based low rank matrix completion that accounts for correlation among samples. The package pcaMethods offers some Bayesian implementation of PCA with missing data. NIPALS (based on SVD computation) is implemented in the packages mixOmics (for PCA and PLS), ade4, nipals and plsRglm (for generalized model PLS). As a generalization, tensorBF implements imputation in 3-way tensor data. ROptSpace proposes a matrix completion method under low-rank assumption and collective matrix factorization for imputation using Bayesian matrix completion for groups of variables (binary, quantitative, Poisson). Similarly, cmfrec proposes different version of matrix imputation based on matrix factorization. Finally, imputation for groups is also available in missMDA in the function imputeMFA.
  • Imputation for non-parametric regression by wavelet shrinkage is implemented in CVThresh using solely maximization of the h-likelihood.
  • mi and VIM also provide diagnostic plots to evaluate the quality of imputation.

Multiple imputation

Some of the above mentioned packages can also handle multiple imputations.

  • Amelia implements Bootstrap multiple imputation using EM to estimate the parameters, for quantitative data it imputes assuming a Multivariate Gaussian distribution. In addition, AmeliaView is a GUI for Amelia, available from the Amelia web page. NPBayesImputeCat also implements multiple imputation by joint modelling for categorical variables with a Bayesian approach.
  • mi, mice and smcfcs implement multiple imputation by Chained Equations. smcfcs extends the models covered by the two previous packages. miceFast provides an alternative implementation of mice imputation methods using object oriented style programming and C++. bootImpute performs bootstrap based imputations and analyses of these imputations to use with mice or smcfcs. miceRanger performs multiple imputation by chained equations using random forests.
  • missMDA implements multiple imputation based on SVD methods.
  • hot.deck implements hot-deck based multiple imputation.
  • Multilevel imputation: Multilevel multiple imputation is implemented in hmi, jomo, mice, miceadds, micemd, mitml, and pan.
  • Qtools and miWQS implement multiple imputation based on quantile regression.
  • lodi implements the imputation of observed values below the limit of detection (LOD) via censored likelihood multiple imputation (CLMI).
  • BaBooN implements a Bayesian bootstrap approach for discrete data imputation that is based on Predictive Mean Matching (PMM).
  • accelmissing provides multiple imputation with the zero-inflated Poisson lognormal model for missing count values in accelerometer data.

In addition, mitools provides a generic approach to handle multiple imputation in combination with any imputation method. And NADIA provides a uniform interface to compare the performances of several imputation algorithms.

Weighting methods

  • Computation of weights for observed data to account for unobserved data by Inverse Probability Weighting (IPW) is implemented in ipw. IPW is also used for quantile estimations and boxplots in IPWboxplot.
  • Doubly Robust Inverse Probability Weighted Augmented GEE Estimator with missing outcome is implemented in CRTgeeDR.

Specific types of data

  • Longitudinal data / time series and censored data: Imputation for time series is implemented in imputeTS and imputePSF. Other packages, such as forecast, spacetime, timeSeries, xts, prophet, stlplus or zoo, are dedicated to time series but also contain some (often basic) methods to handle missing data (see also TimeSeries). To help fill down missing values for time series, the padr and tsibble packages provide methods for imputing implicit missing values. Imputation of time series based on Dynamic Time Warping is implemented in DTWBI for univariate time series and in DTWUMI or in FSMUMI for multivariate ones. naniar also imputes data below the range for exploratory graphical analysis with the function impute_below. TAR implements an estimation of the autoregressive threshold models with Gaussian noise and of positive-valued time series with a Bayesian approach in the presence of missing data and BMTAR has a similar approach for multivariate time series. swgee implements a probability weighted generalized estimating equations method for longitudinal data with missing observations and measurement error in covariates based on SIMEX. icenReg performs imputation for censored responses for interval data. imputeTestbench proposes tools to benchmark missing data imputation in univariate time series. On a related topic, imputeFin handles imputation of missing values in financial time series using AR models or random walk.
  • Spatial data: Imputation for spatial data is implemented in phylin using interpolation with spatial distance weights or kriging. gapfill is dedicated to satellite data. Geostatistical interpolation of data with irregular spatial support is implemented in rtop and in areal that estimates values for overlapping but incongruent polygon features. Estimation and prediction for spatio-temporal data with missing values is implemented in StempCens with a SAEM approach that approximates EM when the E-step does not have an analytic form.
  • Spatio-temporal data: Imputation for spatio-temporal data is implemented in the package cutoffR using different methods as knn and SVD and in CircSpaceTime for circular data using kriging. Similarly, reddPrec imputes missing values in daily precipitation time series accross different locations.
  • Graphs/networks: Imputation for graphs/networks is implemented in the package dils to impute missing edges. PST provides a framework for analyzing Probabilistic Suffix Trees, including functions for learning and optimizing VLMC (variable length Markov chains) models from sets of individual sequences possibly containing missing values. missSBM imputes missing edges in Stochastic Block models and cassandRa predicts possible missing links with different stochastic network models. cglasso implements an extension of the Graphical Lasso with censored and missing values and mgm implements estimation of time-varying k-order mixed graphical models.
  • Imputation for contingency tables is implemented in lori that can also be used for the analysis of contingency tables with missing data.
  • Imputation for compositional data (CODA) is implemented in in zCompositions (various imputation methods for zeros, left-censored and missing data).
  • Imputation for diffusion processes is implemented in DiffusionRimp by imputing missing sample paths with Brownian bridges.
  • Imputation for meta-analyses of binary outcomes is provided in metasens.
  • experiment handles missing values in experimental design such as randomized experiments with missing covariate and outcome data, matched-pairs design with missing outcome.
  • cdparcoord provides tools to handle missing values in parallel coordinates settings.
  • dejaVu performs multiple imputation of recurrent event data based on a negative binomial regression model.

Specific application fields

  • Genetics: Analyses of Case-Parent Triad and/or Case-Control Data with SNP haplotypes is implemented in Haplin, where missing genotypic data are handled with an EM algorithm. FamEvent and snpStats implement imputation of missing genotypes, respectively with an EM algorithm and a nearest neighbor approach. Imputation for genotype and haplotype is implemented in alleHap using solely deterministic techniques on pedigree databases; imputation of missing genotypes is also implemented in QTLRel that contains tools for QTL analyses. Tools for Hardy-Weinberg equilibrium for bi- and multi-allelic genetic marker data are implemented in HardyWeinberg, where genotypes are imputed with a multinomial logit model. StAMPP computes genomic relationship when SNP genotype datasets contain missing data and PSIMEX computes inbreeding depression or heritability on pedigree structures affected by missing paternities with a variant of the SIMEX algorithm.
  • Genomics: Imputation for dropout events (i.e., under-sampling of mRNA molecules) in single-cell RNA-Sequencing data is implemented in DrImpute and Rmagic. RNAseqNet uses hot-deck imputation to improve RNA-seq network inference with an auxiliary dataset.
  • Epidemiology: idem implements a procedure for comparing treatments in clinical trials with missed visits or premature withdrawal. InformativeCensoring implements multiple imputation for informative censoring. pseval evaluates principal surrogates in a single clinical trial in the presence of missing counterfactual surrogate responses. sievePH implements continuous, possibly multivariate, mark-specific hazard ratio with missing values in multivariate marks using an IPW approach.
  • Causal inference: Causal inference with interactive fixed-effect models is available in gsynth with missing values handled by matrix completion. MatchThem matches multiply imputed datasets using several matching methods, and provides users with the tools to estimate causal effects in each imputed datasets. grf offers treatment effect estimation with incomplete confounders and covariates under modified unconfoundedness assumptions.
  • Scoring: Basic methods (mean, median, mode, ...) for imputing missing data in scoring datasets are proposed in scorecardModelUtils.
  • Preference models: Missing data in preference models are handled with a Composite Link approach that allows for MCAR and MNAR patterns to be taken into account in prefmod.
  • Health economy: missingHE implements models for health economic evaluations with missing outcome data.
  • Administrative records / Surveys: fastLink provides a Fellegi-Sunter probabilistic record linkage that allows for missing data and the inclusion of auxiliary information.
  • Regression and classification: eigenmodel handles missing values in regression models for symmetric relational data. randomForest, grf and StratifiedRF handle missing values in predictors in various random forest based methods. misaem handles missing data in linear and logistic regression and allows for model selection. psfmi provides a framework for model selection for various linear models in multiply imputed datasets. naivebayes provides an efficient implementation of the naive Bayes classifier in the presence of missing data. plsRbeta implements PLS for beta regression models with missing data in the predictors. lqr provides quantile regression estimates based on various distributions in the presence of missing values and censored data.
  • Clustering biclustermd handles missing data in biclustering. RMixtComp, MGMM and mixture fit various mixture models in the presence of missing data.
  • Tests for two-sample paired missing data are implemented in robustrank.
  • robustrao computes the Rao-Stirling diversity index (a well-established bibliometric indicator to measure the interdisciplinarity of scientific publications) with data containing uncategorized references.


accelmissing — 1.4

Missing Value Imputation for Accelerometer Data

ade4 — 1.7-18

Analysis of Ecological Data: Exploratory and Euclidean Methods in Environmental Sciences

alleHap — 0.9.9

Allele Imputation and Haplotype Reconstruction from Pedigree Databases

Amelia — 1.8.0

A Program for Missing Data

areal — 0.1.7

Areal Weighted Interpolation

BaBooN — 0.2-0

Bayesian Bootstrap Predictive Mean Matching - Multiple and Single Imputation for Discrete Data

BMTAR — 0.1.1

Bayesian Approach for MTAR Models with Missing Data

brlrmr — 0.1.7

Bias Reduction with Missing Binary Response

biclustermd — 0.2.3

Biclustering with Missing Data

bootImpute — 1.2.0

Bootstrap Inference for Multiple Imputation

cassandRa — 0.1.0

Finds Missing Links and Metric Confidence Intervals in Ecological Bipartite Networks

cat — 0.0-6.5

Analysis of categorical-variable datasets with missing values

cdparcoord — 1.0.1

Top Frequency-Based Parallel Coordinates

cglasso — 2.0.4

Conditional Graphical LASSO for Gaussian Graphical Models with Censored and Missing Values

CircSpaceTime — 0.9.0

Spatial and Spatio-Temporal Bayesian Model for Circular Data

cmfrec — 3.3.1

Collective Matrix Factorization for Recommender Systems

CoImp — 1.0

Copula Based Imputation Method

CRTgeeDR — 2.0

Doubly Robust Inverse Probability Weighted Augmented GEE Estimator

CVThresh — 1.1.1

Level-Dependent Cross-Validation Thresholding

cutoffR — 1.0

CUTOFF: A Spatio-temporal Imputation Method

dejaVu — 0.3.0

Multiple Imputation for Recurrent Events

denoiseR — 1.0.2

Regularized Low Rank Matrix Estimation

DescTools — 0.99.44

Tools for Descriptive Statistics

DiffusionRimp — 0.1.2

Inference and Analysis for Diffusion Processes via Data Imputation and Method of Lines

dils — 0.8.1

Data-Informed Link Strength. Combine multiple-relationship networks into a single weighted network. Impute (fill-in) missing network links.

dlookr — 0.5.4

Tools for Data Diagnosis, Exploration, Transformation

DrImpute — 1.0

Imputing Dropout Events in Single-Cell RNA-Sequencing Data

DTWBI — 1.1

Imputation of Time Series Based on Dynamic Time Warping

DTWUMI — 1.0

Imputation of Multivariate Time Series Based on Dynamic Time Warping

ECLRMC — 1.0

Ensemble Correlation-Based Low-Rank Matrix Completion

eigenmodel — 1.11

Semiparametric Factor and Regression Models for Symmetric Relational Data

experiment — 1.2.0

R Package for Designing and Analyzing Randomized Experiments

FamEvent — 2.1

Family Age-at-Onset Data Simulation and Penetrance Estimation

fastLink — 0.6.0

Fast Probabilistic Record Linkage with Missing Data

FHDI — 1.4.1

Fractional Hot Deck and Fully Efficient Fractional Imputation

filling — 0.2.3

Matrix Completion, Imputation, and Inpainting Methods

forecast — 8.16

Forecasting Functions for Time Series and Linear Models

ForImp — 1.0.3

Imputation of Missing Values Through a Forward Imputation Algorithm

FSMUMI — 1.0

Imputation of Time Series Based on Fuzzy Logic

gapfill — 0.9.6-1

Fill Missing Values in Satellite Data

GenForImp — 1.0

The Forward Imputation: A Sequential Distance-Based Approach for Imputing Missing Data

grf — 2.0.2

Generalized Random Forests

GSE — 4.2

Robust Estimation in the Presence of Cellwise and Casewise Contamination and Missing Data

gsynth — 1.2.1

Generalized Synthetic Control Method

Haplin — 7.2.3

Analyzing Case-Parent Triad and/or Case-Control Data with SNP Haplotypes

HardyWeinberg — 1.7.4

Statistical Tests and Graphics for Hardy-Weinberg Equilibrium

hmi — 1.0.0

Hierarchical Multiple Imputation

Hmisc — 4.6-0

Harrell Miscellaneous

hot.deck — 1.2

Multiple Hot Deck Imputation

iai — 1.7.0

Interface to 'Interpretable AI' Modules

icdGLM — 1.0.0

EM by the Method of Weights for Incomplete Categorical Data in Generlized Linear Models

icenReg — 2.0.15

Regression Models for Interval Censored Data

idem — 5.1

Inference in Randomized Controlled Trials with Death and Missingness

impimp — 0.3.1

Imprecise Imputation for Statistical Matching

imputeFin — 0.1.2

Imputation of Financial Time Series with Missing Values and/or Outliers

imputePSF — 0.1.0

Impute Missing Data in Time Series Data with PSF Based Method

imputeR — 2.2

A General Multivariate Imputation Framework

imputeTestbench — 3.0.3

Test Bench for the Comparison of Imputation Methods

Iscores — 1.1.0

Proper Scoring Rules for Missing Value Imputation

imputeTS — 3.2

Time Series Missing Value Imputation

InformativeCensoring — 0.3.5

Multiple Imputation for Informative Censoring

ipw — 1.0-11

Estimate Inverse Probability Weights

IPWboxplot — 0.1.0

Adapted Boxplot to Missing Observations

isotree — 0.5.5

Isolation-Based Outlier Detection

JointAI — 1.0.3

Joint Analysis and Imputation of Incomplete Data

jomo — 2.7-2

Multilevel Joint Modelling Multiple Imputation

lavaan — 0.6-10

Latent Variable Analysis

lodi — 0.9.2

Limit of Detection Imputation for Single-Pollutant Models

lori — 2.2.2

Imputation of High-Dimensional Count Data using Side Information

lqr — 4.0

Robust Linear Quantile Regression

ltm — 1.1-1

Latent Trait Models under IRT

MatchThem — 1.0.1

Matching and Weighting Multiply Imputed Datasets

mdgc — 0.1.5

Missing Data Imputation Using Gaussian Copulas

mdmb — 1.5-8

Model Based Treatment of Missing Data

memisc —

Management of Survey Data and Presentation of Analysis Results

metasens — 1.0-1

Statistical Methods for Sensitivity Analysis in Meta-Analysis

mgm — 1.2-12

Estimating Time-Varying k-Order Mixed Graphical Models

MGMM — 1.0.0

Missingness Aware Gaussian Mixture Models

mi — 1.0

Missing Data Imputation and Model Checking

mice — 3.14.0

Multivariate Imputation by Chained Equations

miceadds — 3.11-6

Some Additional Multiple Imputation Functions, Especially for 'mice'

miceFast — 0.7.1

Fast Imputations Using 'Rcpp' and 'Armadillo'

micemd — 1.8.0

Multiple Imputation by Chained Equations with Multilevel Data

miceRanger — 1.5.0

Multiple Imputation by Chained Equations with Random Forests

mimi — 0.2.0

Main Effects and Interactions in Mixed and Incomplete Data

misaem — 1.0.1

Linear Regression and Logistic Regression with Missing Covariates

missForest — 1.4

Nonparametric Missing Value Imputation using Random Forest

missingHE — 1.4.1

Missing Outcome Data in Health Economic Evaluation

missMDA — 1.18

Handling Missing Values with Multivariate Data Analysis

missRanger — 2.1.3

Fast Imputation of Missing Values

mirt — 1.35.1

Multidimensional Item Response Theory

missSBM — 1.0.1

Handling Missing Data in Stochastic Block Models

mitml — 0.4-3

Tools for Multiple Imputation in Multilevel Modeling

mitools — 2.4

Tools for Multiple Imputation of Missing Data

miWQS — 0.4.4

Multiple Imputation Using Weighted Quantile Sum Regression

mix — 1.0-10

Estimation/Multiple Imputation for Mixed Categorical and Continuous Data

mixture — 2.0.4

Mixture Models for Clustering and Classification

NADIA — 0.4.1

NA Data Imputation Algorithms

naivebayes — 0.9.7

High Performance Implementation of the Naive Bayes Algorithm

naniar — 0.6.1

Data Structures, Summaries, and Visualisations for Missing Data

nipals — 0.8

Principal Components Analysis using NIPALS or Weighted EMPCA, with Gram-Schmidt Orthogonalization

norm — 1.0-9.5

Analysis of multivariate normal datasets with missing values

norm2 — 2.0.4

Analysis of Incomplete Multivariate Data under a Normal Model

NPBayesImputeCat — 0.4

Non-Parametric Bayesian Multiple Imputation for Categorical Data

OpenMx — 2.20.0

Extended Structural Equation Modelling

padr — 0.6.0

Quickly Get Datetime Data Ready for Analysis

pan — 1.6

Multiple Imputation for Multivariate Panel or Clustered Data

phylin — 2.0.2

Spatial Interpolation of Genetic Data

plsRbeta — 0.2.6

Partial Least Squares Regression for Beta Regression Models

plsRglm — 1.3.0

Partial Least Squares Regression for Generalized Linear Models

prefmod — 0.8-34

Utilities to Fit Paired Comparison Models for Preferences

primePCA — 1.2

Projected Refinement for Imputation of Missing Entries in PCA

prophet — 1.0

Automatic Forecasting Procedure

pseval — 1.3.1

Methods for Evaluating Principal Surrogates of Treatment Response

psfmi — 1.0.0

Prediction Model Pooling, Selection and Performance Evaluation Across Multiply Imputed Datasets

PSIMEX — 1.1

SIMEX Algorithm on Pedigree Structures

PST — 0.94

Probabilistic Suffix Trees and Variable Length Markov Chains

QTLRel — 1.7

Tools for Mapping of Quantitative Traits of Genetically Related Individuals and Calculating Identity Coefficients from Pedigrees

Qtools — 1.5.5

Utilities for Quantiles

randomForest — 4.6-14

Breiman and Cutler's Random Forests for Classification and Regression

RBtest — 1.1

Regression-Based Approach for Testing the Type of Missing Data

reddPrec — 0.4.0

Reconstruction of Daily Data - Precipitation

Rmagic — 2.0.3

MAGIC - Markov Affinity-Based Graph Imputation of Cells

RMixtComp — 4.1.3

Mixture Models with Heterogeneous and (Partially) Missing Data

ROptSpace — 0.2.3

Matrix Reconstruction from a Few Entries

RNAseqNet — 0.1.4

Log-Linear Poisson Graphical Model with Hot-Deck Multiple Imputation

robustrank — 2019.9-10

Robust Rank-Based Tests

robustrao — 1.0-5

An Extended Rao-Stirling Diversity Index to Handle Missing Data

rrcovNA — 0.4-15

Scalable Robust Estimators with High Breakdown Point for Incomplete Data

rsem — 0.5.0

Robust Structural Equation Modeling with Missing Data and Auxiliary Variables

rsparse — 0.5.0

Statistical Learning on Sparse Matrices

rtop — 0.5-14

Interpolation of Data with Variable Spatial Support

samon — 4.0.1

Sensitivity Analysis for Missing Data

sbgcop — 0.980

Semiparametric Bayesian Gaussian Copula Estimation and Imputation

scorecardModelUtils —

Credit Scorecard Modelling Utils

semTools — 0.5-5

Useful Tools for Structural Equation Modeling

sievePH — 1.0.1

Sieve Analysis Methods for Proportional Hazards Models

simputation — 0.2.7

Simple Imputation

sjlabelled — 1.1.8

Labelled Data Utility Functions

sjmisc — 2.8.9

Data and Variable Transformation Functions

smcfcs — 1.6.1

Multiple Imputation of Covariates by Substantive Model Compatible Fully Conditional Specification

softImpute — 1.4-1

Matrix Completion via Iterative Soft-Thresholded SVD

spacetime — 1.2-6

Classes and Methods for Spatio-Temporal Data

StAMPP — 1.6.3

Statistical Analysis of Mixed Ploidy Populations

StatMatch — 1.4.0

Statistical Matching or Data Fusion

stlplus — 0.5.1

Enhanced Seasonal Decomposition of Time Series by Loess

StratifiedRF — 0.2.2

Builds Trees by Sampling Variables in Groups

StempCens — 1.1.0

Spatio-Temporal Estimation and Prediction for Censored/Missing Responses

swgee — 1.4

Simulation Extrapolation Inverse Probability Weighted Generalized Estimating Equations

TAM — 3.7-16

Test Analysis Modules

TAR — 1.0

Bayesian Modeling of Autoregressive Threshold Time Series Models

tensorBF — 1.0.2

Bayesian Tensor Factorization

TestDataImputation — 2.3

Missing Item Responses Imputation for Test and Assessment Data

timeSeries — 3062.100

Financial Time Series Objects (Rmetrics)

tsibble — 1.1.1

Tidy Temporal Data Frames and Tools

ui — 0.1.1

Uncertainty Intervals and Sensitivity Analysis for Missing Data

VarSelLCM —

Variable Selection for Model-Based Clustering of Mixed-Type Data Set with Missing Values

VIM — 6.1.1

Visualization and Imputation of Missing Values

WaverR — 1.0

Data Estimation using Weighted Averages of Multiple Regressions

wNNSel — 0.1

Weighted Nearest Neighbor Imputation of Missing Values using Selected Variables

wrangle — 0.5.7

A Systematic Data Wrangling Idiom

xts — 0.12.1

eXtensible Time Series

yaImpute — 1.0-32

Nearest Neighbor Observation Imputation and Evaluation Tools

zCompositions — 1.4.0

Treatment of Zeros, Left-Censored and Missing Values in Compositional Data Sets

zoo — 1.8-9

S3 Infrastructure for Regular and Irregular Time Series (Z's Ordered Observations)

Task view list