METACRAN search results

Visualization and Imputation of Missing Values

Provides methods for imputation and visualization of missing values. It includes graphical tools to explore the amount, structure and patterns of missing and/or imputed values, supporting exploratory data analysis and helping to investigate potential missingness mechanisms (details in Alfons, Templ and Filzmoser, . The quality of imputations can be assessed visually using a wide range of univariate, bivariate and multivariate plots. The package further provides several imputation methods, including efficient implementations of k-nearest neighbour and hot-deck imputation (Kowarik and Templ 2013, , iterative robust model-based multiple imputation (Templ 2011, ; Templ 2023, ), and machine learning–based approaches such as robust GAM-based multiple imputation (Templ 2024, ) as well as gradient boosting (XGBoost) and transformer-based methods (Niederhametner et al., ). General background and practical guidance on imputation are provided in the Springer book by Templ (2023) .

https://github.com/statistikat/VIM

RUnit — by Roman Zenka, a year ago

R Unit Test Framework

R functions implementing a standard Unit Testing framework, with additional code inspection and report generation tools.

laeken — by Andreas Alfons, 2 years ago

Estimation of Indicators on Social Exclusion and Poverty

Estimation of indicators on social exclusion and poverty, as well as Pareto tail modeling for empirical income distributions.

pROC — by Xavier Robin, 10 months ago

Display and Analyze ROC Curves

Tools for visualizing, smoothing and comparing receiver operating characteristic (ROC curves). (Partial) area under the curve (AUC) can be compared with statistical tests based on U-statistics or bootstrap. Confidence intervals can be computed for (p)AUC or ROC curves.

https://xrobin.github.io/pROC/

pep725 — by Matthias Templ, 2 months ago

Pan-European Phenological Data Analysis

Provides a framework for quality-aware analysis of ground-based phenological data from the PEP725 Pan-European Phenology Database (Templ et al. (2018) ; Templ et al. (2026) ) and similar observation networks. Implements station-level data quality grading, outlier detection, phenological normals (climate baselines), anomaly detection, elevation and latitude gradient estimation with robust regression, spatial synchrony quantification, partial least squares (PLS) regression for identifying temperature-sensitive periods, and sequential Mann-Kendall trend analysis. Supports data import from PEP725 files, conversion of user-supplied data, and downloadable synthetic datasets for teaching without barriers of registration. All analysis outputs provide 'print', 'summary', and 'plot' methods. Interactive spatial visualization is available via 'leaflet'.

https://github.com/matthias-da/pep725

cluster — by Martin Maechler, 4 months ago

"Finding Groups in Data": Cluster Analysis Extended Rousseeuw et al.

Methods for Cluster analysis. Much extended the original from Peter Rousseeuw, Anja Struyf and Mia Hubert, based on Kaufman and Rousseeuw (1990) "Finding Groups in Data".

https://svn.r-project.org/R-packages/trunk/cluster/

distrEx — by Matthias Kohl, a year ago

Extensions of Package 'distr'

Extends package 'distr' by functionals, distances, and conditional distributions.

http://distr.r-forge.r-project.org/

heaping — by Matthias Templ, 4 months ago

Correction of Heaping on Individual Level

Provides methods for correcting heaping (digit preference) in survey data at the individual record level. Age heaping, where respondents disproportionately report ages ending in 0 or 5, is a common phenomenon that can distort demographic analyses. Unlike traditional smoothing methods that only correct aggregated statistics, this package corrects individual values by replacing a calculated proportion of heaped observations with draws from fitted truncated distributions (log-normal, normal, or uniform). Supports 5-year and 10-year heaping patterns, single heap correction, and optional model-based adjustment to preserve covariate relationships.

https://github.com/matthias-da/heaping

robCompositions — by Matthias Templ, 10 months ago

Compositional Data Analysis

Methods for analysis of compositional data including robust methods (), imputation of missing values (), methods to replace rounded zeros (, , ), count zeros (), methods to deal with essential zeros (), (robust) outlier detection for compositional data, (robust) principal component analysis for compositional data, (robust) factor analysis for compositional data, (robust) discriminant analysis for compositional data (Fisher rule), robust regression with compositional predictors, functional data analysis () and p-splines (), contingency () and compositional tables (, , ) and (robust) Anderson-Darling normality tests for compositional data as well as popular log-ratio transformations (addLR, cenLR, isomLR, and their inverse transformations). In addition, visualisation and diagnostic tools are implemented as well as high and low-level plot functions for the ternary diagram.

deepImp — by Matthias Templ, 3 days ago

Imputation with Deep Learning Methods

Imputation of mixed-type and compositional data with neural networks. The architecture (number and size of hidden layers, dropout, activation, optimiser) is user-configurable. See Templ (2021) .

Search results

R links

R homepage

Download R

Mailing lists

R documentation

R manuals

R FAQs

The R Journal

CRAN links

CRAN homepage

CRAN repository policy

Submit a package

METACRAN stuff

About METACRAN

At github

Report a bug