Examples: visualization, C++, networks, data cleaning, html widgets, ropensci.

Found 101 packages in 0.03 seconds

edmdata — by James Joseph Balamuta, 2 months ago

Data Sets for Psychometric Modeling

Collection of data sets from various assessments that can be used to evaluate psychometric models. These data sets have been analyzed in the following papers that introduced new methodology as part of the application section: Jimenez, A., Balamuta, J. J., & Culpepper, S. A. (2023) , Culpepper, S. A., & Balamuta, J. J. (2021) , Yinghan Chen et al. (2021) , Yinyin Chen et al. (2020) , Culpepper, S. A. (2019a) , Culpepper, S. A. (2019b) , Culpepper, S. A., & Chen, Y. (2019) , Culpepper, S. A., & Balamuta, J. J. (2017) , and Culpepper, S. A. (2015) .

BayesGOF — by Doug Fletcher, 6 years ago

Bayesian Modeling via Frequentist Goodness-of-Fit

A Bayesian data modeling scheme that performs four interconnected tasks: (i) characterizes the uncertainty of the elicited parametric prior; (ii) provides exploratory diagnostic for checking prior-data conflict; (iii) computes the final statistical prior density estimate; and (iv) executes macro- and micro-inference. Primary reference is Mukhopadhyay, S. and Fletcher, D. 2018 paper "Generalized Empirical Bayes via Frequentist Goodness of Fit" (< https://www.nature.com/articles/s41598-018-28130-5 >).

CHsharp — by John Braun, 9 years ago

Choi and Hall Style Data Sharpening

Functions for use in perturbing data prior to use of nonparametric smoothers and clustering.

galamm — by Øystein Sørensen, 3 months ago

Generalized Additive Latent and Mixed Models

Estimates generalized additive latent and mixed models using maximum marginal likelihood, as defined in Sorensen et al. (2023) , which is an extension of Rabe-Hesketh and Skrondal (2004)'s unifying framework for multilevel latent variable modeling . Efficient computation is done using sparse matrix methods, Laplace approximation, and automatic differentiation. The framework includes generalized multilevel models with heteroscedastic residuals, mixed response types, factor loadings, smoothing splines, crossed random effects, and combinations thereof. Syntax for model formulation is close to 'lme4' (Bates et al. (2015) ) and 'PLmixed' (Rockwood and Jeon (2019) ).

SASmixed — by Steven Walker, 11 years ago

Data sets from "SAS System for Mixed Models"

Data sets and sample lmer analyses corresponding to the examples in Littell, Milliken, Stroup and Wolfinger (1996), "SAS System for Mixed Models", SAS Institute.

tram — by Torsten Hothorn, 2 days ago

Transformation Models

Formula-based user-interfaces to specific transformation models implemented in package 'mlt' (, ). Available models include Cox models, some parametric survival models (Weibull, etc.), models for ordered categorical variables, normal and non-normal (Box-Cox type) linear models, and continuous outcome logistic regression (Lohse et al., 2017, ). The underlying theory is described in Hothorn et al. (2018) . An extension to transformation models for clustered data is provided (Barbanti and Hothorn, 2022, ). Multivariate conditional transformation models (Klein et al, 2022, ) and shift-scale transformation models (Siegfried et al, 2023, ) can be fitted as well.

gatoRs — by Natalie N. Patten, 6 months ago

Geographic and Taxonomic Occurrence R-Based Scrubbing

Streamlines downloading and cleaning biodiversity data from Integrated Digitized Biocollections (iDigBio) and the Global Biodiversity Information Facility (GBIF).

CUSUMdesign — by Boxiang Wang, 5 years ago

Compute Decision Interval and Average Run Length for CUSUM Charts

Computation of decision intervals (H) and average run lengths (ARL) for CUSUM charts. Details of the method are seen in Hawkins and Olwell (2012): Cumulative sum charts and charting for quality improvement, Springer Science & Business Media.

segmenTier — by Rainer Machne, 6 years ago

Similarity-Based Segmentation of Multidimensional Signals

A dynamic programming solution to segmentation based on maximization of arbitrary similarity measures within segments. The general idea, theory and this implementation are described in Machne, Murray & Stadler (2017) . In addition to the core algorithm, the package provides time-series processing and clustering functions as described in the publication. These are generally applicable where a `k-means` clustering yields meaningful results, and have been specifically developed for clustering of the Discrete Fourier Transform of periodic gene expression data (`circadian' or `yeast metabolic oscillations'). This clustering approach is outlined in the supplemental material of Machne & Murray (2012) ), and here is used as a basis of segment similarity measures. Notably, the time-series processing and clustering functions can also be used as stand-alone tools, independent of segmentation, e.g., for transcriptome data already mapped to genes.

ACTCD — by Wenchao Ma, a year ago

Asymptotic Classification Theory for Cognitive Diagnosis

Cluster analysis for cognitive diagnosis based on the Asymptotic Classification Theory (Chiu, Douglas & Li, 2009; ). Given the sample statistic of sum-scores, cluster analysis techniques can be used to classify examinees into latent classes based on their attribute patterns. In addition to the algorithms used to classify data, three labeling approaches are proposed to label clusters so that examinees' attribute profiles can be obtained.