METACRAN search results

Interaction Statistics

Fast, model-agnostic implementation of different H-statistics introduced by Jerome H. Friedman and Bogdan E. Popescu (2008) . These statistics quantify interaction strength per feature, feature pair, and feature triple. The package supports multi-output predictions and can account for case weights. In addition, several variants of the original statistics are provided. The shape of the interactions can be explored through partial dependence plots or individual conditional expectation plots. 'DALEX' explainers, meta learners ('mlr3', 'tidymodels', 'caret') and most other models work out-of-the-box.

https://github.com/ModelOriented/hstats/, https://modeloriented.github.io/hstats/

modelStudio — by Hubert Baniecki, 3 years ago

Interactive Studio for Explanatory Model Analysis

Automate the explanatory analysis of machine learning predictive models. Generate advanced interactive model explanations in the form of a serverless HTML site with only one line of code. This tool is model-agnostic, therefore compatible with most of the black-box predictive models and frameworks. The main function computes various (instance and model-level) explanations and produces a customisable dashboard, which consists of multiple panels for plots with their short descriptions. It is possible to easily save the dashboard and share it with others. modelStudio facilitates the process of Interactive Explanatory Model Analysis introduced in Baniecki et al. (2023) .

https://modelstudio.drwhy.ai, https://github.com/ModelOriented/modelStudio

treeshap — by Mateusz Krzyzinski, 2 years ago

Compute SHAP Values for Your Tree-Based Models Using the 'TreeSHAP' Algorithm

An efficient implementation of the 'TreeSHAP' algorithm introduced by Lundberg et al., (2020) . It is capable of calculating SHAP (SHapley Additive exPlanations) values for tree-based models in polynomial time. Currently supported models include 'gbm', 'randomForest', 'ranger', 'xgboost', 'lightgbm'.

https://modeloriented.github.io/treeshap/, https://github.com/ModelOriented/treeshap

PvSTATEM — by Tymoteusz Kwiecinski, 9 months ago

Reading, Quality Control and Preprocessing of MBA (Multiplex Bead Assay) Data

Speeds up the process of loading raw data from MBA (Multiplex Bead Assay) examinations, performs quality control checks, and automatically normalises the data, preparing it for more advanced, downstream tasks. The main objective of the package is to create a simple environment for a user, who does not necessarily have experience with R language. The package is developed within the project of the same name - 'PvSTATEM', which is an international project aiming for malaria elimination.

https://github.com/mini-pw/PvSTATEM, https://mini-pw.github.io/PvSTATEM/

SerolyzeR — by Jakub Grzywaczewski, 2 months ago

Reading, Quality Control and Preprocessing of MBA (Multiplex Bead Assay) Data

Speeds up the process of loading raw data from MBA (Multiplex Bead Assay) examinations, performs quality control checks, and automatically normalises the data, preparing it for more advanced, downstream tasks. The main objective of the package is to create a simple environment for a user, who does not necessarily have experience with R language. The package is developed within the project 'PvSTATEM', which is an international project aiming for malaria elimination.

https://github.com/mini-pw/SerolyzeR, https://mini-pw.github.io/SerolyzeR/

survex — by Mikołaj Spytek, 2 years ago

Explainable Machine Learning in Survival Analysis

Survival analysis models are commonly used in medicine and other areas. Many of them are too complex to be interpreted by human. Exploration and explanation is needed, but standard methods do not give a broad enough picture. 'survex' provides easy-to-apply methods for explaining survival models, both complex black-boxes and simpler statistical models. They include methods specific to survival analysis such as SurvSHAP(t) introduced in Krzyzinski et al., (2023) , SurvLIME described in Kovalev et al., (2020) as well as extensions of existing ones described in Biecek et al., (2021) .

https://modeloriented.github.io/survex/

qs — by Travers Ching, 9 months ago

Quick Serialization of R Objects

Provides functions for quickly writing and reading any R object to and from disk.

https://github.com/qsbase/qs

gips — by Adam Przemysław Chojecki, 9 months ago

Gaussian Model Invariant by Permutation Symmetry

Find the permutation symmetry group such that the covariance matrix of the given data is approximately invariant under it. Discovering such a permutation decreases the number of observations needed to fit a Gaussian model, which is of great use when it is smaller than the number of variables. Even if that is not the case, the covariance matrix found with 'gips' approximates the actual covariance with less statistical error. The methods implemented in this package are described in Graczyk et al. (2022) . Documentation about 'gips' is provided via its website at < https://przechoj.github.io/gips/> and the paper by Chojecki, Morgen, Kołodziejek (2025, ).

https://github.com/PrzeChoj/gips, https://przechoj.github.io/gips/

FuzzyResampling — by Maciej Romaniuk, a year ago

Resampling Methods for Triangular and Trapezoidal Fuzzy Numbers

The classical (i.e. Efron's, see Efron and Tibshirani (1994, ISBN:978-0412042317) "An Introduction to the Bootstrap") bootstrap is widely used for both the real (i.e. "crisp") and fuzzy data. The main aim of the algorithms implemented in this package is to overcome a problem with repetition of a few distinct values and to create fuzzy numbers, which are "similar" (but not the same) to values from the initial sample. To do this, different characteristics of triangular/trapezoidal numbers are kept (like the value, the ambiguity, etc., see Grzegorzewski et al. , Grzegorzewski et al. (2020) , Grzegorzewski et al. (2020) , Grzegorzewski and Romaniuk (2022) , Romaniuk and Hryniewicz (2019) ). Some additional procedures related to these resampling methods are also provided, like calculation of the Bertoluzza et al.'s distance (aka the mid/spread distance, see Bertoluzza et al. (1995) "On a new class of distances between fuzzy numbers") and estimation of the p-value of the one- and two- sample bootstrapped test for the mean (see Lubiano et al. (2016, )). Additionally, there are procedures which randomly generate trapezoidal fuzzy numbers using some well-known statistical distributions.

https://github.com/mroman-ibs/FuzzyResampling

fstcore — by Mark Klik, 10 months ago

R Bindings to the 'Fstlib' Library

The 'fstlib' library provides multithreaded serialization of compressed data frames using the 'fst' format. The 'fst' format allows for random access of stored data and compression with the 'LZ4' and 'ZSTD' compressors.

https://www.fstpackage.org/fstcore/

Search results

R links

R homepage

Download R

Mailing lists

R documentation

R manuals

R FAQs

The R Journal

CRAN links

CRAN homepage

CRAN repository policy

Submit a package

METACRAN stuff

About METACRAN

At github

Report a bug