Examples: visualization, C++, networks, data cleaning, html widgets, ropensci.

Found 1771 packages in 0.12 seconds

corrRF — by Elliot H. Young, 7 months ago

Clustered Random Forests for Optimal Prediction and Inference of Clustered Data

A clustered random forest algorithm for fitting random forests for data of independent clusters, that exhibit within cluster dependence. Details of the method can be found in Young and Buehlmann (2025) .

Rforestry — by Theo Saarinen, 7 months ago

Random Forests, Linear Trees, and Gradient Boosting for Inference and Interpretability

Provides fast implementations of Random Forests, Gradient Boosting, and Linear Random Forests, with an emphasis on inference and interpretability. Additionally contains methods for variable importance, out-of-bag prediction, regression monotonicity, and several methods for missing data imputation.

obliqueRSF — by Byron Jaeger, 3 years ago

Oblique Random Forests for Right-Censored Time-to-Event Data

Oblique random survival forests incorporate linear combinations of input variables into random survival forests (Ishwaran, 2008 ). Regularized Cox proportional hazard models (Simon, 2016 ) are used to identify optimal linear combinations of input variables.

rfvimptest — by Roman Hornung, 5 months ago

Sequential Permutation Testing of Random Forest Variable Importance Measures

Sequential permutation testing for statistical significance of predictors in random forests and other prediction methods. The main function of the package is rfvimptest(), which allows to test for the statistical significance of predictors in random forests using different (sequential) permutation test strategies [1]. The advantage of sequential over conventional permutation tests is that they are computationally considerably less intensive, as the sequential procedure is stopped as soon as there is sufficient evidence for either the null or the alternative hypothesis. Reference: [1] Hapfelmeier, A., Hornung, R. & Haller, B. (2023) Efficient permutation testing of variable importance measures by the example of random forests. Computational Statistics & Data Analysis 181:107689, .

IntegratedMRF — by Raziur Rahman, 7 years ago

Integrated Prediction using Uni-Variate and Multivariate Random Forests

An implementation of a framework for drug sensitivity prediction from various genetic characterizations using ensemble approaches. Random Forests or Multivariate Random Forest predictive models can be generated from each genetic characterization that are then combined using a Least Square Regression approach. It also provides options for the use of different error estimation approaches of Leave-one-out, Bootstrap, N-fold cross validation and 0.632+Bootstrap along with generation of prediction confidence interval using Jackknife-after-Bootstrap approach.

optRF — by Thomas Martin Lange, 5 months ago

Optimising Random Forest Stability by Determining the Optimal Number of Trees

Calculating the stability of random forest with certain numbers of trees. The non-linear relationship between stability and numbers of trees is described using a logistic regression model and used to estimate the optimal number of trees.

MixRF — by Jiebiao Wang, 10 years ago

A Random-Forest-Based Approach for Imputing Clustered Incomplete Data

It offers random-forest-based functions to impute clustered incomplete data. The package is tailored for but not limited to imputing multitissue expression data, in which a gene's expression is measured on the collected tissues of an individual but missing on the uncollected tissues.

rfPermute — by Eric Archer, 4 months ago

Estimate Permutation p-Values for Random Forest Importance Metrics

Estimate significance of importance metrics for a Random Forest model by permuting the response variable. Produces null distribution of importance metrics for each predictor variable and p-value of observed. Provides summary and visualization functions for 'randomForest' results.

randomForestVIP — by Kelvyn Bladen, 2 years ago

Tune Random Forests Based on Variable Importance & Plot Results

Functions for assessing variable relations and associations prior to modeling with a Random Forest algorithm (although these are relevant for any predictive model). Metrics such as partial correlations and variance inflation factors are tabulated as well as plotted for the user. A function is available for tuning the main Random Forest hyper-parameter based on model performance and variable importance metrics. This grid-search technique provides tables and plots showing the effect of the main hyper-parameter on each of the assessment metrics. It also returns each of the evaluated models to the user. The package also provides superior variable importance plots for individual models. All of the plots are developed so that the user has the ability to edit and improve further upon the plots. Derivations and methodology are described in Bladen (2022) < https://digitalcommons.usu.edu/etd/8587/>.

rQSAR — by Oche Ambrose George, 2 years ago

QSAR Modeling with Multiple Algorithms: MLR, PLS, and Random Forest

Quantitative Structure-Activity Relationship (QSAR) modeling is a valuable tool in computational chemistry and drug design, where it aims to predict the activity or property of chemical compounds based on their molecular structure. In this vignette, we present the 'rQSAR' package, which provides functions for variable selection and QSAR modeling using Multiple Linear Regression (MLR), Partial Least Squares (PLS), and Random Forest algorithms.