Environment Based Clustering for Interpretable Predictive Models in High Dimensional Data

Companion package to the paper: An analytic approach for interpretable predictive models in high dimensional data, in the presence of interactions with exposures. Bhatnagar, Yang, Khundrakpam, Evans, Blanchette, Bouchard, Greenwood (2017) . This package includes an algorithm for clustering high dimensional data that can be affected by an environmental factor.

Travis-CI Build Status

This package is under active development

The eclust package implements the methods developped in the paper An analytic approach for interpretable predictive models in high dimensional data, in the presence of interactions with exposures (2017+) Preprint. Breifly, eclust is a two-step procedure: 1a) a clustering stage where variables are clustered based on some measure of similarity, 1b) a dimension reduction stage where a summary measure is created for each of the clusters, and 2) a simultaneous variable selection and regression stage on the summarized cluster measures.


You can install the development version of eclust from GitHub with:



See the online vignette for example usage of the functions.


This package is makes use of several existing packages including:

  • glmnet for lasso and elasticnet regression
  • earth for MARS models
  • WGCNA for topological overlap matrices

Related Work

  1. Park, M. Y., Hastie, T., & Tibshirani, R. (2007). Averaged gene expressions for regression. Biostatistics, 8(2), 212-227.
  2. Bühlmann, P., Rütimann, P., van de Geer, S., & Zhang, C. H. (2013). Correlated variables in regression: clustering and sparse estimation. Journal of Statistical Planning and Inference, 143(11), 1835-1858.


Latest news

You can see the most recent changes to the package in the NEWS.md file

Code of Conduct

Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.


eclust 0.1.0

  • Added a NEWS.md file to track changes to the package.
  • First release of the eclust package

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


0.1.0 by Sahir Rai Bhatnagar, a year ago

https://github.com/sahirbhatnagar/eclust/, http://sahirbhatnagar.com/eclust/

Report a bug at https://github.com/sahirbhatnagar/eclust/issues

Browse source code at https://github.com/cran/eclust

Authors: Sahir Rai Bhatnagar [aut, cre] (http://sahirbhatnagar.com/)

Documentation:   PDF Manual  

MIT + file LICENSE license

Imports caret, data.table, dynamicTreeCut, magrittr, pacman, WGCNA, stringr, pander, stats

Suggests cluster, earth, ncvreg, knitr, rmarkdown, protoclust, factoextra, ComplexHeatmap, circlize, pheatmap, viridis, pROC, glmnet

See at CRAN