Methods for Clustering Mixed-Type Data

Implements methods for clustering mixed-type data, specifically combinations of continuous and nominal data. Special attention is paid to the often-overlooked problem of equitably balancing the contribution of the continuous and categorical variables. This package implements KAMILA clustering, a novel method for clustering mixed-type data in the spirit of k-means clustering. It does not require dummy coding of variables, and is efficient enough to scale to rather large data sets. Also implemented is Modha-Spangler clustering, which uses a brute-force strategy to maximize the cluster separation simultaneously in the continuous and categorical variables. For more information, see Foss, Markatou, Ray, & Heching (2016) and Foss & Markatou (2018) .


News

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.

install.packages("kamila")

0.1.1.3 by Alexander Foss, 2 months ago


https://github.com/ahfoss/kamila


Report a bug at https://github.com/ahfoss/kamila/issues


Browse source code at https://github.com/cran/kamila


Authors: Alexander Foss [aut, cre] , Marianthi Markatou [aut]


Documentation:   PDF Manual  


GPL-3 | file LICENSE license


Imports stats, abind, KernSmooth, gtools, Rcpp, plyr

Suggests testthat, clustMD, ggplot2, Hmisc

Linking to Rcpp


See at CRAN