Given a log-transformed expression matrix and list of informative genes:
subsample informative genes, cluster samples using shared nearest neighbors clustering,
estimate missing expression values with the distribution mean of means extrapolated
from these cell clusterings, and return an imputed expression matrix. See Tracy, S.,
Yuan, G.C. and Dries, R. (2019)