The Data Defect Index for Samples that May not be IID

Implements Meng's data defect index (ddi), which represents the degree of sample bias relative to an iid sample. The data defect correlation (ddc) represents the correlation between the outcome of interest and the selection into the sample; when the sample selection is independent across the population, the ddc is zero. Details are in Meng (2018) , "Statistical Paradises and Paradoxes in Big Data (I): Law of Large Populations, Big Data Paradox, and the 2016 US Presidential Election." Survey estimates from the Cooperative Congressional Election Study (CCES) is included to replicate the article's results.


Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


0.1.0 by Shiro Kuriwaki, a year ago

Report a bug at

Browse source code at

Authors: Shiro Kuriwaki [aut, cre]

Documentation:   PDF Manual  

GPL (>= 2) license

Suggests testthat, dplyr, tibble

See at CRAN