Computes compatibility (confidence) distributions along with their corresponding P-values,
S-values, and likelihoods. The intervals can be plotted to form the distributions themselves.
Functions can be compared to one another to see how much they overlap. Results can be
exported to Microsoft Word, Powerpoint, and TeX documents. The package currently supports
resampling methods, computing differences, generalized linear models, mixed-effects models,
survival analysis, and meta-analysis. These methods are discussed
by Schweder T, Hjort NL. (2016, ISBN:9781316445051) and
Rafi Z, Greenland S. (2020)
Interval estimates such as confidence compatibility/consonance
intervals are now widely reported in many journals alongside the exact
P-value of a statistical test and point estimate.
While this is a large improvement over what constituted statistical reporting in the past two decades, it is still largely inadequate.
Take for example, the 95% compatibility interval. As many have stated before, there is nothing special about 95%, yet we rarely see intervals of any other level. Choosing to compute a 95% interval is as mindless as choosing a 5% alpha level for hypothesis testing. A single compatibility interval is only a slice of a wide range of compatibility intervals at different levels. Reporting 95% intervals only promotes cargo-cult statistics since there is not much thought behind the choice. (1)
rather than conscientious practice*.” - Stark & Saltelli, 2018
Thus, we propose that instead of only calculating one interval estimate, every interval associated with a compatibility level be calculated, along with its corresponding P-value and S-value, and plotted to form a function. (2-8)
This can be accomplished using the concurve package in R.
install.packages("concurve")
library(devtools)install_github("zadchow/concurve")
"Statistical software enables and promotes cargo-cult statistics. Marketing and adoption of statistical software are driven by ease of use and the range of statistical routines the software implements. Offering complex and “modern” methods provides a competitive advantage. And some disciplines have in effect standardised on particular statistical software, often proprietary software.
Statistical software does not help you know what to compute, nor how to interpret the result. It does not offer to explain the assumptions behind methods, nor does it flag delicate or dubious assumptions. It does not warn you about multiplicity or p-hacking. It does not check whether you picked the hypothesis or analysis after looking at the data, nor track the number of analyses you tried before arriving at the one you sought to publish – another form of multiplicity. The more “powerful” and “user-friendly” the software is, the more it invites cargo-cult statistics." - Stark & Saltelli, 2018