# 'ggplot2' Based Plots with Statistical Details

Extension of 'ggplot2', 'ggstatsplot' creates graphics with details from statistical tests included in the plots themselves. It provides an easier API to generate information-rich plots for statistical analysis of continuous (violin plots, scatterplots, histograms, dot plots, dot-and-whisker plots) or categorical (pie and bar charts) data. Currently, it supports the most common types of statistical approaches and tests: parametric, nonparametric, robust, and Bayesian versions of t-test/ANOVA, correlation analyses, contingency table analysis, meta-analysis, and regression analyses.

Package Status Usage GitHub References

# Overview

ggstatsplot is an extension of ggplot2 package for creating graphics with details from statistical tests included in the plots themselves and targeted primarily at behavioral sciences community to provide a one-line code to produce information-rich plots. In a typical exploratory data analysis workflow, data visualization and statistical modeling are two different phases: visualization informs modeling, and modeling in its turn can suggest a different visualization method, and so on and so forth. The central idea of ggstatsplot is simple: combine these two phases into one in the form of graphics with statistical details, which makes data exploration simpler and faster.

Currently, it supports only the most common types of statistical tests: parametric, nonparametric, robust, and bayesian versions of t-test/anova, correlation analyses, contingency table analysis, and regression analyses.

It, therefore, produces a limited kinds of plots for the supported analyses:

• violin plots (for comparisons between groups or conditions),
• pie charts and bar charts (for categorical data),
• scatterplots (for correlations between two variables),
• correlation matrices (for correlations between multiple variables),
• histograms and dot plots/charts (for hypothesis about distributions),
• dot-and-whisker plots (for regression models).

In addition to these basic plots, ggstatsplot also provides grouped_ versions for most functions that makes it easy to repeat the same analysis for any grouping variable.

Future versions will include other types of statistical analyses and plots as well.

# Statistical reporting

For all statistical tests reported in the plots, the default template abides by the APA gold standard for statistical reporting. For example, here are results from Yuen’s test for trimmed means (robust t-test):

# Summary of supported statistical analyses

The table below summarizes all the different types of analyses currently supported in this package-

Functions Description Parametric Non-parametric Robust Bayes Factor
ggbetweenstats Between group/condition comparisons Yes Yes Yes Yes
gghistostats, ggdotplotstats Distribution of a numeric variable Yes Yes Yes Yes
ggcorrmat Correlation matrix Yes Yes Yes No
ggscatterstats Correlation between two variables Yes Yes Yes Yes
ggpiestats, ggbarstats Association between categorical variables Yes No No Yes
ggpiestats Proportion test No No No No
ggcoefstats Regression model coefficients Yes No Yes Yes

# Effect sizes and confidence intervals available

ggstatsplot provides a wide range of effect sizes and their confidence intervals.

Test Parametric Non-parametric Robust Bayes
one-sample t-test Yes Yes Yes No
two-sample t-test (between) Yes Yes Yes No
two-sample t-test (within) Yes Yes Yes No
One-way ANOVA (between) Yes Yes Yes No
One-way ANOVA (within) Yes No No No
correlations Yes Yes Yes No
contingency table Yes NA NA No
goodness of fit Yes NA NA No
regression Yes Yes Yes No

# Installation

To get the latest, stable CRAN release (0.0.10):

Note: If you are on a linux machine, you will need to have OpenGL libraries installed (specifically, libx11, mesa and Mesa OpenGL Utility library - glu) for the dependency package rgl to work.

You can get the development version of the package from GitHub (0.0.10.9000). To see what new changes (and bug fixes) have been made to the package since the last release on CRAN, you can check the detailed log of changes here: https://indrajeetpatil.github.io/ggstatsplot/news/index.html

If you are in hurry and want to reduce the time of installation, prefer-

If time is not a constraint-

If you are not using the RStudio IDE and you get an error related to “pandoc” you will either need to remove the argument build_vignettes = TRUE (to avoid building the vignettes) or install pandoc. If you have the rmarkdown R package installed then you can check if you have pandoc by running the following in R:

# Citation

If you want to cite this package in a scientific journal or in any other context, run the following code in your R console:

There is currently a publication in preparation corresponding to this package and the citation will be updated once it’s published.

# Documentation and Examples

To see the detailed documentation for each function in the stable CRAN version of the package, see:

To see the documentation relevant for the development version of the package, see the dedicated website for ggstatplot, which is updated after every new commit: https://indrajeetpatil.github.io/ggstatsplot/.

## Help

In R, documentation for any function can be accessed with the standard help command (e.g., ?ggbetweenstats).

Another handy tool to see arguments to any of the functions is args. For example-

In case you want to look at the function body for any of the functions, just type the name of the function without the parentheses:

If you are not familiar either with what the namespace :: does or how to use pipe operator %>%, something this package and its documentation relies a lot on, you can check out these links-

ggstatsplot relies on non-standard evaluation (NSE), i.e., rather than looking at the values of arguments (x, y), it instead looks at their expressions. This means that you shouldn’t enter arguments with the $ operator and setting data = NULL: data = NULL, x = data$x, y = data$y. You must always specify the data argument for all functions. On the plus side, you can enter arguments either as a string (x = "x", y = "y") or as a bare expression (x = x, y = y) and it wouldn’t matter. To read more about NSE, see- http://adv-r.had.co.nz/Computing-on-the-language.html ggstatsplot is a very chatty package and will by default print helpful notes on assumptions about linear models, warnings, etc. If you don’t want your console to be cluttered with such messages, they can be turned off by setting argument messages = FALSE in the function call. Here are examples of the main functions currently supported in ggstatsplot. Note: If you are reading this on GitHub repository, the documentation below is for the development version of the package. So you may see some features available here that are not currently present in the stable version of this package on CRAN. For documentation relevant for the CRAN version, see: ## ggbetweenstats This function creates either a violin plot, a box plot, or a mix of two for between-group or between-condition comparisons with results from statistical tests in the subtitle. The simplest function call looks like this- Note that this function returns a ggplot2 object and thus any of the graphics layers can be further modified. The type (of test) argument also accepts the following abbreviations: "p" (for parametric) or "np" (for nonparametric) or "r" (for robust) or "bf" (for Bayes Factor). Additionally, the type of plot to be displayed can also be modified ("box", "violin", or "boxviolin"). A number of other arguments can be specified to make this plot even more informative or change some of the default options. In case of a parametric t-test, setting bf.message = TRUE will also attach results from Bayesian Student’s t-test. That way, if the null hypothesis can’t be rejected with the NHST approach, the Bayesian approach can help index evidence in favor of the null hypothesis (i.e., BF01). By default, Bayes Factor quantifies the support for the alternative hypothesis (H1) over the null hypothesis (H0) (i.e., BF10 is displayed). Natural logarithms are shown because BF values can be pretty large. This also makes it easy to compare evidence in favor alternative (BF10) versus null (BF01) hypotheses (since log(BF10) = - log(BF01)). Additionally, there is also a grouped_ variant of this function that makes it easy to repeat the same operation across a single grouping variable: Here is a summary of pairwise comparison tests supported in ggbetweenstats- Type Design Equal variance? Test p-value adjustment? Parametric between No Games-Howell test Yes Parametric between Yes Student’s t-test Yes Parametric within NA Student’s t-test Yes Non-parametric between No Dwass-Steel-Crichtlow-Fligner test Yes Non-parametric within No Durbin-Conover test Yes Robust between No Yuen’s trimmed means test Yes Robust within NA Yuen’s trimmed means test Yes Bayes Factor between No No No Bayes Factor between Yes No No Bayes Factor within NA No No For more, see the ggbetweenstats vignette: https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggbetweenstats.html This function is not appropriate for within-subjects designs. Variant of this function ggwithinstats is currently under work. You can still use this function just to prepare the plot for exploratory data analysis, but the statistical details displayed in the subtitle will be incorrect. You can remove them by adding + ggplot2::labs(subtitle = NULL) to your function call. As a temporary solution, you can use the helper function from ggstatsplot to display results from within-subjects version of the test in question. Here is an example- ## ggscatterstats This function creates a scatterplot with marginal histograms/boxplots/density/violin/densigram plots from ggExtra::ggMarginal and results from statistical tests in the subtitle: Number of other arguments can be specified to modify this basic plot- Additionally, there is also a grouped_ variant of this function that makes it easy to repeat the same operation across a single grouping variable: For more, see the ggscatterstats vignette: https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggscatterstats.html ## ggpiestats This function creates a pie chart for categorical or nominal variables with results from contingency table analysis (Pearson’s chi-squared test for between-subjects design and McNemar’s test for within-subjects design) included in the subtitle of the plot. If only one categorical variable is entered, results from one-sample proportion test will be displayed as a subtitle. This function can also be used to study an interaction between two categorical variables. Additionally, this basic plot can further be modified with additional arguments and the function returns a ggplot2 object that can further be modified with ggplot2 syntax: In case of within-subjects designs, setting paired = TRUE will produce results from McNemar test- Additionally, there is also a grouped_ variant of this function that makes it easy to repeat the same operation across a single grouping variable: For more, including information about the variant of this function grouped_ggpiestats, see the ggpiestats vignette: https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggpiestats.html ## ggbarstats In case you are not a fan of pie charts (for very good reasons), you can alternatively use ggbarstats function- And, needless to say, there is also a grouped_ variant of this function- ## gghistostats In case you would like to see the distribution of one variable and check if it is significantly different from a specified value with a one sample test, this function will let you do that. The type (of test) argument also accepts the following abbreviations: "p" (for parametric) or "np" (for nonparametric) or "r" (for robust) or "bf" (for Bayes Factor). The aesthetic defaults can be easily modified- As can be seen from the plot, bayes factor can be attached (bf.message = TRUE) to assess evidence in favor of the null hypothesis. Additionally, there is also a grouped_ variant of this function that makes it easy to repeat the same operation across a single grouping variable: For more, including information about the variant of this function grouped_gghistostats, see the gghistostats vignette: https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/gghistostats.html ## ggdotplotstats This function is similar to gghistostats, but is intended to be used when numeric variable also has a label. As with the rest of the functions in this package, there is also a grouped_ variant of this function to facilitateto repeat the same operation across a grouping variable. ## ggcorrmat ggcorrmat makes a correlalogram (a matrix of correlation coefficients) with minimal amount of code. Just sticking to the defaults itself produces publication-ready correlation matrices. But, for the sake of exploring the available options, let’s change some of the defaults. For example, multiple aesthetics-related arguments can be modified to change the appearance of the correlation matrix. Note that if there are NAs present in the selected dataframe, the legend will display minimum, median, and maximum number of pairs used for correlation matrices. Alternatively, you can use it just to get the correlation matrices and their corresponding p-values (in a tibble format). Also, note that if cor.vars are not specified, all numeric variables will be used. Additionally, there is also a grouped_ variant of this function that makes it easy to repeat the same operation across a single grouping variable: For examples and more information, see the ggcorrmat vignette: https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggcorrmat.html ## ggcoefstats ggcoefstats creates a lot with the regression coefficients’ point estimates as dots with confidence interval whiskers. The basic plot can be further modified to one’s liking with additional arguments (also, let’s use a robust linear model instead of a simple linear model now): Most of the regression models that are supported in the broom and broom.mixed packages with tidy and glance methods are also supported by ggcoefstats. For example- aareg, anova, aov, aovlist, Arima, biglm, brmsfit, btergm, cch, clm, clmm, confusionMatrix, coxph, ergm, felm, fitdistr, glmerMod, glmmTMB, gls, gam, Gam, gamlss, garch, glm, glmmadmb, glmmTMB, glmrob, gmm, ivreg, lm, lm.beta, lmerMod, lmodel2, lmrob, mcmc, MCMCglmm, mediate, mjoint, mle2, multinom, nlmerMod, nlrq, nls, orcutt, plm, polr, ridgelm, rjags, rlm, rlmerMod, rq, speedglm, speedlm, stanreg, survreg, svyglm, svyolr, svyglm, etc. For an exhaustive list of all regression models supported by ggcoefstats and what to do in case the regression model you are interested in is not supported, see the associated vignette- https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggcoefstats.html ## combine_plots The full power of ggstatsplot can be leveraged with a functional programming package like purrr that replaces for loops with code that is both more succinct and easier to read and, therefore, purrr should be preferrred 😻. (Another old school option to do this effectively is using the plyr package.) In such cases, ggstatsplot contains a helper function combine_plots to combine multiple plots, which can be useful for combining a list of plots produced with purrr. This is a wrapper around cowplot::plot_grid and lets you combine multiple plots and add a combination of title, caption, and annotation texts with suitable defaults. For examples (both with plyr and purrr), see the associated vignette- https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/combine_plots.html ## theme_ggstatsplot All plots from ggstatsplot have a default theme: theme_ggstatsplot. You can change this theme by using the argument ggtheme for all functions. It is important to note that irrespective of which ggplot theme you choose, ggstatsplot in the backdrop adds a new layer with its idiosyncratic theme settings, chosen to make the graphs more readable or aesthetically pleasing. Let’s see an example with gghistostats and see how a certain theme from hrbrthemes package looks with and without the ggstatsplot layer. For more on how to modify it, see the associated vignette- https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/theme_ggstatsplot.html ## Using ggstatsplot helpers to display text results Sometimes you may not like the default plot produced by ggstatsplot. In such cases, you can use other custom plots (from ggplot2 or other plotting packages) and still use ggstatsplot (subtitle) helper functions to display results from relevant statistical test. For example, in the following chunk, we will use pirateplot from yarrr package and use ggstatsplot helper function to display the results. # Code coverage As the code stands right now, here is the code coverage for all primary functions involved: https://codecov.io/gh/IndrajeetPatil/ggstatsplot/tree/master/R # Contributing I’m happy to receive bug reports, suggestions, questions, and (most of all) contributions to fix problems and add features. I personally prefer using the Github issues system over trying to reach out to me in other ways (personal e-mail, Twitter, etc.). Pull requests for contributions are encouraged. Here are some simple ways in which you can contribute: • Read and correct any inconsistencies in the documentation • Raise issues about bugs or wanted features • Review code • Add new functionality (in the form of new plotting functions or helpers for preparing subtitles) Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms. # Session Information For details about the session information in which this README file was rendered, see- https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/session_info.html # News # ggstatsplot 0.0.9.9000 NEW FEATURES • ggcoefstats can support following new model objects: rjags. • New VR_dilemma dataset for toying around with within-subjects design. • subtitle_t_onesample supports both Cohen's d and Hedge's g as effect sizes and also produces their confidence intervals. Additionally, non-central variants of these effect sizes are also supported. Thus, gghistostats and its grouped_ variant gets two new arguments: effsize.type, effsize.noncentral. • ggpiestats used to display odds ratio as effect size for paired designs (McNemar test). But this was only working when the analysis was a 2 x 2 contingency table. It now instead displays Cohen's G as effect size, which generalizes to any kind of design. MINOR CHANGES • The internal function outlier_df to add a column specifying outlier status of any given data point is now exported. • ggstatsplot previously relied on an internal function chisq_v_ci to compute confidence intervals for Cramer's V using bootstrapping but it was pretty slow. It now instead relies on rcompanion package to compute confidence intervals for V. ggstatsplot, therefore, gains a new dependency. • subtitle_mann_nonparametric and subtitle_t_onesample now computes effect size r and its confidence intervals as$Z/\sqrt{N}$(with the help of rcompanion package), instead of using Spearman correlation. # ggstatsplot 0.0.9 BREAKING CHANGES • subtitle_t_onesample no longer has data as the optional argument. This was done to be consistent with other subtitle helper functions. NEW FEATURES • New function ggbarstats (and its grouped_ variant) introduced for making bar charts (thanks to #78). • ggcoefstats also displays a caption with model summary when meta-analysis is required. • gghistostats and its grouped_ variant has a new argument normal.curve to superpose a normal distribution curve on top of the histogram (#138). • ggcoefstats can support following new regression model objects: brmsfit, gam, Gam, gamlss, mcmc, mjoint, stanreg. • New function to convert plots which are not of gg/ggplot class to ggplot class objects. • Instead of using effsize to compute Cohen's d and Hedge's g, ggstatsplot now relies on a new (#159) internal function effect_t_parametric to compute them. This removes effsize from dependencies. • To be consistent with other functions in this package, both ggbarstats and ggpiestats gain results.subtitle which can be set to FALSE if statistical analysis is not required, in which case subtitle argument can be used to provide alternative subtitle. MAJOR CHANGES • ggbetweenstats now defaults to using noncentral-t distribution for computing Cohen's d and Hedge's g. To get variants with central-t distribution, use effsize.noncentral = FALSE. MINOR CHANGES • All grouped_ functions had argument title.prefix that defaulted to "Group". It now instead defaults to NULL, in which case the prefix will variable name for grouping.var argument. • To accommodate non-parametric tests, subtitle_template function can now work with parameter = NULL. • For ggbetweenstats, details contained in the subtitle for non-parametric test are modified. It now uses Spearman's rho-based effect size estimates. This removes coin from dependencies. • ggbetweenstats and its grouped_ variant gain a new argument axes.range.restrict (which defaults to FALSE). This restricts y-axes limits to minimum and maximum of y variable. This is what these functions were doing by default in the past versions, which created issues for additional ggplot components using the ggplot.component argument. • All bayes factor related subtitle and captions replace prior.width with r_{Cauchy}. • ggcoefstats passes dots (...) to augment method from broom. BUG FIXES • The helper function bf_extractor no longer provides option to extract information about posterior distribution because these details were incorrect. The posterior = TRUE details were not used anywhere in the package so nothing about the results changes. • ggcorrmat didn't output pair names when output == "ci" was used. This is fixed. # ggstatsplot 0.0.8 NEW FEATURES • ggcoefstats gains meta.analytic.effect that can be used to carry out meta-analysis on regression estimates. This especially useful when a dataframe with regression estimates and standard error is available from prior analyses. The subtitle is prepared with the new function subtitle_meta_ggcoefstats which is also exported. • ggbetweenstats, ggscatterstats, gghistostats, and ggdotplotstats (and their grouped_ variants) all gain a new ggplot.component argument. This argument will primarily be helpful to change the individual plots in a grouped_ plot. • ggcoefstats can support following new regression model objects: polr, survreg, cch, Arima, biglm, glmmTMB, coxph, ridgelm, aareg, plm, nlrq, ivreg, ergm, btergm, garch, gmm, lmodel2, svyolr, confusionMatrix, multinom, nlmerMod, svyglm, MCMCglmm, lm.beta, speedlm, fitdistr, mle2, orcutt, glmmadmb. BUG FIXES • ggcoefstats didn't work when statistic argument was set to NULL. This was not expected behavior. This has been fixed. Now, if statistic is not specified, only the dot-and-whiskers will be shown without any labels. • subtitle_t_parametric was producing incorrect sample size information when paired = TRUE and the data contained NAs. This has been fixed. MAJOR CHANGES • ggscatterstats and its grouped_ variant accept both character and bare exressions as input to arguments label.var and labe.expression (#110). • To be consistent with rest of the functions in the package, both Pearson's r, Spearman's rho, and robust percentage bend correlations also display information about statistic associated with these tests. • ggscatterstats, by default, showed jittered data points (because it relied on position_jitter defaults). This could be visually inaccurate and, therefore, ggscatterstats now displays points without any jitter. The user can introduce jitter if they wish to using point.width.jitter and point.height.jitter arguments. For similar reasons, for ggbetweenstats and its grouped_ variant, point.jitter.height default has been changed from 0.1 to 0 (no vertical jitter, i.e.). MINOR CHANGES • Confidence interval for Kendall's W is now computed using stats::kruskal.test. As a result, PMCMRplus removed from dependencies. • ggcoefstats gains a caption argument. If caption.summary is set to TRUE, the specified caption will be added on top of the caption.summary. # ggstatsplot 0.0.7 BUG FIXES • ggcoefstats was showing wrong confidence intervals for merMod class objects due to a bug in the broom.mixed package (https://github.com/bbolker/broom.mixed/issues/30#issuecomment-428385005). This was fixed in broom.mixed and so ggcoefstats should no longer have any issues. • specify_decimal_p has been modified because it produced incorrect results when k < 3 and p.value = TRUE (e.g., 0.002 was printed as < 0.001). • ggpiestats produced incorrect results if some levels of the factor had been filtered out prior to using this function. It now drops unused levels and produces correct results. • gghistostats wasn't filtering out NAs properly. This has been fixed. MAJOR CHANGES • New function ggdotplotstats for creating a dot plot/chart for labelled numeric data. • All primary functions gain conf.level argument to control confidence level for effect size measures. • As per APA guidelines, all results show results with two decimal places. That is, the default value for k argument for all functions has been changed from 3 to 2. • All helper functions for the ggbetweenstats subtitles have been renamed to remove _ggbetween_ from their names as this was becoming confusing for the user. Some of these functions work both with the between- and within-subjects designs, so having _ggbetween_ in their names made users suspect if they could use these functions for within-subjects designs. • ggstatsplot now depends on R 3.5.0. This is because some of its dependencies require 3.5.0 to work (e.g., broom.mixed). • All theme_ functions are now exported (theme_pie(), theme_corrmat()). • ggbetweenstats now supports multiple pairwise comparison tests (parametric, nonparametric, and robust variants). It gains a new dependency ggsignif. • ggbetweenstats now supports eta-squared and omega-squared effect sizes for anova models. This function gains a new argument partial. • Following functions are now reexported from the groupedstats package to avoid repeating the same code in two packages: specify_decimal_p, signif_column, lm_effsize_ci, and set_cwd. Therefore, groupedstats is now added as a dependency. • gghistostats can now show both counts and proportions information on the same plot when bar.measure argument is set to "mix". • ggcoefstats works with tidy dataframes. • The helper function untable has been deprecated in light of tidyr::uncount, which does exactly what untable was doing. The author wasn't aware of this function when untable was written. • All vignettes have been removed from CRAN to reduce the size of the package. They are now available on the package website: https://indrajeetpatil.github.io/ggstatsplot/articles/. • subtitle_t_robust function can now handle dependent samples and gains paired argument. • A number of tidyverse operators are now reexported by ggstatsplot: %>%, %<>%, %$%.

MINOR CHANGES

• ggscatterstats, ggpiestats, and their grouped_ variant support bayes factor tests and gain new arguments relevant to this test.
• Effect size and their confidence intervals now available for Kruskal-Wallis test.
• Minor stylistic changes to how symbols for partial-eta-/omega-squared were being displayed in subtitles.
• ggbetweenstats supports bayes factor tests for anova designs.
• ggpiestats (and its grouped_ version) gain slice.label argument that decides what information needs to be displayed as a label on the slices of the pie chart: "percentage" (which has been the default thus far), "counts", or "both".
• ggcorrmat can work with cor.vars = NULL. In such case, all numeric variables from the provided dataframe will be used for computing the correlation matrix.
• Given the constant changes to the default behavior of functions, the lifecycle badge has been changed from stable to maturing.
• When the number of colors needed by a function exceeds the number of colors contained in a given palette, informative message is displayed to the user (with the new internal function palette_message()).
• Several users had requested an easier way to turn off subtitles with results from tests (which was already implemented in ggscatterstats and gghistostats with the argument results.subtitle), so ggbetweenstats also gains two new arguments to do this: results.subtitle and subtitle.
• New dataset added: iris_long.
• More tests added and the code coverage has now jumped to over 75%.
• To avoid code repetition, there is a now a function that produces a generic message any time confidence intervals for effect size estimate are computed using bootstrapping.

# ggstatsplot 0.0.6

MAJOR CHANGES

• The package now exports all functions used to create text expressions with results. This makes it easy for people to use these results in their own plots at any location they want (and not just in subtitle, the current default for ggstatsplot).
• ggcorrmat gains p.adjust.method argument which allows p-values for correlations to be corrected for multiple comparisons.
• ggscatterstats gains label.var and label.expression arguments to attach labels to points.
• gghistostats now defaults to not showing (redundant) color gradient (fill.gradient = FALSE) and shows both "count" and "proportion" data. It also gains a new argument bar.fill that can be used to fill bars with a uniform color.
• ggbetweenstats, ggcoefstats, ggcorrmat, ggscatterstats, and ggpiestats now support all palettes contained in the paletteer package. This helps avoid situations where people had large number of groups (> 12) and there were not enough colors in any of the RColorBrewer palettes.
• ggbetweenstats gains bf.message argument to display bayes factors in favor of the null (currently works only for parametric t-test).
• gghistostats function no longer has line.labeller.y argument; this position is automatically determined now.

BREAKING CHANGES

• legend.title.margin function has been deprecated since ggplot2 3.0.0 has improved on the margin issues from previous versions. All functions that wrapped around this function now lose the relevant arguments (legend.title.margin, t.margin, b.margin).
• The argument ggstatsplot.theme has been changed to ggstatsplot.layer for ggcorrmat function to be consistent across functions.
• For consistency, conf.level and conf.type arguments for ggbetweenstats have been deprecated. No other function in the package allowed changing confidence interval or their type for effect size estimation. These arguments were relevant only for robust tests anyway.
• ggocorrmat argument type has been changed to matrix.type because for all other functions type argument specifies the type of the test, while for this function it specified the display of the visualization matrix. This will make the syntax more consistent across functions.
• ggscatterstats gains new arguments to specify aesthetics for geom point (point.color, point.size, point.alpha). To be consistent with this naming schema, the width.jitter and height.jitter arguments have been renamed to point.width.jitter and point.height.jitter, resp.

MINOR CHANGES

• gghistostats: To be compatible with JASP, natural logarithm of Bayes Factors is displayed, and not base 10 logarithm.
• ggscatterstats gains method and formula arguments to modify smoothing functions.
• ggcorrmat can now show robust correlation coefficients in the matrix plot.
• For gghistostats, binwidth value, if not specified, is computed with (max-min)/sqrt(n). This is basically to get rid of the warnings ggplot2 produces. Thanks to Chuck Powell's PR (#43).
• ggcoefstats gains a new argument partial and can display eta-squared and omega-squared effect sizes for anovas, in addition to the prior partial variants of these effect sizes.
• ggpiestats gains perc.k argument to show desired number of decimal places in percentage labels.

BUG FIXES

• grouped_ggpiestats wasn't working when only main variable was provided with counts data. Fixed that.

# ggstatsplot 0.0.5

MAJOR CHANGES

• For the sake of consistency, theme_mprl is now called theme_ggstatsplot. The theme_mprl function will still be around and will not be deprecated, so feel free to use either or both of them since they are identical.
• ggcoefstats no longer has arguments effects and ran_params because only fixed effects are shown for mixed-effects models.
• ggpiestats can now handle within-subjects designs (McNemar test results will be displayed).

BUG FIXES

• ggbetweenstats was producing wrong axes labels when sample.size.label was set to TRUE and user had reordered factor levels before using this function. The new version fixes this.
• ggcoefstats wasn't producing partial omega-squared for aovlist objects. Fixed that with new version of sjstats.

MINOR CHANGES

• Removed the trailing comma from the robust correlation analyses.
• gghistostats has a new argument to remove color fill gradient.
• ggbetweenstats takes new argument mean.ci to show confidence intervals for the mean values.
• For lmer models, p-values are now computed using sjstats::p_value. This removes lmerTest package from dependencies.
• sjstats no longer suggests apaTables package to compute confidence intervals for partial eta- and omega-squared. Therefore, apaTables and MBESS are removed from dependencies.
• ggscatterstats supports densigram with the development version of ggExtra. It additionally gains few extra arguments to change aesthetics of marginals (alpha, size, etc.).

# ggstatsplot 0.0.4

MAJOR CHANGES

• New function: ggcoefstats for displaying model coefficients.
• All functions now have ggtheme argument that can be used to change the default theme, which has now been changed from theme_grey() to theme_bw().
• The robust correlation is no longer MASS::rlm, but percentage bend correlation, as implemented in WRS2::pbcor. This was done to be consistent across different functions. ggcorrmat also uses percentage bend correlation as the robust correlation measure. This also means that ggstatsplot no longer imports MASS and sfsmisc.
• The data argument is no longer NULL for all functions, except gghistostats. In other words, the user must provide a dataframe from which variables or formulas should be selected.
• All subtitles containing results now also show sample size information (n). To adjust for the inflated length of the subtitle, the default subtitle text size has been changed from 12 to 11.

MINOR CHANGES

• Switched back to Shapiro-Wilk test of normality to remove nortest from imports.
• ggpiestats can now handle dataframes with
• ggbetweenstats and ggpiestats now display sample sizes for each level of the groping factor by default. This behavior can be turned off by setting sample.size.label to FALSE.
• Three new datasets added: Titanic_full, movies_wide, movies_long.
• Added confidence interval for effect size for robust ANOVA.
• The 95% CI for Cramer'V computed using boot::boot. Therefore, the package no longer imports DescTools.
• To be consistent across correlations covered, all correlations now show estimates for correlation coefficients, confidence intervals for the estimate, and p-values. Therefore, t-values and regression coefficients are no longer displayed for Pearson's r.
• The legend.title.margin arguments for gghistostats and ggcorrmat now default to FALSE, since ggplot2 3.0.0 has better legend title margins.
• ggpiestats now sorts the summary dataframes not by percentages but by the levels of main variable. This was done to have the same legends across different levels of a grouping variable in grouped_ggpiestats.
• To remove cluttered display of results in the subtitle, ggpiestats no longer shows titles for the tests run (these were "Proportion test" and "Chi-Square test"). From the pie charts, it should be obvious to the user or reader what test was run.
• gghistostats also allows running robust version of one-sample test now (One-sample percentile bootstrap).

# ggstatsplot 0.0.3

NEW FEATURES

• The ggbetweenstats function can now show notched box plots. Two new arguments notch and notchwidth control its behavior. The defaults are still standard box plots.
• Removed warnings that were appearing when outlier.label argument was of character type.
• The default color palette used for all plots is colorblind friendly.
• gghistostats supports proportion and density as a value measure for bar heights to show proportions and density. New argument bar.measure controls this behavior.
• grouped_ variants of functions ggcorrmat, ggscatterstats, ggbetweenstats, and ggpiestats introduced to create multiple plots for different levels of a grouping variable.

MAJOR CHANGES

• To be internally consistent, all functions in ggstatsplot use the spelling color, rather than colour in some functions, while color in others.
• Removed the redundant argument binwidth.adjust from gghistostats function. This argument was relevant for the first avatar of this function, but is no longer playing any role.
• To be internally consistent, the argument lab_col and lab_size in ggcorrmat have been changed to lab.col and lab.size, respectively.

MINOR CHANGES

• Added a new argument to ggstatsplot.theme function to control if ggstatsplot::theme_mprl is to be overlaid on top of the selected ggtheme (ggplot2 theme, i.e.).
• Two new arguments added to gghistostats to allow user to change colorbar gradient. Defaults are colorblind friendly.
• Both gghistostats and ggcorrmat have a new argument legend.title.margin to control margin adjustment between the title and the colorbar.
• The vertical lines denoting test values and centrality parameters can be tagged with text labels with a new argument line.labeller in gghistostats function.

BUG FIXES

• The centrality.para argument for ggscatterstats was not working properly. Choosing "median" didn't show median, but the mean. This is fixed now.

# ggstatsplot 0.0.2

NEW FEATURES

• Bayesian test added to gghistostats and two new arguments to also display a vertical line for test.value argument.
• Vignette added for gghistostats.
• Added new function grouped_gghistostats to facilitate applying gghistostats for multiple levels of a grouping factor.
• ggbetweenstats has a new argument outlier.coef to adjust threshold used to detect outliers. Removed bug from the same function when outlier.label argument is of factor/character type.

MAJOR CHANGES

• Functions signif_column and grouped_proptest are now deprecated. They were exported in the first release by mistake.
• Function gghistostats no longer displays both density and count since the density information was redundant. The density.plot argument has also been deprecated.
• ggscatterstats argument intercept has now been changed to centrality.para. This was due to possible confusion about interpretation of these lines; they show central tendency measures and not intercept for the linear model. Thus the change.
• The default for effsize.type = "biased" effect size for ggbetweenstats in case of ANOVA is partial omega-squared, and not omega-squared. Additionally, both partial eta- and omega-squared are not computed using bootstrapping with (default) 100 bootstrap samples.

MINOR CHANGES

• More examples added to the README document.
• 95% confidence intervals for Spearman's rho are now computed using broom package. RVAideMemoire package is thus removed from dependencies.
• 95% confidence intervals for partial eta- and omega-squared for ggbetweenstats function are now computed using sjstats package, which allows bootstrapping. apaTables and userfriendlyscience packages are thus removed from dependencies.

# ggstatsplot 0.0.1

• First release of the package.

# Reference manual

install.packages("ggstatsplot")

0.7.0 by Indrajeet Patil, 12 days ago

https://indrajeetpatil.github.io/ggstatsplot/, https://github.com/IndrajeetPatil/ggstatsplot

Report a bug at https://github.com/IndrajeetPatil/ggstatsplot/issues

Browse source code at https://github.com/cran/ggstatsplot

Authors: Indrajeet Patil [cre, aut, cph] , @patilindrajeets)

Documentation:   PDF Manual