Implementation of the metalog distribution in R.
The metalog distribution is a modern, highly flexible, data-driven distribution.
Metalogs are developed by Keelin (2016)
Isaac J. Faber
This repo is a working project for an R package that generates functions for the metalog distribution. The metalog distribution is a highly flexible probability distribution that can be used to model data without traditional parameters.
In economics, business, engineering, science and other fields, continuous uncertainties frequently arise that are not easily- or well-characterized by previously-named continuous probability distributions. Frequently, there is data available from measurements, assessments, derivations, simulations or other sources that characterize the range of an uncertainty. But the underlying process that generated this data is either unknown or fails to lend itself to convenient derivation of equations that appropriately characterize the probability density (PDF), cumulative (CDF) or quantile distribution functions.
The metalog distributions are a family of continuous univariate probability distributions that directly address this need. They can be used in most any situation in which CDF data is known and a flexible, simple, and easy-to-use continuous probability distribution is needed to represent that data. Consider their uses and benefits. Also consider their applications over a wide range of fields and data sources.
To install the package from this repository use the following:
Once the package is loaded you start with a data set of continuous observations. For this repository, we will load the library and use an example of fish size measurements from the Pacific Northwest. This data set is illustrative to demonstrate the flexibility of the metalog distribution as it is bi-modal. The data is installed with the package.
library(rmetalog)data("fishSize")summary(fishSize)#> FishSize#> Min. : 3.00#> 1st Qu.: 7.00#> Median :10.00#> Mean :10.18#> 3rd Qu.:12.00#> Max. :33.00
The base function for the package to create distributions is:
This function takes several inputs:
Here is an example of a lower bounded distribution build.
my_metalog <- metalog(fishSize$FishSize,term_limit = 9,term_lower_bound = 2,bounds=c(0,60),boundedness = 'b',step_len = 0.01)
The function returns an object of class
list. You can get a summary of the distributions using
summary(my_metalog)#> -----------------------------------------------#> Summary of Metalog Distribution Object#> -----------------------------------------------#>#> Parameters#> Term Limit: 9#> Term Lower Bound: 2#> Boundedness: b#> Bounds (only used based on boundedness): 0 60#> Step Length for Distribution Summary: 0.01#> Method Use for Fitting: any#>#>#> Validation and Fit Method#> term valid method#> 2 yes OLS#> 3 yes OLS#> 4 yes OLS#> 5 yes OLS#> 6 yes OLS#> 7 yes OLS#> 8 yes OLS#> 9 yes OLS
You can also plot a quick visual comparison of the distributions by term.
#> #> $cdf
Once the distributions are built, you can create
n samples by selecting a term.
You can also retrieve quantile, density, and probability values similar to other R distributions.
qmetalog(my_metalog, y = c(0.25, 0.5, 0.75), term = 9)#>  7.240623 9.840139 12.063061
probabilities from a quantile.
pmetalog(my_metalog, q = c(3,10,25), term = 9)#>  0.00195673 0.52005826 0.99226703
density from a quantile.
dmetalog(my_metalog, q = c(3,10,25), term = 9)#>  0.004489508 0.126724357 0.002264396
As this package is under development, any feedback is appreciated! Please submit a pull request or issue if you find anything that needs to be addressed.
The first release of the rmetalog package. Functionality demonstrated in the README and Vignette.