The 'HistData' package provides a collection of small data sets that are interesting and important in the history of statistics and data visualization. The goal of the package is to make these available, both for instructional use and for historical research. Some of these present interesting challenges for graphics or analysis in R.
Data Sets from the History of Statistics and Data Visualization
The HistData
package provides a collection of small data sets
that are interesting and important in the history of statistics and data
visualization. The goal of the package is to make these available, both for
instructional use and for historical research. Some of these present interesting
challenges for graphics or analysis in R.
Some of the data sets have examples which reproduce an historical graph or analysis. These are meant mainly as starters for more extensive re-analysis or graphical elaboration.
They are part of a program of research called statistical historiography (Friendly, 2007; Friendly & Denis, 2001; Friendly et-al, 2016) meaning the use of statistical methods to study problems and questions in the history of statistics and graphics. They are also used in a forthcoming book on the history of data visualization (Friendly & Wainer, 2018).
Friendly, M. (2007). A Brief History of Data Visualization.
In Chen, C., Hardle, W. & Unwin, A. (eds.)
Handbook of Computational Statistics: Data Visualization, Springer-Verlag, III, Ch. 1, 1-34.
preprint
Friendly, M. & Denis, D. (2001). Milestones in the history of thematic cartography, statistical graphics, and data visualization. http://datavis.ca/milestones/
Friendly, M. & Sigal, M. & Harnanansingh, D. (2016).
"The Milestones Project: A Database for the History of Data Visualization,"
In Kostelnick, C. & Kimball, M. (ed.), Visible Numbers: The History of Data Visualization, Ashgate Press, Chapter 10.
Friendly, M. & Wainer, H. (2018). The Origin of Graphical Species. Harvard University Press.
Version 0.8-4 (2018-04-04)
Version 0.8-3 (2017-10-01)
Version 0.8-2 (2017-02-10)
Version 0.8-1 (2016-12-22)
Version 0.8-0 (2016-12-07)
Version 0.7-8 (2016-04-13)
Fix web links again
Version 0.7-7 (2016-04-09)
Added vignettes/Snow_deaths and data/Snow.deaths2 [Thx: Peter Li] Fixed web links to make curl happy
Version 0.7-6 (2015-10-13)
Added Armada data [thx: Antoine de Falguerolles]
Version 0.7-4 (2013-10-29)
Added reference to agridat package
Fixed Nightingale.Rd examples to avoid obscuring some segments by overplotting [thx: Monique Graf].
Version 0.7-3 (2013-08-20)
Cosmetic: fix long lines in .Rd files
Version 0.7-2
Extended Nightingale examples
Version 0.7-1
CushnyPebbles renamed to proper spelling, CushnyPeebles (thx: Ian E. Holliday)
Removed Suggests: ReadImages; for now, retain old code examples as \dontrun{}
Extended GaltonFamilies examples
Version 0.7-0
Revised Langren.Rd examples to replace ReadImage with jpeg
Version 0.6-14 (2012-12-01)
Extended examples for Playfair Wheat
Replaced original data(Wheat) on 26 intervals with a new version at 53 time points
and a separate data.frame, Wheat.monarchs giving the reigns. Updated examples to
more closely approximate Playfair's original and a revision.
Version 0.6-13 (2011-11-03)
Added GaltonFamilies data
Fixed deprecated warnings from ggplot2
Version 0.6-12 (2011-10-12)
Added CushnyPebbles data
Extended examples for Michelson and CushnyPebbles
Fixed Suggests: for all packages used in examples
Added ZeaMays data from Darwin, used by Fisher for an early non-parametric test
Illustrated permutation and non-parametric tests for ZeaMays data
Added examples for Macdonell data showing contour plots and simulations [thx: Jim Hanley]
Added frequency-expanded version, MacdonellRD
Now Suggest: lattice
Added plotting examples to PearsonLee
Version 0.6-11 (2011-10-05)
Added ChestSizes data - Quetelet's data on chest measurements of 5738 Scottish Militiamen
Added PolioTrials data - two experiments on the Salk polio vaccine
Added Cavendish data - density of the earth
Added Michelson data - speed of light experiments
Version 0.6-10 (2010-12-22)
Additional variables added to Langren1644
Version 0.6-9 (2010-07-27)
\dontrun{} portion of example(Snow) using sp, due to error from sp::Lines()
Minor documentation fix for car_2
Version 0.6-8 (2010-03-19)
Added links to data sets on main package page
Version 0.6-7 (2010-02-28)
Switched from inst/ChangeLog to NEWS
Corrected typo in Yeast.Rd
Version 0.6-6 (2010-01-26) Added Quarrels data
Version 0.6-5 (2009-11-13) Added Dactyl data Added Pyx data Fixed one-off error creating Night1, Night2 Added simple Dactyl examples; minor updates to other examples Extended example(Jevons) Added Fingerprints data
Version 0.6-4 (2009-11-11) Initial release to CRAN
Version 0.6-3 (2009-11-08) Nightingale$Month made ordered factor Added Nightingale examples
Version 0.6-1 (2009-11-06) Added Yeast example Fixed latin1 characters in Langren1644 Extended Langren example
Version 0.6 (2009-11-05) Added Yeast data
Version 0.5 (2009-11-03) Added Macdonell data Added Langren/ggplot2 example Added OldMaps data
Version 0.4 (2009-11-02) Added Nightingale data
Version 0.3 (2009-10-27) Added Snow data Added Langren data Added inst/images/google-toledo-rome3.jpg
Version 0.2 (2009-10-23) Added Minard data Added Bowley data
Version 0.1 (2009-10-20)
Initial version uploaded to R-Forge