R is used by a vast array of people for a vast array of purposes - including web analytics. This package contains functions for consuming and munging various common forms of request log, including the Common and Combined Web Log formats and various Amazon access logs.
Author: Oliver Keyes
webreadr provides utilities for reading access log data in R. It is capable of handling logs in
Squid, Apache, Varnish, NGINX or AWS's usual formats, and is based around Hadley Wickham's
readr package for
maximum speed. See the vignette
for more information, and if you have requests for additional features, open an issue.
Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.
For the released version:
For the development version:
*Internal tweaks matching the updated readr package.
*IP normalisation and conversion removed; use iptools *URL decoding removed; use urltools. *R user agent parsing removed; use uaparser
*Add an AWS log file reader, read_aws, at Hadley's suggestion. *Convert log readers to use readr's read_log, eliminating some complexity and R-side parsing Documentation around split_ improved to make clear what happens with non-intact requests. *Add a prototype R user agent parser. *Add ip_to_numeric to convert human-readable IPv4s to numeric values.