Tools for Reading Formatted Access Log Files

R is used by a vast array of people for a vast array of purposes - including web analytics. This package contains functions for consuming and munging various common forms of request log, including the Common and Combined Web Log formats and various Amazon access logs.


Author: Oliver Keyes
License: MIT
Status: Stable

downloads

webreadr provides utilities for reading access log data in R. It is capable of handling logs in Squid, Apache, Varnish, NGINX or AWS's usual formats, and is based around Hadley Wickham's readr package for maximum speed. See the vignette for more information, and if you have requests for additional features, open an issue.

Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.

Installation

For the released version:

install.packages("webreadr")

For the development version:

library(devtools)
install_github("ironholds/webreadr")

Dependencies

News

Version 0.4.0

NEW FEATURES

  • Amazon S3 bucket access logs are now supported with read_s3(). Thanks to Hadley Wickham for the feature request.

BUG FIXES

  • For consistency purposes, date-times will be POSIXct rather than occasionally POSIXlt

DEVELOPMENT

  • Splitters are now approximately 40-60% faster and can better handle corrupted or strange fields.

Version 0.3.1

*Internal tweaks matching the updated readr package.

Version 0.3.0

*IP normalisation and conversion removed; use iptools *URL decoding removed; use urltools. *R user agent parsing removed; use uaparser

Version 0.2.0

*Add an AWS log file reader, read_aws, at Hadley's suggestion. *Convert log readers to use readr's read_log, eliminating some complexity and R-side parsing Documentation around split_ improved to make clear what happens with non-intact requests. *Add a prototype R user agent parser. *Add ip_to_numeric to convert human-readable IPv4s to numeric values.

Version 0.1.0

*Initial release

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.

install.packages("webreadr")

0.4.0 by Oliver Keyes, 3 years ago


https://github.com/Ironholds/webreadr


Report a bug at https://github.com/Ironholds/webreadr/issues


Browse source code at https://github.com/cran/webreadr


Authors: Oliver Keyes


Documentation:   PDF Manual  


Task views: Web Technologies and Services


MIT + file LICENSE license


Imports Rcpp, readr

Suggests iptools, urltools, rgeolocate, knitr, testthat

Linking to Rcpp


See at CRAN