weppos edited this page Jan 5, 2011 · 4 revisions
Clone this wiki locally

Apache Log Regex is designed to be a simple class to parse Apache log files. It provides functionalities to parse a line from an Apache log file into a hash.

This library is a Ruby port of Peter Hickman’s Apache::LogRegex 1.4 Perl module, available at http://cpan.uwinnipeg.ca/~peterhi/Apache-LogRegex.


  • Ruby >= 1.8.6 (not tested with previous versions)

ApacheLogRegex is compatible with Ruby 1.9.1.


Apache Log Regex is designed to be a simple class to parse Apache log files.

It takes an Apache logging format and generates a regular expression which is used to parse a line from a log file and returns a Hash with keys corresponding to the fields defined in the log format.
The log format should match the one defined in your Apache configuration file with the LogFormat directive.

Example Usage

The following one is the most simple example usage.
It tries to parse the `access.log` file and echoes each parsed line.

format = '%h %l %u %t \"%r\" %>s %b \"%{Referer}i\" \"%{User-Agent}i\"'
parser = ApacheLogRegex.new(format)

File.foreach('/var/apache/access.log') do |line|
  # {"%r"=>"GET /blog/index.xml HTTP/1.1", "%h"=>"", ... }

More often, you might want to collect parsed lines and use them later in your program.
The following example iterates all log lines, parses them and returns an array of Hash with the results.

format = '%h %l %u %t \"%r\" %>s %b \"%{Referer}i\" \"%{User-Agent}i\"'
parser = ApacheLogRegex.new(format)

File.readlines('/var/apache/access.log').collect do |line|
  # {"%r"=>"GET /blog/index.xml HTTP/1.1", "%h"=>"", ... }

If you want more control over the parser you can use the parse! method.
It raises a ParseError if given line doesn’t match the log format.

common_log_format = '%h %l %u %t \"%r\" %>s %b'
parser = ApacheLogRegex.new(common_log_format)

# No exception
parser.parse(line) # => nil

# Raises an exception
parser.parse!(line) # => ParseError


This library is essentially a Ruby port of Apache::LogRegex Perl library.
A big thanks goes to Peter Hickman, the author of the original package. This Ruby GEM would probably not exists without your contribution!

I also want to thank you Harry Fuecks and Hamish Morgan, the authors of the Python and PHP ports. First of all they gave me the inspiration to translate the original Perl code to Ruby, as soon as I discovered nobody did it before me. Last but not least, their library have been extremely useful to help me understand some obscure Perl statements from the original package.




See the CHANGELOG.rdoc file for details.


Copyright © 2008-2009 Simone Carletti, ApacheLogRegex is released under the MIT license.