Skip to content
This repository


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

A collection of XML/HTML parser benchmarks for Ruby

branch: master

Fetching latest commit…


Cannot retrieve the latest commit at this time

Octocat-spinner-32 bin
Octocat-spinner-32 lib
Octocat-spinner-32 test
Octocat-spinner-32 .autotest
Octocat-spinner-32 .gitignore
Octocat-spinner-32 CHANGELOG.rdoc
Octocat-spinner-32 Manifest.txt
Octocat-spinner-32 README.rdoc
Octocat-spinner-32 Rakefile
Octocat-spinner-32 document_parsing.rdoc

XML Truth

This project is a repository of benchmarks comparing features and speed of the current XML/HTML parsing players in the Ruby world.


This test is for measuring the difference between parse times in different XML parsers. The test was conducted with two XML files, one small xml file at 18k, and one large XML file at 7.0M. N is adjusted in the small document parse runs to make sure that the amount of xml run through each system was approximately the same.

These tests were conducted with:

* stock ruby 1.8.6 on OS X 10.5
* libxml2 version 2.7.3
* Hpricot version 0.7 (hpricot does not have a VERSION constant)
* N being the number of iterations in each test
Something went wrong with that request. Please try again.