tenderlove / xml_truth

A collection of XML/HTML parser benchmarks for Ruby

This URL has Read+Write access

tenderlove (author)
Mon Mar 23 14:21:59 -0700 2009
commit  1898b07c93a2f573047f1970408e2539e3fd0bb5
tree    4d8871f7a4f9a26cda7a77112b0cb6fb6e118cbb
parent  af844a2c997e8d28118a85c91544ba67c2e4ec82
name age message
file .autotest Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
file .gitignore Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
file CHANGELOG.rdoc Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
file Manifest.txt Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
file README.rdoc Tue Mar 17 22:10:31 -0700 2009 updating document benchmarks [tenderlove]
file Rakefile Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
directory bin/ Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
file document_parsing.rdoc Tue Mar 17 22:10:31 -0700 2009 updating document benchmarks [tenderlove]
directory lib/ Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
directory test/ Loading commit data...
README.rdoc

XML Truth

This project is a repository of benchmarks comparing features and speed of the current XML/HTML parsing players in the Ruby world.

Results

This test is for measuring the difference between parse times in different XML parsers. The test was conducted with two XML files, one small xml file at 18k, and one large XML file at 7.0M. N is adjusted in the small document parse runs to make sure that the amount of xml run through each system was approximately the same.

These tests were conducted with:

 * stock ruby 1.8.6 on OS X 10.5
 * libxml2 version 2.7.3
 * Hpricot version 0.7 (hpricot does not have a VERSION constant)
 * N being the number of iterations in each test