public
Description: A collection of XML/HTML parser benchmarks for Ruby
Homepage:
Clone URL: git://github.com/tenderlove/xml_truth.git
tenderlove (author)
Thu Mar 26 14:43:48 -0700 2009
commit  11fdb16c633695eb5a4f37095d930ff4f1f4951a
tree    4743b0e83643993ce7dafe7efde320dbde4b77a3
parent  1898b07c93a2f573047f1970408e2539e3fd0bb5
name age message
file .autotest Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
file .gitignore Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
file CHANGELOG.rdoc Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
file Manifest.txt Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
file README.rdoc Tue Mar 17 22:10:31 -0700 2009 updating document benchmarks [tenderlove]
file Rakefile Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
directory bin/ Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
file document_parsing.rdoc Tue Mar 17 22:10:31 -0700 2009 updating document benchmarks [tenderlove]
directory lib/ Fri Mar 13 16:47:46 -0700 2009 initial checkin [tenderlove]
directory test/ Thu Mar 26 14:43:48 -0700 2009 adding lots of html [tenderlove]
README.rdoc

XML Truth

This project is a repository of benchmarks comparing features and speed of the current XML/HTML parsing players in the Ruby world.

Results

This test is for measuring the difference between parse times in different XML parsers. The test was conducted with two XML files, one small xml file at 18k, and one large XML file at 7.0M. N is adjusted in the small document parse runs to make sure that the amount of xml run through each system was approximately the same.

These tests were conducted with:

 * stock ruby 1.8.6 on OS X 10.5
 * libxml2 version 2.7.3
 * Hpricot version 0.7 (hpricot does not have a VERSION constant)
 * N being the number of iterations in each test