Skip to content
Benchmarking Python XML parsing libraries.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

This program benchmarks a few XML parsing libraries available to Python programmers. In particular, it compares lxml with some of the modules in the xml package in Python standard library.

You need to pass at least one XML file as test input:

$ ./ 1.xml 2.xml 3.xml

Sample final results:

   xml.dom.minidom: 7.49 MBps
   lxml.etree: 89.63 MBps
   xml.etree.ElementTree.iterparse: 31.77 MBps
   xml.etree.ElementTree: 58.43 MBps
   xml.sax: 25.68 MBps

From what I've seen so far, lxml is superior to all other options by a wide margin, will minidom is the slowest option by far.

This is not a very scientific benchmark, so take the results with a grain of salt.

You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session.