Improve performance for large files #56

HansWeltar · 2014-03-27T09:22:27Z

xmltodict becomes slow when you have an XML file with large texts.
Enabling the parser.buffer_text option dramatically increases performance.
Benchmark below. Unittests still work.

Code used to benchmark:

import xmltodict
import time
xml = "<root>" + ("a"*70+"\n")*10000 + "</root>"
s=time.time()
x=xmltodict.parse(xml)
print(time.time() - s)

#19.9860811234 seconds without buffer_text
#0.059289932251 seconds with buffer_text

So 300 times faster

xmltodict becomes slow when you have an XML file with large texts. Enabling the parser.buffer_text option dramatically increases performance. Code used to benchmark: import xmltodict import time xml = "<root>" + ("a"*70+"\n")*10000 + "</root>" s=time.time() x=xmltodict.parse(xml) print(time.time() - s) # 19.9860811234 seconds without buffer_text # 0.059289932251 seconds with buffer_text # So 300 times faster

martinblech · 2014-03-27T11:17:56Z

That's awesome, thank you for this contribution!

Improve performance for large files

- Add TEST_DEPENDS in preparation of automatic test infrastructure Changelog: * Improve performance for large files [1] [1] martinblech/xmltodict#56 PR: 188252 Submitted by: myself Approved by: koobs@ (mentor)

martinblech added a commit that referenced this pull request Mar 27, 2014

Merge pull request #56 from HansWeltar/master

861e50c

Improve performance for large files

martinblech merged commit 861e50c into martinblech:master Mar 27, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance for large files #56

Improve performance for large files #56

HansWeltar commented Mar 27, 2014

martinblech commented Mar 27, 2014

Improve performance for large files #56

Improve performance for large files #56

Conversation

HansWeltar commented Mar 27, 2014

So 300 times faster

martinblech commented Mar 27, 2014