Performance regression #7

jacobat · 2012-05-13T12:29:53Z

Parsing https://www.e-conomic.com/secure/api1/EconomicWebservice.asmx?WSDL with Wasabi 1.0 takes ~0.3 sec:

require 'rubygems'
gem 'wasabi', '1.0.0'
require 'wasabi'
require 'benchmark'

wsdl = Wasabi.document File.read('economic.wsdl'); nil
Benchmark.measure do wsdl.endpoint end
 => #<Benchmark::Tms:0x10123ef90 @total=0.33, @cutime=0.0, @label="", @stime=0.01, @real=0.327524900436401, @utime=0.32, @cstime=0.0>

The same with 2.1.0 takes ~7sec:

require 'rubygems'
gem 'wasabi', '2.1.0'
require 'wasabi'
require 'benchmark'

wsdl = Wasabi.document File.read('economic.wsdl'); nil
Benchmark.measure do wsdl.endpoint end
 => #<Benchmark::Tms:0x1046a2a98 @label="", @stime=0.03, @real=6.99103879928589, @utime=6.97, @cstime=0.0, @total=7.0, @cutime=0.0>

The text was updated successfully, but these errors were encountered:

jacobat · 2012-05-13T12:47:37Z

The regression was introduced with b1e7691

jacobat · 2012-05-13T13:41:43Z

It seems the issue is with the issue of Nokogiris collect_namespaces method. Apparently using this method is a bad idea, from the nokogiri documentation at http://nokogiri.org/Nokogiri/XML/Document.html#method-i-collect_namespaces :

Note this is a very expensive operation in current implementation, as it traverses the entire graph, and also has to bring each node accross the libxml bridge into a ruby object.

Any ideas how to improve the situation?

/cc @jkingdon

jacobat · 2012-05-13T14:03:11Z

A simple patch is https://github.com/jacobat/wasabi/commit/90c0339124dfb4a1759294fa748986688f4fca96 - not sure if it is acceptable though.

jkingdon · 2012-05-16T05:07:44Z

I don't have any particular ability to answer for all WSDL files out there whether the namespaces must be on the root node, nor did I notice such a restriction in a quick glance at http://www.w3.org/TR/wsdl but I would think this patch is worth a try. My guess would be that namespaces on the root node would be the most common practice. But perhaps someone knows better than I on such matters.

jacobat · 2012-05-16T07:14:37Z

I'm not sure what the best approach is. Perhaps it would be better if Wasabi was somehow able to cache the results of it's parsing. I have no idea how much data is actually extracted from the WSDL by Wasabi, but it seems wasteful to parse the whole WSDL like Savon does everytime a new client instance is created.

rubiii · 2012-05-18T17:21:24Z

interesting find. thanks for reporting! wasabi should cache every expensive parse-operation,
but collect_namespaces just doesn't seem to be the right solution here.

this solved issue #7 and should probably work for most services.

rubiii · 2012-05-18T18:17:22Z

published v2.1.1 to fix this problem.

koppen · 2012-06-10T13:20:43Z

This seems to be an issue still (actually, it seems to have gotten worse). Here are the numbers I get from running the above code in Ruby 1.8.7 on different versions of Wasabi:

1.0.0: #<Benchmark::Tms:0x103ae96e8 @real=0.494114875793457, @utime=0.46, @cstime=0.0, @total=0.49, @cutime=0.0, @label="", @stime=0.03>
2.1.0: #<Benchmark::Tms:0x104882110 @real=8.68305802345276, @cutime=0.0, @label="", @total=8.69, @stime=0.05, @utime=8.64, @cstime=0.0>
2.4.0: #<Benchmark::Tms:0x101872740 @real=33.4192819595337, @stime=0.07, @utime=33.35, @total=33.42, @cstime=0.0, @cutime=0.0, @label="">

Numbers from Ruby 1.9.3 are almost as bad:

1.0.0:  0.430000   0.030000   0.460000 (  0.462021)
2.1.0:  7.070000   0.090000   7.160000 (  7.166233)
2.4.0: 28.290000   0.140000  28.430000 ( 28.432920)

koppen · 2012-06-10T15:18:21Z

Looks like 583cf65 introduced the 30ish second parsetime, and doesn't use collect_namespaces, so this is a new performance regression, not the one originally fixed, it seems.

rubiii · 2012-06-10T15:30:11Z

@hoverlover would you mind looking into this?

hoverlover · 2012-06-10T16:37:01Z

Sure, I can take a look.

trevorhiley · 2012-06-22T15:12:19Z

Any update on this one?

hoverlover · 2012-06-22T15:47:28Z

Sorry, I haven't had a moment to dig into this yet. Will try to look at it this weekend, but I'm going on vacation next week so it might be a little while.

rubiii · 2012-07-02T17:10:41Z

i looked into it. it's damn slow, but it's doing actual work. so i'm not sure how to optimize it right now.
but it's a pretty significant regression, so we need to do something about it.

rubiii · 2012-07-02T17:58:05Z

since this change parses almost the entire document, it might make sense to switch back to a sax parser.

hoverlover · 2012-07-02T18:06:34Z

I'm sure a SAX parser would help, since it doesn't hold the entire document in memory. Not sure how it would affect speed. It would be worth a shot to create a test branch.

rubiii · 2012-07-02T18:59:05Z

looking into it ...

jkingdon · 2012-07-03T00:51:56Z

Didn't do any benchmarking when I switched us from SAX to Nokogiri, but Nokogiri is pretty fast so my first instinct is that it is usually going to be our code which is the bottleneck rather than parsing.

If we do want SAX, I think we want something like saxophone ( http://jkingdon2000.blogspot.com/2008/05/parse-xml-with-saxophone.html ). Trying to hack up the previous SAX parser to know about namespaces (and various other features of the WSDL which were needed at that time) was seriously painful.

rubiii · 2012-07-03T10:53:17Z

@jkingdon i think you're right. i played with it for a couple hours and can see this getting messy real quick.
i'll take another look at the code and try to find to find the exact problem.

koppen · 2012-07-04T08:11:56Z

I can't say I entirely understand what the change actually does, but as far as I can tell, I haven't had a need for it in my projects - thus I would be perfectly happy not having to incur the parse time penalty (admittedly, I only use Savon for one endpoint/WSDL).

If it turns out to be too cumbersome/impossible to optimize this, we could perhaps consider making it an option?

rubiii · 2012-07-19T14:07:08Z

not every change to the parser affects every wsdl. reverting this change doesn't seem to be an option.

i started working on a sax parser which is actually pretty readable: fc27d4a

keeping state is kind of ugly, but i'm sure this can be improved somehow.
the impl. has only been tested against one wsdl and it currently ignores xs schema types.

the idea here is to parse the wsdl into a format that can be interpreted by some other class.
the benefit of this approach, is that the parser stays as slim as possible and the output can
be dumped to a yaml file, marshalled and stored in redis or what have you. so apart from
only reading the wsdl once, the parsed data can actually be cached as well.

my next step would be to test against other wsdl files and then build something that
interprets the data to make this pass the existing specs.

rubiii · 2012-07-19T14:23:38Z

one other feature i was thinking about to improve performance, is to somehow remove conditionals from the
giant case statement when they're not needed anymore. i don't know if that would make a huge difference
and maybe the implementation for this would be way too complicated (procs?), but it might be worth a try.

rubiii · 2012-07-19T15:07:42Z

fyi: parsing the original wsdl takes about 5.4 sec on my mba. i expect that number to double since the current impl.
doesn't parse wsdl:message nodes and wsdl:types yet.

jkingdon · 2012-07-20T04:47:01Z

What you've built on SAX (the stack, Wasabi::Matcher, etc) strike me as
the kind of thing
that lets one use SAX without turning the code into spaghetti (not
drastically different from
what I did in saxophone, which I think I mentioned to you). It reads nicely.

As for how to make all that matching performant, the holy grail I
suppose is perhaps compiling the patterns to
some kind of state machine or the like. But perhaps something easier,
like a fast-path expression which
recognized many of the nodes which weren't going to match anything,
would get much of the benefit for
less work. Even just keeping the code as it is and reordering the match
statements to put the ones which
match most often first might make a significant difference.

On 07/19/2012 07:07 AM, Daniel Harrington wrote:

not every change to the parser affects every wsdl. reverting this change doesn't seem to be an option.

i started working on a sax parser which is actually pretty readable:
https://github.com/rubiii/wasabi/commit/fc27d4a58a3814b61de7a6c7838fd287f40969a1

keeping state is kind of ugly, but i'm sure this can be improved somehow.
the impl. has only been tested against one wsdl and it currently ignores xs schema types.

the idea here is to parse the wsdl into a format that can be interpreted by some other class.
the benefit of this approach, is that the parser stays as slim as possible and the output can
be dumped to a yaml file, marshalled and stored in redis or what have you. so apart from
only reading the wsdl once, the parsed data can actually be cached as well.

my next step would be to test against other wsdl files and then build something that
interprets the data to make this pass the existing specs.

Reply to this email directly or view it on GitHub:
https://github.com/rubiii/wasabi/issues/7#issuecomment-7099049

rubiii · 2012-07-20T11:15:01Z

saxophone looks nice and simple, but i would like to avoid pulling in another dependency for something this easy.
i simplified and memoized the matchers and got the time down from 5.4 sec to 1.3 sec.

jacobat · 2012-07-20T11:35:38Z

If you need another big wsdl to test against you can try ExactTargets at: https://webservice.exacttarget.com/etframework.wsdl

rubiii · 2013-04-29T20:08:38Z

ok, so ... i have lots of things to tell you about this whole parser-thing, but right now, i can only give you a quick update.
i changed the current parser to benefit from caching and replaced a lot of very expensive xpath searches with walking the dom tree. sounds simple and in fact it just took some benchmarking and finding out how to avoid the slow parts of nokogiri.

when i started this, parsing the economic wsdl took about 34 seconds on my machine. just ridiculous.
after these changes, it's down to 0.78 seconds!

here's the output from method_profiler before and after the refactoring:
https://gist.github.com/rubiii/5484236

i'm quite satisfied with this, but i'm sure it could be even better. sadly, the code just sucks!

rubiii · 2013-04-29T20:09:23Z

if you could maybe point your gemfile to github master or otherwise test these changes, i'd really appreciate it!

henrik · 2013-04-29T20:28:20Z

Ooh, this sounds awesome. Will give it a go tomorrow at work!

On Mon, Apr 29, 2013 at 10:09 PM, Daniel Harrington <
notifications@github.com> wrote:

if you could maybe point your gemfile to github master or otherwise test
these changes, i'd really appreciate it!

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/7#issuecomment-17190871
.

henrik · 2013-04-30T08:38:54Z

@rubiii We're currently tied to Savon 0.9.5 because of another library that depends on that.

Seems we can't update just Wasabi on its own, because Savon 0.9.5 wants httpi (~> 0.9) and latest Wasabi wants httpi (~> 2.0).

Do you know off the top of your head if httpi 0.9 to 2.0 has any major breaking changes?

rubiii · 2013-04-30T09:13:22Z

@henrik not quite sure if the new httpi works with the old savon. could be that it depends on anything that changed,
but i would give it a try. anything blocking you from updating to savon 2.0?

rubiii · 2013-04-30T09:14:45Z

@henrik here's the changelog.

henrik · 2013-04-30T09:35:17Z

Thanks! Oh yeah, the cookie stuff.

We could and should upgrade, but I don't think it'd be fast or painless. I could try this out on another smaller project, though.

koppen · 2013-04-30T09:45:38Z

@rubiii this sounds awesome, will check it out as soon as possible. Hopefully this means rconomic can finally upgrade to a newer savonrb.

rubiii · 2013-04-30T09:57:10Z

@koppen i would hope so as well. please let me know if there's anything i can do to help.

henrik · 2013-04-30T11:12:37Z

Alright, tested in with my tiny economic_api script where I'm experimenting with a low-level API wrapper.

On my Air:

With latest Wasabi (3.1.0 / 9d190ed) and Savon (2.2.0 / 6a82e9b988f962f68e22979afbf41c31455867c6) and a test script that logs in and runs two requests, the best time I get is around 7 seconds all in all.

With my marshalling patch (which is definitely not production ready), the best times I get are just over 2 seconds.

Without either fix, the total run time is almost 40 seconds.

So it seems to work fine with the requests I tried, and it's a huge speedup. Looking forward to getting to a point where we can use it.

rubiii · 2013-04-30T11:26:11Z

@henrik thanks cool. are you doing anything fancy in your test script?
because for me the time for parser.operations is down to 0.7s (and counting).

henrik · 2013-04-30T11:48:45Z

Nothing very fancy: https://gist.github.com/henrik/5488207

Not runnable without an e-conomic account, of course.

henrik · 2013-04-30T11:50:21Z

@rubiii Oh yeah, the retrieving from URL bit might be it. The marshalled version will of course only do that once, whereas it will probably do it every time with your patch, but parse it fast after. Let me check that…

henrik · 2013-04-30T11:54:42Z

With a local WSDL, the best times I get now are around 3–3.5 seconds. Just a few runs, no proper benchmark.

With the marshalling instead, it's just under 2.5 seconds. So the difference is a lot smaller. And considering that the marshalling probably has a ton of issues, that's very promising.

rubiii · 2013-04-30T12:05:25Z

@henrik i'm working on cleaning up the type parsing right now, so that may or may not bring some performance improvements as well. there's a lots of code to improve 😉 not against caching, but it comes with a few new problems and i think we can actually get this thing even faster.

henrik · 2013-04-30T15:23:58Z

What you have is plenty fast enough for me not to bother with the fragile marshalling stuff (and good riddance).

Shaving off even more time would of course be great, but I'm very happy with what you have now :)

On 30 apr 2013, at 14:05, Daniel Harrington notifications@github.com wrote:

@henrik i'm working on cleaning up the type parsing right now, so that may or may not bring some performance improvements as well. there's a lots of code to improve not against caching, but it comes with a few new problems and i think we can actually get this thing even faster.

—
Reply to this email directly or view it on GitHub.

rubiii · 2013-04-30T15:41:49Z

@henrik cool 😄

koppen · 2013-05-01T08:07:19Z

This is looking really good, thanks so much @rubiii . A preliminary test with Savon 2.2 and running a spec from R-conomic:

Wasabi from gems: Finished in 2 minutes 37.12 seconds
Wasabi from master: Finished in 7.27 seconds

This brings us into usable territory, let the upgrading commence.

rubiii · 2013-05-01T11:51:04Z

started to clean up schema type parsing and pushed the first step to master.
the accidental benefit of this is that now every single schema element is parsed lazily.
if you're going to test this, please let me know how this affects your test scenarios!

the code still merges elements and complex types, ignores simple types and various other element types like group and choice, etc. which needs to be fixed, but it should be no different than before. and it exposes the same interface through the document class, so that it still works with savon.

next step: improve the interface (document class), because it's hard to consume and the code that uses it is the worst code inside savon right now. it's also pretty slow for documents with many elements, because we're iterating over all the types at least 4 times until we have the final xml. i think i can cut that down to one iteration total.

rubiii · 2013-05-01T12:00:16Z

just noticed, that the "lazy parsing" probably doesn't speed up the overall time to request just yet,
because the code around it doesn't take advantage of this. it could even slow down your tests a bit.

rubiii · 2013-05-11T12:17:42Z

summed up everything i learned about this and pushed a lot of new code to master.
it's basically a complete rewrite, but i finally managed to do it incrementally.

i updated the changelog and the readme to reflect the changes. i had to change the public interface, because it did not match the real world, so i need to adjust savon and throw out a lot of bad code.

not done here, but the new code already supports wsdl imports and xml schema imports are next on my todo list.
i would like see as many people as possible try this with their wsdl documents, so that i can fix any major problems as early as possible.

this needs a lot more documentation and proper specs, but i'm very happy about it!

ps. codeclimate is awesome :)

rubiii added a commit that referenced this issue May 18, 2012

only register namespaces found on the root node

1dddae7

this solved issue #7 and should probably work for most services.

rubiii closed this as completed May 18, 2012

rubiii reopened this Jun 10, 2012

ghost assigned hoverlover Jun 12, 2012

rubiii added a commit that referenced this issue Apr 29, 2013

found several major speed improvements /cc #7

280bfa4

rubiii mentioned this issue May 5, 2013

get the new sax-parser integrated into savon savonrb/savon#413

Closed

rubiii mentioned this issue May 11, 2013

v4.0 #27

Closed

17 tasks

This was referenced Jun 29, 2013

v3.0.0 savonrb/savon#473

Closed

improve the wsdl parser to know about messages savonrb/savon#346

Closed

This was referenced Jul 20, 2013

v2.3.0 savonrb/savon#481

Closed

v3.2.0 #28

Closed

tjarratt mentioned this issue Apr 22, 2014

Finish 1.0 savonrb/sekken#1

Open

19 tasks

jacobat closed this as completed Apr 13, 2020

Performance regression #7

Performance regression #7

Comments

jacobat commented May 13, 2012

jacobat commented May 13, 2012

jacobat commented May 13, 2012

jacobat commented May 13, 2012

jkingdon commented May 16, 2012

jacobat commented May 16, 2012

rubiii commented May 18, 2012

rubiii commented May 18, 2012

koppen commented Jun 10, 2012

koppen commented Jun 10, 2012

rubiii commented Jun 10, 2012

hoverlover commented Jun 10, 2012

trevorhiley commented Jun 22, 2012

hoverlover commented Jun 22, 2012

rubiii commented Jul 2, 2012

rubiii commented Jul 2, 2012

hoverlover commented Jul 2, 2012

rubiii commented Jul 2, 2012

jkingdon commented Jul 3, 2012

rubiii commented Jul 3, 2012

koppen commented Jul 4, 2012

rubiii commented Jul 19, 2012

rubiii commented Jul 19, 2012

rubiii commented Jul 19, 2012

jkingdon commented Jul 20, 2012

rubiii commented Jul 20, 2012

jacobat commented Jul 20, 2012

rubiii commented Apr 29, 2013

rubiii commented Apr 29, 2013

henrik commented Apr 29, 2013

henrik commented Apr 30, 2013

rubiii commented Apr 30, 2013

rubiii commented Apr 30, 2013

henrik commented Apr 30, 2013

koppen commented Apr 30, 2013

rubiii commented Apr 30, 2013

henrik commented Apr 30, 2013

rubiii commented Apr 30, 2013

henrik commented Apr 30, 2013

henrik commented Apr 30, 2013

henrik commented Apr 30, 2013

rubiii commented Apr 30, 2013

henrik commented Apr 30, 2013

rubiii commented Apr 30, 2013

koppen commented May 1, 2013

rubiii commented May 1, 2013

rubiii commented May 1, 2013

rubiii commented May 11, 2013