(PUP-3436) Optimize CPU intensitive Compiler methods #3155

thallgren · 2014-10-03T15:26:22Z

This commit contains some performance optimizations in methods
that showed up as very time consuming in the profiler when running
the rake following rake tests:

benchmark:many_modules:run
benchmark:defined_types4:run
benchmark:defined_types:run

CPU consumption with this patch in place is between 70% - 75% of
what it used to be.

Profiling was performed using stackprof
https://github.com/tmm1/stackprof

hlindberg · 2014-10-03T15:39:09Z

lib/puppet/util/tagging.rb

@@ -1,7 +1,7 @@
 require 'puppet/util/tag_set'

 module Puppet::Util::Tagging
-  ValidTagRegex = /^\w[-\w:.]*$/
+  ValidTagRegex = /^[0-9A-Za-z_][0-9A-Za-z_:.-]*$/


Why was this change made? Is it faster? The two regexps looks equivalent to me...

It's significantly faster.

Wow, that is surprising! I think \w is used in the parser too. Will see if this optimization can be applies there as well. Thanks.

hlindberg · 2014-10-03T15:54:49Z

Did you measure for Ruby 1.9 as well?

thallgren · 2014-10-03T16:31:30Z

No, not possible since the profiler relies on features introduced in 2.1

kylog · 2014-10-03T17:58:15Z

With 1.9 going EOL in Feb 2015, targeting 2.1 makes sense to me.

puppetcla · 2014-10-03T18:00:18Z

CLA signed by all contributors.

jpartlow · 2014-10-03T18:14:30Z

lib/puppet/graph/simple_graph.rb

-    @in_to[   e.target][e.source] ||= []; @in_to[   e.target][e.source] |= [e]
-    @out_from[e.source][e.target] ||= []; @out_from[e.source][e.target] |= [e]
+    # Avoid multiple lookups here. This code is performance critical
+    (@in_to[e.target][e.source] ||= []) << e


These aren't equivalent, the first using |= will prevent duplicate instances of 'e' from appearing in the array.

ok, so three alternatives to maintain uniqueness:

Go back to using the |= operator and force an unnecessary creation of an extra array instance (well, using a variable it would be once instead of twice).

Use a Set instead of an Array.

Test using array.include? before adding.

My preference is 3. Thoughts?

Is it the second lookup of @foo[e.target][e.source] that is expensive? The creation of an [e] array or the additional calls to e.target and e.source that is expensive?

Trying 3. sounds reasonable, but I don't know if an include? is faster, would be interesting to see.

How expensive is something like:
(@in_to[e.target][e.source] ||= []) |= [e]
compared to
into = @in_to[e.target][e.source] ||= []
into << e if !into.include?(e)

I'll measure it. But in my mind the former is more expensive since it must do what the latter does anyway. In addition it must:

create an array

add an element to the array

iterate over the array using an index

That makes sense.

thallgren · 2014-10-04T12:54:31Z

Added a new optimization to avoid a lot of unnecessary execution when creating new resources as copies of existing ones. With that in place, the three aforementioned tests execute almost twice as fast as they do without this PR.

kylog · 2014-10-09T21:09:13Z

@thallgren bookkeeping: can you file a ticket and reference it in the commits?

thallgren · 2014-10-10T15:40:00Z

@kylog commits updates as requested.

hlindberg · 2014-11-06T00:32:59Z

@thallgren can you update the PR, it has gone stale

thallgren · 2014-11-06T11:37:11Z

PR updated. All builds pass except one that fails when running ruby 1.8.7. I have verified that the failure is unrelated to this PR (it fails on master too). The failing test was introduced 8 days ago in commit 49c5dab and the error is:

unsatisfied expectations:
- expected exactly once, not yet invoked: Puppet.warning('pkg warning: [\'Certificate '/var/pkg/ssl/871b4ed0ade09926e6adf95f86bf17535f987684' for publisher 'solarisstudio', needed to access 'https://pkg.oracle.com/solarisstudio/release/', will expire in '29' days.\']')

The code incorrectly assumes that Array.to_s adds brackets. It doesn't in Ruby < 1.9

joshcooper · 2014-11-06T16:23:27Z

@thallgren that was me actually last night. I will take a look, as I don't want to break compatibility with 1.8 for no good reason.

joshcooper · 2014-11-06T17:24:49Z

@thallgren should be resolved now

This commit contains some performance optimizations in methods that showed up as very time consuming in the profiler when running the rake following rake tests: benchmark:many_modules:run benchmark:defined_types4:run benchmark:defined_types:run CPU consumption with this patch in place is between 70% - 75% of what it used to be. Profiling was performed using stackprof https://github.com/tmm1/stackprof

The Resource::copy_as_resource first created a new Resource instance with all bells and whistles which means extracting title, munching, and tagging. This commit ensures that these fields are more effiently initialized when the source is known to be a Resource.

This commit removes an unnecessary cloning of a tag set in the method Puppet::Parser::Resource#add_scope_tags. It also removes unnecessary processing of the tags since they have undergone this processing already.

thallgren · 2014-11-07T13:34:18Z

@joshcooper looks good now. Thanks.

hlindberg · 2014-11-26T22:39:40Z

sigh - this again needs a rebase - I am doing that locally and will merge this in.

hlindberg · 2014-11-26T22:43:38Z

Manually merged at 2d0e0a6 - closing

hlindberg reviewed Oct 3, 2014
View reviewed changes

jpartlow reviewed Oct 3, 2014
View reviewed changes

thallgren force-pushed the compiler-optimizations branch 2 times, most recently from 9ad9224 to 44f29d6 Compare October 4, 2014 07:10

thallgren force-pushed the compiler-optimizations branch from b280915 to 95f15c4 Compare October 5, 2014 11:57

thallgren force-pushed the compiler-optimizations branch from 95f15c4 to 95e07cf Compare October 10, 2014 15:39

thallgren changed the title ~~Optimize CPU intensitive Compiler methods~~ (PUP-3436) Optimize CPU intensitive Compiler methods Oct 10, 2014

hlindberg added the PL label Nov 6, 2014

thallgren force-pushed the compiler-optimizations branch from 95e07cf to 5acbd39 Compare November 6, 2014 10:52

thallgren added 3 commits November 7, 2014 14:24

(PUP-3436) Prevent unnecessary copying of tag sets.

78b354b

This commit removes an unnecessary cloning of a tag set in the method Puppet::Parser::Resource#add_scope_tags. It also removes unnecessary processing of the tags since they have undergone this processing already.

thallgren force-pushed the compiler-optimizations branch from 7bf685f to 78b354b Compare November 7, 2014 13:24

hlindberg closed this Nov 26, 2014

thallgren deleted the compiler-optimizations branch January 15, 2015 11:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(PUP-3436) Optimize CPU intensitive Compiler methods #3155

(PUP-3436) Optimize CPU intensitive Compiler methods #3155

thallgren commented Oct 3, 2014

hlindberg Oct 3, 2014

thallgren Oct 3, 2014

hlindberg Oct 5, 2014

hlindberg commented Oct 3, 2014

thallgren commented Oct 3, 2014

kylog commented Oct 3, 2014

puppetcla commented Oct 3, 2014

jpartlow Oct 3, 2014

thallgren Oct 3, 2014

jpartlow Oct 3, 2014

thallgren Oct 3, 2014

jpartlow Oct 3, 2014

thallgren commented Oct 4, 2014

kylog commented Oct 9, 2014

thallgren commented Oct 10, 2014

hlindberg commented Nov 6, 2014

thallgren commented Nov 6, 2014

joshcooper commented Nov 6, 2014

joshcooper commented Nov 6, 2014

thallgren commented Nov 7, 2014

hlindberg commented Nov 26, 2014

hlindberg commented Nov 26, 2014

(PUP-3436) Optimize CPU intensitive Compiler methods #3155

(PUP-3436) Optimize CPU intensitive Compiler methods #3155

Conversation

thallgren commented Oct 3, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hlindberg commented Oct 3, 2014

thallgren commented Oct 3, 2014

kylog commented Oct 3, 2014

puppetcla commented Oct 3, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thallgren commented Oct 4, 2014

kylog commented Oct 9, 2014

thallgren commented Oct 10, 2014

hlindberg commented Nov 6, 2014

thallgren commented Nov 6, 2014

joshcooper commented Nov 6, 2014

joshcooper commented Nov 6, 2014

thallgren commented Nov 7, 2014

hlindberg commented Nov 26, 2014

hlindberg commented Nov 26, 2014