reduce cache mishits #2583

shyouhei · 2019-10-21T09:10:45Z

While running discourse benchmark under linux perf, I noticed one of the most frequent operations that ruby does is the method lookup.

That was kind of known, but something didn't smell right. I pushed 3ffd98c and ran the benchmark again, to get this output:

So 65,039,079 out of 94,613,117 inline cache mishits are spurious; they resulted in method lookups that end up exactly the same method entry the cache already stored. This is not a buggy behaviour and your program works as expected (apart from unnecessarily consuming our precious environmental resources to generate electricity for the computation). However there definitely is a room of improvements.

Let's use the cache more efficiently. We are facing the fact that several classes share the identical method entry for a method name; possibly due to inheritance, inclusions, and so on. We can use this. A call cache is valid for multiple classes. So in order to express the info, this changeset expands struct rb_call_cache from 44-ish bytes to cache line width. The space we add is used for second and later class serials.

By this changeset the debug counter output for the same benchmark is now like this:

The mc_miss_spurious counter dropped down to 23,344,738.

methodmissing · 2019-10-21T23:38:01Z

Excellent write up 🤤

shyouhei · 2019-10-23T01:08:09Z

@ko1 any idea?

shyouhei · 2019-10-25T04:10:32Z

Conflicts with #2564, going to resolve...

shyouhei · 2019-11-04T07:19:41Z

Can I merge this?

Prior to this changeset, majority of inline cache mishits resulted into the same method entry when rb_callable_method_entry() resolves a method search. Let's not call the function at the first place on such situations. In doing so we extend the struct rb_call_cache from 44 bytes (in case of 64 bit machine) to 64 bytes, and fill the gap with secondary class serial(s). Call cache's class serials now behavies as a LRU cache. Calculating ------------------------------------- ours 2.7 2.6 vm2_poly_same_method 2.339M 1.744M 1.369M i/s - 6.000M times in 2.565086s 3.441329s 4.381386s Comparison: vm2_poly_same_method ours: 2339103.0 i/s 2.7: 1743512.3 i/s - 1.34x slower 2.6: 1369429.8 i/s - 1.71x slower

shyouhei force-pushed the n-way-cc branch 2 times, most recently from 21b977c to 4a05e3a Compare October 21, 2019 12:06

shyouhei force-pushed the n-way-cc branch from 4a05e3a to 9f323ac Compare October 23, 2019 01:07

shyouhei marked this pull request as ready for review October 23, 2019 01:07

shyouhei force-pushed the n-way-cc branch from 9f323ac to 241f0de Compare October 26, 2019 11:20

shyouhei force-pushed the n-way-cc branch 5 times, most recently from 3833358 to 1de890a Compare November 7, 2019 05:31

shyouhei force-pushed the n-way-cc branch from 0ed795b to 99b15b7 Compare November 7, 2019 07:06

shyouhei merged commit d45a013 into ruby:master Nov 7, 2019

shyouhei deleted the n-way-cc branch November 7, 2019 08:41

eregon mentioned this pull request Dec 30, 2019

Write specs for new Ruby 2.7 features and changes ruby/spec#745

Open

70 tasks

georgie84 mentioned this pull request Nov 13, 2020

Ruby 2.7 Support jruby/jruby#6464

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reduce cache mishits #2583

reduce cache mishits #2583

shyouhei commented Oct 21, 2019

methodmissing commented Oct 21, 2019

shyouhei commented Oct 23, 2019

shyouhei commented Oct 25, 2019

shyouhei commented Nov 4, 2019

reduce cache mishits #2583

reduce cache mishits #2583

Conversation

shyouhei commented Oct 21, 2019

methodmissing commented Oct 21, 2019

shyouhei commented Oct 23, 2019

shyouhei commented Oct 25, 2019

shyouhei commented Nov 4, 2019