Introduce CallEngine assigned to api.Function implementation. #761

mathetake · 2022-08-24T02:33:01Z

This introduces wasm.CallEngine internal type, and assign it to the api.Function
implementations. api.Function.Call now uses that CallEngine assigned to to it
to make function calls.

Internally, when creating CallEngine implementation, the compiler engine allocates
call frames and values stack. Previously, we allocate these stacks for each function calls,
which was a severe overhead as we can recognize in the benchmarks. As a result,
this reduces the memory usage (== reduces the GC jobs) as long as we reuse
the same api.Function multiple times.

As a side effect, now api.Function.Call is not goroutine-safe. So this adds the comment
about it on that method.

Benchmark result

before

### amd64 

goos: linux
goarch: amd64
pkg: github.com/tetratelabs/wazero/internal/integration_test/vs/jit
cpu: AMD Ryzen 9 3950X 16-Core Processor
BenchmarkAllocation/Call-32         	  105142	     11371 ns/op	    5000 B/op	      21 allocs/op


### arm64

goos: darwin
goarch: arm64
pkg: github.com/tetratelabs/wazero/internal/integration_test/vs/jit
BenchmarkAllocation/Call-10         	  200076	      5730 ns/op	    5048 B/op	      21 allocs/op

after

### amd64

goos: linux
goarch: amd64
pkg: github.com/tetratelabs/wazero/internal/integration_test/vs/jit
cpu: AMD Ryzen 9 3950X 16-Core Processor
BenchmarkAllocation/Call-32         	  168476	      8470 ns/op	     200 B/op	      11 allocs/op


### arm64

goos: darwin
goarch: arm64
pkg: github.com/tetratelabs/wazero/internal/integration_test/vs/jit
BenchmarkAllocation/Call-10         	  246650	      4895 ns/op	     200 B/op	      11 allocs/op

pprof

before

      flat  flat%   sum%        cum   cum%
     0.69s 52.67% 52.67%      0.69s 52.67%  runtime.madvise // <--- meaning majority is spent on allocation
     0.19s 14.50% 67.18%      0.19s 14.50%  runtime._ExternalCode
     0.16s 12.21% 79.39%      0.16s 12.21%  runtime.usleep
     0.07s  5.34% 84.73%      0.07s  5.34%  runtime.pthread_cond_wait
     0.05s  3.82% 88.55%      0.05s  3.82%  runtime.pthread_cond_signal
     0.03s  2.29% 90.84%      0.03s  2.29%  runtime.kevent
     0.03s  2.29% 93.13%      0.03s  2.29%  runtime.pthread_kill
     0.01s  0.76% 93.89%      0.01s  0.76%  github.com/tetratelabs/wazero/internal/engine/compiler.(*callEngine).execWasmFunction
     0.01s  0.76% 94.66%      0.01s  0.76%  runtime.asmcgocall
     0.01s  0.76% 95.42%      0.01s  0.76%  runtime.funcInfo.entry

after

      flat  flat%   sum%        cum   cum%
     320ms 42.67% 42.67%      320ms 42.67%  runtime._ExternalCode // <--- now native code execution is dominant
     270ms 36.00% 78.67%      270ms 36.00%  runtime.madvise
      20ms  2.67% 81.33%       80ms 10.67%  reflect.Value.call
      20ms  2.67% 84.00%       20ms  2.67%  reflect.directlyAssignable
      20ms  2.67% 86.67%       40ms  5.33%  reflect.funcLayout
      10ms  1.33% 88.00%       10ms  1.33%  github.com/tetratelabs/wazero/internal/integration_test/vs.(*wazeroRuntime).log
      10ms  1.33% 89.33%      130ms 17.33%  github.com/tetratelabs/wazero/internal/integration_test/vs.allocationCall
      10ms  1.33% 90.67%       10ms  1.33%  reflect.Value.Elem
      10ms  1.33% 92.00%       10ms  1.33%  runtime.getitab
      10ms  1.33% 93.33%       10ms  1.33%  runtime.pthread_kill

Signed-off-by: Takeshi Yoneda takeshi@tetrate.io

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

codefromthecrypt

just a few cleanups. Glad the perf is so much better and before our beta!

api/wasm.go

internal/engine/compiler/engine.go

internal/engine/interpreter/interpreter.go

internal/integration_test/vs/bench.go

internal/integration_test/vs/interpreter/interpreter_test.go

internal/wasm/call_context.go

internal/wasm/module.go

internal/wasm/gofunc.go

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

mathetake · 2022-08-24T07:08:31Z

ok now in good shape! Thanks @codefromthecrypt

mathetake added 5 commits August 24, 2022 11:32

Introduce CallEngine assigned to api.Function implementation.

d9b740a

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

more

dc98f07

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

more

6058645

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

more

c2aaed7

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

delete useless

9bbdad3

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

mathetake marked this pull request as ready for review August 24, 2022 05:23

mathetake requested a review from codefromthecrypt as a code owner August 24, 2022 05:23

codefromthecrypt approved these changes Aug 24, 2022

View reviewed changes

mathetake added 5 commits August 24, 2022 15:44

fuzz

0a5be7a

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

feedback

666ec95

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

fix

3c6233d

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

ok

2099c58

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

comment

bb0b07d

Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>

mathetake merged commit 0bd2bee into main Aug 24, 2022

mathetake deleted the callengineperapifunction branch August 24, 2022 07:11

inkeliz mentioned this pull request Aug 30, 2022

benchmark: Updates wazero to 1.0.0-beta.2 inkeliz/karmem#85

Merged

mathetake mentioned this pull request Aug 31, 2022

compiler: zero-initializes moduleInstanceAddress of call engine #783

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce CallEngine assigned to api.Function implementation. #761

Introduce CallEngine assigned to api.Function implementation. #761

mathetake commented Aug 24, 2022 •

edited

Loading

codefromthecrypt left a comment

mathetake commented Aug 24, 2022

Introduce CallEngine assigned to api.Function implementation. #761

Introduce CallEngine assigned to api.Function implementation. #761

Conversation

mathetake commented Aug 24, 2022 • edited Loading

Benchmark result

before

after

pprof

before

after

codefromthecrypt left a comment

Choose a reason for hiding this comment

mathetake commented Aug 24, 2022

mathetake commented Aug 24, 2022 •

edited

Loading