cmd/compile: optimize type switch #18492

bradfitz · 2017-01-02T18:40:25Z

In code review https://go-review.googlesource.com/c/34773/, John Doe points out that a type switch in an inner loop can be optimized by pulling it out of the loop, setting an integer, and switching on the integer value instead.

John Doe provided this benchmark: https://play.golang.org/p/6LXF82e6U4

With tip, I get:

$ go test -v -bench=. -benchtime=2s
BenchmarkTypeSwitchInside-4         2000           1298710 ns/op
BenchmarkTypeSwitchOutside-4       10000            475899 ns/op

These seem like they could be identical. In both cases, it's just a switch on an integer. In the "Inside" case, that integer just happens to be in the first word of an interface value.

/cc @randall77 @mdempsky @josharian

The text was updated successfully, but these errors were encountered:

josharian · 2017-01-02T18:57:57Z

cc @dr2chase who likes hoisting expensive operations out of loops

mvdan · 2017-01-02T19:52:29Z

Just to clarify, is this about optimizing all type switches or just about those that can be moved outside of a loop?

josharian · 2017-01-03T18:56:52Z

As I understand it, moving expensive things out of loops. But maybe I'm missing something.

minux · 2017-01-03T20:45:06Z

I think the first goal should be to lower the cost of type switches to that of integer switches and then we can talk about how to hoist expensive part out of loops. The first goal clearly benefits more than the 2nd because the programmer always have a way to do that themselves.

mvdan · 2017-01-03T21:15:53Z

I agree that the former is more interesting. It's also what I would understand given the issue title.

randall77 · 2017-01-03T21:52:50Z

In go 1.8, the type switch compiles fairly small. The non-empty-interface-to-concrete-type switch compiles to just a few instructions.

func f(t I) int {
	switch t.(type) {
	case *T:
		return 1
	}
	return 0
}

generates for the switch

	0x0000 00000 (tmp1.go:14)	MOVQ	"".t+8(FP), AX    // AX = itab of interface
	0x0005 00005 (tmp1.go:14)	TESTQ	AX, AX
	0x0008 00008 (tmp1.go:14)	JEQ	52                // nil interface
	0x000a 00010 (tmp1.go:14)	MOVQ	8(AX), CX         // CX = type
	0x000e 00014 (tmp1.go:14)	MOVL	16(CX), DX        // DX = hash of type
	0x0011 00017 (tmp1.go:14)	CMPL	DX, $432690315
	0x0017 00023 (tmp1.go:14)	JNE	52
	0x0019 00025 (tmp1.go:14)	TESTQ	AX, AX
	0x001c 00028 (tmp1.go:14)	JEQ	$0, 62
	0x001e 00030 (tmp1.go:18)	LEAQ	type.*"".T(SB), AX
	0x0025 00037 (tmp1.go:14)	CMPQ	AX, CX
	0x0028 00040 (tmp1.go:14)	JNE	$0, 52
	0x002a 00042 (tmp1.go:16)	MOVQ	$1, "".~r1+24(FP)
	0x0033 00051 (tmp1.go:16)	RET
	0x0034 00052 (tmp1.go:18)	MOVQ	$0, "".~r1+24(FP)
	0x003d 00061 (tmp1.go:18)	RET
	0x003e 00062 (tmp1.go:14)	MOVQ	AX, CX
	0x0041 00065 (tmp1.go:18)	JMP	30

It's not optimal yet (the redundant TESTQ), but it's pretty small. The code is even slightly smaller when switching on empty interfaces.

Possibly for small switches we could get rid of the hash comparison and check the type pointers directly. (The hash enables binary search. We'd have to do linear search if we just used the type pointers.)

For nonempty interfaces, maybe we could even compare the itab with the address of the now-statically-known I/*T itab struct? Then we wouldn't even need to load the type out of the itab.

The more general question, how to pull this stuff out of the loop altogether, is harder. It involves memory operations so it isn't clear lifting is legal just from looking at the SSA form. We'd have to understand that some of the loads are from never-changing memory locations.

gopherbot · 2017-01-04T00:45:15Z

CL https://golang.org/cl/34810 mentions this issue.

rasky · 2017-01-04T01:30:10Z

For larger switches, given that we already have a compile-time known hash per type, it looks like it would be quite easy to use something along the lines of MRST. In other words, a small sequence of bits in those hashes could be used as perfect hashing to index a small jump table.

randall77 · 2017-01-04T01:34:25Z

Jump tables: #5496, #10870

josharian · 2017-01-04T01:36:17Z

Jump tables see also #15780 (comment)

When doing i.(T) for non-empty-interface i and concrete type T, there's no need to read the type out of the itab. Just compare the itab to the itab we expect for that interface/type pair. Also optimize type switches by putting the type hash of the concrete type in the itab. That way we don't need to load the type pointer out of the itab. Update #18492 Change-Id: I49e280a21e5687e771db5b8a56b685291ac168ce Reviewed-on: https://go-review.googlesource.com/34810 Run-TryBot: Keith Randall <khr@golang.org> TryBot-Result: Gobot Gobot <gobot@golang.org> Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com> Reviewed-by: David Chase <drchase@google.com>

ghost · 2017-06-30T02:15:18Z

With go1.9beta2, I get

BenchmarkTypeSwitchInside-4         2000            687881 ns/op
BenchmarkTypeSwitchOutside-4        3000            443001 ns/op

ghost · 2018-11-19T06:09:59Z

Go 1.11.2:

BenchmarkTypeSwitchInside-4         3000            513823 ns/op
BenchmarkTypeSwitchOutside-4        3000            444997 ns/op

odeke-em · 2020-05-29T09:12:38Z

For Go1.15, here are the results

$ benchstat before.txt after.txt 
name          old time/op  new time/op  delta
TypeSwitch-8   576µs ± 4%   579µs ± 4%   ~     (p=0.548 n=5+5)

after

$ go test -run=^$ -bench=. -count=5 
goos: darwin
goarch: amd64
pkg: github.com/odeke-em/bugs/golang/18492
BenchmarkTypeSwitchInside-8    	    2011	    572145 ns/op
BenchmarkTypeSwitchInside-8    	    2148	    555198 ns/op
BenchmarkTypeSwitchInside-8    	    2050	    579787 ns/op
BenchmarkTypeSwitchInside-8    	    2040	    586727 ns/op
BenchmarkTypeSwitchInside-8    	    1983	    583754 ns/op
BenchmarkTypeSwitchOutside-8   	    2128	    576922 ns/op
BenchmarkTypeSwitchOutside-8   	    2080	    557053 ns/op
BenchmarkTypeSwitchOutside-8   	    2116	    588772 ns/op
BenchmarkTypeSwitchOutside-8   	    2005	    582071 ns/op
BenchmarkTypeSwitchOutside-8   	    2013	    592535 ns/op
PASS
ok  	github.com/odeke-em/bugs/golang/18492	13.956s

Perhaps we can close this issue, as the tertiary improvements have their own issues.

josharian · 2020-05-29T13:21:19Z

I wonder whether the thing that "fixed" this was https://go-review.googlesource.com/c/go/+/228106

josharian · 2020-05-29T13:21:50Z

Either way, yes, let's call this good. Thanks for checking, @odeke-em.

bradfitz added the Performance label Jan 2, 2017

bradfitz added this to the Go1.9Maybe milestone Jan 2, 2017

bradfitz modified the milestones: Go1.10, Go1.9Maybe Jun 30, 2017

bradfitz added the NeedsFix The path to resolution is known, but the work has not been done. label Jun 30, 2017

bradfitz modified the milestones: Go1.10, Go1.11 Nov 28, 2017

bradfitz modified the milestones: Go1.11, Unplanned May 18, 2018

josharian closed this as completed May 29, 2020

golang locked and limited conversation to collaborators May 29, 2021

gopherbot added the FrozenDueToAge label May 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/compile: optimize type switch #18492

cmd/compile: optimize type switch #18492

bradfitz commented Jan 2, 2017 •

edited by josharian

Loading

josharian commented Jan 2, 2017

mvdan commented Jan 2, 2017

josharian commented Jan 3, 2017

minux commented Jan 3, 2017 via email

mvdan commented Jan 3, 2017

randall77 commented Jan 3, 2017 •

edited

Loading

gopherbot commented Jan 4, 2017

rasky commented Jan 4, 2017

randall77 commented Jan 4, 2017

josharian commented Jan 4, 2017

ghost commented Jun 30, 2017

ghost commented Nov 19, 2018

odeke-em commented May 29, 2020

josharian commented May 29, 2020

josharian commented May 29, 2020

cmd/compile: optimize type switch #18492

cmd/compile: optimize type switch #18492

Comments

bradfitz commented Jan 2, 2017 • edited by josharian Loading

josharian commented Jan 2, 2017

mvdan commented Jan 2, 2017

josharian commented Jan 3, 2017

minux commented Jan 3, 2017 via email

mvdan commented Jan 3, 2017

randall77 commented Jan 3, 2017 • edited Loading

gopherbot commented Jan 4, 2017

rasky commented Jan 4, 2017

randall77 commented Jan 4, 2017

josharian commented Jan 4, 2017

ghost commented Jun 30, 2017

ghost commented Nov 19, 2018

odeke-em commented May 29, 2020

josharian commented May 29, 2020

josharian commented May 29, 2020

bradfitz commented Jan 2, 2017 •

edited by josharian

Loading

randall77 commented Jan 3, 2017 •

edited

Loading