-
Notifications
You must be signed in to change notification settings - Fork 17.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cmd/compile: reduce binary bloat for slice literals of interfaces #29573
Comments
This is a case where removing the write barrier could help a lot. To remove the write barrier we need to know 2 things:
The latter is easy, the addresses of globals and nil pointers are not heap pointers. The former is harder. We are writing to a newly allocated object, so it starts out zeroed, but in order to be sure each write is to a zeroed location we need to understand that all the writes are to distinct parts of the newly allocated object. This is the kind of thing that https://go-review.googlesource.com/c/go/+/151498 was supposed to help with, but it can't quite prove it's safe to remove the write barrier in this case. It shouldn't be hard to get this right for the stylized initializations that are done for slice literals and friends. It just needs a bit more analysis in the write barrier insertion pass. |
Another possibility to solve this issue is to make it so we initialize a statictmp with the desired contents and do just one
|
Note that this does work currently if the slice literal is at global scope:
|
Generates just:
|
Change https://golang.org/cl/156363 mentions this issue: |
Change https://golang.org/cl/157799 mentions this issue: |
When initializing a new object, we're often writing 1) to a location that doesn't have a pointer to a heap object 2) a pointer that doesn't point to a heap object When both those conditions are true, we can avoid the write barrier. This CL detects case 1 by looking for writes to known-zeroed locations. The results of runtime.newobject are zeroed, and we perform a simple tracking of which parts of that object are written so we can determine what part remains zero at each write. This CL detects case 2 by looking for addresses of globals (including the types and itabs which are used in interfaces) and for nil pointers. Makes cmd/go 0.3% smaller. Some particular cases, like the slice literal in #29573, can get much smaller. TODO: we can remove actual zero writes also with this mechanism. Update #29573 Change-Id: Ie74a3533775ea88da0495ba02458391e5db26cb9 Reviewed-on: https://go-review.googlesource.com/c/go/+/156363 Run-TryBot: Keith Randall <khr@golang.org> TryBot-Result: Gobot Gobot <gobot@golang.org> Reviewed-by: Cherry Zhang <cherryyz@google.com>
Drive-by comment: I note that when compiling with tip, the generated code looks better (no longer includes write barriers): "".makeLiteral STEXT size=304 args=0x18 locals=0x18 0x0000 00000 (issue29573.go:8) TEXT "".MakeLiteral(SB), ABIInternal, $24-24 0x0000 00000 (issue29573.go:8) MOVQ (TLS), CX 0x0009 00009 (issue29573.go:8) CMPQ SP, 16(CX) 0x000d 00013 (issue29573.go:8) JLS 294 0x0013 00019 (issue29573.go:8) SUBQ $24, SP 0x0017 00023 (issue29573.go:8) MOVQ BP, 16(SP) 0x001c 00028 (issue29573.go:8) LEAQ 16(SP), BP 0x0021 00033 (issue29573.go:8) FUNCDATA $0, gclocals·9fb7f0986f647f17cb53dda1484e0f7a(SB) 0x0021 00033 (issue29573.go:8) FUNCDATA $1, gclocals·69c1753bd5f81501d95132d08af04464(SB) 0x0021 00033 (issue29573.go:8) FUNCDATA $2, gclocals·bfec7e55b3f043d1941c093912808913(SB) 0x0021 00033 (issue29573.go:9) PCDATA $0, $1 0x0021 00033 (issue29573.go:9) PCDATA $1, $0 0x0021 00033 (issue29573.go:9) LEAQ type.[10]interface {}(SB), AX 0x0028 00040 (issue29573.go:9) PCDATA $0, $0 0x0028 00040 (issue29573.go:9) MOVQ AX, (SP) 0x002c 00044 (issue29573.go:9) CALL runtime.newobject(SB) 0x0031 00049 (issue29573.go:9) PCDATA $0, $1 0x0031 00049 (issue29573.go:9) MOVQ 8(SP), AX 0x0036 00054 (issue29573.go:10) PCDATA $0, $2 0x0036 00054 (issue29573.go:10) LEAQ type.uint32(SB), CX 0x003d 00061 (issue29573.go:10) PCDATA $0, $1 0x003d 00061 (issue29573.go:10) MOVQ CX, (AX) 0x0040 00064 (issue29573.go:10) PCDATA $0, $2 0x0040 00064 (issue29573.go:10) LEAQ ""..stmp_0(SB), CX 0x0047 00071 (issue29573.go:10) PCDATA $0, $1 0x0047 00071 (issue29573.go:10) MOVQ CX, 8(AX) 0x004b 00075 (issue29573.go:11) PCDATA $0, $2 0x004b 00075 (issue29573.go:11) LEAQ type.archive/tar.Format(SB), CX 0x0052 00082 (issue29573.go:11) PCDATA $0, $1 0x0052 00082 (issue29573.go:11) MOVQ CX, 16(AX) 0x0056 00086 (issue29573.go:11) PCDATA $0, $2 0x0056 00086 (issue29573.go:11) LEAQ ""..stmp_1(SB), CX 0x005d 00093 (issue29573.go:11) PCDATA $0, $1 0x005d 00093 (issue29573.go:11) MOVQ CX, 24(AX) 0x0061 00097 (issue29573.go:12) PCDATA $0, $2 0x0061 00097 (issue29573.go:12) LEAQ type.*archive/tar.Header(SB), CX 0x0068 00104 (issue29573.go:12) PCDATA $0, $1 0x0068 00104 (issue29573.go:12) MOVQ CX, 32(AX) 0x006c 00108 (issue29573.go:12) MOVQ $0, 40(AX) 0x0074 00116 (issue29573.go:13) PCDATA $0, $2 0x0074 00116 (issue29573.go:13) LEAQ type.*archive/tar.Reader(SB), CX 0x007b 00123 (issue29573.go:13) PCDATA $0, $1 0x007b 00123 (issue29573.go:13) MOVQ CX, 48(AX) 0x007f 00127 (issue29573.go:13) MOVQ $0, 56(AX) 0x0087 00135 (issue29573.go:14) PCDATA $0, $2 0x0087 00135 (issue29573.go:14) LEAQ type.*archive/tar.Writer(SB), CX 0x008e 00142 (issue29573.go:14) PCDATA $0, $1 0x008e 00142 (issue29573.go:14) MOVQ CX, 64(AX) 0x0092 00146 (issue29573.go:14) MOVQ $0, 72(AX) 0x009a 00154 (issue29573.go:15) PCDATA $0, $2 0x009a 00154 (issue29573.go:15) LEAQ type.*archive/zip.File(SB), CX 0x00a1 00161 (issue29573.go:15) PCDATA $0, $1 0x00a1 00161 (issue29573.go:15) MOVQ CX, 80(AX) 0x00a5 00165 (issue29573.go:15) MOVQ $0, 88(AX) 0x00ad 00173 (issue29573.go:16) PCDATA $0, $2 0x00ad 00173 (issue29573.go:16) LEAQ type.*archive/zip.FileHeader(SB), CX 0x00b4 00180 (issue29573.go:16) PCDATA $0, $1 0x00b4 00180 (issue29573.go:16) MOVQ CX, 96(AX) 0x00b8 00184 (issue29573.go:16) MOVQ $0, 104(AX) 0x00c0 00192 (issue29573.go:17) PCDATA $0, $2 0x00c0 00192 (issue29573.go:17) LEAQ type.*archive/zip.ReadCloser(SB), CX 0x00c7 00199 (issue29573.go:17) PCDATA $0, $1 0x00c7 00199 (issue29573.go:17) MOVQ CX, 112(AX) 0x00cb 00203 (issue29573.go:17) MOVQ $0, 120(AX) 0x00d3 00211 (issue29573.go:18) PCDATA $0, $2 0x00d3 00211 (issue29573.go:18) LEAQ type.*archive/zip.Reader(SB), CX 0x00da 00218 (issue29573.go:18) PCDATA $0, $1 0x00da 00218 (issue29573.go:18) MOVQ CX, 128(AX) 0x00e1 00225 (issue29573.go:18) MOVQ $0, 136(AX) 0x00ec 00236 (issue29573.go:19) PCDATA $0, $2 0x00ec 00236 (issue29573.go:19) LEAQ type.*archive/zip.Writer(SB), CX 0x00f3 00243 (issue29573.go:19) PCDATA $0, $1 0x00f3 00243 (issue29573.go:19) MOVQ CX, 144(AX) 0x00fa 00250 (issue29573.go:19) MOVQ $0, 152(AX) 0x0105 00261 (issue29573.go:9) PCDATA $0, $0 0x0105 00261 (issue29573.go:9) PCDATA $1, $1 0x0105 00261 (issue29573.go:9) MOVQ AX, "".~r0+32(SP) 0x010a 00266 (issue29573.go:9) MOVQ $10, "".~r0+40(SP) 0x0113 00275 (issue29573.go:9) MOVQ $10, "".~r0+48(SP) 0x011c 00284 (issue29573.go:9) MOVQ 16(SP), BP 0x0121 00289 (issue29573.go:9) ADDQ $24, SP 0x0125 00293 (issue29573.go:9) RET 0x0126 00294 (issue29573.go:9) NOP 0x0126 00294 (issue29573.go:8) PCDATA $1, $-1 0x0126 00294 (issue29573.go:8) PCDATA $0, $-1 0x0126 00294 (issue29573.go:8) CALL runtime.morestack_noctxt(SB) 0x012b 00299 (issue29573.go:8) JMP 0 Can this bug be closed out as a result? |
Yep. Can confirm this is much better. |
Using
go1.12
Consider the following snippet:
In my program I have many of these slice literals, some with thousands of entries. The generated code is ~60B/entry, when I expect that this could theoretically be closer to 8B/entry for the
*rtype
pointer.\cc @randall77
The text was updated successfully, but these errors were encountered: