cmd/compile: assembly generated is bigger than previous versions #30229

mariecurried · 2019-02-14T15:14:00Z

What version of Go are you using (`go version`)?

$ go version
go version go1.11.4 windows/amd64

Does this issue reproduce with the latest release?

Yes. I tried on tip and the same happens.

What did you do?

I compiled the following code and inspected the assembly instructions generated by the compiler:

func test(slc [][]int) (int, int) {
	var lentotal, lenslc int
	for _, x := range slc {
		lentotal += len(x)
		lenslc++
	}
	return lentotal, lenslc
}

Assembly code generated:

Version 1.10.1: https://godbolt.org/z/yWzkup
Version 1.11 and tip: https://godbolt.org/z/zdJKsA

What did you expect to see?

I expected the compiler to generate code similar to the 1.10.1 version, because on tip it generates unnecessary jumps and an extra block of XOR's.

What did you see instead?

Instead, the compiler generated more code than what is necessary.
In the 1.10.1 version, the only thing that I think could be different is that, on line 5, the slice address is moved to CX, but it might not be necessary in the case that len(slc) is 0, which is well handled on tip.

Summing up, I believe the code should look something like:

        pcdata  $2, $0
        pcdata  $0, $0
        xorl    DX, DX
        xorl    BX, BX
        movq    "".slc+16(SP), AX
        testq   AX, AX
        jle     test_pc37
        pcdata  $2, $1
        pcdata  $0, $1
        movq    "".slc+8(SP), CX
        jmp     test_pc25
test_pc21:
        addq    $24, CX
test_pc25:
        addq    8(CX), BX
        incq    DX
        cmpq    DX, AX
        jlt     test_pc21
test_pc37:
        pcdata  $2, $0
        movq    BX, "".~r1+32(SP)
        movq    DX, "".~r2+40(SP)
        ret

Regarding the test_pc21 block, it could disappear, as is done in the 1.10.1 version.

The text was updated successfully, but these errors were encountered:

mvdan · 2019-02-14T15:30:17Z

Is this about performance, binary size, or just correctness?

mariecurried · 2019-02-14T15:34:44Z

In terms of correctness, I believe both versions are correct in the sense that they produce the desired behavior.
In this issue, my point is more related towards better performance and smaller binary size, by eliminating redundant JUMP instructions and unneeded assembly code blocks.

mvdan · 2019-02-14T15:38:47Z

Fair enough - was just wondering to appropriately label the issue :)

If you'd like to help get this issue fixed faster, you could try bisecting which Go commit introduced the regression. For example, you could bisect between the go1.10 and go1.11 tags, running make.bash at each step, building a tiny program, and checking its size or number of assembly instructions.

mariecurried · 2019-02-14T16:45:24Z

After following your advice, I found that the commit that started generating this code was 837ed98, which makes sense.
According to the commit message, I believe the test_pc21 block makes sense to exist to prevent the past-the-end pointer. Now, what I think could be made better is the duplicated XOR's, which could be put together as the first instructions in the function, as shown in the assembly I wrote above.

mvdan · 2019-02-14T16:48:45Z

/cc @aclements @randall77 @dr2chase; please see the comments above.

randall77 · 2019-02-14T17:38:03Z

Looks like the phi tighten pass introduces the duplicate xors. If I turn that pass off, then the register allocator also introduces duplicate xors. I can't turn off the register allocator :(

The reason for both of those behaviors is that we're trying to avoid excess register pressure by loading constants into registers as late as possible. For an example reason why, see #16407. It sounds like we're being a bit too aggressive in that regard, but I'm not sure how to design a knob to adjust it, or how to decide where to set it.

mariecurried · 2024-08-07T20:25:42Z

Fixed in Go 1.20

mvdan added the NeedsInvestigation label Feb 14, 2019

mvdan added the Performance label Feb 14, 2019

gopherbot added the compiler/runtime label Jul 13, 2022

mknyszek added this to Go Compiler / Runtime Jul 13, 2022

mknyszek moved this to Triage Backlog in Go Compiler / Runtime Jul 15, 2022

seankhliao added this to the Unplanned milestone Aug 20, 2022

mariecurried closed this as completed Aug 7, 2024

github-project-automation bot moved this from Triage Backlog to Done in Go Compiler / Runtime Aug 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/compile: assembly generated is bigger than previous versions #30229

cmd/compile: assembly generated is bigger than previous versions #30229

mariecurried commented Feb 14, 2019 •

edited

Loading

mvdan commented Feb 14, 2019

mariecurried commented Feb 14, 2019

mvdan commented Feb 14, 2019

mariecurried commented Feb 14, 2019

mvdan commented Feb 14, 2019

randall77 commented Feb 14, 2019

mariecurried commented Aug 7, 2024

cmd/compile: assembly generated is bigger than previous versions #30229

cmd/compile: assembly generated is bigger than previous versions #30229

Comments

mariecurried commented Feb 14, 2019 • edited Loading

What version of Go are you using (go version)?

Does this issue reproduce with the latest release?

What did you do?

What did you expect to see?

What did you see instead?

mvdan commented Feb 14, 2019

mariecurried commented Feb 14, 2019

mvdan commented Feb 14, 2019

mariecurried commented Feb 14, 2019

mvdan commented Feb 14, 2019

randall77 commented Feb 14, 2019

mariecurried commented Aug 7, 2024

mariecurried commented Feb 14, 2019 •

edited

Loading

What version of Go are you using (`go version`)?