cmd/compile: assigning large values does not use memmove #10362

davecheney · 2015-04-07T13:29:54Z

Consider this piece of code

package main

import "fmt"

func main() {
        f()
}

func f() {
        var a [200]int
        var b [200]int

        a = b
        b = a

        fmt.Println(&a, &b)
}

The assignments of a = b or b = a where the size of a or b is above the DUFFCOPY limit of 128 words produces some very simplistic code

        a = b
   10c60:       e1a01004        mov     r1, r4
   10c64:       e1a00005        mov     r0, r5
   10c68:       e2843e32        add     r3, r4, #800    ; 0x320
   10c6c:       e4912004        ldr     r2, [r1], #4
   10c70:       e4802004        str     r2, [r0], #4
   10c74:       e1530001        cmp     r3, r1
   10c78:       1afffffb        bne     10c6c <main.f+0x4c>
        b = a
   10c7c:       e1a01005        mov     r1, r5
   10c80:       e1a00004        mov     r0, r4
   10c84:       e2853e32        add     r3, r5, #800    ; 0x320
   10c88:       e4912004        ldr     r2, [r1], #4
   10c8c:       e4802004        str     r2, [r0], #4
   10c90:       e1530001        cmp     r3, r1
   10c94:       1afffffb        bne     10c88 <main.f+0x68>

Should sgen/stackcopy take the opportunity to setup a call to runtime.memmove for values larger than 128 words ?

The text was updated successfully, but these errors were encountered:

davecheney · 2015-04-07T13:37:40Z

This benchmark, shows the cliff when values pass the upper limit of DUFFCOPY

http://paste.ubuntu.com/10762232/

root@labs-782e8a:~/src/duffbench# go test -bench=.                                                                                                                        
testing: warning: no tests to run
PASS
BenchmarkCopy1          300000000                3.89 ns/op
BenchmarkCopy4          50000000                40.0 ns/op
BenchmarkCopy16         20000000                77.5 ns/op
BenchmarkCopy32         20000000               139 ns/op
BenchmarkCopy64          5000000               255 ns/op
BenchmarkCopy128         3000000               538 ns/op
BenchmarkCopy129         3000000               745 ns/op   <<<<
BenchmarkCopy256         2000000              1088 ns/op

josharian · 2015-04-07T15:17:13Z

6g and 8g use REP with MOVSL/MOVSQ, which I believe @randall77 determined to be faster around that threshold. I would believe that the other architectures could benefit from a call to memmove or something similar. (This is a place where NEON should shine.)

rsc · 2015-04-10T03:59:54Z

[Please don't use { } syntax in bug headings. It doesn't sort well.]

This may apply to some subset of the non-x86 systems.
The x86 systems are doing the right thing.

minux · 2015-04-10T04:05:09Z

we need a memmove that takes argument from register rather from the stack (i.e. duffcopy style), otherwise the optimization won't work.

randall77 · 2015-04-15T01:35:57Z

538->745 is hardly a "cliff". I'm surprised it is so close given the lack of anyone tuning this mechanism on arm. (Or did I miss someone doing that?)

minux is right, the moves generated here are sometimes used to marshal arguments to a function, so we can't call a function to do the marshaling. For other situations like your a=b example you could call memmove. It might take some work to distinguish those two cases, however. At the move generation point the marshaling has already been turned into a=b assignments.

rsc changed the title ~~cmd/{5,6,7,8,9g}: assigning large values does not use memmove~~ cmd/gc: assigning large values does not use memmove Apr 10, 2015

rsc added this to the Unplanned milestone Apr 10, 2015

rsc changed the title ~~cmd/gc: assigning large values does not use memmove~~ cmd/compile: assigning large values does not use memmove Jun 8, 2015

gopherbot added the compiler/runtime Issues related to the Go compiler and/or runtime. label Jul 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/compile: assigning large values does not use memmove #10362

cmd/compile: assigning large values does not use memmove #10362

davecheney commented Apr 7, 2015

davecheney commented Apr 7, 2015

josharian commented Apr 7, 2015

rsc commented Apr 10, 2015

minux commented Apr 10, 2015 via email

randall77 commented Apr 15, 2015

cmd/compile: assigning large values does not use memmove #10362

cmd/compile: assigning large values does not use memmove #10362

Comments

davecheney commented Apr 7, 2015

davecheney commented Apr 7, 2015

josharian commented Apr 7, 2015

rsc commented Apr 10, 2015

minux commented Apr 10, 2015 via email

randall77 commented Apr 15, 2015