math/big: optimize amd64 asm shlVU and shrVU for shift==0 case #31097

josharian · 2019-03-28T01:23:12Z

When shift == 0, shlVU and shrVU reduce to a memcopy. When z.ptr == x.ptr, it further reduces to a no-op. The pure Go implementation has these optimizations, as of https://go-review.googlesource.com/c/go/+/164967. The arm64 implementation has one of them (see #31084 (comment)). We should add both to the amd64 implementation.

cc @griesemer

The text was updated successfully, but these errors were encountered:

gopherbot · 2019-03-31T15:48:48Z

Change https://golang.org/cl/170257 mentions this issue: math/big: optimize amd64 asm shlVU and shrVU for shift==0 case

@griesemer

DO NOT MAIL TODO: shrVU too TODO: benchmarks TODO: fuzz for confidence TODO: better commit message When shift == 0, shlVU and shrVU reduce to a memcopy. When z.ptr == x.ptr, it further reduces to a no-op. The pure Go implementation has these optimizations, as of https://go-review.googlesource.com/c/go/+/164967. The arm64 implementation has one of them (see golang#31084 (comment)). We should add both to the amd64 implementation. cc @griesemer Fixes golang#31097 Change-Id: I3979d7c82a63e1840c8191636a8947e8f440af3b

nightlyone · 2020-05-07T13:45:38Z

Can this be done in the wrappers/callers instead so the per-arch assembler as well as the generic can just assume that this optimization has been applied?

This also allows SSA to see where these conditions might be constant either now or in the future.

josharian · 2020-05-07T15:42:40Z

Good question. As of this moment there aren’t any pure go wrappers for these functions—they all go straight to the assembly implementations. Now that we have mid-stack inlining, it might make sense to change that, and do optimizations like this in the wrappers, so they can skip the call entirely. Want to experiment and send a CL for 1.16 if appropriate?

josharian added Performance help wanted NeedsFix The path to resolution is known, but the work has not been done. labels Mar 28, 2019

josharian added this to the Go1.13 milestone Mar 28, 2019

josharian self-assigned this Mar 29, 2019

nsajko mentioned this issue Mar 31, 2019

math/big: optimize amd64 asm shlVU and shrVU for shift==0 case #31171

Closed

andybons modified the milestones: Go1.13, Go1.14 Jul 8, 2019

rsc modified the milestones: Go1.14, Backlog Oct 9, 2019

rsc unassigned josharian Jun 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

math/big: optimize amd64 asm shlVU and shrVU for shift==0 case #31097

math/big: optimize amd64 asm shlVU and shrVU for shift==0 case #31097

josharian commented Mar 28, 2019

gopherbot commented Mar 31, 2019

nightlyone commented May 7, 2020

josharian commented May 7, 2020

math/big: optimize amd64 asm shlVU and shrVU for shift==0 case #31097

math/big: optimize amd64 asm shlVU and shrVU for shift==0 case #31097

Comments

josharian commented Mar 28, 2019

gopherbot commented Mar 31, 2019

nightlyone commented May 7, 2020

josharian commented May 7, 2020