-
Notifications
You must be signed in to change notification settings - Fork 17.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cmd/compile: memcombine should learn allignement of byte slices - easy mode - types #71778
Comments
This came up while I was reviewing some library I was considering using in my project and found this cursed (invalid) use of To fix all real world usecases it would be nice to have something like |
Your snippet compiles down to a single `MOVQ' on amd64 but not riscv64. It looks to me like the memcombine pass is not enabled on riscv64 for some reason. I don't see why alignment would matter - the RISC-V spec explicitly allows unaligned loads. |
It does, but it allows them to be implemented by taking a fault and simulating them in the OS. That would be very slow. |
The exact relevant lines of code are: go/src/cmd/compile/internal/ssa/config.go Line 187 in 81c66e7
go/src/cmd/compile/internal/ssa/memcombine.go Lines 19 to 23 in 81c66e7
Which is not enabled for riscv64 :go/src/cmd/compile/internal/ssa/config.go Lines 324 to 337 in 81c66e7
There is Zicclsm for Since linux v6.11 we can use the hwprobe api to check for load and store support using What I'm saying right now is that there are enough arm and riscv64 cores out there which will always be slow that people get around the compiler not being that smart by using unsafe in a liberal manner. |
Go version
go version devel go1.25-b38415d7e9 Sat Feb 15 21:47:27 2025 -0800 linux/amd64
Output of
go env
in your module/workspace:What did you do?
See this piece of code:
What did you see happen?
This code compiles to a a series of 8bits memory loads then shift them left and finally bitwise or them together.
What did you expect to see?
A single 64 bits memory load.
The text was updated successfully, but these errors were encountered: