-
Notifications
You must be signed in to change notification settings - Fork 17.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cmd/compile: lower % using multiply and subtract when the result of / is known #16416
Comments
The tricky part is doing a % b -> a - b * (a/b) only when the divide is also used in the function. You don't want to do this rewrite if the divide isn't used elsewhere. |
Would SSA phi handling not kill the path to an unused result? From: Keith Randall notifications@github.com The tricky part is doing a % b -> a - b * (a/b) only when the divide is also used in the function. You don't want to do this rewrite if the divide isn't used elsewhere. — |
SSA does delete unused code. I don't think that's relevant here. For the function
We want to compile it to a MOD instruction, not a DIV, MUL, and SUB. In contrast,
We want to compile it to a DIV, MUL, and SUB instead of a DIV and MOD. This assumes we have no DIVMOD instruction, like x86 has. In that case, we want to use that instruction regardless (unless it is for some reason slower than DIV or MOD by itself). |
We already has the concept of multiple results (for ARM 64-bit arithmetic), |
@minux, we could do that but it's a bit of special-case magic. The other option is to always rewrite x % y -> x - y *(x/y), then rewrite it back to mod during lowering if there's only one use of x/y. That would require some coordination with CSE to make sure we count the uses of x/y correctly. Still some magic, maybe. |
Thinking out loud, here's a possibly terrible, half-baked idea. Introduce a Cheaper Op. (Choose? Any?) It takes a variable number of arguments, all of which must contain equivalent values. Rewrite x%y to Cheaper(x%y, x-y*(x/y)). During arch-specific optimizations, as appropriate, rewrite Cheaper(Select(...), ...) to the Select and afterwards (lower in file), rewrite Cheaper(v, w) to w. |
Isn’t this pretty much what the assembler-time instruction selection is doing? Just riffing on the open-thinking… We could have some Mod-like compile-time pseudo instructions N DIV D => Q,R N MOD D => R With the logic like this: PseudoDiv(N,D) => Phi(Q), Phi(R) Where what gets used later determines what instructions gets scheduled up front. This is the same idea as Josh’s but using (abusing?) the existing phi() mechanism of SSA. From: Josh Bleecher Snyder notifications@github.com Thinking out loud, here's a possibly terrible, half-baked idea. Introduce a Cheaper Op. (Choose? Any?) It takes a variable number of arguments, all of which must contain equivalent values. Rewrite x%y to Cheaper(x%y, x-y*(x/y)). During arch-specific optimizations, as appropriate, rewrite Cheaper(Select(...), ...) to the Select and afterwards (lower in file), rewrite Cheaper(v, w) to w. — |
Poked at this a bit. Two observations:
Before continuing on the original issue, I think it's worth knowing how common non-const divmod is. And even then, adding ADIVMOD(U) to ARM is probably the place to start. |
CL 25004 allows amd64 to compute a % b and a / b with a single instruction. However, rewriting code to use it currently risks a regression on architectures with a divmod instruction. We could probably do the rewrite at a generic level instead, and lower a % b to a - b * (a / b) on architectures without divmod. Then we could use the clearer, faster formulation everywhere.
cc @randall77 @MichaelTJones @martisch @bradfitz
The text was updated successfully, but these errors were encountered: