-
Notifications
You must be signed in to change notification settings - Fork 17.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cmd/compile: use common ancestor blocks in CSE? #16973
Comments
0: common ancestor when all descendants compute E. It might be less of a problem child if we're careful; I recall a lot of the work goes into the partitioning, less into the finding common paths. Recall we already sort by dominator entry-number -- if there is a common ancestor to a bunch of nodes, then the first one is leftmost at its depth, and all the intervals (modulo numbering offset games, which we can deal with in a minute) must fill the interval of the common ancestor. There's some generational glitches -- suppose a is ancestor, and a.LL, a.LR, a.RRL, a.RRR, a.RL all have a common E, then LL+LR implies L, RRL+RRR implies RR, RR+RL implies R, L+R implies A. This might be a clever little recursive algorithm that is not that expensive. Handling the more complex cases might work best if we compute a dominator tree/forest for a flow graph that doesn't include panic edges, and we wouldn't want to extend lifetimes unnecessarily anyway just because a block had a common ancestor with a panic that shared a CSE. I.e., we could compute a different dominator numbering and use that for CSE purposes to get the effect of (1) above |
We need to be careful about moving things which can fault. For instance:
We can't move |
Keith, I think your recent CL making nil checks participate in the memory store chain should help with the last concern. |
FWIW, I've written recently a code hoisting pass here https://github.com/momchil-velikov/go/tree/dev.chill.gvn-hoist (it passes all tests with ssa/check/on, except some in nilptr3.go, where it hoists some nil checks). With this pass, the function in the first post compiles to:
|
The load
*p
occurs regardless of the value of b. That is, the compiler should treat this as equivalent to:It does not. The latter generates better code.
The reason is that neither block containing
*p
is an ancestor of the other, so CSE ignores the*p
values. However, they share a common ancestor in which all of the values' arguments are available.This happened in practice in CL 22712. Perhaps CSE should be taught to recognize situations like this.
Potential downsides:
cc @randall77 @dr2chase @aclements @bradfitz @tzneal
If we fix this, we should consider reverting CL 22712.
The text was updated successfully, but these errors were encountered: