-
Notifications
You must be signed in to change notification settings - Fork 17.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
runtime: significant number of bounds checks in growslice #60071
Comments
Change https://go.dev/cl/493796 mentions this issue: |
Do you have some benchmark results for how much the CL helps? Thanks. |
So, benchmarking all the
I haven't yet dug into the reasons for things getting slower. It might also make sense to try and see how it will interact with #49480 and https://go-review.googlesource.com/c/go/+/493836. |
Hmm, maybe it's due to needing to convert the division result into a uint8 and hence ending up emitting a MOVZX:
Also, maybe it's possible to use a single lookup instead of a two lookup. |
I guess one option would be to teach BCE about division and calculations, such as for https://go.godbolt.org/z/ccf4Gbs8E. But, I have no clue how complicated that would be.
|
My best guess is that the performance is very sensitive to code alignment on Threadripper 2950X amd64 Windows... for some reason. I'm seeing Anyways, I submitted the basic change. |
From my experiments and benchmarks, it seems that the benefit is at most ~2ns... and almost imperceptible due to code alignment affecting benchmark results about the same. So, it seems there's no performance win here. However, supporting division in BCE might be a good idea nevertheless. |
While inspecting the
runtime.growslice
code I noticed that it contains a lot of bounds checks. They all come fromruntime.roundupsize
.What version of Go are you using (
go version
)?However also happens with
go1.20.4
.Does this issue reproduce with the latest release?
Yes and with tip.
What did you do?
Inspecting
runtime.growslice
and seeing:What did you expect to see?
I'm not sure how much they affect performance, but triggering that many in
roundupsize
does seem a bit concerning.I tried to do the trivial patch with zero-filling the tables to exactly 256 elements https://go-review.googlesource.com/c/go/+/493796, however, I'm not convinced that this is the best way to address this. Either way, I'm willing to take a stab at fixing it.
The text was updated successfully, but these errors were encountered: