-
Notifications
You must be signed in to change notification settings - Fork 17.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
runtime: remove unnecessary allocations in convT2E #8892
Comments
@dvyukov, does this issue look relevant to you? |
@ysmolsky I don't understand the question. What do you mean? |
@dvyukov I think he is asking whether you think this issue should still be open. Really this is a question for @aclements and @RLH. Is this a feasible optimization with the current GC? |
It should still be a feasible optimization because we still support Go pointers to point to C heap, right? Another potential implementation would work for value types that are smaller than pointer size (e.g. uint16, uint32 on 64-bits, bool, small structs, etc). Namely: let's say we have amd64, pointer size is 64 bits, let's say we are storing uint32 into an interface{}, when we are storing the value we put 0xffffffff into the high 32-bits. This makes this value invalid/outside-of-heap if treated as pointer. Extraction won't require any special code, we just take the low 32 bits.
This passed bootstrap and almost passed Besides avoiding mallocgc call and not generating garbage, this also makes accesses faster (no indirection). Thoughts? |
This is inspired by a real use case. I need to store lots of integers in interfaces, but these won't fit into staticuint64s (up to thousands/tens of thousands). But they perfectly fit into uint32 and that's what I use as the type. |
I’m on my phone, but you probably also need to adjust ifaceData in the compiler. And ifaceeq and efaceeq in the runtime. (Also, it’s be more impressive to get uint8/uint32 working—uint16s are not heavily used.) Another option, which is less likely to cause rare breakage (e.g. in assembly) is to keep an atomic counter in the slow path of convT32 for smallish values and, when used enough, persistent alloc and populate a staticuint64s-like array with a larger range. Or keep a per-P cache of recently allocated values so that we can re-use them. (I tried this for string interning, and it was pretty straightforward.) |
There's an endianness issue in that code, that's a minor problem, also the
description of the pointer smash (0xffffffff) in upper 32 bits doesn't
match what the code does (uintptr(val) | 1<<63).
But otherwise, interesting, though I am worried about assembly language.
…On Sat, Apr 25, 2020 at 12:21 PM Josh Bleecher Snyder < ***@***.***> wrote:
I’m on my phone, but you probably also need to adjust ifaceData in the
compiler. And ifaceeq and efaceeq in the runtime. (Also, it’s be more
impressive to get uint8/uint32 working—uint16s are not heavily used.)
Another option, which is less likely to cause rare breakage (e.g. in
assembly) is to keep an atomic counter in the slow path of convT32 for
smallish values and, when used enough, persistent alloc and populate a
staticuint64s-like array with a larger range.
Or keep a per-P cache of recently allocated values so that we can re-use
them. (I tried this for string interning, and it was pretty
straightforward.)
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#8892 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAOW6JZM77AJJBGOQP7NRXLROMEXTANCNFSM4GCAPQ3Q>
.
|
The text was updated successfully, but these errors were encountered: