runtime: use mremap to move "big" stacks when growing on linux #52910

Jorropo · 2022-05-15T05:28:59Z

mremap allows to move a part of the virtual address space while leaving the physical one unchanged, it also allows growing the moved space with a zero fill.

This is a perfect candidate for resizing big stacks, as this is essentially an ~O(1) operation (it's O(N) on the number of pages, but this cost is minimal and this is heavly dominated by the kernel entry overhead).
This could even maybe allows us to skip taking a few locks because we wouldn't have to interact with the shared memory pool (altho maybe still to keep the address space coherent ?).

This would be a non portable optimization as this syscall is linux specific.

The text was updated successfully, but these errors were encountered:

mknyszek · 2022-05-17T18:17:53Z

CC @golang/runtime

randall77 · 2022-05-17T23:11:04Z

I'm not sure exactly how this would help. We wouldn't have to copy the memory, sure, but we still have to go through the stack and update any pointers into the stack. I believe that is the much more expensive part, because it requires unwinding the stack, looking up funcdata, trapsing though pointer bitmaps, and all that.

Jorropo · 2022-05-17T23:24:22Z

Ok thx, I didn't knew about that part (I naivly assumed everything that wasn't accessed using a stack relative pointer was moved to heap).
This probably wouldn't have a significant impact then.

CAFxX · 2022-05-18T00:41:11Z

Just as food for thought, mremap may be still helpful when a large slice containing noscan elements needs to grow during append. This may be a somewhat niche use-case (especially as we may need to prove that the original array is dead after the append), but for the cases where it would be applicable it would likely be very effective.

This may warrant a different ticket though.

(Also worth pointing out that while mremap is linux-specific, other OSes like windows should allow achieving the same effect with different APIs)

Jorropo · 2022-05-18T13:27:07Z

@CAFxX I don't think it can work in current conditions, because mremap unmaps the old range, so if an other reference exist it's gonna sigsev when accessed after the remap.
So code like that:

s1, s2 := getSlices()
s3 := append(s1, s2...) // assume s1 is big and remap happen.
s1[0] = 1 // SIGSEV because s1's mmaping doesn't exists anymore and has been remmaped to s3.

There are options about configuring that:

MREMAP_DONTUNMAP (linux 5.7) will not unmap the previous range, instead it's gonna be trapped with a zero fill.
I don't think that helps much alone. Ok it doesn't sigsev, but we can't replace data by zeros either.
MREMAP_DONTUNMAP + userfaultfd, this would allows to write a custom page interupt handler that would do a COW of the slice when used, however this solution sounds really complex (only the old mapping is trapped so we would need to reset up a trap on the new one too, ...) and I don't know the cost of userfaultfd to begin with, and that would only be faster assuming not all of the mapping is written too (so I would rename this optimisation lazy slice copy).
A third option would be to add a MREMAP_COW option in linux, which does what you think it do.
But given release cycles, the speed at which people update their software, ... this would mean this would be realistically usable in a few years.

CAFxX · 2022-05-18T13:47:27Z

@Jorropo sure, that's why I mentioned

we may need to prove that the original array is dead after the append

if the array is not dead it becomes much more complicated (and probably not worth the complexity) as you correctly point out

Jorropo · 2022-05-18T14:03:21Z

@CAFxX

mb, but if we are capable to prove that I think better optimisations are possible:

Removing the growslice to begin with (or coalescing them together).
Using stack allocations.

There would be counter examples like when too much conditionals are involved (then we couldn't do the optimizations I'm thinking about but we could remap). But I doubt this shows up in real code.

If you think this is worthwhile, I think you should open a new issue. :)

CAFxX · 2022-05-18T14:17:29Z

Sure.

Just to address your points: proving the array is dead unfortunately would not automatically mean we can coalesce (e.g. if we're appending data coming from external sources), and definitely stack allocations would not help under my assumption of large slices (both because of the size threshold for stack allocations, and, more in general, because we would again run into the same problem mentioned above).

mknyszek added Performance NeedsInvestigation labels May 17, 2022

mknyszek added this to the Backlog milestone May 17, 2022

mknyszek added this to Go Compiler / Runtime May 17, 2022

Jorropo closed this as completed May 17, 2022

Jorropo moved this to Done in Go Compiler / Runtime May 17, 2022

mknyszek removed this from Go Compiler / Runtime Feb 15, 2023

golang locked and limited conversation to collaborators May 18, 2023

gopherbot added the FrozenDueToAge label May 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

runtime: use mremap to move "big" stacks when growing on linux #52910

runtime: use mremap to move "big" stacks when growing on linux #52910

Jorropo commented May 15, 2022

mknyszek commented May 17, 2022

randall77 commented May 17, 2022

Jorropo commented May 17, 2022

CAFxX commented May 18, 2022 •

edited

Loading

Jorropo commented May 18, 2022 •

edited

Loading

CAFxX commented May 18, 2022 •

edited

Loading

Jorropo commented May 18, 2022 •

edited

Loading

CAFxX commented May 18, 2022 •

edited

Loading

runtime: use mremap to move "big" stacks when growing on linux #52910

runtime: use mremap to move "big" stacks when growing on linux #52910

Comments

Jorropo commented May 15, 2022

mknyszek commented May 17, 2022

randall77 commented May 17, 2022

Jorropo commented May 17, 2022

CAFxX commented May 18, 2022 • edited Loading

Jorropo commented May 18, 2022 • edited Loading

CAFxX commented May 18, 2022 • edited Loading

Jorropo commented May 18, 2022 • edited Loading

CAFxX commented May 18, 2022 • edited Loading

CAFxX commented May 18, 2022 •

edited

Loading

Jorropo commented May 18, 2022 •

edited

Loading

CAFxX commented May 18, 2022 •

edited

Loading

Jorropo commented May 18, 2022 •

edited

Loading

CAFxX commented May 18, 2022 •

edited

Loading