proposal: x/exp/maps: Shrink #54454

CAFxX · 2022-08-15T08:40:38Z

I propose to add, to partially address #20135 and #19094 (and the many external issues linked in those discussions), a Shrink function to x/exp/maps¹ with the following semantics:

// Shrink attempts to shrink the provided map to release unneeded memory.
// It can be used by applications when they expect the map to not
// significantly grow in the near future.
// Shrink never modifies the contents of the map, only its internal capacity.
// Whether the map is shrunk depends on the runtime and the internal
// state of the map (including e.g. how big and full the map is).
// Large, mostly-empty maps are likely to be shrunk, whereas mostly-full
// maps are unlikely to be shrunk.
// No concurrent read or write operation should be performed on the map
// by other goroutines during a call to Shrink.
func Shrink[M ~map[K, V], K comparable, V any](m M)

As an initial implementation, Shrink would likely just check if any of the following conditions is true, in which case it would perform the appropriate action:

the map is mostly empty (e.g. average load factor < 6.5/4) -> appropriate action: clone map and assign in-place
the map is growing -> appropriate action: complete map growth

In the future, this may be potentially extended to consider memory pressure, or alternatively turned into a no-op if the runtime were to fully address both #20135 and #19094 (although even with those implemented, Shrink could still be used to handle the case of a map that receives no reads and writes).

Users today can either use Clone or perform a manual clone operation of an existing map, but they have no visibility over the internal state of the map, so they can not know whether doing so will actually decrease memory usage, or if they are just wasting CPU cycles. Users could approximate this by manually keeping track of the high watermark and comparing it to the current length of the map, but this 1) can be quite difficult to do, as all map writes need to be instrumented and 2) it still does not address the "map is growing" case.

With the proposed addition, users could instead periodically simply call Shrink when the map is not in use to ensure that memory is not leaked. Note that this proposal also addresses the case in which a map stops receiving writes (or reads+writes) completely (that would not be covered by any existing or proposed built-in runtime mechanism, as they all require at least some operation to piggyback on).

While this would be exposed in x/exp/maps, it will likely require the bulk of the implementation to be in runtime. ↩

The text was updated successfully, but these errors were encountered:

dsnet · 2022-08-15T17:51:49Z

Currently the maps package has no package dependencies. Presumably the implementation of this would at least require unsafe and possibly some hooks into runtime as well, right?

randall77 · 2022-08-15T18:35:07Z

I think I'd rather fix #20135 and #19094, then we wouldn't need this.

CAFxX · 2022-08-15T23:32:45Z

I think I'd rather fix #20135 and #19094, then we wouldn't need this.

Definitely agree that directly fixing the underlying issues would be better, but it's also true that the issues have been opened for years, and there seem to be no consensus on how to proceed (or even whether we can proceed).

This proposal should be seen as an effective intermediate step, as it offers a clean way forward even if at a later date we actually fix #20135 and/or #19094.

Presumably the implementation of this would at least require unsafe and possibly some hooks into runtime as well, right?

I think the simplest way is to implement it in runtime, and just expose it via x/exp/maps. This would require a single method (AFAICT no hooks) to be exported via go:linkname from the runtime and into x/exp/maps (and you are correct, this would require importing unsafe in x/exp/maps).

randall77 · 2022-08-15T23:43:40Z

and there seem to be no consensus on how to proceed (or even whether we can proceed).

Shrink-on-delete seems straightforward, we just need someone to implement it. Josh already checked in most of the mechanism needed for the tooManyOverflowBuckets change. "can" = yes, "how" = allocate a person to it.
Grow-on-read seems harder, but I don't see any obvious blockers. We'd need a design sketch first, I think, before "can" is certain. Once we have that, "how" is again people-limited.

CAFxX · 2022-08-16T00:51:54Z

Shrink-on-delete seems straightforward, we just need someone to implement it. Josh already checked in most of the mechanism needed for the tooManyOverflowBuckets change.

My interpretation of Josh's comment was the opposite, i.e. that the mechanisms needed for the tooManyOverflowBuckets were somewhat unlikely to help with #20135.

As an additional hurdle, mapclear currently does not change the bucket count. Would the fix for #20135 change this, even though #20138 explicitly argued against changing the bucket count?

And, while I haven't given this too much thought, I can imagine some tricky tradeoffs when dealing with how to deal with the case of a map that starts growing in the middle of a shrink operation.

On the topic of capacity, and intersecting with the previous point, what should the map do with a map with a high hint while it's growing? What if the hint was too large, and the map length never reaches anywhere close to that capacity hint?

Lastly, how do we plan to ensure that the fix for #20135 won't hurt workloads some people may care about, especially since there may be no available workaround for them?

All of this for the simpler case of shrink-on-delete¹; the case of grow-on-read is even fuzzier and may not be solved for quite a while (especially considering the potential performance implications).

Mind you, I am not saying these are insurmountable issues, or that this proposal is an actual alternative to solving them properly. I am just arguing that it has been 5+ years, and that maybe we shouldn't let perfect be the enemy of good (especially in case good won't hurt us in the future, as we would have an easy way to guarantee compatibility across versions).

that I am assuming would be implemented synchronously+incrementally, similarly to the way map growth is; some of these issues would go away if instead we used an asynchronous+incremental GC-mediated approach, closer to the way sync.Pool cleaning is done, but I suspect that would bring its own share of new issues ↩

randall77 · 2022-08-16T03:54:04Z

I'm just advocating that we've long avoided adding API for situations where we could fix the issue without adding API. I understand that could lead to long delays in getting things implemented, but that's the tradeoff we have historically made. Top of mind recently, see #29951.

My interpretation of #20135 (comment) was the opposite, i.e. that the mechanisms needed for the tooManyOverflowBuckets were somewhat unlikely to help with #20135.

It is a good question. I remember when reviewing that change that it seemed applicable to me. Maybe Josh was seeing something I didn't.

As an additional hurdle, mapclear currently does not change the bucket count. Would the fix for #20135 change this, even though #20138 explicitly argued against changing the bucket count?

That is a good point. We probably don't want to shrink if people do the map clear idiom + reinsert a new set of stuff. Perhaps we only start shrinking after 2N operations, so that N delete + N add involves no shrinking. There's definitely a tradeoff here between responsiveness when you use fewer entries and extra work that needs to be done to regrow.

And, while I haven't given this too much thought, I can imagine some tricky tradeoffs when dealing with how to deal with the case of a map that starts growing in the middle of a shrink operation.

I suspect we could deal with this the way inserts work. When we grow from capacity N to 2N, we do enough work at each insert that the grow work is guaranteed to be done by the time the number of entries reaches 2N.
Similarly, when there are fewer than N/4 entries in an N-capacity map, we could shrink to capacity N/2. If we move at least 1 of the N/4 entries on each subsequent operation, then we're guaranteed we won't exceed our new N/2 capacity before the shrink is done. And if we move 2 entries per operation, and we could guarantee that the shrink is done before we'd want to shrink again (@ N/8 entries in our new N/2 capacity map).

On the topic of capacity, and intersecting with the previous point, what should the map do with a map with a high hint while it's growing? What if the hint was too large, and the map length never reaches anywhere close to that capacity hint?

I think this is a case of tuning a heuristic. We do at some point need to give up on the hint and start shrinking. (I guess the other option is to always respect the hint, but I think that overassumes accurate hints.)
We may need to remember the hint for a period of time. But maybe that's not necessary. Maybe as long as inserts, or (inserts-deletes), are happening at a sufficient positive rate, we don't shrink.

Lastly, how do we plan to ensure that the fix for #20135 won't hurt workloads some people may care about, especially since there may be no available workaround for them?

It's going to hurt someone, I'm not too hung up on that. Performance optimization is seldom a win-win. I'm looking for something as good or better than an 80% win / 20% lose.

rsc · 2023-03-15T21:12:25Z

It sounds like we believe this need can be addressed with better tuning, rather than adding new API. That would suggest we should decline this proposal. Do I have that right?

rsc · 2023-03-29T19:18:10Z

This proposal has been added to the active column of the proposals project
and will now be reviewed at the weekly proposal review meetings.
— rsc for the proposal review group

rsc · 2023-04-06T15:32:22Z

Based on the discussion above, this proposal seems like a likely decline.
— rsc for the proposal review group

rsc · 2023-04-12T20:32:29Z

No change in consensus, so declined.
— rsc for the proposal review group

CAFxX added the Proposal label Aug 15, 2022

gopherbot added this to the Proposal milestone Aug 15, 2022

zhangyunhao116 mentioned this issue Aug 30, 2022

runtime: use SwissTable #54766

Open

5 tasks

dsnet mentioned this issue Oct 5, 2022

reflect: add Value.Clear #55002

Closed

ianlancetaylor mentioned this issue Oct 10, 2022

proposal: x/exp/maps: additional functionality, also a new package x/exp/sets #56128

Closed

lizthegrey mentioned this issue Oct 12, 2022

runtime: expand capacity of map by more than 1 at a time? #56182

Open

thepudds mentioned this issue Oct 21, 2022

spec: add clear(x) builtin, to clear map, zero content of slice #56351

Closed

ianlancetaylor mentioned this issue Dec 22, 2022

maps: new standard library package based on x/exp/maps #57436

Closed

rsc added the Proposal-FinalCommentPeriod label Apr 6, 2023

rsc closed this as completed Apr 12, 2023

rsc removed the Proposal-FinalCommentPeriod label Apr 12, 2023

golang locked and limited conversation to collaborators Apr 11, 2024

gopherbot added the FrozenDueToAge label Apr 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proposal: x/exp/maps: Shrink #54454

proposal: x/exp/maps: Shrink #54454

CAFxX commented Aug 15, 2022 •

edited

dsnet commented Aug 15, 2022

randall77 commented Aug 15, 2022

CAFxX commented Aug 15, 2022 •

edited

randall77 commented Aug 15, 2022

CAFxX commented Aug 16, 2022 •

edited

randall77 commented Aug 16, 2022

rsc commented Mar 15, 2023

rsc commented Mar 29, 2023

rsc commented Apr 6, 2023

rsc commented Apr 12, 2023

proposal: x/exp/maps: Shrink #54454

proposal: x/exp/maps: Shrink #54454

Comments

CAFxX commented Aug 15, 2022 • edited

Footnotes

dsnet commented Aug 15, 2022

randall77 commented Aug 15, 2022

CAFxX commented Aug 15, 2022 • edited

randall77 commented Aug 15, 2022

CAFxX commented Aug 16, 2022 • edited

Footnotes

randall77 commented Aug 16, 2022

rsc commented Mar 15, 2023

rsc commented Mar 29, 2023

rsc commented Apr 6, 2023

rsc commented Apr 12, 2023

CAFxX commented Aug 15, 2022 •

edited

CAFxX commented Aug 15, 2022 •

edited

CAFxX commented Aug 16, 2022 •

edited