-
Notifications
You must be signed in to change notification settings - Fork 17.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
runtime: TestArenaCollision has many "out of memory" failures on linux-ppc64le-power9osu #35514
Comments
I'm having a very difficult time reproducing this. 10000 consecutive runs and nothing. The test's failure should be independent of the full runtime test since it spins up a new process for it, so I'm led to believe that this is actually just the machine saying it's out of memory (and not the test doing something weird/wrong in the face of new changes). With that being said, my next best guess is that the machine has swap enabled and it's trying to reserve swap space for all the |
I noticed this the other day and I am not able to reproduce the problem either on any of our systems. |
Also affected linux/arm64
|
I suspect the problem is that |
Change https://golang.org/cl/207758 mentions this issue: |
Change https://golang.org/cl/207757 mentions this issue: |
This was happening fairly frequently but then it stopped. The most recent occurrence is 2019-11-19T15:30:58-580337e/linux-ppc64le-power9osu I'm calling this fixed. |
As a post-mortem, I suspect https://go-review.googlesource.com/c/go/+/207497 actually helped a lot here, since each L2 entry is only 1 MiB in size now, and the page bitmap had the biggest footprint. https://go-review.googlesource.com/c/go/+/207758 reduces the footprint of the test by an order of magnitude, which should make concerns about a discontiguous address space go away for good with the new page allocator. |
This change adds a new inUse field to the allocator which tracks ranges of addresses that are owned by the heap. It is updated on each heap growth. These ranges are tracked in an array which is kept sorted. In practice this array shouldn't exceed its initial allocation except in rare cases and thus should be small (ideally exactly 1 element in size). In a hypothetical worst-case scenario wherein we have a 1 TiB heap and 4 MiB arenas (note that the address ranges will never be at a smaller granularity than an arena, since arenas are always allocated contiguously), inUse would use at most 4 MiB of memory if the heap mappings were completely discontiguous (highly unlikely) with an additional 2 MiB leaked from previous allocations. Furthermore, the copies that are done to keep the inUse array sorted will copy at most 4 MiB of memory in such a scenario, which, assuming a conservative copying rate of 5 GiB/s, amounts to about 800µs. However, note that in practice: 1) Most 64-bit platforms have 64 MiB arenas. 2) The copies should incur little-to-no page faults, meaning a copy rate closer to 25-50 GiB/s is expected. 3) Go heaps are almost always mostly contiguous. Updates #35514. Change-Id: I3ad07f1c2b5b9340acf59ecc3b9ae09e884814fe Reviewed-on: https://go-review.googlesource.com/c/go/+/207757 Run-TryBot: Michael Knyszek <mknyszek@google.com> TryBot-Result: Gobot Gobot <gobot@golang.org> Reviewed-by: Cherry Zhang <cherryyz@google.com> Reviewed-by: Austin Clements <austin@google.com>
Prior to this change, if the heap was very discontiguous (such as in TestArenaCollision) it's possible we could map a large amount of memory as R/W and commit it. We would use only the start and end to track what should be mapped, and we would extend that mapping as needed to accomodate a potentially fragmented address space. After this change, we only map exactly the part of the summary arrays that we need by using the inUse ranges from the previous change. This reduces the GCSys footprint of TestArenaCollision from 300 MiB to 18 MiB. Because summaries are no longer mapped contiguously, this means the scavenger can no longer iterate directly. This change also updates the scavenger to borrow ranges out of inUse and iterate over only the parts of the heap which are actually currently in use. This is both an optimization and necessary for correctness. Fixes #35514. Change-Id: I96bf0c73ed0d2d89a00202ece7b9d089a53bac90 Reviewed-on: https://go-review.googlesource.com/c/go/+/207758 Run-TryBot: Michael Knyszek <mknyszek@google.com> TryBot-Result: Gobot Gobot <gobot@golang.org> Reviewed-by: Austin Clements <austin@google.com>
2019-11-11T20:17:10-b1159ba/linux-ppc64le-power9osu
2019-11-11T18:00:18-ad7ce39/linux-ppc64le-power9osu
2019-11-10T20:36:44-47bc240/linux-ppc64le-power9osu
2019-11-08T22:44:29-42db1da/linux-ppc64le-power9osu
2019-11-08T20:50:17-9e914f5/linux-ppc64le-power9osu
2019-11-08T19:28:49-f6ff806/linux-ppc64le-power9osu
2019-11-08T18:00:54-a2cd2bd/linux-ppc64le-power9osu
2019-11-08T17:01:05-a5a6f61/linux-ppc64le-power9osu
2019-11-08T16:44:48-374c284/linux-ppc64le-power9osu
CC @aclements @mknyszek @laboger @ceseo
The text was updated successfully, but these errors were encountered: