runtime: maybe allgs should shrink after peak load #34457

cch123 · 2019-09-22T13:14:52Z

What version of Go are you using (`go version`)?

$ go version
go version go1.12.4 linux/amd64

Does this issue reproduce with the latest release?

Y

What operating system and processor architecture are you using (`go env`)?

any

What did you do?

When serving a peak load, the system creates a lot of goroutines, and after that the goroutine garbages cause more CPU consuming.

This can be reproduced by:

package main

import (
	"log"
	"net/http"
	_ "net/http/pprof"
	"time"
)

func sayhello(wr http.ResponseWriter, r *http.Request) {}

func main() {
	for i := 0; i < 1000000; i++ {
		go func() {
			time.Sleep(time.Second * 10)
		}()
	}
	http.HandleFunc("/", sayhello)
	err := http.ListenAndServe(":9090", nil)
	if err != nil {
		log.Fatal("ListenAndServe:", err)
	}
}

after 10 seconds, the inuse objects still remain the same.

What did you expect to see?

The global goroutines shrink to a proper size.

What did you see instead?

Many inuse objects created by malg

The text was updated successfully, but these errors were encountered:

zboya · 2019-09-23T03:13:12Z

In fact, allgs never been reduced, it is not conducive to stability, should provide a strategy to reduce, such as sysmon monitoring found that more than half of g are dead, then release it.

agnivade · 2019-09-23T03:34:13Z

@aclements @mknyszek

aclements · 2019-09-25T03:31:10Z

Your observation is correct. Currently the runtime never frees the g objects created for goroutines, though it does reuse them. The main reason for this is that the scheduler often manipulates g pointers without write barriers (a lot of scheduler code runs without a P, and hence cannot have write barriers), and this makes it very hard to determine when a g can be garbage collected.

One possible solution is to use an RCU-like reclamation scheme over the Ms that understands when each M's scheduler passes through a quiescent state. Then we could schedule unused gs to be reclaimed after a grace period, when all of the Ms have been in a quiescent state. Unfortunately, we can't simply use STWs to detect this grace period because those stop all Ps, so, just like the write barriers, those won't protect against scheduler instances manipulating gs without a P.

@changkun, I'm not sure what your benchmark is measureing. Calling runtime.GC from within a RunParallel doesn't make sense. The garbage collector is already concurrent, and calling runtime.GC doesn't start another garbage collection until the first one is done. Furthermore, if there are several pending runtime.GC calls, they'll all be coalesced into a single GC. If the intent is to just measure how long a GC takes, just call runtime.GC without the RunParallel.

aclements · 2019-09-26T15:53:54Z

Calling runtime.GC within a RunParallel does not measure contention on allglock. The GCs are serialized by runtime.GC itself, so they're not fighting over allglock, and they're coalesced by runtime.GC, so calling runtime.GC N times concurrently can result in anywhere from 1 to N GCs depending on vagaries of scheduling.

Benchmark aside, though, I think we're all clear on the issue that allgs is never collected and that impacts GC time and heap size.

Since gs are just heap allocated, it would make the most sense to collect them during GC like other heap allocations. The question is when it's safe to unlink them from allgs and allow them to be collected, given that the normal GC reachability invariants don't apply to gs. (At the same time, we don't want to be over-aggressive about unlinking them from allgs either, since we want the allocation pooling behavior to reduce the cost of starting a goroutine.) This is certainly doable, though it would require a fair amount of care.

matteo-gz · 2021-01-25T01:26:44Z

so every time defer runtime.GC?

davecheney · 2021-01-25T01:29:35Z

@matteo-gz GC is run between benchmarks. For asking questions, see:

Stack Overflow with questions tagged "go"
The Go Forum, a web-based forum
Gophers Slack, use the invite app for access. The #general channel is a good starting point.
Go Community on Hashnode with questions and posts tagged with "go"
The golang-nuts mailing list
IRC channel #go-nuts on Freenode

cch123 changed the title ~~maybe allgs should shrink after peak load~~ runtime : maybe allgs should shrink after peak load Sep 22, 2019

cch123 changed the title ~~runtime : maybe allgs should shrink after peak load~~ runtime: maybe allgs should shrink after peak load Sep 22, 2019

agnivade added the NeedsInvestigation label Sep 23, 2019

agnivade added this to the Unplanned milestone Sep 23, 2019

cch123 mentioned this issue Sep 23, 2019

为什么 Go 模块在下游服务抖动恢复后，CPU 占用无法恢复 cch123/blog_comment#126

Open

This comment has been minimized.

Sign in to view

gopherbot added the compiler/runtime label Jul 7, 2022

mknyszek added this to Go Compiler / Runtime Jul 7, 2022

mknyszek removed this from Go Compiler / Runtime Jul 13, 2022

cloudxxx8 mentioned this issue May 8, 2023

device-virtual memory keeps increasing when device contains autoevents edgexfoundry/device-sdk-go#1447

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

runtime: maybe allgs should shrink after peak load #34457

runtime: maybe allgs should shrink after peak load #34457

cch123 commented Sep 22, 2019 •

edited

Loading

zboya commented Sep 23, 2019

agnivade commented Sep 23, 2019

This comment has been minimized.

aclements commented Sep 25, 2019

This comment has been minimized.

aclements commented Sep 26, 2019

matteo-gz commented Jan 25, 2021

davecheney commented Jan 25, 2021

runtime: maybe allgs should shrink after peak load #34457

runtime: maybe allgs should shrink after peak load #34457

Comments

cch123 commented Sep 22, 2019 • edited Loading

What version of Go are you using (go version)?

Does this issue reproduce with the latest release?

What operating system and processor architecture are you using (go env)?

What did you do?

What did you expect to see?

What did you see instead?

zboya commented Sep 23, 2019

agnivade commented Sep 23, 2019

This comment has been minimized.

aclements commented Sep 25, 2019

This comment has been minimized.

aclements commented Sep 26, 2019

matteo-gz commented Jan 25, 2021

davecheney commented Jan 25, 2021

cch123 commented Sep 22, 2019 •

edited

Loading

What version of Go are you using (`go version`)?

What operating system and processor architecture are you using (`go env`)?