cmd/go: 'go test' should dynamically compute best timeout #12446

dsnet · 2015-09-02T07:58:21Z

Using go1.5

I discovered that "go test" crashes after the timeout of 600s has passed. I was running all the benchmarks in compress/flate and I set -benchtime 10s. Since each benchmark will run for about 10s or more, and there are 36 benchmarks in that package, this occupies at least 360s, which is running close to the default of 600s.

In situations like these, the go test tool should be able to figure out that a timeout of 600s is cutting things close when running 36 benchmarks each for 10s and choose a timeout that is more appropriate.

The text was updated successfully, but these errors were encountered:

dsnet · 2015-09-02T08:20:09Z

One possible timeout is: max(10*time.Minute, 5*benchTime*numBenchmarks)

The 5x increase was chosen since it seems the benchmark follow the following progression: 1x, 2x, 5x, 10x, 20x, 50x, etc. Thus, at worst, the last iteration of a Benchmark may run for 2.5x longer than targeted time. Also, since the summation of all the iterations of a given Benchmark resembles a geometric series where r=1/2, we get a another factor of 2x. Thus, 2*2.5 = 5x.

dsnet · 2016-10-04T23:12:42Z

The addition of sub-benchmarks has made this feature harder to implement (if not impractical) since the number of benchmarks is not known ahead of time.

davecheney · 2016-10-05T01:24:03Z

I don't think this bug should be addressed. That is to say, I don't think we should change the go tool as this will enable people to continue to do the wrong thing.

To accurately benchmark you should have a before and after sample and run those both several times back to back to rule out thermal variation. This is what the -c flag gives you, a binary that you can run with whatever timeout and count flags you need on dedicated benchmarking hardware.

Working around this inside go test encourages people to benchmark on shared cloud Ci hardware, which is inaccurate.

josharian · 2017-04-10T17:11:52Z

See also #19394.

bcmills · 2019-01-23T17:05:32Z

The addition of sub-benchmarks has made this feature harder to implement (if not impractical) since the number of benchmarks is not known ahead of time.

One option would be to apply the timeout to only the test portion, and allow the benchmarks to continue running for -benchtime each. (For example, cmd/go could look for the messages that indicate the beginning and end of benchmarks and pause the timeout during that interval.)

josharian · 2019-01-23T19:10:07Z

allow the benchmarks to continue running for -benchtime each

That sounds eminently sensible. Unfortunately, -benchtime measures the time the benchmark timer is running, not the time the benchmark code is running. So any benchmark that does non-trivial work using StartTimer and StopTimer will run afoul of this. Also, benchmark warm-up and over-estimation commonly caused running a benchmark without StartTimer/StopTimer/ResetTime to take multiples of the -benchtime. (I want to improve that, e.g.
#27168 (comment), but no fix will ever be complete.)

josharian · 2020-01-27T21:39:27Z

I keep getting bitten by this.

Yet another option here is to staticly detect when the user is guaranteed to hit a timeout. Multiply benchtime * count * leaf-benchmarks, and if it is greater than timeout, don't even run any benchmarks; exit immediately. This would require running all benchmarks for 0 iterations up front to discover all subbenchmarks. (There are no guarantees that this would work, but the whole calculation is only best effort anyway.)

Another option is to have the real timeout be count * timeout. I thought that had been suggested, but I don't see it here.

One option would be to apply the timeout to only the test portion, and allow the benchmarks to continue running for -benchtime each.

I think that this is tantamount (in intent) to having no timeout at all for the benchmark portion, which also seems fine to me.

stevenh · 2023-02-23T21:53:34Z

It doesn't seem to have been mentioned that I would expect the intent of -timeout when combined with -count would be that each iteration for the count is limited to the value of -timeout and NOT that to total time is a multiple as the use can already do that by just changing -timeout to account for -count if that was the intention.

The reason for this is to allow the user to track down the cause of a test which is randomly hanging. Sure you could write a wrapper script to run -count=1 N times but that seems like a miss.

ianlancetaylor added this to the Unplanned milestone Sep 2, 2015

dsnet mentioned this issue Oct 4, 2016

cmd/go: go test timeout doesn't take into account count #17337

Closed

josharian mentioned this issue Apr 10, 2017

testing: call (*M).after on signal or internal-generated panic #19394

Closed

bcmills added the NeedsDecision Feedback is required from experts, contributors, and/or the community before a change can be made. label Jan 23, 2019

ALTree mentioned this issue Mar 14, 2020

cmd/go: builtin test timeout conflicts with -count #37856

Closed

seankhliao mentioned this issue Dec 4, 2021

testing: benchmark timeout when benchtime set too long #49973

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/go: 'go test' should dynamically compute best timeout #12446

cmd/go: 'go test' should dynamically compute best timeout #12446

dsnet commented Sep 2, 2015

dsnet commented Sep 2, 2015

dsnet commented Oct 4, 2016

davecheney commented Oct 5, 2016

josharian commented Apr 10, 2017

bcmills commented Jan 23, 2019

josharian commented Jan 23, 2019

josharian commented Jan 27, 2020

stevenh commented Feb 23, 2023

cmd/go: 'go test' should dynamically compute best timeout #12446

cmd/go: 'go test' should dynamically compute best timeout #12446

Comments

dsnet commented Sep 2, 2015

dsnet commented Sep 2, 2015

dsnet commented Oct 4, 2016

davecheney commented Oct 5, 2016

josharian commented Apr 10, 2017

bcmills commented Jan 23, 2019

josharian commented Jan 23, 2019

josharian commented Jan 27, 2020

stevenh commented Feb 23, 2023