x/build: trybots should include all platforms that can contribute release-blockers #29239

griesemer · 2018-12-13T22:29:09Z

If a build failure for a given platform P can be considered a release-blocker (such as #29221), then trybots should also run on P. Otherwise we have to rely on the actual build failure before we can fix the issue.

Example: In #29221, some newly added math tests failed on s390x, yet the trybots didn't notice.

If it's too expensive (too slow) to run the trybots for some platforms all the time, maybe we could consider doing it at least some time before any of the imminent release stages. For instance, if we are planning to cut a Beta or RC, we might want to consider starting to run the trybots a week before on all platforms where failures might block the release.

cc: @golang/osp-team
cc: @bradfitz
cc: @dmitshur
cc: @andybons

andybons · 2018-12-14T18:52:22Z

If we consider breaking the platform to be a release-blocking bug, then I agree it should be run on every trybot run triggered by a CL. That said, for builders that we can't spin up arbitrary instances of like the IBM Z-series, it could create a bottleneck for all runs across every CL.

One thing we could do (though it's not a trivial task) is create a submit queue that requires all builders across first-class ports to succeed and if they do, the patch gets submitted. So intermediate runs during review are a smaller subset that run quicker but the final submission would catch this sort of issue.

@bradfitz did we ever have email alerts when a builder failed? Is there precedent for adding them?

bradfitz · 2018-12-14T18:56:48Z

@bradfitz did we ever have email alerts when a builder failed? Is there precedent for adding them?

Yes. That is #12509. (You even commented on it. :))

bradfitz · 2018-12-14T19:04:51Z

A submit queue would also ensure that syntactically-but-not-semantically merge conflicts don't break builds.

bcmills · 2018-12-14T19:24:21Z

for builders that we can't spin up arbitrary instances of like the IBM Z-series, it could create a bottleneck

We could batch up TryBot changes on those platforms: pile up all of the changes that need TryBot testing in a queue, and have a single TryBot run test all of the changes in the queue as a batch.

(The downside to that approach, of course, is a higher false-positive rate, or the need to break up the batch and retry failing tests individually in case of regressions.)

bradfitz · 2018-12-14T20:03:23Z

TryBots are supposed to be fast, though.

If we want the SlowBots, that is more #24539 and ... another bug I can't find right now about letting users pick which trybots to run instead of the default set.

For submit queue, that is basically #12482 combined with #9858.

bradfitz · 2019-10-21T17:30:49Z

Slowbots (#34501) are now live.

Of the first class ports (the ones that we define as being release blockers), the two we're missing as trybots are linux/arm and darwin/amd64. We used to have both of those but they were too unreliable.

Things we can improve that would satisfy this bug, probably in this order:

improve slowbot reporting (make trybot status URLs permanent, survive restarts)
start with making CLs that look like they're touching arm code or darwin code auto-invoke slowbots
make slowbots report trybot happiness once trybots are happy, even if the slow ones are still running
make slowbots time out after inability to get a builder for, say, an hour
add some global test caching (x/build/cmd/coordinator: use cmd/go's build caching #28950) so we don't waste builder resources running tests on, say, doc changes
include linux/arm and darwin/* as slowbots always

griesemer added the NeedsDecision label Dec 13, 2018

andybons added this to the Unreleased milestone Dec 13, 2018

andybons changed the title ~~trybot: should run on all platforms that can contribute release-blockers~~ x/build: trybots should include all platforms that can contribute release-blockers Dec 13, 2018

gopherbot added the Builders label Dec 13, 2018

martisch mentioned this issue Sep 24, 2019

x/build: add "slowbots" support #34501

Closed

bradfitz added the NeedsFix label Oct 21, 2019

gopherbot removed the NeedsDecision label Oct 21, 2019

This was referenced Mar 16, 2021

x/build: add linux-arm-aws to the set of trybots #45064

Closed

x/build: add linux-arm64-aws to the set of trybots #45065

Closed

bcmills mentioned this issue Jun 24, 2021

x/build: configure dashboard builders to test the same configurations that will be shipped in binary releases #46900

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

x/build: trybots should include all platforms that can contribute release-blockers #29239

x/build: trybots should include all platforms that can contribute release-blockers #29239

griesemer commented Dec 13, 2018 •

edited by dmitshur

Loading

andybons commented Dec 14, 2018

bradfitz commented Dec 14, 2018 •

edited

Loading

bradfitz commented Dec 14, 2018

bcmills commented Dec 14, 2018

bradfitz commented Dec 14, 2018

bradfitz commented Oct 21, 2019

x/build: trybots should include all platforms that can contribute release-blockers #29239

x/build: trybots should include all platforms that can contribute release-blockers #29239

Comments

griesemer commented Dec 13, 2018 • edited by dmitshur Loading

andybons commented Dec 14, 2018

bradfitz commented Dec 14, 2018 • edited Loading

bradfitz commented Dec 14, 2018

bcmills commented Dec 14, 2018

bradfitz commented Dec 14, 2018

bradfitz commented Oct 21, 2019

griesemer commented Dec 13, 2018 •

edited by dmitshur

Loading

bradfitz commented Dec 14, 2018 •

edited

Loading