New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
net: TestDialParallel failures due to longer-than-expected delay on Windows #52173
Comments
Since (It may be possible to resolve by making the timing bounds even more lax, or by somehow eliminating the timing bounds entirely if the test is meaningful without them.) |
FWIW, I've been attempting to reproduce a different Windows issue, but have managed to reproduce this twice thus far in ~1hr of testing with 25 windows-amd64-longtest gomotes running all.bat using https://github.com/mknyszek/goswarm/. Edit: I got 58(!) cases of this over the weekend. |
Change https://go.dev/cl/408354 mentions this issue: |
I am still reproducing this after https://go.dev/cl/408354, though at a much lower rate. In the past ~24hr of 25 windows-amd64-longtest gomotes running all.bat I have reproduced this once.
|
@prattmic is this OK after beta 1? |
@neild will know better, but I imagine so, it is quite rare now. |
Yes, this is okay after beta1. The test is irredeemably flaky. I think it needs to be rewritten. |
Change https://go.dev/cl/410754 mentions this issue: |
Change https://go.dev/cl/410957 mentions this issue: |
CL 410754 introduces a race accessing the global testHookDialTCP hook. Avoiding this race is difficult, since Dial can return while goroutines it starts are still running. Add a version of this hook to sysDialer, so it can be set on a per-test basis. (Perhaps other uses of this hook should be moved to use the sysDialer-local hook, but this change fixes the immediate data race.) For #52173. Change-Id: I8fb9be13957e91f92919cae7be213c38ad2af75a Reviewed-on: https://go-review.googlesource.com/c/go/+/410957 Run-TryBot: Damien Neil <dneil@google.com> Reviewed-by: Ian Lance Taylor <iant@golang.org> Reviewed-by: Cherry Mui <cherryyz@google.com> TryBot-Result: Gopher Robot <gobot@golang.org>
greplogs --dashboard -md -l -e 'FAIL: TestDialParallel .*(?:\n .*)*got .*; want <= .*'
2022-04-05T18:01:26-9e16cc1/windows-amd64-longtest
Note that the test currently uses a hard-coded heuristic for the timing bounds:
https://cs.opensource.google/go/go/+/master:src/net/dial_test.go;l=302-309;drc=da7891f6f36c48f2931ed916ed305330c06f9bd7
(See previously #35616; CC @ianlancetaylor @neild.)
The text was updated successfully, but these errors were encountered: