New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
net: TestDialTimeout flake #11872
Comments
It's probably because the test cases, both TestDialTimeout and TestDialTimeoutFDLeak, depend on a wrong assumption with runtime scheduler. Looks like the flakiness in net/http is irrelevant, though.
|
Can you give https://golang.org/cl/12691 a shot? |
CL https://golang.org/cl/12691 mentions this issue. |
I don't really understand why that CL will fix this (or, indeed, what it's really fixing), but I'll update and restart my run.bash loop. |
Previously I was able to reproduce this twice in 322 iterations. With this change I've gotten through 1,500 iterations without a failure. |
Thanks for the confirmation. The root cause comes from the current corner-cutting socktest package implementation. It tries to track socket calls with socket descriptor numbers as a key for testing, and has no care about "quick socket descriptor number recycling" for simplicity. Therefore it may confuse socket descriptors in a situation like the following:
This may happen in selfConnect, in the case of TCP simultaneous open, and Linux is one of the platforms that can make TCP simultaneous open happen easily. |
…he moment (we have a test which can fail on linux (see: golang/go#11872 (comment))
I've been running run.bash in a loop on my linux/amd64 workstation for the past few days and out of ~500 runs, I've had three failures of TestDialTimeout with:
It has my GC changes from CL 12674, but I don't think those are related. The latest commit from master is ae1ea2a. This shows up occasionally on the dashboard as well:
2015-06-05T18:51:23-d64cdde/linux-amd64-nocgo
2015-06-05T18:51:23-d64cdde/linux-amd64-noopt
2015-06-11T14:33:40-0beb931/solaris-amd64-smartos
2015-06-14T20:54:01-48d865a/linux-amd64-noopt
2015-07-05T03:36:56-1edf489/linux-amd64-noopt
2015-07-14T00:07:31-337b7e7/linux-amd64
2015-07-15T23:28:42-08dbd8a/solaris-amd64-smartos
2015-07-17T22:46:05-955c0fd/android-arm-crawshaw
2015-07-23T17:10:28-e0ac5c5/linux-386-sid
This may be related to #11474, though the error sounds different.
@mikioh
The text was updated successfully, but these errors were encountered: