-
Notifications
You must be signed in to change notification settings - Fork 17.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
x/net/http2: many tests are flaky: Error while expecting an RSTStream frame: timeout waiting for frame #18111
Comments
/cc @tombergan |
We're seeing this across the board on basically on all GOOS now. |
Closed dup #18235. See some debugging notes there. |
With @tombergan's new debugging, https://build.golang.org/log/b8e6726f6ff2fecdaebc69ebbd16e2f3c34a7ea4
|
Oh, is this due to "http2: schedule RSTStream writes onto its stream's queue" (golang/net@00ed5e9) ? |
Are you suggesting the stream is somehow closed before the RSTStream is flushed from the queue? A stream's queue is wiped when the stream is closed, so this is vaguely possible, although I haven't thought through the specific buggy sequence. |
If that's the problem, it might also be related to the remaining flake here: |
I haven't thought through it all yet either. But that's my hunch, since a bunch of tests (all involving RSTStream frames) are now failing where the tests had been stable for ages. |
This looks like the buggy line: What if we instead modify endStream to return true for RST_STREAM, then wait for the RST_STREAM to be written before closing the stream? |
Yeah, that should probably work. Only thing I'd be concerned about is what else could happen in the meantime (between the Alternative, just for completeness (but not well-considered) is: the Your fix sounds good, though. Want to send a CL? It probably doesn't need a test, since we have tons of failing builders already which are effectively the test. If the existing tests pass, that's a good sign. |
Sure, I'll send a CL soonish. |
CL https://golang.org/cl/34238 mentions this issue. |
CL https://golang.org/cl/34495 mentions this issue. |
Updates bundled x/net/http2 to git rev 1195a05d for: http2: fix incorrect panic https://golang.org/cl/34498 http2: fix race in writePushPromise https://golang.org/cl/34493 http2: speed up TestTransportFlowControl in short mode https://golang.org/cl/33241 http2: don't flush a stream's write queue in sc.resetStream https://golang.org/cl/34238 http2: allow Transport to connect to https://[v6literal]/ without port https://golang.org/cl/34143 http2: log Framer reads and writes when a server test fails https://golang.org/cl/34130 Updates #18326 Updates #18273 Updates #18111 Updates #18248 Updates #18235 Change-Id: I18c7a297fc94d6a843284efcfc43e0fdab9b5f41 Reviewed-on: https://go-review.googlesource.com/34495 Run-TryBot: Chris Broadfoot <cbro@golang.org> TryBot-Result: Gobot Gobot <gobot@golang.org> Reviewed-by: Brad Fitzpatrick <bradfitz@golang.org>
resetStream(st) queues a RST_STREAM frame and then calls closeStream(st). Unfortunately, closeStream(st) flushes any writes pending on st, which can drop the RST_STREAM that was just queued. (If we are lucky, the RST_STREAM will fit in the send buffer and won't be dropped, however, if we are unlucky the RST_STREAM will be dropped.) I fixed this bug by removing closeStream(st) from resetStream. Instead, closeStream happens after the RST_STREAM frame is written. This is a more direct implementation of the diagram in RFC 7540 Section 5.1, which says that a stream does not transition to "closed" until after the RST_STREAM has been sent. A side-effect is that the stream may stay open for longer than it did previously (since it won't close until *after* the RST_STREAM frame is actually written). Situations like the following are a problem: - Client sends a DATA frame that exceeds its flow-control window - Server returns streamError(ErrCodeFlowControl) from processData - RST_STREAM is queued behind other frames - Server process the request body and releases flow-control - Client sends another DATA frame, this one fits in the flow-control window. Server should NOT process this frame. To avoid the above problem, we set a bool st.resetQueued=true when RST_STREAM is queued, then ignore all future incoming HEADERS and DATA frames on that stream. I also removed st.sentReset and st.gotReset, which were used to ignore frames after RST_STREAM is sent. Now we just check if the stream is closed. Fixes golang/go#18111 Change-Id: Ieb7c848989431add5b7d95f40d6d91db7edc4980 Reviewed-on: https://go-review.googlesource.com/34238 Reviewed-by: Brad Fitzpatrick <bradfitz@golang.org> Run-TryBot: Brad Fitzpatrick <bradfitz@golang.org> TryBot-Result: Gobot Gobot <gobot@golang.org>
https://storage.googleapis.com/go-build-log/0b1b6d4c/openbsd-amd64-gce58_f9489e71.log
The text was updated successfully, but these errors were encountered: