-
Notifications
You must be signed in to change notification settings - Fork 17.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
testing: when using a custom TestMain, m.Run does not return if one of the tests it runs panics #37206
Comments
I've verified func testMain(m *testing.M) int {
defer func() {
if e := recover(); e != nil {
fmt.Println("there was a panic")
}
}()
setup()
defer teardown()
return m.Run()
} The string "there was a panic" doesn't get printed. It does get printed if there is a panic before func testMain(m *testing.M) int {
defer func() {
if e := recover(); e != nil {
fmt.Println("there was a panic")
}
}()
setup()
defer teardown()
panic("about to m.Run")
return m.Run()
} The code in This needs further investigation. /cc @mpvl @josharian per owners. |
#34129 is somewhat relevant, although the implementation for that issue probably will not address this one. (CC @changkun @abuchanan-nr) |
Change https://golang.org/cl/219977 mentions this issue: |
@bcmills Thanks for CC me. I basically agree with @dmitshur . The Go team must decide either document the behavior or allow testing.M.Run to return properly if there is a test panic. For the following code snippet:
outputs:
If we decide to allow testing.M.Run to return, then it will execute
|
We can never make this perfect. If some code in a test calls go func() { panic("die die die") }() then The only question here is how we should handle a test that panics in the goroutine used to run the test. Should we try to handle that specific case? Or should we just treat it like a panic in a different goroutine, which is in effect what we do now? I don't have a strong opinion about that. But the testing package is already fairly baroque. Is it really worth complicating it further to handle one specific case when we can't handle other similar cases? |
No, we can't do anything about it yet. I didn't write any test regarding user goroutine panics and left this case to the CL's reviewers, and I think this particular case can simply be ignored because this type of panic happens in a user goroutine. The testing package has done everything it could achieve (panics happens directly in tests and subtests). If we interested in handling this particular case, more investigation could be done with subsequent CLs. |
I suppose I'm asking for opinions. Is it worth complicating the testing package to return from |
Thank you for explaining the current situation and trade-offs in #37206 (comment), @ianlancetaylor. My opinion is that we should hold off on complicating the |
I was answering an opinion. This particular case can simply be ignored because this type of panic happens in a user goroutine. |
If I get CL 134395 cleaned up and merged, it will be fairly straightforward for users to use an Moreover, I suspect that the vast majority of tests do not explicitly spawn goroutines, let alone goroutines that may panic. It seems reasonable for users to expect that their best-effort cleanup using |
At least part of the complexity of the implementation seems to arise because the runtime does not provide a mechanism for a program to re-raise an existing panic without altering its stack trace. (I've discussed that deficiency with at least @aclements before, but I don't see an issue filed for it.) If we provided a general mechanism to re-panic without altering the stack trace, then I think the (CC @danscales) |
This might be already off-topic: Was there any existed discussion on registering global handler of panics? It seems (server-side) Go users nowadays avoid to use panic/recover pair, just because they do not want their service down by accident, and there is no way to capture a panic outside a goroutine. The runtime also does not have this privilege. |
I don't think a global panic handler would be an appropriate solution, here or in servers. An uncaught |
I don't see any way of doing it, handling all panics seems to be equivalent to the status in GC and mutators. That is to say, a user goroutine may panic intentionally, the runtime should not do anything with it by default. This basically matches what I said in the beginning: "The testing package has done everything it could achieve with the CL. If we interested in handling this particular panic case, more investigation could be done in subsequent CLs." cc @dmitshur |
Good point. Agree. I just randomly threw some virgin ideas, since I didn't experience much use cases regarding capturing a panic outside a goroutine except this, not suggesting anything :) |
I personally am not convinced by "let's do this now and figure out how to do more later." Sometimes that is the right approach, but sometimes it's important to at least understand how we could do more later. Right now people are confused because when they |
I don't think we do need to “figure out how to do more later”. We should structure our own code to propagate panics from non-main goroutines back to the main goroutine, and encourage users to do the same. Especially for non- |
I am also agreed with you about the part " ... 'let's do this now and figure out how to do more later.' ..." is not a good decision in this case. However, the current situation is I don't see any way of doing it better than the proposed CL because we cannot differentiate an accidental panic and an intentional panic in a spawned goroutine from the runtime unless people give a clear indication to the TestMain or equivalent. This knob makes the usage and behavior even more complex than the proposed CL. If we carefully think about panics propagate out from a test goroutine, things become more subtle. Go users were taught to fail a test by I personally argue this is the best action we can offer and there are no further thoughts from my side. |
Change https://golang.org/cl/221321 mentions this issue: |
It appears that the proposed solution would propagate the panic to TestMain. An alternative would be that a panic within a test instead triggers an immediate failure of that test but allows the other tests to continue, such that the panic is never propagated to TestMain but arguably never needs to be. On the downside, this may encourage test writers to deliberately call panic to fail -- but I'm not sure we're at risk of that becoming common practice. On the plus side, I think this alternative might be:
|
@progressnerd, an unexpected panic potentially leaves the program in an arbitrarily corrupted state. Swamping the panic in noise from subsequent test failures — or worse, deadlocks — would make the panic harder to diagnose, in addition to delaying the output of the panic, for at best marginal additional information from the test. |
Automatically drop any old test databases. Initially I was under the impression that users could make use of `TestMain` to ensure that test databases are correctly cleaned up after the test suite completes, but it turns out that if a test panics then `TestMain` may not return [1]. As a result there doesn't seem to be a straightforward way to ensure that test databases created by this library are eventually cleaned up, which means that the number of test databases can continually grow across test runs. Mainly this is an annoyance, but eventually it can also start to cause problems. Since this library is explicitly about setting up and tearing down test databases, we really should make an effort to ensure that the number of test databases doesn't continuously grow in certain situations. One potential option would be to replace the supervisor `GetTestDB(testing.TB) TestDB` with something like: ```go func (s *testSupervisor) WithTestDB(t testing.TB, fn func(TestDB)) { defer func() { if r := recover(); r != nil { t.Errorf("recovered panic: %v", r) } }() dbResource, err := s.inner.getTestDB(ctx) if err != nil { t.Fatalf("get test db: %s", err) } t.Cleanup(func() { // release dbResource to the pool. ... }) return fn(dbResource.Data()) } ``` While this would help, there will still be problems if other tests in the suite panic. Also it feels less ergonomic (in my opinion) and would make it harder for users to wrap this library (which is expected). There is also a similar problem, albeit to a lesser extent, with test databases that are explicitly persisted through the `KeepDatabasesForFailed` option. These changes update the supervisor to automatically drop any old test databases, meaning those with the `pg_test` prefix, immediately. This means that by default, after a test suite finishes there should not be any test databases remaining. I am not sure if this is the correct approach. Mainly, I don't think this will actually work if tests from multiple packages are run in parallel and multiple packages make use of `pgtest`. Also, it isn't the end of the world to just have the user manually drop these test databases themself with something like: ``` psql postgres -c "\l" | grep pg_test | awk '{print $1}' | xargs -I{} psql postgres -c "DROP DATABASE {};" ``` I was just trying to verify the `TestMain` approach worked as expected even if the tests didn't exit normally, and once I found out this wasn't the cased tried to come up with a solution. If we find that automatically dropping old test databases is problematic, it might be best to just add some extra documentation explaining the behaviour. [1] golang/go#37206
I'm new to Go and I just hit this issue while trying to make my tests clean up after themselves.
This is exactly the behavior I was expecting! |
@johndunlap, I suspect that's what most people would prefer. I actually built a library that achieved this by reflectively modifying test cases, and it was greatly appreciated by me and others who used it, but it was a bit too hard to maintain that kind-of-hacky, reflection-based implementation. Continuing with tests cases after one fails always risks swamping that test failure with noise from subsequent test cases, but the balance of usefulness, in my experience, has always been heavily on the side of continuing. Nearly every test panic -- if not literally every -- could have been recovered by the test runner and produced useful info by running subsequent tests. At the end of the day, the test as a whole is either going to pass or fail either way -- so the only question here is what is usually most useful to the user. |
What version of Go are you using (
go version
)?Does this issue reproduce with the latest release?
Yes
What operating system and processor architecture are you using (
go env
)?go env
OutputWhat did you do?
I wrote a function, a test for it, and
TestMain
to perform setup and teardown for testing this function:main.go
p_test.go
main_test.go
What did you expect to see?
I expected to see "setting up" and "tearing down" at some point, and a panic "foo" message with a stack trace.
What did you see instead?
Only setup and panic output appeared:
The program probably called
os.Exit
at some point while running the tests, or panicked in a separate go routine with no way for me to recover. I would expect both setup and teardown functions to be called at some point.The text was updated successfully, but these errors were encountered: