-
Notifications
You must be signed in to change notification settings - Fork 17.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
database/sql: make maxBadConnRetries configurable #48309
Comments
any update? |
There are two ways to approach this, especially that we do a better job of marking connections as bad before they go into the pool to begin with.
I'm open to (2), but I would prefer it not to introduce more locks and shared state. Want to suggest a change? |
I'd like to chip in as I recently opened go-sql-driver/mysql#1385. I think this comes down to a similar problem space. So I would like this setting to be configurable so I can set it to 1 and directly experience any failure. I would expect normal applications to use the current behaviour and not touch any special configurations unless they specifically want to. However, I would like to be able to turn this off, and thus get a much better idea of issues seen when monitoring the health of my database servers. I would imagine others may have a similar issue which is partly where this issue comes from. I also notice that sql.Stats provide some very simple statistics and one of the issues I notice is that we can't see how frequently connections are retried, so there's no visibility in the "stability" of the backend servers being talked to. Basically the current retry logic while good from the application perspective hides issues which we might want to know about. Is there a chance that this could be corrected and you handle one of the 2 points above and also perhaps provide a global counter for deliberate retries or some non-logical errors which perhaps the low level driver can return to you? It seems clear that co-ordination with the low level driver authors is needed so that things work as required. I'm not sure how such co-ordination is best handled, but I do see the low level driver layer can not change things that |
@sjmudd [1] https://cs.opensource.google/go/go/+/refs/tags/go1.19.5:src/database/sql/sql.go;l=852 |
Yes, and I would like it to be incorporated into https://github.com/go-sql-driver/mysql but believe there are some concerns about compatibility issues with the different versions of go that is supported. I would like also to see exposed something like:
I believe that there was some concern about making What I see now is about 1 in 100,000 connections failing though that's after the retry having been applied so I would expect the actual error count to be higher. Given I'm polling a large number of servers all the time this number is actually significant as some of these connections are actually to healthy servers which need to be fixed when another server fails. My primary interest is to be able to collect statistics from database/sql and optionally be able to turn off the retries. With that in place I can then get the detail necessary back from a modified The MySQL driver through database/sql to identify what specific conditions are generating these errors. |
Could I ask where I know database/sql is backend agnostic layer, but I'm also aware of MySQL being notorious for "MySQL gone away errors" often triggered by the server disconnecting connections which were idle for too long. So the reason for this setting or at least the attempt to reconnect if you get an error would seem to match that scenario. Perhaps other backends suffer from similar behaviour? This just hides the problem from the user "most of the time" which is good in some ways. However, the lack of visibility as to how often this is taking place is a concern as it is hiding a reliability problem which perhaps needs addressing outside of the database/sql layer. |
Slightly related but of relevance is that MySQL had a patch provided by a colleague for this specific problem which now makes "updated clients" see an error message in such a case (e.g. adjust your MySQL setting of So having a way to configure the retries, and having visibility on the problem when all things are in place would allow people monitoring errors more closely to see errors and then act upon them and thus have a better experience. |
What version of Go are you using (
go version
)?v1.16.2
Does this issue reproduce with the latest release?
Yes
What operating system and processor architecture are you using (
go env
)?n/a
What did you do?
The value of
maxBadConnRetries
is hardcoded to 2 -> https://github.com/golang/go/blob/master/src/database/sql/sql.go#L1511.I was wondering if there is a specific reason for this, as there seems to be attempts to make this variable in the past which were abandoned -> #15608
Making it variable would be very much appreciated in our particular use-case.
Scenario
Note that we cannot simply block connections to the entire DB instance as it needs to be contactable during this period.
The text was updated successfully, but these errors were encountered: