-
Notifications
You must be signed in to change notification settings - Fork 40.6k
Data race in httptest.Server - pkg/storage/etcd, pkg/registry/generic/etcd #18928
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
The failure comes from: TestOIDCAuthentication |
xref: #18917 Same spot I fail on: https://paste.fedoraproject.org/303002/50488092/ |
cc @yifan-gu |
I've just seen the same error in a different test:
This failure comes from: So this may not be related to oidc itself, but maybe to httptest.Server |
I've also seen exactly the same in pkg/storage/etcd. |
This is an interesting one - it is from TestWatchPurposefulShutdown, and there we already have rt.Gosched, so this will not help. |
I think I have a hypothesis what can be happening here (although, I'm not able to reproduce it locally). What I think is happening here is that:
I will prepare a PR fixing it (although unfortunately I'm not able to reproduce it locally, so this will just be a guess...) |
I've sent #18956 - although I hope that it will solve most of problems here. |
However - my PR will not fix OIDC test - I think that one is unrelated... |
It seems that my fix #18956 didn't help... |
One more try to fix tests has just been merged #18999 (although I'm still not able to reproduce them locally) |
Since TestOIDCAuthentication is a different issue - I'm creating a separate issue for that one. |
Unfortunately the second fix also didn't help (it seems that flakes are rare even after the first fix, but they are still present... - I will debug more. |
One more attempt is in #19006 |
@fgrzadkowski - hopefully it was finally fixed in the fourth attempt :) |
I'll merge #19187 to make this flakes go away. |
If anyone see this happening again please ping this issue. |
Oh sorry, I see @xiang90 is working on a less hacky solution, as described here Re-assigning to him. |
But if we think @wojtek-t 's sleep will fix the problem good enough for now, we should lower the priority of this from P0 to P1 or P2... |
@davidopp Right. Agree to de-prioritize it. I will try to get a more robust one next week. |
Thanks, bumped to P2. |
@xiang90 We're prioritizing flakes for the time being. Is this something you'll be able to get to soon, or should we find someone else to work on it? |
@ihmccreery I am in China recently. I can look into this later. But I assume this issue has been fixed by a work around. And go will also fix the root cause soon. This issue is just to make the test shutdown in a more graceful way. |
As @brendandburns mentioned, this might be fixed by #19458 |
iirc the long term fix is in etcd proper. |
I'm taking this due to #20123 having the fix. |
@timothysc etcd/client does not really manage the transport layer. We generate and pass in the transport in k8s and k8s has full management of the shutdown. We need to change that part if we want a more graceful shutdown. |
The failure is the following:
@timothysc @kubernetes/goog-testing @yifan-gu
The text was updated successfully, but these errors were encountered: