-
Notifications
You must be signed in to change notification settings - Fork 200
TestMultipleVMs failure - "exit status 148" and "race detected during execution of test" #358
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@xibz indicated he previously encountered exit code 148 when Firecracker hit seccomp violations: firecracker-microvm/firecracker#897 So that's something to look into as a possibility. EDIT: I also found this explicitly mentioned in their changelogs too: https://github.com/firecracker-microvm/firecracker/blob/master/CHANGELOG.md#changed-5 |
I can reproduce this locally. I saw there is an unreleased fix for an issue that will cause a seccomp violation during a firecracker panic, so I tried with 0.20 (from here), but still see VMs exiting with 148. The exit 148 always happens right as the VM is supposed to shutdown and often (but not always) includes these logs about a bad syscall:
According to |
Opened an issue with Firecracker here w/ some more details I found: firecracker-microvm/firecracker#1456 In the meantime, I think we can clean up the error about "os: process already finished" as it should ideally be more like "VMM exited with status 148" or something like that. |
Also, I haven't reproduced the |
I merged #362, so while the issue persists, the error is no longer "os: process already finished", it's now:
and we no longer send SIGTERM to the already dead process. |
The Firecracker team confirmed here there is a bug around handling |
Firecracker team has fixed the issue and we have upgraded our _submodules/firecracker to include the fix. |
…ependabot/go_modules/github.com/go-openapi/errors-0.20.1 Bump github.com/go-openapi/errors from 0.20.0 to 0.20.1
Uh oh!
There was an error while loading. Please reload this page.
There was an ephemeral failure of TestMultipleVMs here (run as part of #356 ): https://buildkite.com/firecracker-microvm/firecracker-containerd/builds/1315#1dd8d6e8-953c-417b-97f7-e029aea0a82f
I haven't looked in depth yet, but a few lines in the logs for vmID 78 (which got one of the "os: process already finished" errors), shows some odd looking lines:
I'm not sure off the top of my head what caused the exit code to be 148. We also shouldn't need to send SIGTERM if the VM is already exited (but that may be unrelated).
The text was updated successfully, but these errors were encountered: