-
Notifications
You must be signed in to change notification settings - Fork 633
[core] Refactoring the return value of teardown #1595
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…nto teardown-return-value
Thanks for adding this @Michaelvll! What's a good way to test this PR/simulate down failures? |
…cretevitamin/sky-experiments into teardown-return-value
Sorry for missing the question. One way to test it is to disconnect the internet and |
…nto teardown-return-value
This is also important to avoid leakage of spot clusters in managed spot, as we didn't actually catch the exceptions when the skypilot/sky/spot/recovery_strategy.py Lines 37 to 39 in 5bb759a
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, if it's confirmed that the following works: sky down -a
(or multiple clusters) and only some of the clusters throw a RuntimeError.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just tested sky down
multiple clusters: some succeeded and some failed, with the correct progress bar.
I am testing again with the smoke tests and will merge it after all tests pass:
-
pytest tests/test_smoke.py
Fixes #1649
This is a minor refactoring that avoids using the return value to indicate the failure of the teardown. It also avoids the
sky down
's progress bar progress when a teardown fails. cc @mrahejaTested (run the relevant ones):
pytest tests/test_smoke.py