[ws-manager] The container could not be located when the pod was terminated

### Bug description

We have observed in production, in preview-env and integration tests an error that results in the status of the workspace being "The container could not be located when the pod was terminated". We check the related GCP log and there is no data loss happened.

### Questions

- Is "The container could not be located when the pod was deleted.  The container used to be Running" also happening?
  > Yes! Check this [GCP log](https://console.cloud.google.com/logs/query;cursorTimestamp=2023-01-04T05:22:19.977165741Z;query=%22workspace%20failed%22%0Atimestamp%3D%222023-01-04T05:22:19.977165741Z%22%0AinsertId%3D%22domvix05xh6189z6%22;timeRange=PT3H?project=workspace-clusters).
- Is this only happening on stop? Milan's scenario seems to indicate otherwise.

### Plan

1. [x] Verify whether as it is now there is data loss when this error occurs
2. [ ] Check Milan's case to understand if it happened during the running workspace phase.

<details>
  <summary>Old description</summary>

  We try to get the status of the pod, when is not running anymore, at a time we are not sure.  [logs](https://cloudlogging.app.goo.gl/EALr6Wbfh8q4VNBS7)

Impact to the user: 
(1) the workspace is generally left in a failed state. Users can try to restart, as failed is a terminal phase.
(2) user data may be lost.

This error message(`The container could not be located when the pod was terminated`) comes from kubelet.
https://github.com/kubernetes/kubernetes/blob/4aa451e8458a7cbf78ed464e9e47e87d424541ce/pkg/kubelet/kubelet_pods.go#L1810-L1817

Potentially related with this Kubernetes bug: https://github.com/kubernetes/kubernetes/issues/104107

### Steps to reproduce

I don't know

### Workspace affected

_No response_

### Expected behavior

There isn't this error message in production.

### Example repository

_No response_

### Anything else?

This has been happening in `gen59`, `gen60` and `gen61`, too. [Logs.](https://cloudlogging.app.goo.gl/86UASif6LeCtkS8T9)

### Definition of done

Let's spend some time researching if this is a Kubernetes bug, or in fact could be caused by other circumstances too. Please timebox at 2 hours, after which please share results with the team in Slack, so we can socialize next steps. 

Why research? Because the workspaces impacted by this bug end with a Failed status. cc @geropl I'm not sure if a workspace ending in a failed status will negatively impact UBP...assume not, but, wanted to check.

<img src="https://front.com/assets/img/favicons/favicon-32x32.png" height="16" width="16" alt="Front logo" /> [Front conversations](https://app.frontapp.com/open/top_3sf4a)
</details>



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ws-manager] The container could not be located when the pod was terminated #12021

Bug description

Questions

Plan

Steps to reproduce

Workspace affected

Expected behavior

Example repository

Anything else?

Definition of done

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[ws-manager] The container could not be located when the pod was terminated #12021

Description

Bug description

Questions

Plan

Steps to reproduce

Workspace affected

Expected behavior

Example repository

Anything else?

Definition of done

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions