Skip to content

Pods stuck in ContainerCreating after cluster autoscaling (AWS CNI race condition) #247

@deliahu

Description

@deliahu

Description

See:

To reproduce

  1. Run cluster with two t3.mediums and cluster autoscaling enabled
  2. Deploy iris and let run to completion
  3. Deploy pipelines/iris

This will trigger cluster autoscaling, and once the new node has joined the cluster, the pending Spark job will get scheduled but will be stuck in ContainerCreating

Metadata

Metadata

Assignees

Labels

blockedBlocked on another task or external eventbugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions