Description
Bug description
While observing a customer call we saw that an upgrade of a real-world environment was taking over 4 minutes; this is precariously close to the 5 minute helm upgrade
timeout.
Additional debugging pointed the finger at shiftfs-module-loader. This init container compiles the shiftfs kernel module (which is slow) and the version of Kubernetes on some distributions updates daemonsets one node at a time. Combining the slow runtime of shiftfs-module-loader with the linear scaling properties of daemonsets, it's possible for the installation process to be very lengthy on larger clusters.
Steps to reproduce
n/a
Workspace affected
No response
Expected behavior
The helm upgrade process should have a timeout period long enough that it can't accidentally be reached. A value of 10 - 15 minutes is less risky.
Example repository
No response
Anything else?
No response