Cooldown-based eager marshaling #278

jiahuif · 2022-02-14T23:18:03Z

With #251, the marshaling of our OpenAPI spec is delayed until the first request, so that the spec is not re-marshaled every time a new CRD is added, which saves a lot of marshaling time. However, I've encountered the following frustration with our current implementation.

The API server is idle after CRDs are added and before first request comes. Considering in a real-world scenario, the user usually start adding workload after a few minutes, there is usually plenty of time for the API server to "warm-up" before first request from the user.
In a replicated/HA control plane, there are multiple instances of API server. The first request warms up only one of the instances. In other words, for N instances, the first N requests are "warm-up"s and slow.

This PR addresses the issues by adding a cooldown. If the spec is not updated within the cooldown, build the cache by marshaling the spec before the first request comes. This way, we can avoid having to frequently re-marshal the spec while keeping first requests fast.

k8s-ci-robot · 2022-02-14T23:18:22Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: jiahuif
To complete the pull request process, please assign sttts after the PR has been reviewed.
You can assign the PR to them by writing /assign @sttts in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

jiahuif · 2022-02-15T18:31:27Z

/assign @jpbetz @apelisse @DangerOnTheRanger

apelisse · 2022-02-16T01:32:11Z

I haven't read the code yet but thank you for the great explanation of the pull-request. I don't know if this is something we want for OpenAPI v2 since eventually we have to keep the feature enabled but we're hoping to decrease the number of users to 0, which means that the cache would NEVER have to be populated (that's the target goal). Now, we could do what you suggest for OpenAPI v3, but since it's split across group, we also don't know how often the groups are pulled vs refreshed. Do we have any specific data or reason to implement this?

jpbetz · 2022-02-16T15:40:42Z

cc @DangerOnTheRanger

This seems like a good idea (trying to proactively warm the cache so that the 1st request after an update is less likely to be high latency). I don't know exactly what the ideal cooldown time should be.

k8s-ci-robot · 2022-03-22T05:35:49Z

@jiahuif: PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-triage-robot · 2022-06-20T06:11:21Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2022-07-20T06:36:27Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2022-08-19T07:33:54Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen
Mark this issue or PR as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

k8s-ci-robot · 2022-08-19T07:34:12Z

@k8s-triage-robot: Closed this PR.

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen

Mark this issue or PR as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

jiahuif added 2 commits January 17, 2022 11:00

handler v2: eager marshaling with cooldown.

9da3187

handler v3: eager marshaling with cooldown.

8f35e44

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Feb 14, 2022

k8s-ci-robot requested review from jpbetz and sttts February 14, 2022 23:18

k8s-ci-robot assigned apelisse, DangerOnTheRanger and jpbetz Feb 15, 2022

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 22, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 20, 2022

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jul 20, 2022

k8s-ci-robot closed this Aug 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cooldown-based eager marshaling #278

Cooldown-based eager marshaling #278

jiahuif commented Feb 14, 2022

k8s-ci-robot commented Feb 14, 2022

jiahuif commented Feb 15, 2022

apelisse commented Feb 16, 2022

jpbetz commented Feb 16, 2022 •

edited

Loading

k8s-ci-robot commented Mar 22, 2022

k8s-triage-robot commented Jun 20, 2022

k8s-triage-robot commented Jul 20, 2022

k8s-triage-robot commented Aug 19, 2022

k8s-ci-robot commented Aug 19, 2022

Cooldown-based eager marshaling #278

Cooldown-based eager marshaling #278

Conversation

jiahuif commented Feb 14, 2022

k8s-ci-robot commented Feb 14, 2022

jiahuif commented Feb 15, 2022

apelisse commented Feb 16, 2022

jpbetz commented Feb 16, 2022 • edited Loading

k8s-ci-robot commented Mar 22, 2022

k8s-triage-robot commented Jun 20, 2022

k8s-triage-robot commented Jul 20, 2022

k8s-triage-robot commented Aug 19, 2022

k8s-ci-robot commented Aug 19, 2022

jpbetz commented Feb 16, 2022 •

edited

Loading