Skip to content

Conversation

natasha41575
Copy link
Contributor

@natasha41575 natasha41575 commented Sep 22, 2025

  • One-line PR description: Graduate InPlacePodVerticalScaling to GA.
  • Other comments:

We believe we are ready to graduate InPlacePodVerticalScaling to GA in kubernetes 1.35.

Per the graduation requirements:

  • VPA integration has moved to beta.
  • No major bugs have been reported for 3 months.
  • Pod-scoped resources will be handled by this enhancement, and we expect this to land in v1.35.
  • UpdatePodSandboxResources has been implemented in crio, and is slated to land in the containerd 2.2 release in November (PR link). This is not the critical piece of functionality for in-place pod resize, so while we do want and expect it to land, we do not consider it blocking.
  • Re-evaluation of the following decisions:
    • Resize atomicity - Resizes will stay atomic. Allowing partial resizes adds significant complexity and the use case is unclear.
    • Exposing allocated resources in the pod status - We will continue to expose allocated resources in the pod status.
    • QOS class changes - This is still considered out of scope for this KEP.

Most importantly, user feedback is green:

  • Internal usage metrics show that usage is high. There have been no major bugs reported.
  • There are now multiple integrations with in-place pod resize including Cast.ai, ScaleOps, and VPA. Scale-ops has even announced that their in-place pod resize integration is GA: https://scaleops.com/blog/now-ga-scaleops-launches-in-place-pod-resizing/.
  • We have feedback from a user of Cast.ai that they are very happy with it.
  • We have a customer that ran benchmarking of their workloads, and showed that they have 30% cost efficiency when using in-place pod resize vs without it.
  • ...and a few others.

/sig node autoscaling scheduling

/cc @tallclair @dchen1107

/cc @dom4ha
for sig-scheduling

/cc @jackfrancis
for sig-autoscaling

@k8s-ci-robot k8s-ci-robot added sig/node Categorizes an issue or PR as relevant to SIG Node. sig/autoscaling Categorizes an issue or PR as relevant to SIG Autoscaling. sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. labels Sep 22, 2025
@k8s-ci-robot k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Sep 22, 2025
- Resize atomicity
- Exposing allocated resources in the pod status
- QOS class changes
- The subset of pod resize tests [here](https://github.com/kubernetes/kubernetes/blob/1aec2eb0030d2f121b4cf78998e9391d9389f1a0/test/e2e/common/node/pod_resize.go) under `doPodResizeTests` and `doPodResizeErrorTests` that meet the Conformance test requirements are promoted to Conformance.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We have a tracking bug for the Conformance endpoints: kubernetes/kubernetes#133607

@dom4ha
Copy link
Member

dom4ha commented Sep 23, 2025

/label lead-opted-in
/milestone v1.35

@natasha41575
Copy link
Contributor Author

/cc @wojtek-t

@jpbetz jpbetz mentioned this pull request Oct 1, 2025
95 tasks
@jpbetz
Copy link
Contributor

jpbetz commented Oct 1, 2025

/approve
For PRR (Everything appears to have been filled out by Beta, which is what I like to see!)

@tallclair tallclair self-assigned this Oct 1, 2025
Copy link
Member

@tallclair tallclair left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

'cpu' and 'memory' as names. It supports the following restart policy values:
* `PreferNoRestart` - default value; resize the Container without restart, if possible.
* `NotRequired` - Equivalent to `PreferNoRestart`, deprecated with v1.33.
* `NotRequired` - default value; resize the Container without restart, if possible.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oops, thanks for cleaning this up.

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 2, 2025
@helayoty helayoty moved this to Needs Review in SIG Scheduling Oct 2, 2025
@k8s-ci-robot k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 3, 2025
Copy link

@jackfrancis jackfrancis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

for SIG Autoscaling

cc @adrianmoisey

@github-project-automation github-project-automation bot moved this from Needs Review to In Progress in SIG Scheduling Oct 3, 2025
@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 3, 2025
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: jackfrancis, jpbetz, natasha41575
Once this PR has been reviewed and has the lgtm label, please assign mrunalp for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory lead-opted-in Denotes that an issue has been opted in to a release lgtm "Looks good to me", indicates that a PR is ready to be merged. sig/autoscaling Categorizes an issue or PR as relevant to SIG Autoscaling. sig/node Categorizes an issue or PR as relevant to SIG Node. sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
Projects
Status: In Progress
Development

Successfully merging this pull request may close these issues.

6 participants