Commit Graph

126914 Commits

Author SHA1 Message Date
ndixita
26f11c4586 QOS changes for Pod Level resources 2024-11-08 03:00:54 +00:00
ndixita
6db40446de Scheduler changes:
1. Use pod-level resource when feature is enabled and resources are set at pod-level
2. Edge case handling: When a pod defines only CPU or memory limits at pod-level (but not both), and container-level requests/limits are unset, the pod-level requests stay empty for the resource without a pod-limit. The container's request for that resource is then set to the default request value from schedutil.
2024-11-08 03:00:54 +00:00
ndixita
8a8dc27b4e Adding the logic to validate pod-level resources as following:
1. The effective container requests cannot be greater than pod-level requests
2. Inidividual container limits cannot be greater than pod-level limits
3. Only CPU & Memory are supported at pod-level
4. Inplace container resources updates are not supported if pod-level resources are set
Note: effective container requests cannot be greater than pod-level limits is supported by transitivity. Effective container requests <= pod-level requests && pod-level requests <= pod-level limits; Therefore effective container requests <= pod-level limits

Signed-off-by: ndixita <ndixita@google.com>
2024-11-08 03:00:54 +00:00
ndixita
a2ddde877c Adding the logic to set default pod-level request as following:
1. If pod-level limit is set, pod-level request is unset and container-level request is set: derive pod-level request from container-level requests
2. If pod-level limit is set, pod-level request is unset and container-level request is unset: set pod-level request equal to pod-level limit
2024-11-08 03:00:54 +00:00
ndixita
502e0f55c4 Adding support for pod level resources in kubectl
1. Add support for pod level resources in kubectl
2. Reuse the existing method to describe container resources and generalize it to describe both pod and container level resources
2024-11-08 03:00:53 +00:00
ndixita
85488b5f10 Generated files and compatability data from API changes 2024-11-08 03:00:50 +00:00
ndixita
d7f488b5e3 API changes for Pod Level Resources
1. Add Resources struct to PodSpec struct in both external and internal API packages
2. Adding feature gate and logic for dropping disabled fields for Pod Level Resources
KEP: enhancements/keps/sig-node/2837-pod-level-resource-spec
2024-11-08 02:45:04 +00:00
Kubernetes Prow Robot
210deea063
Merge pull request #128681 from soltysh/client-go_port_forward_reset
Client go port forward reset, error handling and tests
2024-11-07 23:33:03 +00:00
Kubernetes Prow Robot
b09acdfc72
Merge pull request #128674 from richabanker/master
Fix OIDC flags
2024-11-07 23:32:56 +00:00
Mark Rossetti
3c9380c449
Memory manager support for Windows nodes (#128560) 2024-11-07 23:32:49 +00:00
Kubernetes Prow Robot
8504758a2e
Merge pull request #125757 from Nordix/esotsal/125205
[FG:InPlacePodVerticalScaling] Fix backoff problem when quickly reverting resize patch
2024-11-07 23:32:42 +00:00
Kubernetes Prow Robot
b7113809cb
Merge pull request #128670 from liggitt/externaljwt-broadcast
Move broadcast of key updates into sync, fixup of externaljwt generation / test
2024-11-07 22:20:58 +00:00
Kubernetes Prow Robot
ab30adcbae
Merge pull request #128356 from lauralorenz/crashloopbackoff-maintain10minuterecoverythreshold
KEP-4603: Maintain current 10 minute recovery threshold for container backoff regardless of changes to the maximum duration
2024-11-07 22:20:50 +00:00
Kubernetes Prow Robot
3300aa1783
Merge pull request #128247 from mattcary/autodelete-ga
Promote StatefulSetAutoDeletePVC to stable in 1.32
2024-11-07 22:20:43 +00:00
Kubernetes Prow Robot
6e25c2a456
Merge pull request #128668 from liggitt/externaljwt
Update staging docs to add externaljwt
2024-11-07 21:07:17 +00:00
Kubernetes Prow Robot
601feb8ccb
Merge pull request #128661 from sanposhiho/split-test
fix: move the queueing integration tests to another package to address the timeout issue
2024-11-07 21:07:09 +00:00
Kubernetes Prow Robot
b82013ecd6
Merge pull request #127326 from stlaz/ctb_new_signer
trustbundles: add a new kube-apiserver-serving signer
2024-11-07 21:07:02 +00:00
Kubernetes Prow Robot
1dd81aa1c9
Merge pull request #126653 from zhifei92/fix-podstatus
fix the issue of losing the pending phase after a node restart.
2024-11-07 21:06:54 +00:00
Kubernetes Prow Robot
1ce20b2b6f
Merge pull request #126336 from HirazawaUi/remove-runonce-mode
Kubelet: Remove runonce mode
2024-11-07 21:06:46 +00:00
carlory
b209a62483 Fix OIDC flags 2024-11-07 12:28:03 -08:00
Maciej Szulik
0b1617ccef
Clean error handling in port-forward
This commit introduces:
1. Cleanups in port-forwarding error handling code, which ensures that
   we only compare lowercased text always.
2. E2E verifying that when a pod is removed a port-forward is stopped.

Signed-off-by: Maciej Szulik <soltysh@gmail.com>
2024-11-07 21:19:59 +01:00
Nic
dbe6b6657b
fix: draining remote stream after port-forward connection broken
Signed-off-by: Nic <qianyong@api7.ai>
2024-11-07 21:19:59 +01:00
Kubernetes Prow Robot
847be85000
Merge pull request #128657 from ffromani/unshare-containermap-among-managers
node: cm: don't share containerMap instances between managers
2024-11-07 19:45:20 +00:00
Kubernetes Prow Robot
aee1a91896
Merge pull request #128644 from huww98/multi-volume-part-1
kubelet: don't check for mounted before update dsw PV size
2024-11-07 19:45:11 +00:00
Kubernetes Prow Robot
25101d33bc
Merge pull request #128518 from tallclair/pleg-watch-conditions
[FG:InPlacePodVerticalScaling] PLEG watch conditions: rapid polling for expected changes
2024-11-07 19:45:01 +00:00
Kubernetes Prow Robot
fb033826a8
Merge pull request #128170 from sanposhiho/async-preemption
feature(KEP-4832): asynchronous preemption
2024-11-07 19:44:54 +00:00
Kubernetes Prow Robot
9660e5c4cd
Merge pull request #127360 from knight42/feat/split-stdout-stderr-server-side
API: add a new `Stream` field to `PodLogOptions`
2024-11-07 19:44:45 +00:00
Jordan Liggitt
070f74bbdb
Add definitions for non-dockerized proto generation 2024-11-07 14:17:26 -05:00
Jordan Liggitt
c68a11a09f
Deflake supported key ack 2024-11-07 14:14:39 -05:00
Jordan Liggitt
dc41c91a07
Move broadcast of key updates into sync 2024-11-07 14:07:33 -05:00
Sotiris Salloumis
68fcc9cf8a Fix slow reconcile when quickly reverting resize patch 2024-11-07 19:51:47 +01:00
Laura Lorenz
a0b83a7741 Maintain 10 minute recovery threshold for container backoff
Signed-off-by: Laura Lorenz <lauralorenz@google.com>
2024-11-07 18:46:11 +00:00
Kubernetes Prow Robot
9c571abeec
Merge pull request #128658 from stlaz/ctb_alpha_extend
clustertrustbundles: extend alpha for 2 more releases
2024-11-07 18:32:58 +00:00
Kubernetes Prow Robot
7243fa60e1
Merge pull request #128641 from benluddy/e2e-cbor-client-compat
KEP-4222: Fix JSON fallback for clients using default content-type and add E2E client test.
2024-11-07 18:32:50 +00:00
Kubernetes Prow Robot
631d83b56f
Merge pull request #128569 from tenzen-y/add-job-evaluation-orders-comment
Job: Add evaluation step comments in the syncJob
2024-11-07 18:32:43 +00:00
Kubernetes Prow Robot
50362ac7d0 Promote StatefulSetAutoDeletePVC to stable for 1.32. 2024-11-07 09:43:49 -08:00
Kensei Nakada
6e19d5ac79 fix: move the queueing integration tests to another package to address the timeout issue 2024-11-08 02:33:57 +09:00
Stanislav Láznička
a4b83e77d9
trustbundles: add a new kube-apiserver-serving signer 2024-11-07 18:07:06 +01:00
Kubernetes Prow Robot
af41aa1d9f
Merge pull request #128664 from carlory/fix-node
mark the status of fake node as NotReady
2024-11-07 16:10:52 +00:00
Kubernetes Prow Robot
a660b51201
Merge pull request #128498 from googs1025/flake/TestReconcilerAPIServerLeaseMultiCombined
flake(TestReconcilerAPIServerLeaseMultiCombined): fix TestReconcilerAPIServerLeaseMultiCombined flake
2024-11-07 16:10:43 +00:00
Kubernetes Prow Robot
e5f5975f96
Merge pull request #128472 from sanposhiho/qhint-beta
feat: graduate SchedulerQueueingHints to beta
2024-11-07 16:10:36 +00:00
Kubernetes Prow Robot
c93ba4e96f
Merge pull request #124817 from carlory/cleanup-VolumePluginMgr
remove loggedDeprecationWarnings from VolumePluginMgr
2024-11-07 16:10:28 +00:00
Francesco Romani
2a99bfc3d1 node: cm: don't share containerMap instances between managers
Since the GA graduation of memory manager in https://github.com/kubernetes/kubernetes/pull/128517
we are sharing the initial container map across managers.

The intention of this sharing was not to actually share a data
structure, but
1. save the relatively expensive relisting from runtime
2. have all the managers share a consistent view - even though the
   chance for misalignement tend to be tiny.

The unwanted side effect though is now all the managers race
to modify a data shared, not thread safe data structure.

The fix is to clone (deepcopy) the computed map when passing it
to each manager. This restores the old semantic of the code.

This issue brings the topic of possibly managers go out of sync
since each of them maintain a private view of the world.
This risk is real, yet this is how the code worked for
most of the lifetime, so the plan is to look at this and evaluate
possible improvements later on.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2024-11-07 16:02:55 +01:00
Jordan Liggitt
ecb5fc450b
Update staging docs to add externaljwt 2024-11-07 09:59:27 -05:00
Kubernetes Prow Robot
33c64b380a
Merge pull request #128646 from pohly/dra-kubelet-separate-beta-api
DRA kubelet: separate beta and alpha gRPC APIs
2024-11-07 14:57:45 +00:00
Kubernetes Prow Robot
e30492f77a
Merge pull request #128495 from olyazavr/refresh-probed-plugins
refresh probed plugins on init to avoid probe race/erroneous unmounts
2024-11-07 14:57:37 +00:00
Omer Aplatony
9d816f1587
Replace PollImmediate with PollUntilContextTimeout (#128147)
* Replace PollImmediate with PollUntilContextTimeout

Signed-off-by: Omer Aplatony <omerap12@gmail.com>

* Add context to RetryErrorCondition function

Signed-off-by: Omer Aplatony <omerap12@gmail.com>

* lint: fix error comparison in scale package

Signed-off-by: Omer Aplatony <omerap12@gmail.com>

* Fix RetryErrorCondition function signature

Signed-off-by: Omer Aplatony <omerap12@gmail.com>

* revert to if err statement

Signed-off-by: Omer Aplatony <omerap12@gmail.com>

---------

Signed-off-by: Omer Aplatony <omerap12@gmail.com>
2024-11-07 14:57:29 +00:00
Yuki Iwai
5dda60ee4e Job: Add evaluation step comments in the syncJob
Signed-off-by: Yuki Iwai <yuki.iwai.tz@gmail.com>
2024-11-07 23:36:44 +09:00
Stanislav Láznička
9705024349
make update 2024-11-07 15:33:39 +01:00
Stanislav Láznička
c8b5401175
move CTB alpha deprecation 2 releases 2024-11-07 15:33:34 +01:00