Commit Graph

126634 Commits

Author SHA1 Message Date
holder
39726b119f fix: fix state validate error after memorymanager with static policy start
(cherry picked from commit b91951f847d0b159c9d8ef32688cc96489ac1884)
2024-10-29 12:08:16 +02:00
Patrick Ohly
4419568259 DRA: treat AdminAccess as a new feature gated field
Using the "normal" logic for a feature gated field simplifies the
implementation of the feature gate.

There is one (entirely theoretic!) problem with updating from 1.31: if a claim
was allocated in 1.31 with admin access, the status field was not set because
it didn't exist yet. If a driver now follows the current definition of "unset =
off", then it will not grant admin access even though it should. This is
theoretic because drivers are starting to support admin access with 1.32, so
there shouldn't be any claim where this problem could occur.
2024-10-29 10:22:31 +01:00
Patrick Ohly
9a7e4ccab2 DRA admin access: add feature gate
The new DRAAdminAccess feature gate has the following effects:
- If disabled in the apiserver, the spec.devices.requests[*].adminAccess
  field gets cleared. Same in the status. In both cases the scenario
  that it was already set and a claim or claim template get updated
  is special: in those cases, the field is not cleared.

  Also, allocating a claim with admin access is allowed regardless of the
  feature gate and the field is not cleared. In practice, the scheduler
  will not do that.
- If disabled in the resource claim controller, creating ResourceClaims
  with the field set gets rejected. This prevents running workloads
  which depend on admin access.
- If disabled in the scheduler, claims with admin access don't get
  allocated. The effect is the same.

The alternative would have been to ignore the fields in claim controller and
scheduler. This is bad because a monitoring workload then runs, blocking
resources that probably were meant for production workloads.
2024-10-29 09:50:11 +01:00
Patrick Ohly
f3fef01e79 DRA API: AdminAccess in DeviceRequestAllocationResult
Drivers need to know that because admin access may also grant additional
permissions. The allocator needs to ignore such results when determining which
devices are considered as allocated.

In both cases it is conceptually cleaner to not rely on the content of the
ClaimSpec.
2024-10-29 09:50:07 +01:00
Kubernetes Prow Robot
5f594f4215
Merge pull request #128401 from tenzen-y/use-same-receiver-name
Job: Consistentely use the same reveiver name in the controller
2024-10-29 08:16:55 +00:00
Kubernetes Prow Robot
66b3dc1a38
Merge pull request #128400 from tenzen-y/use-uid-typed-instead-of-string
Job: Refactor uncountedTerminatedPods to avoid casting everywhere
2024-10-29 06:32:56 +00:00
yunwang
b00f221a85 fix InPlacePodVerticalScaling restore bug: the content wrote to and read from file pod_status are different 2024-10-29 13:39:58 +08:00
Yuki Iwai
d4959d8d29 Job: Consistentely use the same reveiver name in the controller
Signed-off-by: Yuki Iwai <yuki.iwai.tz@gmail.com>
2024-10-29 14:11:10 +09:00
Yuki Iwai
a23e7a42d3 Job: Refactor uncountedTerminatedPods to avoid casting everywhere
Signed-off-by: Yuki Iwai <yuki.iwai.tz@gmail.com>
2024-10-29 13:12:35 +09:00
jessestutler
f7003c76b4 Refactor the dynamicResources struct to DynamicResources 2024-10-29 11:44:42 +08:00
yunwang0911
114d4df4b4
Update pkg/kubelet/status/state/state_checkpoint.go
Co-authored-by: Tim Allclair <timallclair@gmail.com>
2024-10-29 11:39:54 +08:00
yunwang0911
8edc80c470
Update pkg/kubelet/status/state/state_checkpoint_test.go
Co-authored-by: Tim Allclair <timallclair@gmail.com>
2024-10-29 11:39:41 +08:00
yunwang0911
9d18e900c8
Update pkg/kubelet/status/state/checkpoint.go
Co-authored-by: Tim Allclair <timallclair@gmail.com>
2024-10-29 11:39:32 +08:00
yunwang0911
b67a5623b1
Update pkg/kubelet/status/state/checkpoint.go
Co-authored-by: Tim Allclair <timallclair@gmail.com>
2024-10-29 11:39:18 +08:00
zhifei92
5c01709387 Fix the apiserver panic caused by adding a container when updating a pod
add unit test

refactor:  Merge the test cases into TestMarkPodProposedForResize.

chore:  Add the comment and fix the test names
2024-10-29 10:52:46 +08:00
Kubernetes Prow Robot
ce47f7b416
Merge pull request #128395 from thockin/master
Mark api/openapi-spec/**/*.json as generated files
2024-10-29 02:28:54 +00:00
Kubernetes Prow Robot
685b8b3ba1
Merge pull request #126981 from kannon92/stable-empty-dir-promotion
KEP-1967: promote size backed memory volumes to stable
2024-10-29 01:00:54 +00:00
Kubernetes Prow Robot
36081ff24f
Merge pull request #128321 from yongruilin/reset-label-allow-list
feat(metrics): Add method to reset label allow lists
2024-10-28 23:56:59 +00:00
Kubernetes Prow Robot
961344ddb2
Merge pull request #128124 from PiotrProkop/topology-manager-options-stable
topologymanager: Promote support for improved multi-numa alignment in Topology Manager to GA
2024-10-28 23:56:53 +00:00
Ben Luddy
f831368428
Test response content negotiation with CBOR enablement. 2024-10-28 19:21:21 -04:00
Kubernetes Prow Robot
a15840a34b
Merge pull request #128348 from sanposhiho/patch-12
Fix: use pod-high-priority.yaml to trigger preemption in PreemptionAsync test case
2024-10-28 22:19:17 +00:00
Kubernetes Prow Robot
30ee35516b
Merge pull request #128323 from benluddy/cbor-client-gen
KEP-4222: Support CBOR for generated clients based on client-go feature gating.
2024-10-28 22:19:08 +00:00
Kubernetes Prow Robot
86b99869cb
Merge pull request #128299 from SergeyKanzhelev/updateDHS
Update Device Health fields description for KEP-4680
2024-10-28 22:19:01 +00:00
Kubernetes Prow Robot
eb5c8965be
Merge pull request #128013 from seans3/admission-configuration-strict
AdmissionConfiguration now uses strict validation
2024-10-28 22:18:54 +00:00
Kubernetes Prow Robot
8aae9aabf3
Merge pull request #127661 from pohly/dra-resourceclaim-metrics
DRA resourceclaims: maintain metric of total and allocated claims
2024-10-28 21:12:53 +00:00
Tim Hockin
4ccb7265c4
Mark api/openapi-spec/**/*.json as generated files 2024-10-28 13:33:50 -07:00
Kubernetes Prow Robot
1e7f75292d
Merge pull request #126798 from borg-land/easy-rsa-patch
Use easyrsa from Github
2024-10-28 19:02:53 +00:00
Kubernetes Prow Robot
2fabf5fa63
Merge pull request #128029 from bouaouda-achraf/oom-adjust-score-pod-aware
kubelet(OOM-score-adj): change OOM score adjustment calculation for sidecar container
2024-10-28 17:42:54 +00:00
Ben Luddy
969357b99c
Update vendor (client now calls ParseAccept from goautoneg). 2024-10-28 13:15:28 -04:00
Ben Luddy
67b9dc1f3e
Wire client feature gates affecting RESTClient content config. 2024-10-28 13:15:28 -04:00
Kubernetes Prow Robot
78fed9dbcd
Merge pull request #128390 from aroradaman/netlink-bump
bump: github.com/vishvananda/netlink
2024-10-28 16:36:54 +00:00
Ed Bartosh
c1cd8495a5 kubelet: define custom buckets for DRA metrics 2024-10-28 18:04:51 +02:00
Patrick Ohly
9d1b0654e0 DRA: add wg/device-management label automatically
This makes PRs show up automatically in the WG's project
board (https://github.com/orgs/kubernetes/projects/95/views/1).
2024-10-28 16:36:04 +01:00
Daman Arora
440c6dfd9a bump: github.com/vishvananda/netlink
Signed-off-by: Daman Arora <aroradaman@gmail.com>
2024-10-28 20:36:14 +05:30
Kubernetes Prow Robot
646b0bc49b
Merge pull request #128354 from SataQiu/join-control-plane-e2e
kubeadm: support joining control plane nodes in dryrun mode without a real initialized control plane
2024-10-28 14:40:55 +00:00
Daman Arora
6657d220d3 proxy: cleanup UpdateServiceMapResult
Signed-off-by: Daman Arora <aroradaman@gmail.com>
2024-10-28 20:10:46 +05:30
Daman Arora
c398af07fa proxy: refactor UpdateEndpointsMapResult
Signed-off-by: Daman Arora <aroradaman@gmail.com>
2024-10-28 20:10:34 +05:30
Daman Arora
1ad8880c0f proxy/conntrack: reconciler
Signed-off-by: Daman Arora <aroradaman@gmail.com>
2024-10-28 20:08:53 +05:30
Daman Arora
ba3940c2e0 proxy/conntrack: interface update
Signed-off-by: Daman Arora <aroradaman@gmail.com>
2024-10-28 20:08:53 +05:30
SataQiu
dc48aed791 kubeadm: support joining control plane nodes in dryrun mode without a real initialized control plane 2024-10-28 21:37:58 +08:00
carlory
677e5709c9 remove unused vac code 2024-10-28 17:25:28 +08:00
Achraf BOUAOUDA
e6a518a644 kubelet(OOM-score-adj): change OOM score adjustment calculation for sidecar container 2024-10-28 09:12:59 +01:00
yunwang
cf8cdaf5a7 fix InPlacePodVerticalScaling restore bug: the content wrote to and read from file pod_status are different 2024-10-28 14:50:06 +08:00
Kubernetes Prow Robot
9ec52fc063
Merge pull request #128283 from bzsuni/bz/ut/kubeadm
kubeadm: Add UT for cmd/kubeadm/app/cmd/util/join.go
2024-10-28 06:42:54 +00:00
Michal Wozniak
cad648035a Job Pod Failure policy - cover testing of negative exit codes 2024-10-28 07:24:26 +01:00
Michal Wozniak
af772412be Job Pod Failure policy refactor e2e test to pass on Windows 2024-10-28 07:23:40 +01:00
bzsuni
80fe671cc9 Add UT for cmd/kubeadm/app/cmd/util/join.go
Signed-off-by: bzsuni <bingzhe.sun@daocloud.io>
2024-10-28 11:12:06 +08:00
Kubernetes Prow Robot
1148e5ee5f
Merge pull request #128363 from aojea/e2e_context_cancellation
e2e: stop controller if context cancelled
2024-10-27 22:28:53 +00:00
Antonio Ojea
ad0bc4857b e2e: stop controller if context cancelled 2024-10-27 21:23:00 +00:00
Ed Bartosh
9a044cd46a kubelet: intercept DRA GRPC to record metrics 2024-10-27 10:47:14 +02:00