Commit Graph

124291 Commits

Author SHA1 Message Date
Patrick Ohly
616a014347 DRA: move ResourceSlice publishing into DRA drivers
This is a first step towards making kubelet independent of the resource.k8s.io
API versioning because it now doesn't need to copy structs defined by that API
from the driver to the API server. The next step is removing the other
direction (reading ResourceClaim status and passing the resource handle to
drivers).

The drivers must get deployed so that they have their own connection to the API
server. Securing at least the writes via a validating admission policy should
be possible.

As before, the kubelet removes all ResourceSlices for its node at startup, then
DRA drivers recreate them if (and only if) they start up again. This ensures
that there are no orphaned ResourceSlices when a driver gets removed while the
kubelet was down.

While at it, logging gets cleaned up and updated to use structured, contextual
logging as much as possible. gRPC requests and streams now use a shared,
per-process request ID and streams also get logged.
2024-07-18 09:09:19 +02:00
Patrick Ohly
8d814298bb kubelet: grant permission for DeleteCollection
2e34e187c9 enabled kubelet to do List and Watch
requests with the caveat that kubelet should better use a field selector (which
it does). The same is now also needed for DeleteCollection because kubelet will
use that to clean up in one operation instead of using multiple.
2024-07-18 09:09:19 +02:00
Patrick Ohly
3d4bc44a2f dra e2e node: addd test case for ResourceSlice handling during kubelet startup
Any redundant object must get deleted, but not the ones of other names.
2024-07-18 09:09:19 +02:00
Michal Wozniak
1be4df6e02 Cleanup Job controller isPodFailed function 2024-07-18 09:08:23 +02:00
Kubernetes Prow Robot
1b3d7d06c5
Merge pull request #126102 from carlory/cleanup-after-JobPodFailurePolicy-ga
cleanup after JobPodFailurePolicy is promoted to GA
2024-07-17 21:27:06 -07:00
googs1025
a3978e8315 scheduler: Add ctx param and error return to EnqueueExtensions.EventsToRegister() 2024-07-18 12:22:17 +08:00
carlory
dae05f3b88 cleanup after JobPodFailurePolicy is promoted to GA 2024-07-18 10:00:56 +08:00
Kubernetes Prow Robot
d0545c8eb4
Merge pull request #126073 from a7i/fake-apply-scale-subresource
fix: fake clientset ApplyScale subresource from 'status' to 'scale'
2024-07-17 17:35:52 -07:00
Kubernetes Prow Robot
45cb3a1bd0
Merge pull request #126173 from bergerhoffer/cli-help
A few minor help text tweaks
2024-07-17 16:29:34 -07:00
Kubernetes Prow Robot
b68a58d372
Merge pull request #126141 from Nordix/esotsal/fix-126135
test/e2e_node:  Fix pod_resize tests in CI
2024-07-17 16:29:25 -07:00
Mark Rossetti
3683010a7c
Adding e2e test to validate memory-pressure eviction on Windows
Signed-off-by: Mark Rossetti <marosset@microsoft.com>
2024-07-17 15:11:30 -07:00
Mark Rossetti
0411a3d565
Add support for memory pressure evictiong on Windows
Signed-off-by: Mark Rossetti <marosset@microsoft.com>
2024-07-17 15:11:30 -07:00
Kubernetes Prow Robot
5d40866fae
Merge pull request #125994 from carlory/fix-job-api
clean up codes after PodDisruptionConditions was promoted to GA
2024-07-17 14:37:09 -07:00
Kubernetes Prow Robot
ef2cb782be
Merge pull request #126172 from haircommander/proc-mount-loosten-test
e2e_node: loosen proc mount test
2024-07-17 13:10:09 -07:00
Kubernetes Prow Robot
42e22cc99d
Merge pull request #126162 from danwinship/kube-proxy-config-owners
Add me to kube-proxy config api approvers
2024-07-17 13:10:00 -07:00
Kubernetes Prow Robot
b23f41e192
Merge pull request #125940 from thockin/master
Clarify errors in ProjectedVolume validation
2024-07-17 13:09:51 -07:00
Kubernetes Prow Robot
6aa300c48b
Merge pull request #125482 from mimowo/failure-policy-e2e-conformance
Promote JobPodFailurePolicy and PodDisruptionConditions e2e tests to Conformance
2024-07-17 13:09:37 -07:00
Kubernetes Prow Robot
d879103c28
Merge pull request #125820 from macsko/add_separate_lock_for_pod_nominator_scheduling_queue
Add a separate lock for pod nominator in scheduling queue
2024-07-17 12:06:10 -07:00
Kubernetes Prow Robot
ef8d67f865
Merge pull request #125809 from aojea/cloud_hostname
add unit tests for hostname node.status.addresses
2024-07-17 12:05:52 -07:00
Andrea Hoffer
5252f79f9b A few minor help text tweaks 2024-07-17 13:30:34 -04:00
Peter Hunt
3d8cb4fa89 e2e_node: loosen proc mount test
the exact number of lines/ro lines is not important, just that there are more than 0 ro lines
and more than 1 line total.

this helps accomodate different architectures that implement different kernel APIs

Signed-off-by: Peter Hunt <pehunt@redhat.com>
2024-07-17 13:26:23 -04:00
Kubernetes Prow Robot
c3bcd4fff0
Merge pull request #126139 from enj/enj/i/revert_list_cache
Revert "Move ConsistentListFromCache to Beta default"
2024-07-17 09:59:14 -07:00
Kubernetes Prow Robot
a8110d7174
Merge pull request #125259 from sohankunkerkar/bump-proc-mount-beta
[KEP-4265] promote ProcMountType feature gate to beta
2024-07-17 09:59:07 -07:00
Peter Hunt
f53069eac6 PSA: update test fixtures
Signed-off-by: Peter Hunt <pehunt@redhat.com>
2024-07-17 12:28:59 -04:00
Michal Wozniak
25d938b481 Promote JobPodFailurePolicy and PodDisruptionConditions e2e tests to Conformance 2024-07-17 18:13:08 +02:00
Patrick Ohly
6604ff94d8 kubelet: enhance podresources tests
The manual deep comparison code is hard to maintain (would need to be updated
in https://github.com/kubernetes/kubernetes/pull/125488) and error prone.

In fact, one test case failed when doing a full automatic comparison with
cmp.Diff because it wasn't setting allMemory.
2024-07-17 17:50:10 +02:00
Peter Hunt
840e4a82d4 PSA: allow container_engine_t selinux type
container_engine_t (a68865582e/container_contexts (L6))
is a type that is tuned to allow running container engines (podman/docker) inside of a container. Since it's among the recognized process types as
container_t, container_kvm_t, container_init_t, it should be recognized in the security standard as well

Signed-off-by: Peter Hunt <pehunt@redhat.com>
2024-07-17 11:44:51 -04:00
Kubernetes Prow Robot
8ba158c7f4
Merge pull request #126164 from soltysh/cleanup_runners_2
Cleanup unused test functions - cont-ed
2024-07-17 08:25:07 -07:00
Hemant Kumar
68df11f4f8 Use context of polling function for API calls in e2e 2024-07-17 10:59:55 -04:00
Hemant Kumar
2115c3e7d8 Fix e2e test with new resizer 2024-07-17 10:43:36 -04:00
Maciej Szulik
ce01dfc492
Cleanup unused test functions - cont-ed
Following up the work started in 0c0bd6d this is further cleaning up the
test/utils directory getting rid of unused functions.
2024-07-17 16:22:11 +02:00
Patrick Ohly
b9d00841a6 kubelet: improve checkpoint errors
Recording the expected and actual checksum in the error makes it possible to
provide that information, for example in a failed test like the ones for DRA.
Otherwise developers have to manually step through the test with a debugger to
figure out what the new checksum is.
2024-07-17 16:07:31 +02:00
Dan Winship
5c372faca6 Add me to kube-proxy config api approvers 2024-07-17 09:41:53 -04:00
Kubernetes Release Robot
03fe89c233 CHANGELOG: Update directory for v1.30.3 release 2024-07-17 11:51:02 +00:00
Kubernetes Release Robot
ff78057631 CHANGELOG: Update directory for v1.28.12 release 2024-07-17 10:51:59 +00:00
Kubernetes Prow Robot
89283e0219
Merge pull request #124958 from bells17/qhint-volume-binding-storageclass
volumebinding: scheduler queueing hints - StorageClass
2024-07-17 02:47:06 -07:00
Maciej Skoczeń
5def93b10a Add a separate lock for pod nominator in scheduling queue 2024-07-17 07:58:59 +00:00
Kubernetes Release Robot
29e4f5a893 CHANGELOG: Update directory for v1.27.16 release 2024-07-17 07:48:22 +00:00
Kubernetes Release Robot
bf9aab74b9 CHANGELOG: Update directory for v1.29.7 release 2024-07-17 07:26:06 +00:00
Kubernetes Prow Robot
ad72be434d
Merge pull request #125417 from bitoku/splitfs
KEP-4191: Split Image Filesystem add end-to-end tests
2024-07-16 23:27:06 -07:00
bells17
4c3c4128af
volumebinding: scheduler queueing hints - StorageClass 2024-07-17 15:03:17 +09:00
Sotiris Salloumis
3a01281d2f test/e2e_node: pod_resize tests
add NodeAlphaFeature label, as the feature is in alpha to be skipped in CI
add missing Arm64 check
2024-07-17 07:55:44 +02:00
Kubernetes Prow Robot
a6460c4f3e
Merge pull request #126036 from macsko/scheduler_perf_throughput_thresholds
Allow to set scheduling throughput thresholds in scheduler_perf tests
2024-07-16 21:43:13 -07:00
Kubernetes Prow Robot
9247a21be6
Merge pull request #124959 from bells17/qhint-volume-binding-pvc
volumebinding: scheduler queueing hints - PersistentVolumeClaim
2024-07-16 21:43:06 -07:00
Monis Khan
aeb51a16e3
Revert "Move ConsistentListFromCache to Beta default"
This reverts commit 0c0e19b343.

During stress test for SVM controller, the controller is unable to
make a list call due to following error:

resourceversion.go:155: I0716 21:49:26.973127] storage-version-migrator-controller: Error syncing SVM resource, retrying svm="crdsvm" err="error getting latest resourceVersion for stable.example.com/v1, Resource=testcrds: Timeout: Too large resource version: 28976, current: 20349"

With the feature disabled, the stress test passes.

Signed-off-by: Monis Khan <mok@microsoft.com>
2024-07-16 23:12:16 -04:00
Kubernetes Prow Robot
8aff9d3192
Merge pull request #126072 from aroradaman/proxy-config-v1alpah2-windows
kube-proxy: internal config: add Linux and Windows section
2024-07-16 19:37:12 -07:00
Kubernetes Prow Robot
a00c834ebf
Merge pull request #123303 from haircommander/proc-mount-e2e-tests
KEP-4265: add e2e tests for ProcMountType
2024-07-16 19:37:05 -07:00
Kubernetes Prow Robot
52c0ed4673
Merge pull request #124342 from zhifei92/fix-error-check
fix error checking in kl.killPod within SyncPod
2024-07-16 16:05:07 -07:00
Peter Hunt
a20a8225cf e2e_node: skip proc mount tests on nodes without userns support in the runtime
Signed-off-by: Peter Hunt <pehunt@redhat.com>
Co-authored-by: Sohan Kunkerkar <sohank2602@gmail.com>
2024-07-16 17:46:23 -04:00
Peter Hunt
4f57a1448d test/e2e: add test to ensure proc mount is stripped from windows pods
Signed-off-by: Peter Hunt <pehunt@redhat.com>
2024-07-16 17:46:23 -04:00