Commit Graph

10980 Commits

Author SHA1 Message Date
Kubernetes Prow Robot
f82670d8ec Merge pull request #120680 from ruiwen-zhao/pod-start-bucket
Use a wider-range of metric buckets for PodStartDuration
2023-10-25 20:16:34 +02:00
Jan Safranek
e511edf11f Fix SELinux unit tests
Use device mountable volume, to make it impossible to share the same global
mount with different SELinux contexts.

And fix pod2Name to actually refer to pod2.
2023-10-25 10:43:29 +02:00
Jan Safranek
2f5903b4cf Move SELinux warning metric to be counted once per pod
volume_manager_selinux_volume_context_mismatch_warnings_total should be
counted only once per volume + pod. The previous location is evaluated
periodically, so bump the metric only when a new pod is added to volume.
2023-10-25 10:43:29 +02:00
Kubernetes Prow Robot
8453eb0c24 Merge pull request #121069 from jsafrane/ocp-add-plugin-label
Add volume plugin label to SELinux metrics
2023-10-25 08:13:20 +02:00
Kubernetes Prow Robot
0b884c0e3a Merge pull request #121494 from aojea/fix/flake-unit-test
Fix/flake TestNewManagerImplStartProbeMode  unit test
2023-10-25 06:57:04 +02:00
Antonio Ojea
8e0be64b8f remove data race on the devicemanager client plugin
Change-Id: I45b85440a792e5ed2f75a344ec1f0332854d8d6d
2023-10-24 21:35:13 +00:00
Kubernetes Prow Robot
413b15a8d9 Merge pull request #120914 from kannon92/kubelet-disk-api-cri-update
Kubelet disk api cri update
2023-10-24 21:00:40 +02:00
Kubernetes Prow Robot
87981480f3 Merge pull request #119186 from seans3/stream-translator-proxy
Stream Translator Proxy and FallbackExecutor for WebSockets
2023-10-24 17:10:34 +02:00
Kubernetes Prow Robot
6d7d249372 Merge pull request #121077 from chrishenzie/readwriteoncepod-ga
Graduate ReadWriteOncePod to GA
2023-10-24 05:26:05 +02:00
Kubernetes Prow Robot
441d4b54ae Merge pull request #120397 from ty-dc/StaticCheck
cleanup: omit comparison with bool constants
2023-10-24 05:25:52 +02:00
Shiming Zhang
35f4d29d73 Fix unit test 2023-10-24 11:06:35 +08:00
Sean Sullivan
168998e87b StreamTranslator and FallbackExecutor for WebSockets 2023-10-23 15:33:38 -07:00
Kubernetes Prow Robot
e1824b6a47 Merge pull request #117615 from aheng-ch/checkpoint
Fix: do not assign an empty value to the resource (CPU or memory) if it's not defined in the container
2023-10-24 00:30:08 +02:00
Kevin Hannon
8ae0d390d7 add container filesystem to CRI api
Co-authored-by: Mike Brown <brownwm@us.ibm.com>
2023-10-23 15:04:30 -04:00
Kubernetes Prow Robot
76fc18c528 Merge pull request #120099 from TommyStarK/gh_119469
dra: refactoring overall flow of prepare/unprepare resources
2023-10-23 19:51:53 +02:00
TommyStarK
55e3662b72 dra: refactoring overall flow of prepare/unprepare resources
Signed-off-by: TommyStarK <thomasmilox@gmail.com>
2023-10-23 15:11:27 +02:00
Kubernetes Prow Robot
f41ede6241 Merge pull request #118534 from swatisehgal/sample-dp-register-by-default
node: sample-device-plugin: register to kubelet by default and ensure re-registration to kubelet on kubelet restarts
2023-10-23 13:41:19 +02:00
Kubernetes Prow Robot
3aec335a8f Merge pull request #121207 from cyclinder/sysctl_log_level
kubelet/sysctl: update log level
2023-10-21 01:48:53 +02:00
Kubernetes Prow Robot
d916a83d31 Merge pull request #120185 from charles-chenzz/tc_for_localstoragecapisolation
eviction manager: add test case with localStorageCapacityIsolation open
2023-10-20 23:50:46 +02:00
Chris Henzie
2dbd405583 Graduate ReadWriteOncePod to GA 2023-10-20 10:40:39 -07:00
Kubernetes Prow Robot
7aa89084a3 Merge pull request #121240 from HirazawaUi/add-safe-sysctl
Add  ipv4 TCP settings safe sysctls
2023-10-20 03:10:34 +02:00
Kubernetes Prow Robot
de054fbf94 Merge pull request #120942 from sairameshv/evented_pleg_fix
Fix issue in enabling evented pleg feature gate
2023-10-19 10:45:22 +02:00
Kubernetes Prow Robot
cfafffa611 Merge pull request #121019 from kl52752/rate-limiting
Move grpc rate limiter from podresource folder
2023-10-19 08:15:26 +02:00
Kubernetes Prow Robot
4734021993 Merge pull request #121258 from saschagrunert/cri-fs-err
Populate CRI filesystem info error
2023-10-19 04:03:45 +02:00
Kubernetes Prow Robot
39697a9f3b Merge pull request #120782 from PI-Victor/fix/refactor_port_resolver_test
kubelet/lifecycle handlers: refactor port resolver
2023-10-19 04:03:26 +02:00
Kubernetes Prow Robot
3cb3e8b7dc Merge pull request #116892 from SataQiu/fix-kubelet-20230323
kubelet: perform the admission checks that preemption will not help first to avoid meaningless pod eviction
2023-10-19 02:47:50 +02:00
Rodrigo Campos
2508f468a8 kubelet/userns: Add more unit tests
This covers all public methods and overall test coverage is above 80%
again.

Signed-off-by: Rodrigo Campos <rodrigoca@microsoft.com>
2023-10-18 11:59:54 +02:00
Kubernetes Prow Robot
a7b8357a55 Merge pull request #118165 from champly/master
kubelet: fix comment typo
2023-10-17 23:28:25 +02:00
Kubernetes Prow Robot
0095ae3b25 Merge pull request #120195 from Ithrael/fix/error-handling-condition-in-test
fix(test): fix error handling condition in test
2023-10-17 20:08:01 +02:00
Kubernetes Prow Robot
3d77b95bcf Merge pull request #118704 from dgl/crio-socket-fix
Match on cri-o socket suffix only
2023-10-17 20:07:52 +02:00
Kubernetes Prow Robot
7824ac0f3e Merge pull request #114336 from claudiubelu/fixes-test-get-file-type
unittests: Fixes hostutil.GetFileType for Windows
2023-10-17 20:07:39 +02:00
Kubernetes Prow Robot
639f63c4e5 Merge pull request #121261 from kannon92/revert-119882-podres-client-wait
Revert "podresources: e2e: force eager connection"
2023-10-17 16:14:29 +02:00
Swati Sehgal
9a354fc9d0 node: sample-dp: Add retry to handle device plugin restart failure
Add retry mechanism to handle cases where after kubelet restarts, the device
plugin unix socket(s) were created but not ready to serve yet.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2023-10-17 12:19:10 +01:00
Swati Sehgal
d0d133298d node: sample-dp: Use fsnotify for kubelet restart detection
Add kubeletSocket file to fsnotify instead of polling and waiting for deletion
of device plugin unix socket as a way of detecting kubelet restart. We need to
ensure that the device plugin re-registers itself after kubelet restart depending
on the configured registration mode (auto-registration or controller registration).

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2023-10-17 12:19:10 +01:00
Swati Sehgal
211d8cc80a node: sample-dp: stubRegisterControlFunc for controlling registration
If the user specifies the intent to control registration process, we rely on
registration triggers (deletion of control file) to prompt registration.

This behvaiour is expected to be consistent across kubelet restarts and therefore
across the watch calls where we watch for changes to the unix socket so we make
this part of Stub object instead of a parameter.

Co-authored-by: Francesco Romani <fromani@redhat.com>
Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2023-10-17 12:19:10 +01:00
Swati Sehgal
c4c9d61d66 node: sample-dp: Handle re-registration for controlled registrations
In case `REGISTER_CONTROL_FILE` is specified, we want to ensure that the
registration is triggered by deletion of the control file. This is
applicable both when the registration happens for the first time and
subsequent ones because of kubelet restarts.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2023-10-17 12:19:07 +01:00
Swati Sehgal
6714e678d3 node: sample-dp: register by default and re-register on restarts
In issue: 115107 we added an environment variable to control the registration of sample
device plugin to kubelet. The intent of this patch is to ensure that the default
behaviour of the plugin is to register to kubelet (in case no environment
variable is specified).

In addition to that, we want to ensure that the plugin registers itself not just once.
It should re-register itself to kubelet in case of node reboot or kubelet restarts.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2023-10-17 12:14:09 +01:00
Sai Ramesh Vanka
1715fc0ca0 Fix issue in enabling evented pleg feature gate
Fixes https://github.com/kubernetes/kubernetes/issues/120941
GetNewerThan() call isn't blocking until the pod status/cache is updated and returning the empty pod status.
Hence, whenever the `SyncLoop ADD/UPDATE/RECONCILE` functions are called multiple times in a very less time interval,
Kubelet calls multiple `CreateContainer` CRI api that results in the creation of duplicate containers within a given pod.
The initially created conainer keeps `Running` and the later container keeps `Exiting` and hence resulting the pod in `CrashLoopBackOff` state forever

Signed-off-by: Sai Ramesh Vanka <svanka@redhat.com>
2023-10-17 13:07:01 +05:30
Kubernetes Prow Robot
873eb57a4e Merge pull request #119522 from YTGhost/validation-for-static-pods-name
add validation for static pods to have a name
2023-10-17 03:38:15 +02:00
Kubernetes Prow Robot
c5815fee72 Merge pull request #113825 from harche/ep_comments
Keep PLEG interruptions in a separate interface
2023-10-17 03:37:57 +02:00
Kubernetes Prow Robot
f6ba25fdbd Merge pull request #119026 from AxeZhan/sleepAction
Introducing Sleep Action for PreStop Hook
2023-10-16 21:19:44 +02:00
Kubernetes Prow Robot
c7d270302c Merge pull request #121059 from matte21/improve_err_message_in_cpu_assignments
Improve error message in Kubelet CPU assignment logic
2023-10-16 16:48:54 +02:00
Kubernetes Prow Robot
0de29e1d43 Merge pull request #120911 from gjkim42/devicemanager-remove-deprecated-sets-string
pkg/kubelet/cm: Remove deprecated sets.String and sets.Int
2023-10-16 16:48:40 +02:00
Kevin Hannon
dd9c3358f5 Revert "podresources: e2e: force eager connection" 2023-10-16 09:46:04 -04:00
Sascha Grunert
39dcad8a19 Populate CRI filesystem info error
Usually we just log the error but since it's used by the GC we now
populate it up the call stack.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2023-10-16 13:09:04 +02:00
HirazawaUi
1132fd0afd add tcp_fin_timeout, tcp_keepalive_intvl and tcp_keepalive_probes to safe sysctls 2023-10-15 23:05:40 +08:00
AxeZhan
3a96afdfef implementation 2023-10-15 13:57:48 +08:00
Kubernetes Prow Robot
378866edba Merge pull request #120518 from saschagrunert/metrics-container-start
kubelet: fix metric `container_start_time_seconds` timestamp
2023-10-15 07:05:37 +02:00
Kubernetes Prow Robot
95bd8b95a7 Merge pull request #100448 from saschagrunert/cri-stats-log
Do not error log CRI stats for not cached partitions
2023-10-14 23:49:12 +02:00
Kubernetes Prow Robot
4911aad463 Merge pull request #115702 from xyz-li/master
Fix:  kubelet will not output logs after log file is rotated
2023-10-14 22:42:04 +02:00