Commit Graph

128012 Commits

Author SHA1 Message Date
Jan Safranek
9ebe4422a9 Add support for Debian and Ubuntu
Both Debian and Ubuntu + containerd can run with SELinux enabled. Add their
defaults to the SELinux e2e tests.
2025-02-07 14:42:27 +01:00
Kubernetes Prow Robot
20b12ad5c3
Merge pull request #129685 from swatisehgal/cpu-mgr-logs-improvements
CPU Manager logging improvements
2025-02-07 03:50:02 -08:00
Kubernetes Prow Robot
e094e5e89c
Merge pull request #129684 from swatisehgal/mm-mgr-logs-improvements
Memory Manager logging improvements
2025-02-07 03:49:55 -08:00
Daman Arora
6971779d0d pkg/proxy/conntrack: refactor log levels for conntrack cleanup
Signed-off-by: Daman Arora <aroradaman@gmail.com>
2025-02-07 15:35:26 +05:30
Daman Arora
07c279d06b bump netlink
Signed-off-by: Daman Arora <aroradaman@gmail.com>
2025-02-07 15:35:26 +05:30
Kubernetes Prow Robot
a02fe24385
Merge pull request #129341 from pohly/log-client-go-watch
client-go watch: context support
2025-02-07 01:35:55 -08:00
SataQiu
7edc7fbd47 kubeadm: add --etcd-upgrade flag to kubeadm upgrade plan 2025-02-07 15:40:00 +08:00
Kubernetes Prow Robot
bfde2edba7
Merge pull request #126423 from AndrewSirenko/negative-e2e
Add volume modify recovery e2e test
2025-02-06 23:05:55 -08:00
carlory
dbefbd593e remove unnecessary unnecessary 2025-02-07 15:03:32 +08:00
Kubernetes Prow Robot
ee22760391
Merge pull request #130017 from aramase/aramase/f/enable_serviceaccountnodeaudiencerestriction_1.33
Enable ServiceAccountNodeAudienceRestriction feature gate by default in v1.33
2025-02-06 14:40:06 -08:00
Kubernetes Prow Robot
7b38ff48af
Merge pull request #129993 from aramase/aramase/i/fix_129935
Fix service account node audience restriction for in-tree pv to csi migration
2025-02-06 14:39:55 -08:00
Kubernetes Prow Robot
e6be5f9602
Merge pull request #129416 from siyuanfoundation/refactor
KEP-4330: Refactor compatibility version code
2025-02-06 12:17:56 -08:00
Anish Ramasekar
cd9fc8bc71
Enable ServiceAccountNodeAudienceRestriction feature gate by default in v1.33
Signed-off-by: Anish Ramasekar <anish.ramasekar@gmail.com>
2025-02-06 14:11:17 -06:00
Anish Ramasekar
5738ee4def
Disable ServiceAccountNodeAudienceRestriction feature gate by default in v1.32
Signed-off-by: Anish Ramasekar <anish.ramasekar@gmail.com>
2025-02-06 14:10:20 -06:00
Anish Ramasekar
62809dd0de
node audience restriction: use csi translator to convert intree inline_vol/pv to csi
Signed-off-by: Anish Ramasekar <anish.ramasekar@gmail.com>
2025-02-06 13:17:16 -06:00
Swati Sehgal
ecd67e2537 node: memory-mgr: Adhere to the message style guidelines
Ensure that the log messages adhere to the message style guildelines
as captured [here](https://github.com/kubernetes/community/blob/master/contributors/devel/sig-instrumentation/logging.md#message-style-guidelines).

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 18:53:08 +00:00
Anish Ramasekar
d6c50c391d
node audience restriction: add unit tests to simulate intree inline_vol/pv to csi failure
Signed-off-by: Anish Ramasekar <anish.ramasekar@gmail.com>
2025-02-06 12:13:34 -06:00
Swati Sehgal
f449697457 node: device-mgr: Adhere to the message style guidelines
Ensure that the log messages adhere to the message style guildelines
as captured [here](https://github.com/kubernetes/community/blob/master/contributors/devel/sig-instrumentation/logging.md#message-style-guidelines).

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 18:13:10 +00:00
Swati Sehgal
40c86d8e06 node: device-mgr: Ensure consistent use of named arguments
Throughout the devicemanager codebase the named argument to represent
resource for logging pupose is `resourceName` as opposed to `resource`.
The latter can only be seen in topology_hints.go files. To ensure consistency
with the rest of the codebase and also because we want to adhere to the
recommendations in the Kubernetes documentation about named arguments:
https://github.com/kubernetes/community/blob/master/contributors/devel/sig-instrumentation/migration-to-structured-logging.md#name-arguments
we update the key from `resource` to `resourceName`.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 17:22:24 +00:00
Swati Sehgal
19e406a357 node: device-mgr: Update klog.Infof(..., err) to klog.ErrorS(err,...)
Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 17:21:53 +00:00
Swati Sehgal
9cd041156f node: device-mgr: Add logs in the happy path
We have reasonable amount of logs when things go wrong.
While debugging, it can be useful to have logs to indicate that
things have gone as expected.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 17:21:53 +00:00
Swati Sehgal
a585bd74c4 node: device-mgr: Keep log level consistent across all gRPC calls
Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 17:21:53 +00:00
Swati Sehgal
4b613ae96f node: device-mgr: Bump log level for expected skips
Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 17:21:53 +00:00
Swati Sehgal
584e224b43 node: device-mgr: Add metadata to logs
Ensure that if possible, we provide sufficient metadata
inclusing pod name and UID to allow filtering by pod name or its
UID.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 17:21:49 +00:00
Swati Sehgal
a10b3c3555 node: memory-mgr: Add logs when memory allocation is skipped due to QoS
Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 16:42:19 +00:00
Swati Sehgal
6240febf4c node:memory-mgr: Add logs on the happy path
We have reasonable amount of logs when things go wrong.
While debugging, it can be useful to have logs to indicate that
things have gone as expected especially when it comes to
important events like successful startup of memory manager
and successful allocation of resources.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 16:41:34 +00:00
Swati Sehgal
ee7f2616c6 node: memory-mgr: Update klog.Infof(..., err) to klog.ErrorS(err,...)
Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 16:41:02 +00:00
Swati Sehgal
07d83acce5 node: memory-mgr: Update log levels and add logs to capture state update
Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 16:41:02 +00:00
Swati Sehgal
34fd61a3ef node: memory-mgr: Add metadata to logs
Ensure that whereever possible, we provide sufficient metadata
inclusing pod name and UID to allow filtering by pod name or its
UID.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 16:40:56 +00:00
Swati Sehgal
7997c93cfd node: cpu-mgr: Adhere to the message style guidelines
Ensure that the log messages adhere to the message style guildelines
as captured [here](https://github.com/kubernetes/community/blob/master/contributors/devel/sig-instrumentation/logging.md#message-style-guidelines).

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 16:30:02 +00:00
Swati Sehgal
ca2c46a273 node: cpu-mgr: Add logs when CPU allocation is skipped
CPU Allocation is skipped in CPU Manager with static policy
in case the pod doesn't belong to Guaranteed QoS or the CPUs
requested are not integral.

We add logs to capture these skips.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 16:26:40 +00:00
Swati Sehgal
01a546fe53 node: cpu-mgr: Add logs on the happy path
While debugging, it can be useful to have logs to indicate that
things have gone as expected especially when it comes to
important events like successful startup of CPU manager
and successful allocation of resources.

Signed-off-by: Swati Sehgal <swsehgal@redhat.com>
2025-02-06 16:25:07 +00:00
Filipe Xavier
14c0bc19ac kubelet: improve allocated resources checkpointing
changed calls to set allocation from container level to pod level on status manager.
2025-02-06 09:20:39 -03:00
Kubernetes Prow Robot
491a23f079
Merge pull request #129999 from pohly/test-e2e-node-timeout
E2E node: fix --timeout default
2025-02-06 03:59:55 -08:00
Patrick Ohly
46a17f60e4 E2E node: fix --timeout default
For unknown reasons, hack/make-rules/test-e2e-node.sh adds -timeout instead of
--timeout. Therefore the fallback code in test/e2e_node/remote/remote.go didn't
find it and added its own --timeout=60m after it. This effectively limits E2E
node test runs to 60 minutes, regardless of what is specified in the job:

    W0206 09:53:51.425532    7151 remote.go:158] ginkgo flags are missing explicit --timeout (ginkgo defaults to 60 minutes)
    I0206 09:53:51.425565    7151 remote.go:165] updated ginkgo flags: -timeout=24h --label-filter="Feature: containsAny DynamicResourceAllocation && Feature: isSubsetOf { Beta, DynamicResourceAllocation } && !Flaky && !Slow"  --no-color -v --timeout=60m
    ...
    I0206 09:53:57.767096    7151 ssh.go:146] Running the command ssh, with args: ... timeout -k 30s 3600.000000s ./ginkgo -timeout=24h --label-filter="Feature: containsAny DynamicResourceAllocation && Feature: isSubsetOf { Beta, DynamicResourceAllocation } && !Flaky && !Slow"  --no-color -v --timeout=60m ...

Note that the timeout for the test was 60m in this case (hence the "timeout -k
30s 3600.000000s") but it could also be something larger.
2025-02-06 11:45:12 +01:00
Kubernetes Prow Robot
9a03243789
Merge pull request #129929 from serathius/deprecate-separate-rpc
Flip SeparateCacheWatchRPC feature gate to false and deprecate it
2025-02-05 17:18:16 -08:00
Siyuan Zhang
8fc3a33454 Refactor compatibility version code
Replace DefaultComponentGlobalsRegistry with new instance of componentGlobalsRegistry in test api server.

Signed-off-by: Siyuan Zhang <sizhang@google.com>

move kube effective version validation out of component base.

Signed-off-by: Siyuan Zhang <sizhang@google.com>

move DefaultComponentGlobalsRegistry out of component base.

Signed-off-by: Siyuan Zhang <sizhang@google.com>

move ComponentGlobalsRegistry out of featuregate pkg.

Signed-off-by: Siyuan Zhang <sizhang@google.com>

remove usage of DefaultComponentGlobalsRegistry in test files.

Signed-off-by: Siyuan Zhang <sizhang@google.com>

change non-test DefaultKubeEffectiveVersion to use DefaultBuildEffectiveVersion.

Signed-off-by: Siyuan Zhang <sizhang@google.com>

Restore useDefaultBuildBinaryVersion in effective version.

Signed-off-by: Siyuan Zhang <sizhang@google.com>

rename DefaultKubeEffectiveVersion to DefaultKubeEffectiveVersionForTest.

Signed-off-by: Siyuan Zhang <sizhang@google.com>

pass options.ComponentGlobalsRegistry into config for controller manager and scheduler.

Signed-off-by: Siyuan Zhang <sizhang@google.com>

Pass apiserver effective version to DefaultResourceEncodingConfig.

Signed-off-by: Siyuan Zhang <sizhang@google.com>

change statusz registry to take effective version from the components.

Signed-off-by: Siyuan Zhang <sizhang@google.com>

Address review comments

Signed-off-by: Siyuan Zhang <sizhang@google.com>

update vendor

Signed-off-by: Siyuan Zhang <sizhang@google.com>
2025-02-05 16:10:53 -08:00
Kubernetes Prow Robot
22f25efc2c
Merge pull request #128991 from Henrywu573/cm-statuz
Add statusz endpoint for kube-controller-manager
2025-02-05 15:54:15 -08:00
Kubernetes Prow Robot
72d74869e9
Merge pull request #129114 from bart0sh/PR167-fix-DRA-registration-test
kubelet: fix DRA registration test
2025-02-05 14:38:26 -08:00
Kubernetes Prow Robot
0634e21fb5
Merge pull request #128367 from vivzbansal/sidecar-2
[FG:InPlacePodVerticalScaling] Implement resize for sidecar containers
2025-02-05 14:38:15 -08:00
Kubernetes Release Robot
e54be1e133 CHANGELOG: Update directory for v1.33.0-alpha.1 release 2025-02-05 20:11:40 +00:00
Kubernetes Prow Robot
925cf7db71
Merge pull request #129930 from serathius/deprecate-watch-from-storage
Deprecate WatchFromStorageWithoutResourceVersion
2025-02-05 10:18:23 -08:00
Kubernetes Prow Robot
1527a145b1
Merge pull request #129921 from srivastav-abhishek/fix-etcd-test
Additional timeout to receive all watchEvents
2025-02-05 10:18:17 -08:00
Henry(Qishan) Wu
8bd4e1bab2 Update test/integration/serving/serving_test.go
Co-authored-by: Antonio Ojea <antonio.ojea.garcia@gmail.com>
2025-02-05 09:48:08 -08:00
Kubernetes Prow Robot
d2ad0cc7c0
Merge pull request #129956 from chrischdi/pr-kubeadm-cp-local-mode-fixes
kubeadm: Promote ControlPlaneKubeletLocalMode feature gate to beta - second attempt
2025-02-05 07:02:16 -08:00
Kubernetes Prow Robot
8b1307894d
Merge pull request #129962 from cpanato/update-go-123-main
[go] Bump images, dependencies and versions to go 1.23.5 and distroless iptables
2025-02-05 05:48:16 -08:00
Kubernetes Prow Robot
54eddbd50f
Merge pull request #128989 from Henrywu573/kube-proxy-statuz-new
Add statusz endpoint for kube-proxy
2025-02-05 04:34:15 -08:00
Christian Schlotter
6c093b1699
kubeadm: fix dry-run for kubelet-wait-bootstrap phase 2025-02-05 12:40:08 +01:00
Kubernetes Prow Robot
c7489b20f2
Merge pull request #129750 from googs1025/scheduler/add_integration_for_queuesortplugin
feature: add scheduler queuesort plugins integration test
2025-02-05 03:08:17 -08:00
Marek Siarkowicz
065bf2004d Deprecate WatchFromStorageWithoutResourceVersion
Around the 1.31 release, we discovered that a change introduced in 1.27 allowead
clients to open WATCH requests directly to etcd. This had detrimental consequences,
enabling abusive clients to bypass caching and overwhelm etcd.
Unlike the API server, etcd lacks protection against such behavior.

To mitigate this, we redirected all WATCH requests to be served from the cache.
The WatchFromStorageWithoutResourceVersion feature gate was retained as an escape hatch.
However, since we have no plans to allow direct WATCH requests to etcd again,
this flag is now obsolete.

Direct WATCH requests to etcd offer no advantage, as they don't provide stronger
consistency guarantees. WATCH operations are inherently inconsistent; unlike LIST
operations, they do not confirm the resource version with a quorum. While Kubernetes
uses the WithRequireLeader option on WATCH requests to prevent maintaining connections
to isolated etcd members, the API server provides the same level of guarantee through
its health checks, which fail if it cannot connect to etcd member.  Therefore,
the WatchFromStorageWithoutResourceVersion feature gate can be deprecated and removed.
2025-02-05 11:42:18 +01:00