Christian Schlotter
20fbdeac96
kubeadm: fix upgrade to be able to rollback ControlPlaneLocalMode
2025-02-05 11:33:55 +01:00
Christian Schlotter
bb36212342
kubeadm: Promote ControlPlaneKubeletLocalMode feature gate to beta
2025-02-05 11:33:34 +01:00
Kubernetes Prow Robot
569d1896e6
Merge pull request #129620 from neolit123/1.33-update-all-cp-components-check
...
kubeadm: graduate WaitForAllControlPlaneComponents to Beta
2025-02-05 01:54:17 -08:00
Marek Siarkowicz
b1ad53c533
Disable StorageNamespaceIndex feature gate when BtreeWatchCache is enabled and deprecate it
...
Previously, the cache used a map keyed by the full object key,
requiring iteration and filtering by namespace for namespace-scoped requests.
This index allowed for faster responses by avoiding this iteration.
With the introduction of the BtreeWatchCache, this optimization is no longer necessary.
The B-tree structure allows efficient prefix-based searches,
including fetching objects by namespace.
Furthermore, the B-tree returns elements ordered by key, eliminating the need for separate sorting.
Performance improvements with the BtreeWatchCache have been validated through benchmarks matching K8s scalability dimentions (see table below).
These results demonstrate that the B-tree approach provides comparable or better performance than the map with index.
Therefore, the StorageNamespaceIndex feature flag can be safely flipped to false and subsequently deprecated.
| Benchmark | Btree with Index (current) | Btree without Index | Map with Index | Map without Index (sanity check) |
| --------------------------------------------------------------------------------- | -------------------------- | ---------------------- | ---------------------- | -------------------------------- |
| StoreList (10k Namespaces, 150k Pods, 5k Nodes, RV=, Namespace Scope) | 20.77µs ± 10% | 20.14µs ± 13% (~0%) | 19.73µs ± 6% (~0%) | 1067.34µs ± 10% (+5037.73%) |
| StoreList (10k Namespaces, 150k Pods, 5k Nodes, RV=NotOlderThan, Namespace Scope) | 3.943µs ± 6% | 3.928µs ± 6% (~0%) | 3.665µs ± 3% (-7.05%) | 944.641µs ± 1% (+23857.41%) |
| StoreList (50 Namespaces, 150k Pods, 5k Nodes, RV=, Namespace Scope) | 303.3µs ± 2% | 258.2µs ± 2% (-14.85%) | 340.1µs ± 3% (+12.15%) | 1668.6µs ± 4% (+450.23%) |
| StoreList (50 Namespaces, 150k Pods, 5k Nodes, RV=NotOlderThan, Namespace Scope) | 286.2µs ± 3% | 234.7µs ± 1% (-17.99%) | 326.9µs ± 2% (+14.22%) | 1347.7µs ± 4% (+370.91%) |
| StoreList (100 Namespaces, 110k Pods, 1k Nodes, RV=, Namespace Scope) | 125.3µs ± 2% | 112.3µs ± 5% (-10.38%) | 137.5µs ± 2% (+9.81%) | 1395.1µs ± 8% (+1013.78%) |
| StoreList (100 Namespaces, 110k Pods, 1k Nodes, RV=NotOlderThan, Namespace Scope) | 120.6µs ± 2% | 113.2µs ± 1% (-6.13%) | 133.8µs ± 1% (+10.92%) | 1719.1µs ± 5% (+1325.35%) |
| Geometric Mean | 68.94µs | 62.73µs (-9.02%) | 72.72µs (+5.48%) | 1.326ms (+1823.40%) |
2025-02-05 10:49:22 +01:00
Kubernetes Prow Robot
481cc1a392
Merge pull request #129560 from bart0sh/PR168-DRA-fix-All-allocation-mode
...
DRA: fix allocation mode `All`
2025-02-05 00:38:16 -08:00
Henry Wu
c5f66bfe70
Add statusz endpoint for kube-proxy
2025-02-04 22:20:31 -08:00
Kubernetes Prow Robot
c4434c3161
Merge pull request #129910 from bitoku/fix-129836
...
Fix flaky test for container life cycle
2025-02-04 16:23:09 -08:00
Kubernetes Prow Robot
fab0d76574
Merge pull request #129731 from gjkim42/promote-sidecar-containers-to-ga
...
Promote SidecarContainers feature to GA
2025-02-04 16:22:58 -08:00
Kubernetes Prow Robot
f82439f536
Merge pull request #129486 from iholder101/bugfix/swap-container-cri-stats
...
[KEP-2400] [Bugfix]: Ensure container-level swap metrics are collected
2025-02-04 08:14:59 -08:00
Patrick Ohly
1a8d8c9b4a
client-go watch: NewIndexerInformerWatcherWithContext -> WithLogger
...
The ability to automatically stop on context cancellation was new functionality
that adds complexity and wasn't really used in Kubernetes. If someone wants
this, they can add it outside of the function.
A *WithLogger variant avoids the complexity and is consistent with
NewStreamWatcherWithLogger over in apimachinery.
2025-02-04 16:32:55 +01:00
Kubernetes Prow Robot
7f9fdd65eb
Merge pull request #129968 from sanposhiho/patch-15
...
fix: remove the mention that DRA uses Pending
2025-02-04 01:36:59 -08:00
Kubernetes Prow Robot
a376ae5dad
Merge pull request #128845 from SergeyKanzhelev/staticPodUpgrade
...
static pod upgrade test with hostNetwork
2025-02-03 23:30:58 -08:00
Kubernetes Prow Robot
28ba942659
Merge pull request #129844 from cici37/bumCEL
...
Bump cel-go to v0.23.2
2025-02-03 17:26:58 -08:00
Cici Huang
e1ab6073ab
Add more tests for optional.
2025-02-04 00:14:43 +00:00
Kensei Nakada
3701e39327
fix: remove the mention that DRA uses Pending
2025-02-04 06:45:05 +09:00
Cici Huang
8a3d0d68a2
Update the env option.
2025-02-03 18:07:23 +00:00
Cici Huang
7b1c7c639e
Fixed the estimated cost for opt map.
2025-02-03 18:06:51 +00:00
Cici Huang
c1e0443232
Bump cel-go to v0.23.2.
2025-02-03 18:06:51 +00:00
cpanato
0ca45bd4f8
Bump images, dependencies and versions to go 1.23.5 and distroless iptables
...
Signed-off-by: cpanato <ctadeu@gmail.com>
2025-02-03 18:26:38 +01:00
Kubernetes Prow Robot
82e3a671e7
Merge pull request #129920 from googs1025/feature/integration_scoring
...
feature: Added score integration tests for missing part plugins: TaintToleration plugin
2025-02-03 08:46:57 -08:00
Kubernetes Prow Robot
1b7a059187
Merge pull request #128999 from macsko/improve_goroutines_metric_writes_in_parallelizer_until
...
Improve Goroutines metric calls in parallelizer.Until
2025-02-03 07:30:57 -08:00
Kubernetes Prow Robot
183ef23273
Merge pull request #129937 from pohly/dra-scheduler-perf-limits
...
scheduler_perf: add thresholds to DRA test cases
2025-02-03 05:24:57 -08:00
googs1025
5281152f07
feature: Added score integration tests for missing part plugins: TaintToleration plugin
2025-02-03 21:20:00 +08:00
Patrick Ohly
e2ff03486d
scheduler_perf: add thresholds to DRA test cases
...
They were enabled yesterday and executed seven times, with results that (so
far) seem to be fairly stable with just one run that was slower across the
board.
The links in the YAML can be used to navigate to each test case quickly. The
thresholds were chose with a 20% security margin below what seems to be a
common result.
2025-02-03 13:10:10 +01:00
Kubernetes Prow Robot
fc268ecd09
Merge pull request #129823 from googs1025/chore/log_improve
...
fix(dra plugin): when there is no resourceclaim, return directly
2025-02-02 16:28:56 -08:00
Kubernetes Prow Robot
6e3546228d
Merge pull request #129895 from tallclair/refactor-allocation
...
Delete unused code: allocated state ClearState
2025-02-02 09:40:55 -08:00
Gunju Kim
8d27bf2108
Leave TODOs in pkg/kubelet/kuberuntime for later removal
...
This leaves TODOs in pkg/kubelet/kuberuntime to remove these redundant
code paths later, since they are supposed to be a subset of the new code
paths.
2025-02-02 17:45:50 +09:00
Gunju Kim
0bee0bcaa7
Promote SidecarContainers feature to GA
2025-02-02 17:45:36 +09:00
Kubernetes Prow Robot
b4f902f037
Merge pull request #129897 from vinayakankugoyal/testfix
...
Fix kubelet_authz_test.go
2025-01-31 08:52:56 -08:00
Vinayak Goyal
81f09811ca
Fix kubelet_authz_test.go
2025-01-31 15:38:18 +00:00
Marek Siarkowicz
e0f548183c
Graduate BtreeWatchCache feature gate to GA
2025-01-31 15:33:24 +01:00
Marek Siarkowicz
4a5bbc4c15
Flip SeparateCacheWatchRPC feature gate to false and deprecate it.
...
Watch requests to etcd are mapped to a single stream that has a limited throughput.
By opening a lot of concurrent watch requests to single resource, users
could starve other watches from getting any events.
Separating the RPC was meant to protect the watch opened by cache.
However, as we are no longer planning to allow users to open watch directly to etcd,
the flag is not needed.
2025-01-31 14:08:15 +01:00
Kubernetes Prow Robot
295ed0a607
Merge pull request #129901 from bart0sh/PR165-migrate-sysctl-to-contextual-logging
...
kubelet: Migrate pkg/kubelet/sysctl to contextual logging
2025-01-31 01:52:57 -08:00
Kubernetes Prow Robot
0a62cb2209
Merge pull request #129914 from soltysh/expand_portfowrad_errors
...
e2e: expand error conditions when test-ing port-forward
2025-01-30 21:33:23 -08:00
Abhishek Kr Srivastav
f6b527cb54
Additional timeout to receive all watchEvents
2025-01-31 10:27:15 +05:30
Kubernetes Prow Robot
4bf6cdb60c
Merge pull request #129554 from zylxjtu/bootid
...
Add the check of node rebooting in windows reboot_node e2e test
2025-01-30 17:33:23 -08:00
Kubernetes Prow Robot
0e9ca10eeb
Merge pull request #129504 from felipeagger/cleanup/ckpt-pod-alloc-state-mem
...
Cleanup Kubelet Checkpointing: Refactor state mem
2025-01-30 13:05:33 -08:00
Kubernetes Prow Robot
4e54a67d57
Merge pull request #128679 from ffromani/unshare-containermap-among-managers-cleanup
...
node: cm: use maps.Clone instead of reinvent it
2025-01-30 13:05:25 -08:00
Kubernetes Prow Robot
cec0492ddf
Merge pull request #129913 from mimowo/fix-remaining-integration-test
...
Fix the remaining flaky integration tests in Job controller
2025-01-30 11:07:23 -08:00
Maciej Szulik
4619a89eba
e2e: expand error conditions when test-ing port-forward
...
Signed-off-by: Maciej Szulik <soltysh@gmail.com>
2025-01-30 19:07:28 +01:00
Michal Wozniak
0d5d714c2a
Fix the remaining flaky integration tests in Job controller
2025-01-30 18:20:34 +01:00
Kubernetes Prow Robot
59f3aa1e34
Merge pull request #129894 from mimowo/fix-flaky-job-controller-test
...
Fix the flaky Job integration tests
2025-01-30 08:49:30 -08:00
Kubernetes Prow Robot
79addb569b
Merge pull request #128925 from richabanker/zpages-cleanup
...
Move zpages common code to an httputil package
2025-01-30 08:49:23 -08:00
Ayato Tokubi
da5a76bd39
Fix flaky test for container life cycle
...
Signed-off-by: Ayato Tokubi <atokubi@redhat.com>
2025-01-30 16:23:51 +00:00
Filipe Xavier
f7df3ec07f
move log restored state to better location.
2025-01-30 12:06:12 -04:00
Kubernetes Prow Robot
6ef2458aaa
Merge pull request #129859 from neolit123/1.33-use-etcdapi-timeout
...
kubeadm: use the v1beta4 EtcdAPICall timeout for etcd calls
2025-01-30 07:33:30 -08:00
Kubernetes Prow Robot
586f0fad5c
Merge pull request #125046 from tklauser/min-max-builtins
...
Use Go 1.21 min/max builtins
2025-01-30 07:33:23 -08:00
Ed Bartosh
829fa63b5b
DRA: fix allocation mode All
...
`All` allocation mode should mean 'at least one' for DRA.
Allocation should fail if `All` devices requested and none found.
2025-01-30 16:34:25 +02:00
Francesco Romani
e766b04ade
node: cm: use maps.Clone instead of reinvent it
...
As pointed out in https://github.com/kubernetes/kubernetes/pull/128657#discussion_r1832973928
Signed-off-by: Francesco Romani <fromani@redhat.com>
2025-01-30 14:45:04 +01:00
Kubernetes Prow Robot
209538059e
Merge pull request #129885 from macsko/default_topology_spreading_scheduler_perf_test_case
...
Add scheduler_perf test case for default PodTopologySpreading constraints
2025-01-30 05:05:32 -08:00