Commit Graph

125996 Commits

Author SHA1 Message Date
Kubernetes Prow Robot
4812ea8aa5
Merge pull request #125570 from sanchezl/test-additional-types
KEP-4222: Cover aggregator and apiextension types in unstructured roundtrip test.
2024-10-12 03:18:20 +01:00
Paco Xu
e9d999d5c2 add PSA testdata 1.32 2024-10-12 09:22:55 +08:00
Kubernetes Prow Robot
762a85e25d
Merge pull request #125923 from haircommander/cpuset-fix-restart
kubelet/cm: fix bug where kubelet restarts from missing cpuset cgroup
2024-10-12 00:12:20 +01:00
Sean Sullivan
553b2041e1 LeaderMigrationConfig now uses strict validation 2024-10-11 14:59:35 -07:00
Peter Hunt
b94c5387b8 e2e_node: use restart instead of start stop
Signed-off-by: Peter Hunt <pehunt@redhat.com>
2024-10-11 16:53:33 -04:00
Luis Sanchez
aaa7364f60 cover additional types in unstructured roundtrip test
Co-authored-by: Ben Luddy <bluddy@redhat.com>
2024-10-11 16:07:39 -04:00
Kubernetes Prow Robot
c45f3ab813
Merge pull request #127976 from chengjoey/fix/named-ports
fix eps named ports does not work in sidecar(initContainer with restartPolicy=Always)
2024-10-11 19:14:21 +01:00
Kubernetes Prow Robot
2f7df335ad
Merge pull request #127615 from macsko/add_node_add_event_benchmark_to_scheduler_perf
Add scheduler_perf test case for NodeAdd event handling
2024-10-11 18:10:19 +01:00
Francesco Romani
cc87438f2f e2e_node: add a test to verify the kubelet starts
with systemd cgroup driver and cpumanager none policy.

This was originally planned to be a correctness check for
https://issues.k8s.io/125923, but it was difficult to reproduce the bug,
so it's now a regression test against it.

Signed-off-by: Francesco Romani <fromani@redhat.com>
Signed-off-by: Peter Hunt <pehunt@redhat.com>
2024-10-11 11:29:16 -04:00
Peter Hunt
77d03e42cd kubelet/cm: move CPU reading from cm to cm/cpumanager
Authored-by: Francesco Romani <fromani@redhat.com>
Signed-off-by: Peter Hunt <pehunt@redhat.com>
2024-10-11 11:29:16 -04:00
Peter Hunt
c51195dbd0 kubelet/cm: fix bug where kubelet restarts from missing cpuset cgroup
on None cpumanager policy, cgroupv2, and systemd cgroup manager, kubelet
could get into a situation where it believes the cpuset cgroup was created
(by libcontainer in the cgroupfs) but systemd has deleted it, as it wasn't requested
to create it. This causes one unnecessary restart, as kubelet fails with

`failed to initialize top level QOS containers: root container [kubepods] doesn't exist.`

This only causes one restart because the kubelet skips recreating the cgroup
if it already exists, but it's still a bother and is fixed this way

Signed-off-by: Peter Hunt <pehunt@redhat.com>
2024-10-11 10:49:16 -04:00
Kubernetes Prow Robot
1b6c993cee
Merge pull request #127952 from macsko/allow_to_specify_feature_gates_on_workload_level_scheduler_perf
Allow to set feature gates on workload level in scheduler_perf
2024-10-11 15:28:19 +01:00
Kubernetes Prow Robot
8cbb11519c
Merge pull request #127998 from skitt/golang-x-oct-2024
October 2024 golang.org/x bump
2024-10-11 14:16:28 +01:00
Kubernetes Prow Robot
a0e146a4b0
Merge pull request #127988 from pohly/e2e-daemonset-health-check
e2e daemonset: stronger health check of DaemonSet status
2024-10-11 14:16:21 +01:00
Kubernetes Prow Robot
6e5e8f374e
Merge pull request #127985 from dims/update-moby-runc-dependencies-oct-10
Update moby/runc dependencies
2024-10-11 12:50:19 +01:00
joey
6bce72a794
fix eps named ports does not work in sidecar(initContainer with restartPolicy=Always)
Signed-off-by: joey <zchengjoey@gmail.com>
2024-10-11 17:53:17 +08:00
Maciej Skoczeń
e676d0e76a Allow to specify feature gates on workload level in scheduler_perf 2024-10-11 08:41:08 +00:00
Patrick Ohly
3ec84373c1 e2e daemonset: stronger health check of DaemonSet status
The error was only generated if both checks (generated pods and ready pods)
failed. This looks like a logic error, failing if either of those isn't
matching expectations seems better.
2024-10-11 10:36:36 +02:00
Maciej Skoczeń
6dbb5d84b3 Move integration tests perf utils to scheduler_perf package 2024-10-11 08:27:08 +00:00
Stephen Kitt
6c5a528727
October 2024 golang.org/x bump
Nothing major here, but nothing liable to cause pain to downstreams
either.

* https://github.com/golang/crypto/compare/v0.26.0...v0.28.0 (there’s
  a SHA3 fix there but it’s only relevant for 32-bit platforms)
* https://github.com/golang/net/compare/v0.28.0...v0.30.0 (mostly
  http2; route address parsing fix on Darwin)
* https://github.com/golang/oauth2/compare/v0.21.0...v0.23.0 (Google
  license fix)
* https://github.com/golang/sys/compare/v0.23.0...v0.26.0 (faster
  getrandom() on Linux through the vDSO; improved RISC-V support)
* https://github.com/golang/term/compare/v0.23.0...v0.25.0
* https://github.com/golang/time/compare/v0.3.0...v0.7.0 (0-limit
  handling fix in x/time/rate; Google license fix)
* https://github.com/golang/tools/compare/v0.24.0...v0.26.0

This doesn’t include golang.org/x/exp; that doesn’t have any relevant
changes. There’s an apidiff fix but we always pull in the latest
apidiff anyway.

Signed-off-by: Stephen Kitt <skitt@redhat.com>
2024-10-11 10:22:13 +02:00
Davanum Srinivas
521f2d106b
Update moby/runc dependencies
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2024-10-11 03:52:09 -04:00
Maciej Skoczeń
25850caf8a Add scheduler_perf test case for NodeAdd event handling 2024-10-11 07:40:06 +00:00
Kubernetes Prow Robot
9ffc095f88
Merge pull request #127892 from utam0k/test-qhint-volume-restriction
Add integration test for VolumeRestriction in requeueing scenarios
2024-10-11 07:32:20 +01:00
Kubernetes Prow Robot
05b076b741
Merge pull request #123627 from liggitt/go-workspaces-update-vendor
simplify update-vendor.sh to use `go work sync`
2024-10-11 04:26:22 +01:00
Kubernetes Prow Robot
5c96d8d34d
Merge pull request #127485 from seans3/websocket-upgrade-error-info
Upgrade websocket failure add extra error info
2024-10-11 02:00:19 +01:00
Kubernetes Prow Robot
1c45ca8059
Merge pull request #127984 from tkashem/refactor-storage-internal-error
KEP-4795: storage.InternalError should retain the inner error
2024-10-11 00:32:26 +01:00
Kubernetes Prow Robot
3bf17e2340
Merge pull request #127959 from ffromani/fix-smtalign-error-message
node: cpumanager: fix smtalign error message and minor cleanup
2024-10-11 00:32:20 +01:00
Sean Sullivan
eec2be8168 Upgrade websocket failure add extra error info 2024-10-10 15:54:56 -07:00
Jordan Liggitt
27c41e75b6
Regenerate go.work.sum 2024-10-10 17:58:55 -04:00
Jordan Liggitt
ada932a6e2
Simplify update-vendor.sh to use go work sync 2024-10-10 17:58:54 -04:00
Kubernetes Prow Robot
c15581b277
Merge pull request #127695 from kaisoz/wait-for-job-failfast
Fail fast when waiting for job conditions in e2e tests
2024-10-10 22:28:19 +01:00
Lubomir I. Ivanov
30f9893374 kubeadm: refactor the dry-run logic
The current dryrun client implemnetation is suboptimal
and sparse. It has the following problems:

- When an object CREATE or UPDATE reaches the default dryrun client
the operation is a NO-OP, which means subsequent GET calls must
fully emulate the object that exists in the store.
- There are multiple implmentations of a DryRunGetter interface
such the one in init_dryrun.go but there are no implementations
for reset, upgrade, join.
- There is a specific DryRunGetter that is backed by a real
client in clientbacked_dryrun.go, but this is used for upgrade
and does not work in conjuction with a fake client.

This commit does the following changes:

- Removes all existing *dryrun*.go implementations.
- Add a new DryRun implementation in dryrun.go that implements
3 clients - fake clientset, real clientset, real dynamic client.
- The DryRun object uses the method chaining pattern.
- Allows the user opt-in into real clients only if needed, by passing
a real kubeconfig. By default only constructs a fake client.
- The default reactor chain for the fake client, always logs the
object action, then for GET or LIST actions attempts to use the
real dynamic client to get the object. If a real object does not
exist it attempts to get the object from the fake object store.
- The user can prepend or append reactors to the chain.
- All known needed reactors for operations during init, join,
reset, upgrade are added as methods of the DryRun struct.
- Adds detailed unit test for the DryRun struct and its methods
including reactors.

Additional changes:
- Use the new DryRun implementation in all command workflows -
init, join, reset, upgrade.
- Ensure that --dry-run works even if there is no active cluster
by returning faked objects. For join, a faked cluster-info
with a fake bootstrap token and CA are used.
2024-10-11 00:15:59 +03:00
Tomas Tormo
3b1a5bfc9c Fail fast when waiting for job conditions in e2e tests 2024-10-10 20:18:21 +00:00
Kubernetes Prow Robot
fc318e3ba4
Merge pull request #127942 from liggitt/json123
sigs.k8s.io/json go 1.23 bump
2024-10-10 18:58:20 +01:00
Kubernetes Prow Robot
95612e7b3b
Merge pull request #127878 from AxeZhan/sidecar
[scheduler] calculate pod requests resources with sidecar containers
2024-10-10 17:54:19 +01:00
AxeZhan
8b15843d00 remove unused GetNonzeroRequests function 2024-10-10 23:52:25 +08:00
Kubernetes Prow Robot
d88b4e3b6e
Merge pull request #127777 from tkashem/refactor-delete-option
KEP-4795: refactor: add delete options for Delete method in storage interface
2024-10-10 16:36:27 +01:00
Kubernetes Prow Robot
61d9bae274
Merge pull request #127348 from RyanAoh/kep-1860-ga
Promote LoadBalancerIPMode to GA
2024-10-10 16:36:19 +01:00
Abu Kashem
f24ec7e00d
refactor: storage.InternalError should retain the inner error
This is so we can do proper error handling, at the same time
we want to maintain backward compatibility
2024-10-10 11:00:36 -04:00
Jordan Liggitt
8eff759b6a
Update sigs.k8s.io/json to go1.23 2024-10-10 11:00:11 -04:00
Kubernetes Prow Robot
daf76e6ead
Merge pull request #127778 from tkashem/refactor-conditional-delete
KEP-4795: refactor: etcd store conditional delete
2024-10-10 14:46:22 +01:00
Abu Kashem
fecab0713b
refactor etcd store conditional delete 2024-10-10 08:42:44 -04:00
Aohan Yang
da5738d9aa Set feature gate emulation version during test 2024-10-10 19:26:31 +08:00
Kubernetes Prow Robot
a6ea7b8218
Merge pull request #122923 from tkashem/timeout-documenting-test
document behavior of per handler read/write timeout with test(s)
2024-10-10 09:48:23 +01:00
Francesco Romani
838f911dea cpumanager: smtalign: fix error message
Fix error message if availablePhysicalCPUs = 0.
Without this change, the logic was mistakenly emitting
the old error message, which is confusing for troubleshooting.

Plus, a tiny quality of life improvement:
cpumanager static policy wants to use `cpuGroupSize` multiple times.
The value represents how many VCPUs per PCPUs the machine has.
So, let's cache (and log!) the value in the policy data.
We don't support dynamic update of the HW topology anyway.

Signed-off-by: Francesco Romani <fromani@redhat.com>
2024-10-10 10:18:44 +02:00
AxeZhan
b1f07bb36c add tests for scheduler 2024-10-10 15:53:19 +08:00
AxeZhan
6fbc3a618f using NonMissingContainerRequests 2024-10-10 15:53:07 +08:00
Kubernetes Prow Robot
1f9d2577da
Merge pull request #127869 from ak20102763/explain_o_shorthand
Added shorthand for kubectl explain --output
2024-10-10 08:08:25 +01:00
Kubernetes Prow Robot
fe218437e0
Merge pull request #127974 from jpbetz/mvp-test-cleanup
peerproxy flake: Use t.Cleanup instead of defer to shut down servers
2024-10-10 03:54:22 +01:00
Kubernetes Prow Robot
582dcc2aca
Merge pull request #127221 from toVersus/test/restartable-init-termination
[Sidecar Containers] Expand test coverage for Node E2E tests on pod termination behavior
2024-10-10 02:48:23 +01:00