2429 Commits

Author SHA1 Message Date
Paco Xu
e71353c942 fail if the runtime/kubelet process is not fetched 2022-01-25 17:43:51 +08:00
Imran Pochi
6071f6e8ab Addressing review comments
This commit is to be squashed and merged with the first commit.

Signed-off-by: Imran Pochi <imran@kinvolk.io>
2022-01-20 11:45:22 +05:30
Sergey Kanzhelev
7e7bc6d53b remove DynamicKubeletConfig logic from kubelet 2022-01-19 22:38:04 +00:00
Kubernetes Prow Robot
32f83b2b60 Merge pull request #107595 from wzshiming/test/skip_not_found_cmd
Skip tests when no signal emitting command
2022-01-18 16:14:05 -08:00
Kubernetes Prow Robot
feb758027c Merge pull request #106907 from cyclinder/remove_dockershim_flags
Clean up dockershim flags in the kubelet
2022-01-18 09:09:09 -08:00
Shiming Zhang
00790153b2 Skip tests when no signal emitting command 2022-01-17 16:03:28 +08:00
Kubernetes Prow Robot
8815a3119c Merge pull request #107517 from mason1kwok/removes-test-skiper
Removes code that was not completely cleared before
2022-01-16 09:03:28 -08:00
cyclinder
07999dac70 Clean up dockershim flags in the kubelet
Signed-off-by: cyclinder <qifeng.guo@daocloud.io>
Co-authored-by: Ciprian Hacman <ciprian@hakman.dev>
Signed-off-by: Ciprian Hacman <ciprian@hakman.dev>
2022-01-14 16:02:50 +02:00
Kubernetes Prow Robot
3bd422dc76 Merge pull request #107293 from dims/jan-1-owners-cleanup
Cleanup OWNERS files - Jan 2021 Week 1
2022-01-13 10:30:30 -08:00
mason1kwok
5307ae7159 cleanup code from critical_pod_test 2022-01-13 14:44:05 +08:00
Kubernetes Prow Robot
8c69e5d25b Merge pull request #107361 from wzshiming/test/skip
Skip tests when no signal emitting command
2022-01-12 14:21:38 -08:00
fengzixu
f202164c87 fix e2e test 2022-01-11 13:50:17 +00:00
fengzixu
ed7fd0ced5 add volumeHealth label to metrics 2022-01-11 13:50:17 +00:00
fengzixu
bab1755274 fix: correct metrics expression 2022-01-11 13:50:17 +00:00
Davanum Srinivas
9682b7248f OWNERS cleanup - Jan 2021 Week 1
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2022-01-10 08:14:29 -05:00
Sascha Grunert
fcca81aeae Fix AppArmor unloaded profile e2e test
With the removal of the kubelet AppArmor profile validation in
https://github.com/kubernetes/kubernetes/pull/97966 we passed the
responsibility of the desired behavior to the container runtime.
Therefore we have to change the e2e test which silently broke after the
PR merge.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2022-01-10 13:22:11 +01:00
Kubernetes Prow Robot
d2c9456963 Merge pull request #106287 from Dragoncell/update_cadvisor_version
update cadvisor version in test
2022-01-08 15:50:19 -08:00
Jiaming Xu
d6d63b44fa update cadvisor version in test
update NodePrePullImageList

fix conflicts

fix conflicts

remove script

add cadvisor as dependency

address comments
2022-01-07 22:29:12 +00:00
Shiming Zhang
3516e76f3d Skip tests when no signal emitting command 2022-01-06 11:38:36 +08:00
Kubernetes Prow Robot
2b0ddeb23b Merge pull request #106621 from shuheiktgw/check_private_ssh_key_existence
Check the private SSH key existence in the node e2e tests
2022-01-05 18:06:35 -08:00
Imran Pochi
61c4732c71 Revert "Merge pull request #104308 from ehashman/revert-103608-imran/e2e-lock-contention"
This reverts commit 9d09c9d246

This E2E test was reverted becuase the test was failing continously.
More on the issue here #104307

This commit re-reverts and brings back the LockContention test, with
the addition of [Serial] tag to the test.
2022-01-04 07:34:07 +05:30
Ciprian Hacman
a0abe5aa33 Clean up dockershim in tests
Signed-off-by: Ciprian Hacman <ciprian@hakman.dev>
2021-12-22 13:05:34 +02:00
Paco Xu
f0e7025371 skip reduceAllocatableMemoryUsage if cgroup v2 is enabled 2021-12-16 14:46:50 +08:00
Davanum Srinivas
497e9c1971 Cleanup OWNERS files (No Activity in the last year)
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2021-12-15 10:34:02 -05:00
Davanum Srinivas
9405e9b55e Check in OWNERS modified by update-yamlfmt.sh
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2021-12-09 21:31:26 -05:00
David Porter
95264a418d kubelet: set failed phase during graceful shutdown
Revert to previous behavior in 1.21/1.20 of setting pod phase to failed
during graceful node shutdown.

Setting pods to failed phase will ensure that external controllers that
manage pods like deployments will create new pods to replace those that
are shutdown. Many customers have taken a dependency on this behavior
and it was breaking change in 1.22, so this change reverts back to the
previous behavior.

Signed-off-by: David Porter <david@porter.me>
2021-12-09 13:17:40 -08:00
Kubernetes Prow Robot
4aba017280 Merge pull request #106307 from cynepco3hahue/e2e_node_fix_memory_manager_multi_NUMA
e2e_node: provide an option to specify hugepages on the specific NUMA node
2021-12-07 17:26:25 -08:00
shuheiktgw
79484ed7b6 Check key existence in the getPrivateSSHKey function 2021-11-27 15:00:24 +09:00
Kubernetes Prow Robot
c3e6b66643 Merge pull request #106533 from haircommander/summary-page-fault-test
test: update major page fault values for summary test
2021-11-23 15:09:45 -08:00
shuheiktgw
db4a743b77 Check the private SSH key existence in the node e2e tests 2021-11-23 17:20:42 +09:00
Kubernetes Prow Robot
e31aafc4fd Merge pull request #106348 from endocrimes/dani/rm-gpu
e2e_node: unify device tests
2021-11-22 19:46:16 -08:00
Jonathan Lebon
3ebd93cd02 test-e2e-node: support pure SSH mode
Right now, `run_remote.go` only supports GCE instances. But actually
running the tests is completely independent of GCE and could work just
as well on any SSH-accessible machine.

This patch adds a new `--mode` switch, which defaults to `gce` for
backwards compatibility, but can be set to `ssh`. In that mode, the GCE
API is not used at all, and we simply connect to the hosts given via
`--hosts`.

This is still better than `run_local.go` because the latter mixes build
environment with test environment, which doesn't fit well with
container-optimized operating systems.

This is part of an effort to setup the e2e node tests on Fedora CoreOS
(see https://github.com/coreos/fedora-coreos-tracker/issues/990).

Patch best viewed with whitespace ignored.
2021-11-22 10:13:15 -05:00
Jonathan Lebon
591f4cdb77 run_remote.go: factor out prepareGceImages()
Mostly a pure code move. Only changed the `klog.Fatalf` to `fmt.Errorf`.
Prep for future patch.
2021-11-22 10:12:29 -05:00
Jonathan Lebon
032dbd2063 run_remote.go: move registerGceHostIP() call to testImage()
I.e. don't assume that `testHost` is called on a GCE host. Prep for
future patch.
2021-11-22 10:12:28 -05:00
Jonathan Lebon
36233b985b run_remote.go: factor out registerGceHostIP()
Prep for future patch.
2021-11-22 10:12:28 -05:00
Kubernetes Prow Robot
21d3acc787 Merge pull request #106544 from ehashman/fix-flake-restart
Deflake "Kubelet should correctly account for terminated pods after restart"
2021-11-20 00:04:59 -08:00
Elana Hashman
6ddf86d422 Set startTimeout back to 3m, restore wait loop at end of test 2021-11-19 11:30:43 -08:00
Elana Hashman
b4a8861af3 Tweak resource requests for Kubelet restart test 2021-11-18 14:57:22 -08:00
Peter Hunt
76df8acb80 test: update major page fault values for summary test
as well as use a variable instead of a constant

Signed-off-by: Peter Hunt <pehunt@redhat.com>
2021-11-18 09:24:41 -05:00
Kubernetes Prow Robot
3b9bd229b2 Merge pull request #106493 from endocrimes/dani/endocrimes-test-reviewer
node e2e: endocrimes as reviewer
2021-11-17 22:41:01 -08:00
Kubernetes Prow Robot
d766ab88f7 Merge pull request #106501 from ehashman/cri-graduation-v1
Make CRI v1 the default and allow a fallback to v1alpha2
2021-11-17 19:57:01 -08:00
Kubernetes Prow Robot
91b7fb4dc9 Merge pull request #102915 from wzshiming/feat/graceful-shutdown-based-on-pod-priority
Graceful Node Shutdown Based On Pod Priority
2021-11-17 18:45:03 -08:00
Sascha Grunert
de37b9d293 Make CRI v1 the default and allow a fallback to v1alpha2
This patch makes the CRI `v1` API the new project-wide default version.
To allow backwards compatibility, a fallback to `v1alpha2` has been added
as well. This fallback can either used by automatically determined by
the kubelet.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2021-11-17 11:05:05 -08:00
Sergey Kanzhelev
b7affcced1 implement :grpc probe action 2021-11-17 17:31:23 +00:00
Danielle Lancashire
e60ad8ebc6 e2e_node: add endocrimes as reviewer 2021-11-17 15:35:35 +01:00
Shiming Zhang
7c656b55e4 Update shutdown cases 2021-11-17 11:47:12 +08:00
Shiming Zhang
df7e4c1a3d Add e2e for GracefulNodeShutdownBasedOnPodPriority 2021-11-17 11:47:12 +08:00
Elana Hashman
303b05cded Fix timeout flake in restart kubelet e2e 2021-11-15 13:42:58 -08:00
Kubernetes Prow Robot
159fcbb01e Merge pull request #106408 from cynepco3hahue/e2e_node_quota_isci_test_fix_panic_nil_pointer_exception
e2e_node: fix nil pointer exception under quota lsci test
2021-11-15 11:27:02 -08:00
Antonio Ojea
5eb584d1cb Node tests fixes (#106371)
* capture loop variable

* capture the loop variable and don't fail on not found errors

* capture loop variable

* Revert "Mark restart_test as flaky"

This reverts commit 990e9506de.

* skip e2e node restart test with dockershim

* Update test/e2e_node/restart_test.go

Co-authored-by: Mike Miranda <mikemp96@gmail.com>

* capture loop using index

Co-authored-by: Mike Miranda <mikemp96@gmail.com>
2021-11-14 19:54:47 -08:00