kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-05-13 18:43:36 +00:00

Author	SHA1	Message	Date
Dan Mihai	a0bd9e02ca	tests: policy-job: detect create container errors early During the ${wait_time} for an expected condition, if CreateContainerRequest was NOT expected to fail: detect possible CreateContainerRequest failures early and abort the wait. For example, before this change: not ok 1 Successful job with auto-generated policy in 107111ms ok 2 Policy failure: unexpected environment variable in 7920ms ok 3 Policy failure: unexpected command line argument in 7874ms ok 4 Policy failure: unexpected emptyDir volume in 7823ms ok 5 Policy failure: unexpected projected volume in 7812ms ok 6 Policy failure: unexpected readOnlyRootFilesystem in 7903ms ok 7 Policy failure: unexpected UID = 222 in 7720ms After this change: not ok 1 Successful job with auto-generated policy in 10271ms ok 2 Policy failure: unexpected environment variable in 8018ms ok 3 Policy failure: unexpected command line argument in 7886ms ok 4 Policy failure: unexpected emptyDir volume in 7621ms ok 5 Policy failure: unexpected projected volume in 7843ms ok 6 Policy failure: unexpected readOnlyRootFilesystem in 7632ms ok 7 Policy failure: unexpected UID = 222 in 7619ms Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-11-03 15:55:55 +00:00
Dan Mihai	992c91371c	tests: policy-deployment-sc: detect create container errors early During the ${wait_time} for an expected condition, if CreateContainerRequest was NOT expected to fail: detect possible CreateContainerRequest failures early and abort the wait. For example, before this change: ok 1 Successful sc deployment with auto-generated policy and container image volumes in 14769ms ok 2 Successful sc with fsGroup/supplementalGroup deployment with auto-generated policy and container image volumes in 8384ms not ok 3 Successful sc deployment with security context choosing another valid user in 136149ms ok 4 Successful layered sc deployment with auto-generated policy and container image volumes in 8862ms ok 5 Policy failure: unexpected GID = 0 for layered securityContext deployment in 7941ms ok 6 Policy failure: malicious root group added via supplementalGroups deployment in 11612ms After: ok 1 Successful sc deployment with auto-generated policy and container image volumes in 15230ms ok 2 Successful sc with fsGroup/supplementalGroup deployment with auto-generated policy and container image volumes in 9364ms not ok 3 Successful sc deployment with security context choosing another valid user in 11060ms ok 4 Successful layered sc deployment with auto-generated policy and container image volumes in 9124ms ok 5 Policy failure: unexpected GID = 0 for layered securityContext deployment in 7919ms ok 6 Policy failure: malicious root group added via supplementalGroups deployment in 11666ms Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-11-03 15:55:55 +00:00
Dan Mihai	704ee76f1e	tests: policy-deployment-sc: reduced redundancy Call common function instead of copy/paste of three commands. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-11-03 15:55:55 +00:00
Dan Mihai	2cafb10a6a	tests: policy-pod: detect create container errors early During the ${wait_time} for an expected condition, if CreateContainerRequest was NOT expected to fail: detect possible CreateContainerRequest failures early and abort the wait. For example, before this change: not ok 1 Successful pod with auto-generated policy in 110801ms not ok 2 Able to read env variables sourced from configmap using envFrom in 94104ms not ok 3 Successful pod with auto-generated policy and runtimeClassName filter in 95838ms not ok 4 Successful pod with auto-generated policy and custom layers cache path in 110712ms ok 5 Policy failure: unexpected container image in 8113ms ok 6 Policy failure: unexpected privileged security context in 7943ms ok 7 Policy failure: unexpected terminationMessagePath in 11530ms ok 8 Policy failure: unexpected hostPath volume mount in 7970ms ok 9 Policy failure: unexpected config map in 7933ms not ok 10 Policy failure: unexpected lifecycle.postStart.exec.command in 112677ms ok 11 RuntimeClassName filter: no policy in 2302ms not ok 12 ExecProcessRequest tests in 93946ms not ok 13 Successful pod: runAsUser having the same value as the UID from the container image in 94003ms ok 14 Policy failure: unexpected UID = 0 in 8016ms ok 15 Policy failure: unexpected UID = 1234 in 7850ms After: not ok 1 Successful pod with auto-generated policy in 12182ms not ok 2 Able to read env variables sourced from configmap using envFrom in 10121ms not ok 3 Successful pod with auto-generated policy and runtimeClassName filter in 11738ms not ok 4 Successful pod with auto-generated policy and custom layers cache path in 26592ms ok 5 Policy failure: unexpected container image in 7742ms ok 6 Policy failure: unexpected privileged security context in 7949ms ok 7 Policy failure: unexpected terminationMessagePath in 7789ms ok 8 Policy failure: unexpected hostPath volume mount in 7887ms ok 9 Policy failure: unexpected config map in 7818ms not ok 10 Policy failure: unexpected lifecycle.postStart.exec.command in 9120ms ok 11 RuntimeClassName filter: no policy in 2081ms not ok 12 ExecProcessRequest tests in 9883ms not ok 13 Successful pod: runAsUser having the same value as the UID from the container image in 9870ms ok 14 Policy failure: unexpected UID = 0 in 11161ms ok 15 Policy failure: unexpected UID = 1234 in 7814ms Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-11-03 15:55:55 +00:00
Fabiano Fidêncio	c539a9e90e	tests: k8s: parallel: Increase timeout We've seen a few cases where we fail the test due to timeout and when we print the pods we just see that they've been created. With that in mind, let's just increase the timeout a little bit. Example: ``` not ok 1 Parallel jobs in 6250ms (in test file k8s-parallel.bats, line 41) `kubectl wait --for=condition=Ready --timeout=$timeout pod -l jobgroup=${job_name}' failed No resources found in kata-containers-k8s-tests namespace. [bats-exec-test:71] INFO: k8s configured to use runtimeclass job.batch/process-item-test1 created job.batch/process-item-test2 created job.batch/process-item-test3 created NAME STATUS COMPLETIONS DURATION AGE process-item-test1 Running 0/1 0s process-item-test2 Running 0/1 0s process-item-test3 Running 0/1 0s error: no matching resources found No resources found in kata-containers-k8s-tests namespace. No resources found in kata-containers-k8s-tests namespace. DEBUG: system logs of node 'aks-nodepool1-25989463-vmss000000' since test start time (2025-11-01 16:39:03) -- No entries -- job.batch "process-item-test1" deleted job.batch "process-item-test2" deleted job.batch "process-item-test3" deleted ``` Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-01 18:09:37 +01:00
Fabiano Fidêncio	8a5ebd5d16	tests: k8s: run QoS tests on a bigger instance It's been failing to start quite regularly on the smaller instance. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-01 17:54:58 +01:00
Fabiano Fidêncio	c75a46d17f	tests: Do not enable NFD on s390x As we're failing on the uninstall, which seems related to a bug on NFD itself, but I don't have access to a s390x machine to debug, let's skip the enablement for now and enable it back once we've experimented it better on s390x. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-31 16:30:13 +01:00
Fabiano Fidêncio	67e38e0f92	tests: Do not enable NFD on cbl-mariner As we're failing to install NFD on CBL Mariner, let's skip the enablement there, and enable it once we've experimented it better there. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-31 16:30:13 +01:00
Fabiano Fidêncio	1bc873397b	tests: Use NFD as part of the tests As we have the ability to deploy NFD as a sub-chart of our chart, let's make sure we test it during our CI. We had to increase the timeout values, where we had timeouts set, to deploy / undeploy kata, as now NFD is also deployed / undeployed. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-31 16:30:13 +01:00
Fabiano Fidêncio	e30e2b5f45	tests: k8s: Remove tests running on GitHub provided runner We have 2 tests running on GitHub provided runners: * devmapper * CRI-O - devmapper situation For devmapper, we're currently testing devmapper with s390x as part of one of its jobs. More than that, this test has been failing here due to a lack of space in the machine for quite some time, and no-action was taken to bring it back either via GARM or some other way. With that said, let's rely on the s390x CI to test devmapper and avoid one extra failure on our CI by removing this one. - cri-o situation CRI-O is being tested with a fixed version of kubernetes that's already reached its EOL, and a CRI-O version that matches that k8s version. There has been attempts to raise issues, and also to provide a PR that does at least part of the work ... leaving the debugging part for the maintainers of the CI. However, there was no action on those from the maintainers. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-30 11:46:59 +01:00
Manuel Huber	8dc78057d6	ci: Refactor NVIDIA NIM test Change NIM bats file logic to allow skipping test cases which require multiple GPUs. This can be helpful for test clusters where there is only one node with a single GPU, or for local test environments with a single-node cluster with a single GPU. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-10-28 19:12:16 +01:00
Manuel Huber	be32b77baf	ci: Add NVIDIA CUDA vectoradd test This change adds a CUDA vectoradd test case and makes enabling NVRC tracing optional and idempotent. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-10-28 19:12:16 +01:00
Alex Lyn	25ab615da5	Merge pull request #11913 from Apokleos/dedicated-error-rs CI: Add dedicated expected error message for runtime-rs	2025-10-27 10:47:07 +08:00
Dan Mihai	61ee4d7f8b	Merge pull request #11951 from burgerdev/watchable genpolicy: allow non-watchable ConfigMaps	2025-10-24 08:38:55 -07:00
Dan Mihai	ac3ea973ee	Merge pull request #11958 from microsoft/danmihai1/policy-tests-upstream5 tests: k8s: auto-generate policy for additional tests	2025-10-24 07:18:00 -07:00
Alex Lyn	e539432a91	CI: Add dedicated expected error message for runtime-rs Runtime-rs has its dedicated error message, we need handle it separately. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-10-24 20:08:59 +08:00
Markus Rudy	acc7974602	genpolicy: allow non-watchable ConfigMaps If a ConfigMap has more than 8 files it will not be mounted watchable [1]. However, genpolicy assumes that ConfigMaps are always mounted at a watchable path, so containers with large ConfigMap mounts fail verification. This commit allows mounting ConfigMaps from watchable and non-watchable directories. ConfigMap mounts can't be meaningfully verified anyway, so the exact location of the data does not matter, except that we stay in the sandbox data dirs. [1]: `0ce3f5fc6f/docs/design/inotify.md (L11-L21)` Fixes: #11777 Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-10-23 15:45:17 +02:00
Fabiano Fidêncio	94adc58342	tests: Ensure helm secret for kata-deploy installation is cleaned up Every now and then, in case a failure happens, helm leaves the secret behind without cleaning it up, leading to issues in the consecutive runs. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-23 11:15:13 +02:00
Fabiano Fidêncio	12a515826d	tools: Install Golang from a reliable mirror (follow-up) Aurélien has moved to a reliable mirror for our tests, but we missed that our tools Dockerfiles could benefit from the same change, which is added now. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-23 11:15:13 +02:00
Hyounggyu Choi	2c805900a4	Merge pull request #11891 from stevenhorsman/signature-tests-with-initdata tests/k8s: Add initdata variants of signature verification and registry authentication tests	2025-10-22 20:27:26 +02:00
Dan Mihai	d7176ffcc8	tests: k8s-sandbox-vcpus-allocation generated policy Auto-generate policy for k8s-sandbox-vcpus-allocation.bats. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-10-21 21:36:49 +00:00
Dan Mihai	25299bc2a9	tests: k8s-block-volume.bats generated policy Auto-generate policy for k8s-block-volume.bats. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-10-21 21:36:40 +00:00
Dan Mihai	02a8ec0f63	tests: k8s-measured-rootfs auto generated policy Generate Agent Policy for the pod from k8s-measured-rootfs.bats. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-10-21 21:36:27 +00:00
Zvonko Kaiser	1ff8b066c6	Merge pull request #11941 from fidencio/topic/kata-deploy-add-missing-helm-docs helm: Add missing documentation	2025-10-21 16:04:55 -04:00
Dan Mihai	ebaecbd3d6	Merge pull request #11949 from microsoft/danmihai1/optional-secret-volume genpolicy: allow optional secret volumes	2025-10-21 12:27:13 -07:00
Dan Mihai	f11853ab33	tests: k8s-optional-empty-secret.bats policy Auto-generate policy in k8s-optional-empty-secret.bats, now that genpolicy suppprts optional secret-based volumes. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-10-21 15:27:31 +00:00
Fabiano Fidêncio	552378cf1e	helm: Add missing documentation We've recently added support for: * deploying and setting up a snapshotter, via _experimentalSetupSnapshotter * enabling experimental_force_guest_pull, via _experimentalForceGuestPull However, we never updated the documentation for those, thus let's do it now. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-21 16:20:21 +02:00
Aurélien Bombo	22aa27ff5e	tests: Install Go from reliable mirror Downloading Go from storage.googleapis.com fails intermittently with a 403 (see error below) so we switch to go.dev as referenced at https://go.dev/dl/. /tmp/install-go-tmp.Rw5Q4thEWr ~/work/kata-containers/kata-containers /usr/bin/go [install_go.sh:85] INFO: removing go version go1.24.9 linux/amd64 [install_go.sh:94] INFO: Download go version 1.24.6 % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 100 298 100 298 0 0 2610 0 --:--:-- --:--:-- --:--:-- 2614 [install_go.sh:97] INFO: Install go gzip: stdin: not in gzip format tar: Child returned status 1 tar: Error is not recoverable: exiting now [install_go.sh:99] ERROR: sudo tar -C /usr/local/ -xzf go1.24.6.linux-amd64.tar.gz https://github.com/kata-containers/kata-containers/actions/runs/18602801597/job/53045072109?pr=11947#step:5:17 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-10-21 08:47:41 -05:00
Aurélien Bombo	edbb4b633c	Merge pull request #11890 from microsoft/saulparedes/optional_initdata genpolicy: take path to initdata from command line if provided	2025-10-16 11:04:57 -05:00
stevenhorsman	9b086376a4	tests/k8s: Skip initdata tests on tdx The new initdata variants of the tests are failing on the tdx runner, so as discussed, skip them for now: Issue #11945 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-10-15 14:52:08 +01:00
stevenhorsman	09149407fd	tests/k8s: Delete k8s-initdata.bats Now we have wider coverage of initdata testing in k8s-guest-pull-image-signature.bats then remove the old testing. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-10-15 14:52:08 +01:00
stevenhorsman	bdc0a3cf19	tests/k8s: Add initdata variant of registry creds tests Our current set of authenticated registry tests involve setting kernel_params to config the image pull process, but as of kata-containers#11197 this approach is not the main way to set this configuration and the agent config has been removed. Instead we should set the configuration in the `cdh.toml` part of the initdata, so add new test cases for this. In future, when we have been through the deprecation process, we should remove the old tests Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-10-15 14:52:08 +01:00
stevenhorsman	7fbbd170ee	tests/k8s: Add initdata variants of oci signature tests Our current set of signature tests involve setting kernel_parameters to config the image pull process, but as of https://github.com/kata-containers/kata-containers/pull/11197 this approach is not the main way to set this configuration and the agent config has been removed. Instead we should set the configuration in the `cdh.toml` part of the initdata, so add new test cases for this. In future, when we have been through the deprecation process, we should remove the old tests Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-10-15 14:52:08 +01:00
stevenhorsman	90ad5cd884	tests/k8s: Refactor initdata annotation Create a shared get_initdata method that injects a cdh image section, so we don't duplicate the initdata structure everywhere Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-10-15 14:52:08 +01:00
Fabiano Fidêncio	d46474cfc0	tests: Run apt-get update before installing a package Otherwise it'll just break. :-) Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-13 23:33:46 +02:00
Saul Paredes	ba7a5953c8	tests: k8s-policy-pod.bats: test unspecified initdata path use auto_generate_policy_no_added_flags, so we don't pass --initdata-path to genpolicy Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2025-10-13 10:47:53 -07:00
Saul Paredes	395f237fc2	tests: k8s: use default-initdata.toml when auto-generating policy - copy default-initdata.toml in create_tmp_policy_settings_dir, so it can be modified by other tests if needed - make auto_generate_policy use default-initdata.toml by default - add auto_generate_policy_no_added_flags, so it may be used by tests that don't want to use default-initdata.toml by default Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2025-10-13 10:47:53 -07:00
Fabiano Fidêncio	e782d1ad50	ci: k8s: Test experimental_force_guest_pull Now that we have added the ability to deploy kata-containers with experimental_force_guest_pull configured, let's make sure we test it to avoid any kind of regressions. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-10 20:08:10 +02:00
Fabiano Fidêncio	1bc89d09ae	tests: Consider SNAPSHOTTER in the cluster name Otherwise we have no way to differentiate running tests on qemu-coco-dev with different snapshotters. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-10 20:08:10 +02:00
Fabiano Fidêncio	a1f90fe350	tests: k8s: Unify k8s TEE tests There's no reason to have the code duplication between the SNP / TDX tests for CoCo, as those are basically using the same configuration nowadays. Note that for the TEEs case, as the nydus-snapshotter is deployed by the admin, once, instead of deploying it on every run ... I'm actually removing the nydus-snapshotter steps so we make it clear that those steps are not performed by the CI. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-10 09:51:59 +02:00
Dan Mihai	364d3cded0	tests: k8s-nested-configmap-secret policy Add auto-generated agent policy in k8s-nested-configmap-secret.bats. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-10-08 23:37:54 +00:00
Fabiano Fidêncio	544f688104	tests: Add ability to deploy vanilla k8s with erofs As done in the previous commit, let's expand the vanilla k8s deployment to also allow the erofs host side configuration. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-08 10:34:09 +02:00
Fabiano Fidêncio	3ac6579ca6	tests: Add support for deploying vanilla k8s We already have support for deploying a few flavours of k8s that are required for different tests we perform. Let's also add the ability to deploy vanilla k8s, as that will be very useful in the next commits in this series. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-08 10:34:09 +02:00
Fabiano Fidêncio	287db1865f	tests: Relax regex used to install containerd Let's make sure that we can get non-official releases as well, otherwise we won't be able to test a coming release of containerd, to know whether it solves issues that we face or not, before it's actually released. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-08 10:34:09 +02:00
Dan Mihai	08272ab673	Merge pull request #11884 from kata-containers/sprt/priv-test tests/k8s: Add test for privileged containers	2025-10-07 19:18:06 -07:00
Dan Mihai	650863039b	tests: k8s-volume: auto-generate policy Auto-generate the agent policy, instead of using the insecure "allow all" policy. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-10-07 23:35:06 +02:00
Dan Mihai	5ed76b3c91	tests: k8s-volume: retry failed exec Use grep_pod_exec_output to retry possible failing "kubectl exec" commands. Other tests have been hitting such errors during CI in the past. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-10-07 23:35:06 +02:00
Aurélien Bombo	6e451e3da0	tests/k8s: Add test for privileged containers This adds an integration test to verify that privileged containers work properly when deploying Kata with kata-deploy. This is a follow-up to #11878. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-10-07 09:59:05 -05:00
Fabiano Fidêncio	6f17125ea4	tests: Allow using the new way to deploy nydus-snapshotter This allows us to stop setting up the snapshotter ourselves, and just rely con kata-deploy to do so. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-07 10:32:46 +02:00
Fabiano Fidêncio	e1269afe8a	tests: Only use Authorization when GH_TOKEN is available The code, how it was, would lead to the following broke command: `--header "Authorization: Bearer: "` Let's only expand that part of the command if ${GH_TOKEN} is passed, otherwise we don't even bother adding it. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-07 10:32:46 +02:00

1 2 3 4 5 ...

1698 Commits