kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-04-05 03:24:15 +00:00

Author	SHA1	Message	Date
Manuel Huber	b6cf00a374	tests: parametrize storage parameters - trusted-storage.yaml.in: use $PV_STORAGE_CAPACITY and $PVC_STORAGE_REQUEST so that PV/PVC size can vary per test. - confidential_common.sh: add optional size (MB) argument to create_loop_device. - k8s-guest-pull-image.bats: pass PV_STORAGE_CAPACITY and PVC_STORAGE_REQUEST when generating storage config. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-04-01 10:22:26 +02:00
Manuel Huber	79efe3e041	tests: gpu: use container data storage feature Use the container data storage feature for the k8s-nvidia-nim.bats test pod manifests. This reduces the pods' memory requirements. For this, enable the block-encrypted emptydir_mode for the NVIDIA GPU TEE handlers. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-03-23 11:43:11 -07:00
Aurélien Bombo	a98e328359	tests: Add test for trusted ephemeral data storage This tests the feature on CoCo machines. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-09 14:52:17 -05:00
Dan Mihai	3f845af9d4	tests: k8s: basic test for subPathExpr Add basic genpolicy test coverage for subPathExpr and corresponding container mounts. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-03-04 16:28:29 +00:00
Manuel Huber	566bb306f1	tests: enable policy for openvpn on nydus Specify runAsUser, runAsGroup, supplementalGroups values embedded in the image's /etc/group file explicitly in the security context. With this, both genpolicy and containerd, which in case of using nydus guest-pull, lack image introspection capabilities, use the same values for user/group/additionalG IDs at policy generation time and at runtime when the OCI spec is passed. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-24 08:08:15 +01:00
Chiranjeevi Uddanti	88203cbf8d	tests: Add regression test for sandbox_cgroup_only=false Add unit test for get_ch_vcpu_tids() and integration test that creates a pod with sandbox_cgroup_only=false to verify it starts successfully. Signed-off-by: Chiranjeevi Uddanti <244287281+chiranjeevi-max@users.noreply.github.com> Co-authored-by: Antigravity <antigravityagent@google.com>	2026-02-18 20:20:14 +01:00
Ajay Victor	83935e005c	ci: Add integration tests for secret & configmap propagation Enhance k8s-configmap.bats and k8s-credentials-secrets.bats to test that ConfigMap and Secret updates propagate to volume-mounted pods. - Enhanced k8s-configmap.bats to test ConfigMap propagation * Added volume mount test for ConfigMap consumption * Added verification that ConfigMap updates propagate to volume-mounted pods - Enhanced k8s-credentials-secrets.bats to test Secret propagation * Added verification that Secret updates propagate to volume-mounted pods Fixes #8015 Signed-off-by: Ajay Victor <ajvictor@in.ibm.com>	2026-02-14 08:56:21 +05:30
Fabiano Fidêncio	5c0269881e	tests: Make editorconfig-checker happy - Trim trailing whitespace and ensure final newline in non-vendor files - Add .editorconfig-checker.json excluding vendor dirs, .patch, .img, .dtb, .drawio, *.svg, and pkg/cloud-hypervisor/client so CI only checks project code - Leave generated and binary assets unchanged (excluded from checker) Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-10 21:58:28 +01:00
Manuel Huber	525192832f	tests: Clean up superfluous GPU annotation This annotation was required for GPU cold-plug before using a newer device plugin and before querying the pod resources API. As this annotation is no longer required, cleaning it up. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-09 11:28:24 -08:00
Fabiano Fidêncio	dda1b30c34	tests: nvidia-nim: Use sealed secrets for NGC_API_KEY Convert the NGC_API_KEY from a regular Kubernetes secret to a sealed secret for the CC GPU tests. This ensures the API key is only accessible within the confidential enclave after successful attestation. The sealed secret uses the "vault" type which points to a resource stored in the Key Broker Service (KBS). The Confidential Data Hub (CDH) inside the guest will unseal this secret by fetching it from KBS after attestation. The initdata file is created AFTER create_tmp_policy_settings_dir() copies the empty default file, and BEFORE auto_generate_policy() runs. This allows genpolicy to add the generated policy.rego to our custom CDH configuration. The sealed secret format follows the CoCo specification: sealed.<JWS header>.<JWS payload>.<signature> Where the payload contains: - version: "0.1.0" - type: "vault" (pointer to KBS resource) - provider: "kbs" - resource_uri: KBS path to the actual secret Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-04 12:34:44 +01:00
Steve Horsman	4d1095e653	Merge pull request #12350 from manuelh-dev/mahuber/term-grace-period tests: Remove terminationGracePeriod in manifests	2026-01-29 15:17:17 +00:00
Dan Mihai	20ca4d2d79	runtime: DEFDISABLEBLOCK := true 1. Add disable_block_device_use to CLH settings file, for parity with the already existing QEMU settings. 2. Set DEFDISABLEBLOCK := true by default for both QEMU and CLH. After this change, Kata Guests will use by default virtio-fs to access container rootfs directories from their Hosts. Hosts that were designed to use Host block devices attached to the Guests can re-enable these rootfs block devices by changing the value of disable_block_device_use back to false in their settings files. 3. Add test using container image without any rootfs layers. Depending on the container runtime and image snapshotter being used, the empty container rootfs image might get stored on a host block device that cannot be safely hotplugged to a guest VM, because the host is using the same block device. 4. Add block device hotplug safety warning into the Kata Shim configuration files. Signed-off-by: Dan Mihai <dmihai@microsoft.com> Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Signed-off-by: Cameron McDermott <cameron@northflank.com>	2026-01-28 19:47:49 +01:00
Manuel Huber	6438fe7f2d	tests: Remove terminationGracePeriod in manifests Do not kill containers immediately, instead use Kubernetes' default termination grace period. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-01-23 16:18:44 -08:00
Manuel Huber	78c41b61f4	tests: nvidia: Update images, probes and timeouts Changes in NIM/RAG samples: - update image references - update memory requirements, timeouts, model name - sanitize some of the probes and print-out Further refinements can be made in the future. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-12-18 10:57:14 +01:00
Manuel Huber	0373428de4	tests: nvidia: Use secret for NGC API key This is a slight change in the manifest to at least use a secret for the environment variable. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-12-18 10:57:14 +01:00
Fabiano Fidêncio	50b853eb93	tests: nvidia: Always rely on the "kata" default runtime class This is a pattern already followed by all the other tests. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-12 16:31:42 +01:00
Manuel Huber	1781fb8b06	tests: nvidia: cc: Use CUDA image from NVCR Pull from nvcr.io to avoid hitting unauthenticated pull rate limits. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-12-12 12:52:33 +01:00
Manuel Huber	bf26ad9532	nvidia: tests: remove outer CDI annotations With the new device plugin being used by CI runners, these annotations are no longer necessary. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-12-12 12:52:33 +01:00
Fabiano Fidêndio	71f78cc87e	tests: cc: gpu: Lower the amount of memory required by the pods We've made the pods require a ridiculous amount of memory, just for the sake of getting them running. Now that those are running, tests are passing, CI is required, let's work to lower the amount of mmemory needed as everything else is working as expected. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-06 00:16:43 +01:00
Manuel Huber	34efa83afc	tests: nvidia: cc: Add attestation test Add the attestation bats test case to the NVIDIA CI and provide a second pod manifest for the attestation test with a GPU. This will enable composite attestation in a subsequent step. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-12-05 11:48:55 +01:00
stevenhorsman	5c618dc8e2	tests: Switch nginx images to use version.yaml details - Swap out the hard-coded nginx registry and verisons for reading the test image details for version.yaml which can also ensure that the quay.io mirror is used rather than the docker hub versions which can hit pull limits - Try setting imagePullPoliycy Always to fix issues with the arm CI Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-12-02 10:04:09 +01:00
Manuel Huber	7e20118c8e	tests: nvidia: move secret definitions to bottom The add_allow_all_policy_to_yaml in tests_common.sh needs some improvements so that this function can support pod manifests with different resource kinds. For now, moving the Secret definition to the bottom so that we can create a default policy for the Pod. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-11-21 09:24:15 +01:00
Manuel Huber	477ca3980b	tests: nvidia: cc: Re-enable multi GPU test case Use the pod name variable so that kubectl wait finds the pod. Currently, kubectl waits for nvidia-nim-llama-3-2-nv-embedqa-1b-v2, not for nvidia-nim-llama-3-2-nv-embedqa-1b-v2-tee Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-11-20 10:05:46 +01:00
Fabiano Fidêncio	94ed4051b0	tests: nvidia: cc: Increase RAM for NIM pods Those need to pull the models inside the guest, and the guest has 50% of its memory "allowed" to be used as tmpfs, so, we gotta usa the RAM that we have. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-18 22:29:42 +01:00
Fabiano Fidêncio	e5062a056e	tests: nvidia: cc: Adjust timeouts on NIM pods Timeout increases for confidential computing slowness: * livenessProbe: * initialDelaySeconds: 15 → 120 seconds * timeoutSeconds: 1 → 10 seconds * failureThreshold: 3 → 10 * readinessProbe: * initialDelaySeconds: 15 → 120 seconds * timeoutSeconds: 1 → 10 seconds * failureThreshold: 3 → 10 * startupProbe: * initialDelaySeconds: 40 → 180 seconds * timeoutSeconds: 1 → 10 seconds * failureThreshold: 180 → 300 Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-18 22:29:42 +01:00
Fabiano Fidêncio	4d314e8676	tests: nvidia: nims: Adjust to CC There are several changes needed in order to get this test working with CC, and yet we still are skipping it. Basically, we need to: * Pull an authenticated image inside the guest, which requires: * Using Trustee to release the credential * We still depend on a PR to be merged on Trustee side * https://github.com/confidential-containers/trustee/pull/1035 * We still depend on a Trustee bump (including the PR above) on our side Apart from those changes, I ended up "duplicating" the tests by adding a "-tee" version of those, which already have: * The proper kbs annotations set up * Dropped host mounts * Increases the memory needed Last but not least, as "bats" probably means "being a terrible script", I had to re-arrange a few things otherwise the tests would not even run due to bats-isms that I am sincerely not able to pin-point. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-10 13:01:30 +01:00
Manuel Huber	c6dc176a03	tests: nvidia: cc: Enable NIMs tests Same deal as the previous commut, just enabling the tests here, with the same list of improvements that we will need to go through in order to get is working in a perfect way. Signed-off-by: Manuel Huber <manuelh@nvidia.com> Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-06 16:28:33 +01:00
Manuel Huber	8ca77f2655	tests: nvidia: cc: Run CUDA vectorAdd tests on CC mode While the primary goal of this change is to detect regressions to the NVIDIA SNP GPU scenario, various improvements to reflect a more realistic CC setting are planned in subsequent changes, such as: * moving away from the overlayfs snapshotter * disabling filesystem sharing * applying a pod security policy * activating the GPUs only after attestation * using a refined approach for GPU cold-plugging without requiring annotations * revisiting pod timeout and overhead parameters (the podOverhead value was increased due to CUDA vectorAdd requiring about 6Gi of podOverhead, as well as the inference and embedqa requiring at least 12Gi, respectively, 14Gi of podOverhead to run without invoking the host's oom-killer. We will revisit this aspect after addressing points 1. and 2.) Signed-off-by: Manuel Huber <manuelh@nvidia.com> Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-06 16:28:33 +01:00
Manuel Huber	be32b77baf	ci: Add NVIDIA CUDA vectoradd test This change adds a CUDA vectoradd test case and makes enabling NVRC tracing optional and idempotent. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-10-28 19:12:16 +01:00
Saul Paredes	395f237fc2	tests: k8s: use default-initdata.toml when auto-generating policy - copy default-initdata.toml in create_tmp_policy_settings_dir, so it can be modified by other tests if needed - make auto_generate_policy use default-initdata.toml by default - add auto_generate_policy_no_added_flags, so it may be used by tests that don't want to use default-initdata.toml by default Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2025-10-13 10:47:53 -07:00
Aurélien Bombo	6e451e3da0	tests/k8s: Add test for privileged containers This adds an integration test to verify that privileged containers work properly when deploying Kata with kata-deploy. This is a follow-up to #11878. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-10-07 09:59:05 -05:00
Dan Mihai	5e46f814dd	Merge pull request #11832 from kata-containers/sprt/dev-hostpath runtime: Simplify mounting guest devices when using hostPath volumes	2025-10-06 12:36:36 -07:00
Manuel Huber	4b7c1db064	ci: Add test case for openvpn Introduce new test case which verifies that openvpn clients and servers can run as Kata pods and can successfully establish a connection. Volatile certificates and keys are generated by an initialization container and injected into the client and server containers. This scenario requires TUN/TAP support for the UVM kernel. Signed-off-by: Manuel Huber <mahuber@microsoft.com> Co-authored-by: Manuel Huber <manuelh@nvidia.com>	2025-10-02 11:40:49 +02:00
Aurélien Bombo	5c21b1faf3	runtime: Simplify mounting guest devices when using hostPath volumes This change crystallizes and simplifies the current handling of /dev hostPath mounts with virtually no functional change. Before this change: - If a mount DESTINATION is in /dev and it is a non-regular file on the HOST, the shim passes the OCI bind mount as is to the guest (e.g. /dev/kmsg:/dev/kmsg). The container rightfully sees the GUEST device. - If the mount DESTINATION does not exist on the host, the shim relies on k8s/containerd to automatically create a directory (ie. non-regular file) on the HOST. The shim then also passes the OCI bind mount as is to the guest. The container rightfully sees the GUEST device. - For other /dev mounts, the shim passes the device major/minor to the guest over virtio-fs. The container rightfully sees the GUEST device. After this change: - If a mount SOURCE is in /dev and it is a non-regular file on the HOST, the shim passes the OCI bind mount as is to the guest. The container rightfully sees the GUEST device. - The shim does not anymore rely on k8s/containerd to create missing mount directories. Instead it explicitely handles missing mount SOURCES, and treats them like the previous bullet point. - The shim no longer uses virtio-fs to pass /dev device major/minor to the guest, instead it passes the OCI bind mount as is. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-10-01 15:32:21 -05:00
Hyounggyu Choi	c961f70b7e	ci: Remove DL3007 ignore comment for base image The Hadolint warning DL3007 (pin the version explicitly) is no longer applicable. We have updated the base image to use a specific version digest, which satisfies the linter's requirement for reproducible builds. This commit removes the corresponding inline ignore comment. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-09-25 15:46:39 +02:00
Hyounggyu Choi	e9653eae6e	GHA: Pin Alpine to 3.20 for tee-unencrypted image We recently hit the following error during build: ``` RUN ssh-keygen -t ed25519 -f /etc/ssh/ssh_host_ed25519_key -P "" OpenSSL version mismatch. Built against 3050003f, you have 30500010 ``` This happened because `alpine:latest` moved forward and the `ssh-keygen` binary in the base image was compiled against a newer OpenSSL version that is not available at runtime. Pinning the base image to the stable release (3.20) avoids the mismatch and ensures consistent builds. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-09-25 11:49:04 +02:00
Cameron Baird	bdd98ec623	ci: Add test case for iptables, exercised via istio init container Introduce new test case in k8s-iptables.bats which verifies that workloads can configure iptables in the UVM. Users discovered that they weren't able to do this for common usecases such as istio. Proper support for this should be built into UVM kernels. This test ensures that current and future kernel configurations don't regress this functionality. Signed-off-by: Cameron Baird <cameronbaird@microsoft.com>	2025-09-04 07:18:45 +02:00
Zvonko Kaiser	8be41a4e80	gpu: Add embeding service For a simple RAG pipeline add a embeding service Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-08-14 16:34:21 +00:00
Zvonko Kaiser	8768e08258	gpu: Add embeding service For a simple RAG pipeline add a embeding service Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-07-30 13:45:59 +00:00
Zvonko Kaiser	254dbd9b45	gpu: Add Pod spec for NIM llama Pod spec for the NIM inferencing service Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-07-30 13:45:59 +00:00
Xuewei Niu	6aa3517393	tests: Prevent the shim from being killed in k8s-oom test The actual memory usage on the host is equal to the hypervisor memory usage plus the user memory usage. An OOM killer might kill the shim when the memory limit on host is same with that of container and the container consumes all available memory. In this case, the containerd will never receive OOM event, but get "task exit" event. That makes the `k8s-oom.bats` test fail. The fix is to add a new container to increase the sandbox memory limit. When the container "oom-test" is killed by OOM killer, there is still available memory for the shim, so it will not be killed. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2025-07-24 23:44:21 +08:00
Archana Choudhary	e093919b42	tests: update container image for ci and unit test This patch updates the container image for the CI test workloads: - `k8s-layered-sc-deployment.yaml` - `k8s-pod-sc-deployment.yaml` - `k8s-pod-sc-nobodyupdate-deployment.yaml` - `k8s-pod-sc-supplementalgroups-deployment.yaml` - `k8s-policy-deployment.yaml` Also updates unit tests: - `test_create_container_security_context` - `test_create_container_security_context_supplemental_groups` This fixes tests failing due to an image pull error as the previous image is no longer available in the container registry. Signed-off-by: Archana Choudhary <archana1@microsoft.com> Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2025-06-20 10:46:56 -07:00
Cameron Baird	19d502de76	ci: Add test cases for fsGroup and supplementalGroup fields Introduce new test case to the security context bats file which verifies that policy works properly for a deployment yaml containing fsGroup and supplementalGroup configuration. Signed-off-by: Cameron Baird <cameronbaird@microsoft.com>	2025-05-13 21:48:58 +00:00
stevenhorsman	f2a2117252	tests: k8s: Retry output of kubectl exec in k8s-cpu-ns We are seeing failures in this test, where the output of the kubectl exec command seems to be blank, so try retrying the exec like #11024 Fixes: #11133 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-04-30 15:01:08 +01:00
Cameron Baird	fc75aee13a	ci: Add CI tests for runAsGroup, GID policy Introduce tests to check for policy correctness on a redis deployment with 1. a pod-level securityContext 2. a container-level securityContext which shadows the pod-level securityContext 3. a pod-level securityContext which selects an existing user (nobody), causing a new GID to be selected. Redis is an interesting container image to test with because it includes a /etc/passwd file with existing user/group configuration of 1000:1000 baked in. Signed-off-by: Cameron Baird <cameronbaird@microsoft.com>	2025-04-28 16:28:31 +00:00
Ryan Savino	1dbe3fb8bc	tests: fix confidential ssh Dockerfile Need to set correct permissions for ssh directories and files Fixes: #11005 Signed-Off-By: Ryan Savino <ryan.savino@amd.com>	2025-03-10 18:31:05 -05:00
Fabiano Fidêncio	18137b1583	tests: k8s: confidential: Increase log_buf_len to 4M Relying on dmesg is really not ideal, as we may lose important info, mainly those which happen very early in the boot, depending on the size of kernel ring buffer. So, for this specific test, let's increase the kernel ring buffer, by default, to 4M. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-01-23 12:04:13 +01:00
Fabiano Fidêncio	44ff602c64	tests: k8s: Be more aggressive to get OOM Let's increase the amount of bytes allocated per VM worker, so we can hit the OOM sooner. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-01-06 21:48:20 +01:00
Saul Paredes	711d12e5db	policy: support optional metadata uid field This prevents a deserialization error when uid is specified Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-12-02 11:24:58 -08:00
Saul Paredes	461efc0dd5	tests: remove manifest v1 test This test was meant to show support for pulling images with v1 manifest schema versions. The nginxhttps image has been modified in https://hub.docker.com/r/ymqytw/nginxhttps/tags such that we are no longer able to pull it: $ docker pull ymqytw/nginxhttps:1.5 Error response from daemon: missing signature key We may remove this test since schema version 1 manifests are deprecated per https://docs.docker.com/engine/deprecated/#pushing-and-pulling-with-image-manifest-v2-schema-1 : "These legacy formats should no longer be used, and users are recommended to update images to use current formats, or to upgrade to more current images". This schema version was used by old docker versions. Further OCI spec https://github.com/opencontainers/image-spec/blob/main/manifest.md#image-manifest-property-descriptions only supports schema version 2. Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-11-08 13:38:51 -08:00

1 2 3

109 Commits