kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-04-10 22:12:35 +00:00

Author	SHA1	Message	Date
Fabiano Fidêncio	b3ae6ef99c	Merge pull request #12760 from fitzthum/bump-nvat Bump trustee and guest-components to add nvswitch / ppcie support	2026-04-07 19:07:50 +02:00
Aurélien Bombo	79fab93041	Merge pull request #12779 from rophy/fix/strip-cr-from-tty-exec tests: strip \r from kubectl exec output for TTY containers	2026-04-07 10:19:21 -05:00
Tobin Feldman-Fitzthum	e40abcf72d	nvidia: add nvrc.smi.srs=1 to default nvidia kernel params The attestation-agent no longer sets nvidia devices to ready automatically. Instead, we should use nvrc for this. Since this is required for all nvidia workloads, add it to the default nv kernel params. With bounce buffers, the timing of attesting a device versus setting it to ready is not so important. Signed-off-by: Tobin Feldman-Fitzthum <tfeldmanfitz@nvidia.com>	2026-04-07 14:28:50 +00:00
Tobin Feldman-Fitzthum	7385938c57	tests: fix default KBS Policy path We recently moved the default policy in the Trustee repo. Now it's in the same place as all the other policies. Update the test code to match. Signed-off-by: Tobin Feldman-Fitzthum <tfeldmanfitz@nvidia.com>	2026-04-07 05:46:27 +00:00
Rophy Tsai	f7d9024249	tests: strip \r from kubectl exec output for TTY containers The busybox-pod.yaml test fixture sets tty: true on the second container. When a container has a TTY, kubectl exec may return \r\n line endings. The invisible \r causes string comparisons to fail: container_name=$(kubectl exec ... -- env \| grep CONTAINER_NAME) [ "$container_name" == "CONTAINER_NAME=second-test-container" ] This comparison fails because $container_name contains a trailing \r character. Fix by piping through tr -d '\r' after grep. This is harmless when \r is absent and fixes the mismatch when present. Fixes: #9136 Signed-off-by: Rophy Tsai <rophy@users.noreply.github.com>	2026-04-07 01:35:10 +00:00
Dan Mihai	9b770793ba	Merge pull request #12728 from manuelh-dev/mahuber/empty-dir-fsgrou-policy genpolicy: adjust GID after passwd GID handling and set fs_group for encrypted emptyDir volumes	2026-04-06 10:22:34 -07:00
Fabiano Fidêncio	1300145f7a	tests: add k3s/rke2 to OCI 1.3.0 drop-in overlay condition k3s and rke2 ship containerd 2.2.2, which requires the OCI 1.3.0 drop-in overlay. Move them from the separate OCI 1.2.1 branch into the OCI 1.3.0 condition alongside nvidia-gpu, qemu-snp, qemu-tdx, and custom container engine versions. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-04-06 18:50:20 +02:00
llink5	f7878cc385	runtime: fix Docker 26+ networking by rescanning after Start Docker 26+ configures container networking (veth pair, IP addresses, routes) after task creation rather than before. Kata's endpoint scan runs during CreateSandbox, before the interfaces exist, resulting in VMs starting without network connectivity (no -netdev passed to QEMU). Add RescanNetwork() which runs asynchronously after the Start RPC. It polls the network namespace until Docker's interfaces appear, then hotplugs them to QEMU and informs the guest agent to configure them inside the VM. Additional fixes: - mountinfo parser: find fs type dynamically instead of hardcoded field index, fixing parsing with optional mount tags (shared:, master:) - IsDockerContainer: check CreateRuntime hooks for Docker 26+ - DockerNetnsPath: extract netns path from libnetwork-setkey hook args with path traversal protection - detectHypervisorNetns: verify PID ownership via /proc/pid/cmdline to guard against PID recycling - startVM guard: rescan when len(endpoints)==0 after VM start Fixes: #9340 Signed-off-by: llink5 <llink5@users.noreply.github.com>	2026-04-02 21:23:16 +02:00
Manuel Huber	dd868dee6d	tests: nvidia: onboard NIM service test Onboard a test case for deploying a NIM service using the NIM operator. We install the operator helm chart on the fly as this is a fast operation, spinning up a single operand. Once a NIM service is scheduled, the operator creates a deployment with a single pod. For now, the TEE-based flow uses an allow-all policy. In future work, we strive to support generating pod security policies for the scenario where NIM services are deployed and the pod manifest is being generated on the fly. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-04-02 16:58:54 +02:00
Manuel Huber	57e42b10f1	tests: nvidia: Do not use elevated privileges Do not run the NIM containers with elevated privileges. Note that, using hostPath requires proper host folder permissions, and that using emptyDir requires a proper fsGroup ID. Once issue 11162 is resolved, we can further refine the securityContext fields for the TEE manifests. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-04-01 10:23:26 -07:00
Manuel Huber	a762b136de	tests: generate policy for pod-empty-dir-fsgroup The logic in the k8s-empty-dirs.bats file missed to add a security policy for the pod-empty-dir-fsgroup.yaml manifest. With this change, we add the policy annotation. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-04-01 10:23:26 -07:00
Manuel Huber	177f5c308e	tests: gpu: use container image layer storage Use the container image layer storage feature for the k8s-nvidia-nim.bats test pod manifests. This reduces the pods' memory requirements. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-04-01 10:22:26 +02:00
Manuel Huber	b6cf00a374	tests: parametrize storage parameters - trusted-storage.yaml.in: use $PV_STORAGE_CAPACITY and $PVC_STORAGE_REQUEST so that PV/PVC size can vary per test. - confidential_common.sh: add optional size (MB) argument to create_loop_device. - k8s-guest-pull-image.bats: pass PV_STORAGE_CAPACITY and PVC_STORAGE_REQUEST when generating storage config. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-04-01 10:22:26 +02:00
Hyounggyu Choi	11cd5f2808	tests: Configure devmapper properly regardless of containerd version The follow differences are observed between container 1.x and 2.x: ``` [plugins.'io.containerd.snapshotter.v1.devmapper'] snapshotter = 'overlayfs' ``` and ``` [plugins."io.containerd.snapshotter.v1.devmapper"] snapshotter = "overlayfs" ``` The current devmapper configuration only works with double quotes. Make it work with both single and double quotes via tomlq. In the default configuration for containerd 2.x, the following configuration block is missing: ``` [[plugins.'io.containerd.transfer.v1.local'.unpack_config]] platform = "linux/s390x" # system architecture snapshotter = "devmapper" ``` Ensure the configuration block is added for containerd 2.x. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-04-01 07:14:52 +02:00
Hyounggyu Choi	8cebcf0113	Merge pull request #12742 from BbolroC/remove-skipped-emptydir-tests-for-ibm-sel tests: Remove skip condition for emptyDir-related tests on IBM SEL	2026-03-27 14:35:48 +01:00
Fabiano Fidêncio	f0ad9f1709	tests: snp: policy: Adjust to containerd 2.3.0 As the AMD maintainers switched to the 2.3.0-beta.0 containerd (due to the nydus fixes that landed there). Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-27 11:14:54 +01:00
Fabiano Fidêncio	1b8189731a	tests: hand nydus snapshotter setup over to kata-deploy Now that kata-deploy deploys and manages nydus-for-kata-tee on all platforms, the separate standalone nydus-snapshotter DaemonSet deployment is no longer needed. - Short-circuit deploy_nydus_snapshotter and cleanup_nydus_snapshotter to no-ops with an explanatory message. - Add qemu-snp to the workaround case so AMD SEV-SNP baremetal runners also get USE_EXPERIMENTAL_SETUP_SNAPSHOTTER=true and kata-deploy picks up the snapshotter setup on every run. - Drop the x86_64 arch guard and the hypervisor sub-case from the EXPERIMENTAL_SETUP_SNAPSHOTTER block, allowing any architecture and hypervisor to use the kata-deploy-managed path when the flag is set. Made-with: Cursor Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-27 11:14:54 +01:00
Hyounggyu Choi	de3afd3076	tests: Remove skip condition for s390x in trusted ephemeral storage test Remove the skip condition for s390x in k8s-trusted-ephemeral-data-storage.bats. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-03-26 18:58:13 +01:00
Hyounggyu Choi	911aee5ad7	tests: Remove skip condition for emptyDir-related tests on IBM SEL Fixes: #10002 Since #11537 resolves the issue, remove the skip conditions for the k8s e2e tests involving emptyDir volume mounts. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-03-26 15:39:33 +01:00
Fabiano Fidêncio	814ae53d77	tests: Use the helm chart to setup nydus for TDX Now that containerd 2.3.0-beta.0 has been released, it brings fixes for multi-snapshotters that allows us to test the baremetal machines in the same way we test the non-baremetal ones. Let's start doing the switch for TDX as timezone is friendlier with Mikko. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-24 19:13:59 +01:00
Manuel Huber	79efe3e041	tests: gpu: use container data storage feature Use the container data storage feature for the k8s-nvidia-nim.bats test pod manifests. This reduces the pods' memory requirements. For this, enable the block-encrypted emptydir_mode for the NVIDIA GPU TEE handlers. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-03-23 11:43:11 -07:00
Steve Horsman	2728b493d5	Merge pull request #12681 from manuelh-dev/mahuber/ci-pip-py-venv tests: cc: setup function for python venv	2026-03-23 14:33:30 +00:00
Fabiano Fidêncio	fe817bb47b	Merge pull request #12705 from fidencio/topic/tests-nginx-connectibity-2nd-try tests: nginx-connectivity: Use `-O index.html` to override the downloaded file	2026-03-23 13:08:51 +01:00
Fabiano Fidêncio	514a2b1a7c	Merge pull request #12264 from fidencio/topic/nvidia-gpu-cc-use-nydus-snapshotter nvidia: cc: Use nydus-snapshotter	2026-03-23 12:50:15 +01:00
Fabiano Fidêncio	83f37f4beb	tests: nginx-connectivity: Override index.html (2nd try) We need to explicitly pass `-O index.html` as the busybox' wget has a different behaviour than GNU's wget. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-23 11:11:44 +01:00
Fabiano Fidêncio	e44dfccf7a	Revert "tests: nginx-connectivity: Allow overriding the downloded file" This reverts commit `4403289123`. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-23 11:06:23 +01:00
Fabiano Fidêncio	4403289123	tests: nginx-connectivity: Allow overriding the downloded file In case a wget fails for one reason or another, it'll leave behind an 'index.html' file. Let's make sure we allow overriding that file so the retry loop doesn't fail for no reason. Fixes: #12670 Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-23 04:08:24 +01:00
Fabiano Fidêncio	740d380b8e	tests: nvidia: cc: Use nydus-snapshotter So we can test what we just changed in the config files. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-22 10:10:34 +01:00
Manuel Huber	5765bc97b4	tests: cc: setup function for python venv We recently had a failure on a new CI runner where ${HOME}/.cicd/venv/bin/activate was not present. The relevant call originated from ensure_sev_snp_measure. Thus, add a function ensure_cicd_python_venv before callers to pip install. Currently, the NVIDIA NIM test and the confidential attestation tests use pip to install dependencies. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-03-18 17:07:47 -07:00
Aurélien Bombo	f8e234c6f9	Merge pull request #12650 from kata-containers/sprt/remove-csi ci: Stop building/deploying CSI driver	2026-03-16 16:53:02 -05:00
Manuel Huber	e13748f46d	tests: Adapt trusted ephemeral storage test With the new CDH version, the LUKS header is moved off of the disk into guest memory. We hence adapt the test's filesystem type checks. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-03-16 09:43:17 -07:00
Manuel Huber	5bbc0abb81	tests: use pre-created, signed sealed secrets With signature support for sealed secret, use pre-created signed sealed secrets and provision the signing public key to the KBS. Add instructions for re-creating these signed secrets. Improve k8s-sealed-secrets.bats by reducing repeated kubectl logs calls. A test run showed a SIGPIPE error one one of the grep-logs while the printouts of the initial kubectl logs invocation showed that the expected values were actually in the logs. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-03-16 09:43:17 -07:00
Manuel Huber	4a7022d2f4	tests: nvidia: call genpolicy auth for all tests Call the setup_genpolicy_registry_auth in run_kubernetes_nv_tests.sh. Authenticate before exercising any tests. Recently, we have seen UnauthorizedError messages for the CUDA vectorAdd image. While this image is not gated behind authentication, rate limiting may be a possible issue. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-03-13 09:03:01 -07:00
Aurélien Bombo	dd2c4c0db3	Revert "coco: ci: Add no-op steps to deploy CSI driver" This reverts commit `5e4990bcf5`. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-11 12:55:23 -05:00
Fabiano Fidêncio	374b0abe29	tests: Fix kubelet data dir for k0s in trusted ephemeral storage test k0s uses /var/lib/k0s/kubelet instead of /var/lib/kubelet as its kubelet data directory. Introduce get_kubelet_data_dir() in tests_common.sh and use it in k8s-trusted-ephemeral-data-storage.bats instead of hardcoding /var/lib/kubelet. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-09 14:52:17 -05:00
Aurélien Bombo	68bdbef676	tests: Improve logging for some tests Use modern test semantics to ease debugging. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-09 14:52:17 -05:00
Aurélien Bombo	3dd77bf576	tests: Introduce new env variables to ease development It can be useful to set these variables during local testing: * AZ_REGION: Region for the cluster. * AZ_NODEPOOL_TAGS: Node pool tags for the cluster. * GENPOLICY_BINARY: Path to the genpolicy binary. * GENPOLICY_SETTINGS_DIR: Directory holding the genpolicy settings. I've also made it so that tests_common.sh modifies the duplicated genpolicy-settings.json (used for testing) instead of the original git-tracked one. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-09 14:52:17 -05:00
Aurélien Bombo	a98e328359	tests: Add test for trusted ephemeral data storage This tests the feature on CoCo machines. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-09 14:52:17 -05:00
Alex Lyn	a35dcf952e	ci: Fix YAML parsing flakiness caused by mktemp random suffixes In some CI runs, `mktemp` generates random characters that accidentally form file extensions like `.cSV` or `.Xml`. This triggers downstream parsing errors because the YAML content is misidentified as CSV/XML. The issues look like as below: ``` '/tmp/bats-run-KodZEA/.../pod-guest-pull-in-trusted-storage.yaml.in.cSV': ... ``` This commit fixes the issue by: 1. Moving the `XXXXXX` placeholder before the `.yaml` extension. 2. Ensuring the generated file always ends in `.yaml`. This prevents format misidentification while maintaining filename uniqueness and security. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-03-06 09:21:29 +08:00
Fabiano Fidêncio	8f35c31b30	Merge pull request #12542 from fidencio/topic/genpolicy-distribute-different-settings-rather-than-patching-for-ci genpolicy: settings.d drop-ins and scenario example drop-ins	2026-03-05 07:37:30 +01:00
Fabiano Fidêncio	b5e0a5b7d6	Merge pull request #12555 from fidencio/topic/tests-use-local-pv-pvc-for-policy-tests k8s-policy-pvc: use local PV/PVC when no default StorageClass exists	2026-03-05 07:37:11 +01:00
Fabiano Fidêncio	a0b9d965e5	k8s-policy-pvc: use local PV/PVC when no default StorageClass exists Create local block storage (loop device, StorageClass, PV) in the test only when the cluster has no default StorageClass, matching the approach used in k8s-volume.bats. Set our StorageClass as default so the PVC binds to our PV; tear it down after the test. When a default already exists (e.g. AKS), skip creation and cleanup so we do not change the cluster's default storage class. Fixes: #9846 Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-04 21:50:51 +01:00
Fabiano Fidêncio	d40afe592c	genpolicy: add settings drop-in directory and RFC 6902 JSON Patch support Allow genpolicy -j to accept a directory instead of a single file. When given a directory, genpolicy loads genpolicy-settings.json from it and applies all genpolicy-settings.d/.json files (sorted by name) as RFC 6902 JSON Patches. This gives precise control over settings with explicit operations (add, remove, replace, move, copy, test), including array index manipulation and assertions. Ship composable drop-in examples in drop-in-examples/: - 10- files set platform base settings (non-CoCo, AKS, CBL-Mariner) - 20-* files overlay specific adjustments (OCI version, guest pull) Users copy the combination they need into genpolicy-settings.d/. Replace the old adapt_common_policy_settings_* jq-patching functions in tests_common.sh with install_genpolicy_drop_ins(), which copies the right combination of 10-* and 20-* drop-ins for the CI scenario. Tests still generate 99-test-overrides.json on the fly for per-test request/exec overrides. Packaging installs 10-* and 20-* drop-ins from drop-in-examples/ into the tarball; the default genpolicy-settings.d/ is left empty. Made-with: Cursor Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-04 20:13:21 +01:00
Dan Mihai	3f845af9d4	tests: k8s: basic test for subPathExpr Add basic genpolicy test coverage for subPathExpr and corresponding container mounts. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-03-04 16:28:29 +00:00
Fabiano Fidêncio	56c3618c1d	tests: kata-deploy: wait for API recovery after uninstall kata-deploy's SIGTERM cleanup restarts the CRI runtime, which on k3s/rke2 takes down the API server temporarily. The helm uninstall may complete with errors, and the next test suite would start with a dead API. Add a wait loop after uninstall to ensure the API is available before proceeding. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-04 11:26:31 +01:00
Fabiano Fidêncio	9725df658f	tests: k8s: policy: set OCI bundle 1.2.1 for k3s/rke2 k3s and rke2 use containerd that expects OCI bundle 1.2.1; otherwise autogenerated policy tests fail. Add adapt_common_policy_settings_for_k3s_rke2 and call it from adapt_common_policy_settings when KUBERNETES is k3s or rke2. Tested with k3s v1.34.4+k3s1, rke2 v1.34.4+rke2r1. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-03 12:55:10 +01:00
stevenhorsman	66e58d6490	tests: Delete install_go.sh Having a script to install go is legacy from Jenkins, so delete it, so there is less code in our repo. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-27 12:42:43 +00:00
Hyounggyu Choi	be5ae7d1e1	Merge pull request #12573 from BbolroC/support-memory-hotplug-go-runtime-s390x runtime: Support memory hotplug via virtio-mem on s390x	2026-02-27 09:59:40 +01:00
Aurélien Bombo	2a13f33d50	Merge pull request #12565 from microsoft/danmihai1/clh-51.1 versions: update cloud hypervisor to v51.1	2026-02-26 07:52:57 -06:00
Hyounggyu Choi	b1847f9598	tests: Run TestContainerMemoryUpdate() on s390x only with virtio-mem Let's run `TestContainerMemoryUpdate` on s390x only when virtio-mem is enabled. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-02-26 14:21:34 +01:00

1 2 3 4 5 ...

1121 Commits