kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-03-18 02:32:26 +00:00

Author	SHA1	Message	Date
Fabiano Fidêncio	59f487d7ab	do-not-merge: tests/cri-containerd: temporarily use containerd fork with getRuncOptions fix The cri-containerd integration tests fail with the shim sandboxer when running non-runc runtimes (e.g. Kata). The root cause is a bug in containerd's client/task.go: getRuncOptions() unconditionally tries to unmarshal the container's stored runtimeOptions into containerd.runc.v1.Options, but Kata containers store runtimeoptions.v1.Options. This causes: failed to create containerd task: failed to get runtime v2 options: can't unmarshal type "runtimeoptions.v1.Options" to output "containerd.runc.v1.Options" A fix has been submitted upstream. Until it is merged and released, clone containerd from the fork that carries the fix so that `make cri-integration` (which builds and runs its own containerd daemon) picks up the corrected binary. TODO: revert once the fix is in an upstream containerd release and versions.yaml is updated accordingly. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-06 17:16:19 +01:00
Fabiano Fidêncio	1f9260d978	tests: exclude TestContainerRestart from the cri-containerd test list Creating a new container in the same sandbox VM after the previous container has exited and been removed has never been supported by kata-containers (neither with the go-based nor the rust-based runtime). When the last container is removed the kata VM shuts down, so any attempt to start a new container in the same sandbox fails. This test exercises a use-case kata does not currently support, and it has never been part of the passing list for good reason. Mark it explicitly excluded with a comment so it is clear this is a deliberate omission rather than an oversight. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-06 17:15:57 +01:00
Fabiano Fidêncio	458a64e9b9	tests: Use podsandbox sandboxer for the runc sanity check The check_daemon_setup function verifies that containerd + runc are functional before the real kata tests run. Using the shim sandboxer for this runc check hits a known containerd bug where the OCI spec is not populated before NewBundle is called, so config.json is never written and containerd-shim-runc-v2 fails at startup. See https://github.com/containerd/containerd/issues/11640 The sandboxer choice is irrelevant for this sanity check, so use podsandbox which works correctly with runc. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-06 11:25:53 +01:00
Alex Lyn	a35dcf952e	ci: Fix YAML parsing flakiness caused by mktemp random suffixes In some CI runs, `mktemp` generates random characters that accidentally form file extensions like `.cSV` or `.Xml`. This triggers downstream parsing errors because the YAML content is misidentified as CSV/XML. The issues look like as below: ``` '/tmp/bats-run-KodZEA/.../pod-guest-pull-in-trusted-storage.yaml.in.cSV': ... ``` This commit fixes the issue by: 1. Moving the `XXXXXX` placeholder before the `.yaml` extension. 2. Ensuring the generated file always ends in `.yaml`. This prevents format misidentification while maintaining filename uniqueness and security. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-03-06 09:21:29 +08:00
Fabiano Fidêncio	8f35c31b30	Merge pull request #12542 from fidencio/topic/genpolicy-distribute-different-settings-rather-than-patching-for-ci genpolicy: settings.d drop-ins and scenario example drop-ins	2026-03-05 07:37:30 +01:00
Fabiano Fidêncio	b5e0a5b7d6	Merge pull request #12555 from fidencio/topic/tests-use-local-pv-pvc-for-policy-tests k8s-policy-pvc: use local PV/PVC when no default StorageClass exists	2026-03-05 07:37:11 +01:00
Fabiano Fidêncio	a0b9d965e5	k8s-policy-pvc: use local PV/PVC when no default StorageClass exists Create local block storage (loop device, StorageClass, PV) in the test only when the cluster has no default StorageClass, matching the approach used in k8s-volume.bats. Set our StorageClass as default so the PVC binds to our PV; tear it down after the test. When a default already exists (e.g. AKS), skip creation and cleanup so we do not change the cluster's default storage class. Fixes: #9846 Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-04 21:50:51 +01:00
Fabiano Fidêncio	d40afe592c	genpolicy: add settings drop-in directory and RFC 6902 JSON Patch support Allow genpolicy -j to accept a directory instead of a single file. When given a directory, genpolicy loads genpolicy-settings.json from it and applies all genpolicy-settings.d/.json files (sorted by name) as RFC 6902 JSON Patches. This gives precise control over settings with explicit operations (add, remove, replace, move, copy, test), including array index manipulation and assertions. Ship composable drop-in examples in drop-in-examples/: - 10- files set platform base settings (non-CoCo, AKS, CBL-Mariner) - 20-* files overlay specific adjustments (OCI version, guest pull) Users copy the combination they need into genpolicy-settings.d/. Replace the old adapt_common_policy_settings_* jq-patching functions in tests_common.sh with install_genpolicy_drop_ins(), which copies the right combination of 10-* and 20-* drop-ins for the CI scenario. Tests still generate 99-test-overrides.json on the fly for per-test request/exec overrides. Packaging installs 10-* and 20-* drop-ins from drop-in-examples/ into the tarball; the default genpolicy-settings.d/ is left empty. Made-with: Cursor Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-04 20:13:21 +01:00
Dan Mihai	3f845af9d4	tests: k8s: basic test for subPathExpr Add basic genpolicy test coverage for subPathExpr and corresponding container mounts. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-03-04 16:28:29 +00:00
Fabiano Fidêncio	56c3618c1d	tests: kata-deploy: wait for API recovery after uninstall kata-deploy's SIGTERM cleanup restarts the CRI runtime, which on k3s/rke2 takes down the API server temporarily. The helm uninstall may complete with errors, and the next test suite would start with a dead API. Add a wait loop after uninstall to ensure the API is available before proceeding. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-04 11:26:31 +01:00
Fabiano Fidêncio	966d710df5	tests: increase kata-deploy wait timeout to 15 minutes kata-deploy restarts the CRI runtime during install, which can cause the kata-deploy pod to be killed and recreated by the DaemonSet controller. On k3s and rke2 in particular, the restart can take several minutes. Increase the default timeout from 600s (10m) to 900s (15m) to accommodate this. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-04 11:26:31 +01:00
Fabiano Fidêncio	a2216ec05a	tests: set up full K3s/RKE2 V3 containerd template when needed If the rendered config-v3.toml does not import the drop-in dir, write the full k3s ContainerdConfigTemplateV3 (with hardcoded import path) so kata-deploy can use drop-in. This allows us to test with K3s/RKE2 before my patch there gets released. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-04 11:26:31 +01:00
Fabiano Fidêncio	3e807300ac	tests: k0s: Ensure --logging=containerd=debug is passed As the default is `info` and that actually overrides whatever is set in the drop-in file used by k0s. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-03 12:55:10 +01:00
Fabiano Fidêncio	876c6c832d	tests: set runtime-request-timeout to 600s for k0s, k3s, rke2, microk8s Align with kubeadm and bare metal by setting the kubelet CRI runtime-request-timeout to 600s in deploy functions for k0s (worker profile), k3s (--kubelet-arg), rke2 (config.yaml), and microk8s (args/kubelet + restart). Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-03 12:55:10 +01:00
Fabiano Fidêncio	9725df658f	tests: k8s: policy: set OCI bundle 1.2.1 for k3s/rke2 k3s and rke2 use containerd that expects OCI bundle 1.2.1; otherwise autogenerated policy tests fail. Add adapt_common_policy_settings_for_k3s_rke2 and call it from adapt_common_policy_settings when KUBERNETES is k3s or rke2. Tested with k3s v1.34.4+k3s1, rke2 v1.34.4+rke2r1. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-03 12:55:10 +01:00
Dan Mihai	3ea23528a5	docs: require user/group/fsGroup/supplementalGroups Add a nydus guest-pull limitation explaining that specifying runAsUser, runAsGroup, fsGroup, and supplementalGroups are required. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Signed-off-by: Dan Mihai <dmihai@microsoft.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-03-02 23:48:36 +01:00
stevenhorsman	993a4846c8	versions: Bump go to 1.25.7 Now that go 1.26 is out, 1.24 is not supported, so bump to 1.25 as per our policy. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-03-02 16:33:47 +01:00
stevenhorsman	66e58d6490	tests: Delete install_go.sh Having a script to install go is legacy from Jenkins, so delete it, so there is less code in our repo. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-27 12:42:43 +00:00
Hyounggyu Choi	be5ae7d1e1	Merge pull request #12573 from BbolroC/support-memory-hotplug-go-runtime-s390x runtime: Support memory hotplug via virtio-mem on s390x	2026-02-27 09:59:40 +01:00
Aurélien Bombo	2a13f33d50	Merge pull request #12565 from microsoft/danmihai1/clh-51.1 versions: update cloud hypervisor to v51.1	2026-02-26 07:52:57 -06:00
Hyounggyu Choi	b1847f9598	tests: Run TestContainerMemoryUpdate() on s390x only with virtio-mem Let's run `TestContainerMemoryUpdate` on s390x only when virtio-mem is enabled. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-02-26 14:21:34 +01:00
Steve Horsman	da0ca483b0	Merge pull request #12572 from fitzthum/bump-trustee versions: bump Trustee to latest version	2026-02-26 08:48:37 +00:00
Dan Mihai	2361dc7ca0	tests: k8s: reinstate testing on mariner hosts Reinstate mariner host testing - including the Agent Policy tests on these hosts - now that a new CLH version brought in the required fixes. This reverts commit `ea53779b90`. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-02-25 21:01:25 +00:00
Tobin Feldman-Fitzthum	b4b5db2f1c	tests: fixup SNP attestation test for new Trustee version Trustee now returns the binary SNP TCB claims as hex rather than base64 (for consistency with other platforms). Fortunately, the sev-snp-measure tool has a flag for setting the output type of the launch digest. I think hex is the default, but let's keep the flag here to be explicit. Signed-off-by: Tobin Feldman-Fitzthum <tfeldmanfitz@nvidia.com>	2026-02-25 09:57:36 -08:00
Steve Horsman	a655605e8f	Merge pull request #12566 from manuelh-dev/mahuber/fail-exp-timeout tests: Extend fail timeout for failure test	2026-02-25 16:11:53 +00:00
Steve Horsman	7ffb7719b5	Merge pull request #12562 from kata-containers/prep-for-go-1.25-switch Prep for go 1.25 switch	2026-02-25 11:13:30 +00:00
stevenhorsman	9b307a5fa6	metrics: Uncapitalise error strings Fix `T1005: error strings should not be capitalized (staticcheck)` This is to comply with go conventitions as errors are normally appended, so there would be a spurious captialisation in the middle of the message Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-24 14:33:04 +00:00
stevenhorsman	6eb67327d0	tests: Use ReplaceAll over Replace strings.ReplaceAll was introduced in Go 1.12 as a more readable and self-documenting way to say "replace everything". Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-24 14:33:04 +00:00
stevenhorsman	487f530d89	ci: Update golangci configuration Add a setting to skip the `T1005: error strings should not be capitalized (staticcheck)` rule to avoid impact to our error strings Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-24 14:22:09 +00:00
Hyounggyu Choi	3d71be3dd3	tests: Improve assertion handling for runtime-rs hypervisor Since runtime-rs added support for virtio-blk-ccw on s390x in #12531, the assertion in k8s-guest-pull-image.bats should be generalized to apply to all hypervisors ending with `-runtime-rs`. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-02-24 11:48:07 +01:00
stevenhorsman	2ac89f4569	versions: Update golangci-lint Bump to the latest version to pick up support for Go 1.25 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-24 10:02:48 +00:00
Manuel Huber	566bb306f1	tests: enable policy for openvpn on nydus Specify runAsUser, runAsGroup, supplementalGroups values embedded in the image's /etc/group file explicitly in the security context. With this, both genpolicy and containerd, which in case of using nydus guest-pull, lack image introspection capabilities, use the same values for user/group/additionalG IDs at policy generation time and at runtime when the OCI spec is passed. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-24 08:08:15 +01:00
Fupan Li	0bfb6b3c45	Merge pull request #12531 from BbolroC/blkdev-hotplug-s390x-runtime-rs runtime-rs: Support for block device hotplug on s390x	2026-02-24 13:03:59 +08:00
Fabiano Fidêncio	a0d954cf7c	tests: Enable auto-generated policies for experimental_force_guest_pull We want to run with auto-generated policies when using experimental_force_guest_pull. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-23 22:15:18 +01:00
Manuel Huber	e15c18f05c	tests: Extend fail timeout for failure test Extend the timeout for the assert_pod_fail function call for the test case "Test we cannot pull a large image that pull time exceeds createcontainer timeout inside the guest" when the experimental force guest-pull method is being used. In this method, the image is first pulled on the host before creating the pod sandbox. While image pull times can suddenly spike, we already time out in the assert_pod_fail function before the image is even pulled on the host. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-23 12:56:23 -08:00
Hyounggyu Choi	4e533f82e7	tests: Remove skip condition for runtime-rs on s390x in k8s-block-volume This commit removes the skip condition for qemu-runtime-rs on s390x in k8s-block-volume.bats. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-02-23 09:00:29 +01:00
Fabiano Fidêncio	96c20f8baa	tests: k8s: set CreateContainerRequest (on free runners) timeout to 600s Set KubeletConfiguration runtimeRequestTimeout to 600s mainly for CoCo (Confidential Containers) tests, so container creation (attestation, policy, image pull, VM start) does not hit the default CRI timeout. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-21 08:44:47 +01:00
Fabiano Fidêncio	a6b7a2d8a4	tests: assert_pod_fail accept RunContainerError and StartError Treat waiting.reason RunContainerError and terminated.reason StartError/Error as container failure, so tests that expect guest image-pull failure (e.g. wrong credentials) pass when the container fails with those states instead of only BackOff. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-21 08:44:47 +01:00
Fabiano Fidêncio	42d980815a	tests: skip k8s-policy-pvc on non-AKS Otherwise it'll fail as we cannot bind the device. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-21 08:44:47 +01:00
Fabiano Fidêncio	1b9b53248e	tests: k8s: coco: rely more on free runners Run all CoCo non-TEE variants in a single job on the free runner with an explicit environment matrix (vmm, snapshotter, pull_type, kbs, containerd_version). Here we're testing CoCo only with the "active" version of containerd. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-21 08:44:47 +01:00
Fabiano Fidêncio	1fa3475e36	tests: k8s: rely more on free runners We were running most of the k8s integration tests on AKS. The ones that don't actually depend on AKS's environment now run on normal ubuntu-24.04 GitHub runners instead: we bring up a kubeadm cluster there, test with both containerd lts and active, and skip attestation tests since those runtimes don't need them. AKS is left only for the jobs that do depend on it. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-21 08:44:47 +01:00
Zvonko Kaiser	6d1eaa1065	Merge pull request #12461 from manuelh-dev/mahuber/guest-pull-bats tests: enable more scenarios for k8s-guest-pull-image.bats	2026-02-20 08:48:54 -05:00
Dan Mihai	ea53779b90	ci: k8s: temporarily disable mariner host Disable mariner host testing in CI, and auto-generated policy testing for the temporary replacements of these hosts (based on ubuntu), to work around missing: 1. cloud-hypervisor/cloud-hypervisor@0a5e79a, that will allow Kata in the future to disable the nested property of guest VPs. Nested is enabled by default and doesn't work yet with mariner's MSHV. 2. cloud-hypervisor/cloud-hypervisor@bf6f0f8, exposed by the large ttrpc replies intentionally produced by the Kata CI Policy tests. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-02-19 20:42:50 +01:00
Dan Mihai	3e2153bbae	ci: k8s: easier to modify az aks create command Make `az aks create` command easier to change when needed, by moving the arguments specific to mariner nodes onto a separate line of this script. This change also removes the need for `shellcheck disable=SC2046` here. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-02-19 20:42:50 +01:00
Manuel Huber	fd340ac91c	tests: remove skips for some guest-pull scenarios Issue 10838 is resolved by the prior commit, enabling the -m option of the kernel build for confidential guests which are not users of the measured rootfs, and by commit `976df22119`, which ensures relevant user space packages are present. Not every confidential guest has the measured rootfs option enabled. Every confidential guest is assumed to support CDH's secure storage features, in contrast. We also adjust test timeouts to account for occasional spikes on our bare metal runners (e.g., SNP, TDX, s390x). Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-19 10:10:55 -08:00
Steve Horsman	6a67250397	Merge commit from fork runtime-go/rs: Disable virtio-pmem for Cloud Hypervisor	2026-02-19 09:00:56 +00:00
Chiranjeevi Uddanti	88203cbf8d	tests: Add regression test for sandbox_cgroup_only=false Add unit test for get_ch_vcpu_tids() and integration test that creates a pod with sandbox_cgroup_only=false to verify it starts successfully. Signed-off-by: Chiranjeevi Uddanti <244287281+chiranjeevi-max@users.noreply.github.com> Co-authored-by: Antigravity <antigravityagent@google.com>	2026-02-18 20:20:14 +01:00
Aurélien Bombo	8ff9cd1f12	Merge pull request #12455 from ajaypvictor/secret-cm-without-sharedfs ci: Add integration tests for secret & configmap propagation	2026-02-18 12:06:48 -06:00
Aurélien Bombo	336b922d4f	tests/cbl-mariner: Stop disabling NVDIMM explicitly This is not needed anymore since now disable_image_nvdimm=true for Cloud Hypervisor. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-02-18 11:52:51 -06:00
Dan Mihai	eee25095b5	tests: mariner annotations for k8s-openvpn This test uses YAML files from a different directory than the other k8s CI tests, so annotations have to be added into these separate files. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-02-18 07:17:04 +01:00

1 2 3 4 5 ...

1914 Commits