kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-07-02 07:02:16 +00:00

Author	SHA1	Message	Date
Steve Horsman	59b27c4645	Merge pull request #13057 from microsoft/danmihai1/deploy-check-hypervisor-name gha: k8s: reject unsupported KATA_HYPERVISOR values	2026-05-17 18:43:49 +01:00
Dan Mihai	ddc36060d2	gha: k8s: reject unsupported KATA_HYPERVISOR values Exit early with an error message instead of starting kata-deploy if the value of KATA_HYPERVISOR is not expected during CI. For example: "cloud-hypervisor" was renamed recently to "clh-runtime-rs" and user scripts depending on the old name were getting tangled in kata-deploy instead of just rejecting the old value quickly. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-05-16 01:04:31 +00:00
Dan Mihai	b85fc8ed13	tests: export target_branch="${branch}" Avoid running "git remote show origin" repeatedly when common.bash gets sourced multiple times and target_branch was not specified by the caller. Repeated "git remote show origin" calls inflicted the additional overhead of authenticating and communicating with the remote git repository. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-05-16 00:59:44 +00:00
Dan Mihai	0f3df5d1e4	Merge pull request #13025 from manuelh-dev/mahuber/img-pull-policy tests: generate guest-pull image pull agent security policies	2026-05-15 14:09:00 -07:00
Fabiano Fidêncio	c19bdbf23b	tests: nvidia-nim: use trusted storage templates for runtime-rs Now that runtime-rs supports block-encrypted emptyDir volumes, remove the no-trusted-storage workaround templates and the is_runtime_rs branching in the NIM test. Runtime-rs now uses the same TEE templates as the Go runtime with emptyDir + PVC at 48Gi memory, instead of the 128Gi workaround that compensated for lacking trusted storage. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: Cursor <cursoragent@cursor.com>	2026-05-14 22:56:11 +02:00
Fabiano Fidêncio	54aaa1ea2a	tests: enable trusted ephemeral storage for runtime-rs Remove the runtime-rs skip from the trusted ephemeral data storage test now that runtime-rs implements block-encrypted emptyDir volumes. Also remove the genpolicy drop-in that disabled encrypted_emptydir for runtime-rs and the corresponding copy logic in tests_common.sh. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: Cursor <cursoragent@cursor.com>	2026-05-14 22:56:11 +02:00
Manuel Huber	ed4233bf91	rootfs: cdh: Update CDH to new version Update CDH to a newer version and: - adjust the NVIDIA root filesystem build to reflect the change from using libcryptsetup to using the cryptsetup binary. - adjust image-pull test cases to conduct parallel write operations on the /dev/trusted_store backed guest image pull location since issue #12721 has been solved on CDH side. Fixes #12721 Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-05-13 20:20:45 +02:00
Greg Kurz	d2dc0a923c	Merge pull request #13030 from stevenhorsman/go-1.25.10-bump Go 1.25.10 bump	2026-05-13 08:09:51 +02:00
Dan Mihai	3799473041	Merge pull request #13010 from microsoft/danmihai1/label-references genpolicy: support env variable values sourced from metadata.labels values	2026-05-12 15:41:11 -07:00
Manuel Huber	da4307efb7	tests: generate policies for guest-pull images Replace guest-pull image allow-all placeholders with explicit auto-generated policies for each generated pod manifest. Generate policy after the final YAML edits so initdata and image pull secrets are represented in the policy inputs. Assisted-by: OpenAI Codex <codex@openai.com> Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-05-12 15:03:15 -07:00
Manuel Huber	6a274b5110	tests: seed auto-generated policy from initdata Teach auto_generate_policy to reuse a cc_init_data annotation by decoding it into the temporary default-initdata.toml file. This lets tests preserve CDH initdata while genpolicy appends the generated agent security policy for the workload. Assisted-by: OpenAI Codex <codex@openai.com> Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-05-12 15:03:14 -07:00
Manuel Huber	e774b13c95	tests: share genpolicy registry auth setup helper Move the Docker auth setup into common.bash so tests beyond the NVIDIA runner can provide credentials for genpolicy image pulls. Make the registry, username, password and output directory explicit while preserving the nvcr.io setup used by the NIM tests. Assisted-by: OpenAI Codex <codex@openai.com> Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-05-12 15:03:14 -07:00
Greg Kurz	733ccb3254	Merge pull request #12996 from stevenhorsman/swap-agent-ctl-to-skopeo&umoci agent-ctl: Swap rootfs bundle pull implementation	2026-05-12 19:12:27 +02:00
stevenhorsman	4a65aca9cf	versions: bump golang to 1.25.10 Bump the go version to resolve CVEs: - GO-2026-4918 - GO-2026-4971 - GO-2026-4976 - GO-2026-4977 - GO-2026-4980 - GO-2026-4981 - GO-2026-4982 - GO-2026-4986 Signed-off-by: stevenhorsman <steven@uk.ibm.com> Assisted-by: IBM Bob	2026-05-12 11:56:13 +01:00
Manuel Huber	c265e4905f	tests: nvidia: avoid NIM journal dumps on success BATS_TEST_COMPLETED is per-test and remains empty in teardown_file. Track file-level state so successful NIM runs skip the journal dump while setup or test failures still include node diagnostics. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-05-10 09:10:01 -07:00
Manuel Huber	1c081ff434	tests: nvidia: place NIM service into namespace Place the NIM service into our test namespace. We are still observing various situations where for some reasons, the NIM service appears in the default namespace in our CI. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-05-10 07:36:23 +00:00
Fabiano Fidêncio	f7be57efe2	Merge pull request #13007 from manuelh-dev/mahuber/dbg-nim-svc tests: nvidia: Wait for NIM operator pod and print	2026-05-08 20:58:51 +02:00
Manuel Huber	714adec3f8	tests: nvidia: Wait for NIM operator pod and print Wait for the NIM operator pod to run before deploying NIM services. Add a temporary debug function to print resource placement into the different namespaces. Remove this function again when the NIM tests are stabilized. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-05-08 06:27:48 +00:00
Ubuntu	b95be5332a	genpolicy: env variables from metadata.labels Add basic genpolicy support for container environment variables sourced from metadata.labels. In this implementation, the relevant labels must be available as input to the policy tool. This is slightly different from the way variables sourced from metadata.annotations are treated by the tool: when the relevant annotation is not available as input, the generated Policy allows any value. Depending on metadata.labels use cases that we might encounter maybe the labels will be handled the same way as the annotations in the future. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-05-07 23:35:56 +00:00
Dan Mihai	39b9c318e2	tests: k8s: merge two policy-pod test cases One of these test cases was a subset of the other, so remove that redundancy. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-05-07 22:39:23 +00:00
stevenhorsman	e92d954b51	agent-ctl: Swap rootfs bundle pull implementation Switch the rootfs bundle pull implementatio from using image-rs to use skopeo and umoci to remove the really long crate dependency tail that image-rs brings. Generated-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-05-07 21:11:27 +01:00
Fabiano Fidêncio	8dde5f39b7	tests: dump kata-deploy pod describe+logs on install timeout When kubectl wait times out the pod never reached Ready, so the existing log collection (which runs after wait succeeds) produces "-- No entries --" with zero useful information. Capture kubectl describe and kubectl logs (including previous container) immediately on timeout so the next CI run shows exactly why the pod is stuck (ImagePullBackOff, OOMKilled, probe failures, containerd restart hang, etc.). Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: Cursor <cursoragent@cursor.com>	2026-05-07 13:40:55 +02:00
Fabiano Fidêncio	0f3160276b	ci: k8s: skip no-op Helm uninstall on free runners In cleanup_kata_deploy, bail out early when no kata-deploy Helm release exists so baremetal-* pre-deploy cleanup on fresh clusters does not block on helm uninstall --wait (up to 10m). Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: Cursor <cursoragent@cursor.com>	2026-05-07 13:40:55 +02:00
Fabiano Fidêncio	19c194aa94	ci: Add runtime-rs GPU shims to NVIDIA GPU CI workflow Add qemu-nvidia-gpu-runtime-rs and qemu-nvidia-gpu-snp-runtime-rs to the NVIDIA GPU test matrix so CI covers the new runtime-rs shims. Introduce a `coco` boolean field in each matrix entry and use it for all CoCo-related conditionals (KBS, snapshotter, KBS deploy/cleanup steps). This replaces fragile name-string comparisons that were already broken for the runtime-rs variants: `nvidia-gpu (runtime-rs)` was incorrectly getting KBS steps, and `nvidia-gpu-snp (runtime-rs)` was not getting the right env vars. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-05-07 10:33:26 +02:00
Fabiano Fidêncio	acfb9f9762	Merge pull request #12954 from zvonkok/modular-makefile build: remove gha-adjust-to-use-prebuilt-components.sh	2026-05-07 10:32:32 +02:00
manuelh-dev	8473144ee5	Merge pull request #12989 from microsoft/danmihai1/ignore-unnecessary-fields genpolicy: ignore additional irrelevant fields	2026-05-06 23:54:39 -07:00
Greg Kurz	16bc6db59e	static-checks: Drop vendor checks The repo doesn't track vendor code anymore. Also, I could not find any evidence that this code is actually called. The reference to URL ``` https://github.com/kata-containers/community/blob/main/VENDORING.md ``` that was recently removed by https://github.com/kata-containers/community/pull/442 is another indication that this flow is outdated. Drop it. Signed-off-by: Greg Kurz <groug@kaod.org>	2026-05-06 09:49:53 +02:00
Greg Kurz	af54cd8a27	tests: Remove vendor directory Now shipped in the vendored code tarball. Signed-off-by: Greg Kurz <groug@kaod.org>	2026-05-06 09:32:05 +02:00
Dan Mihai	fcee4864e7	genpolicy: ignore additional PodAffinity fields 1. Ignore PodAffinity's preferredDuringSchedulingIgnoredDuringExecution. 2. Ignore additional PodAffinityTerm fields. 3. Add basic tests for the new fields. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-05-06 01:38:02 +00:00
Dan Mihai	b6349f50ab	genpolicy: ignore preemptionPolicy Ignore the pod preemptionPolicy field from input YAML - irrelevant for building the Policy. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-05-06 00:35:27 +00:00
Dan Mihai	9f4a7a9d55	Merge pull request #12978 from microsoft/danmihai1/empty-env-var genpolicy: support empty environment variables	2026-05-05 14:10:35 -07:00
Dan Mihai	99dd897814	genpolicy: support empty environment variables K8s supports them, so genpolicy should support them too. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-05-05 18:53:25 +00:00
Fabiano Fidêncio	29e63c21a1	tests: k8s-cron-job: set runtimeClassName to kata The cron-job test workload was missing `runtimeClassName: kata`, which meant the cron job was not actually being executed under the Kata runtime, defeating the purpose of the test. Set it explicitly, consistent with the sibling `job.yaml` workload. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-05 11:21:05 +02:00
Fabiano Fidêncio	27c3dfbb8c	Merge pull request #12943 from fidencio/topic/kata-deploy-add-http-health-probes kata-deploy: add HTTP health probes (healthz/readyz)	2026-05-05 09:30:17 +02:00
Dan Mihai	0a6dc2fae0	ci: mariner: use OCI version 1.2.1 Mariner moved from version 1.2.0 to version 1.2.1. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-05-05 02:23:30 +00:00
Fabiano Fidêncio	9e3bd6b576	tests: fix kata-deploy lifecycle test reliability Fix two issues in kata-deploy-lifecycle.bats that caused failures on k3s, k0s and rke2: run_on_host(): - `kubectl run --rm -i` causes k3s/rke2 to inject session-recording banners into stdout, polluting command output and breaking string assertions. Replace with a create/wait/logs/delete sequence so only the container's actual stdout is captured. "Artifacts are fully cleaned up after uninstall": - After a CRI restart the kubelet may briefly report "Unknown" for the container runtime version. Retry for up to 60s before asserting. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-03 22:09:08 +02:00
Fabiano Fidêncio	ed4f6ebc9e	tests: use readiness probes to wait for kata-deploy install Now that kata-deploy has a proper readiness probe (/readyz returns 200 only after install completes), replace the ad-hoc wait strategies with kubectl wait --for=condition=Ready on the kata-deploy pods. Note: helm --wait is ineffective for single-node clusters with maxUnavailable=1 (the DaemonSet is considered ready with 0 ready pods), so the CI uses kubectl wait on the pod readiness condition directly. gha-run-k8s-common.sh: - Drop the waitForProcess polling loop for Running pods - Drop the `sleep 60s` with its FIXME comment - Add kubectl wait --for=condition=Ready instead helm-deploy.bash: - Drop the extra `kubectl rollout status` after helm - Drop the `sleep 60` - The existing --wait on the helm command now suffices Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-03 22:09:08 +02:00
Fabiano Fidêncio	8c3c7aa871	ci: Drop ITA_KEY usage from CI workflows The ITA_KEY secret was conditionally passed to TDX jobs for Intel Trust Authority attestation, but it is no longer needed. Remove it from all workflow files and the test helper export. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-03 18:05:51 +02:00
Zvonko Kaiser	a4129e41f3	build: remove gha-adjust-to-use-prebuilt-components.sh No longer used; its two responsibilities are now expressed directly in the workflows and the Makefile. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-04-30 00:49:13 +00:00
Aurélien Bombo	f3dc71a770	Revert "tests: k8s: policy: improve settings selection for runtime-rs hypervisors" This reverts commit `cafdd278ba`.	2026-04-28 10:58:01 -05:00
Aurélien Bombo	cf6a91a104	runtime-rs/config: rename cloud-hypervisor to clh This aligns on the previous commit and runtime-go. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-04-28 10:58:01 -05:00
Aurélien Bombo	e4fbddb91a	ci: rename cloud-hypervisor to clh-runtime-rs This aligns on qemu-runtime-rs and makes more sense. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-04-28 10:58:01 -05:00
Saul Paredes	7c8df3b9e6	Revert "test: temp skip failing tests on AKS" This reverts commit `90e94ab305`.	2026-04-27 09:36:51 -07:00
Saul Paredes	3273c4e1cc	Revert "ci: Skip tests not working with k8s 1.36.0" This reverts commit `df68536cd6`.	2026-04-27 08:08:27 -07:00
Saul Paredes	51f234cb56	tests: describe pods deployment when testing deployment output For k8s 1.36.0, the events of a pod are no longer included in the "kubectl describe pod" output when describing a deployment. Describe using the "app" label instead. Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2026-04-27 08:07:58 -07:00
Mikko Ylinen	9cccfb5cb5	tests: align qemu-tdx kbs tests to use Trustee AS No need to deviate from how other CoCo targets use Trustee and enables us to add more tests (e.g., RVPS) that ITA Trustee implemention does not support. Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>	2026-04-25 22:53:15 +02:00
Fabiano Fidêncio	df68536cd6	ci: Skip tests not working with k8s 1.36.0 At first we thought this only happened with AKS, but it seems this is a change in k8s 1.36.0 as the tests now started failing outside of AKS as well. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2026-04-25 08:56:42 +02:00
Fabiano Fidêncio	e6c6aad7af	ci: k8s: temporarily remove smb tests All the CIs are failing on the tests and in order to avoid blocking upstream while allowing enough time for the developers to properly fix it, let's just not execute the test. This commit should be reverted once a fix is proposed. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-04-24 21:13:23 +02:00
Aurélien Bombo	15296fc9fe	Merge pull request #12374 from microsoft/cameronbaird/add-cifs kernel: add required configs for CIFS support	2026-04-24 10:42:09 -05:00
Fabiano Fidêncio	b7eb3ae402	tests: Fix shellcheck issues in helm-deploy.bash Address shellcheck warnings including proper variable quoting, use of [[ ]] over [ ], declaring and assigning variables separately, and adding appropriate shellcheck disable directives where needed. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Made-with: Cursor	2026-04-24 08:14:08 +02:00

1 2 3 4 5 ...

2103 Commits