kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-07-01 22:50:54 +00:00

Author	SHA1	Message	Date
Alex Lyn	e21621140f	ci: Add qemu-runtime-rs and clh-runtime-rs test with nydus It aims to enable nydus tests for qemu-runtime-rs and clh-runtime-rs. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 21:42:48 +02:00
Fabiano Fidêncio	4935bf8bc6	tests: align kata-monitor containerd version selector Switch kata-monitor workflows from the deprecated "active" key to "latest" so CI resolves containerd versions from versions.yaml correctly after the key rename. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-06-09 21:25:45 +02:00
Fabiano Fidêncio	620d641458	ci: rename kata-deploy publish jobs These jobs build and push the kata-deploy OCI image, so call them publish-kata-deploy-image-* instead of -payload-, matching the kata-monitor image jobs and making the workflow easier to read. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-06-09 14:33:30 +02:00
Fabiano Fidêncio	92a9691470	tests: add kata-monitor helm chart k8s test Add a single-job k8s test that installs the kata-deploy helm chart with monitor.enabled=true, pointed at the per-PR kata-monitor image built earlier in the same run, and exercises both the rollout and the user-visible behaviour: * the kata-monitor DaemonSet rolls out and the pod stays up without container restarts; * a real kata-runtime probe pod is scheduled, then /metrics and /sandboxes are scraped through the apiserver pod-proxy to prove kata-monitor sees the sandbox (non-zero running-shim count plus at least one per-sandbox kata_shim_* metric); * after the probe pod is deleted, /metrics drops back to a zero running-shim count. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: OpenAI Codex <codex@openai.com>	2026-06-09 14:33:30 +02:00
Fabiano Fidêncio	63fec205fe	tests: run kata-monitor functional tests against the dedicated image Exercise the published kata-monitor container image (the one built by publish-kata-monitor-payload-amd64) rather than the on-disk binary, so integration regressions like the recent glibc/musl mismatch surface at PR time. The kata-monitor-tests.sh script keeps the binary fallback for ad-hoc local runs. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: OpenAI Codex <codex@openai.com>	2026-06-09 14:33:30 +02:00
Fabiano Fidêncio	d5bc1177c0	tests: focus kata-monitor CI on containerd active Drop the stale CRI-O matrix entry (its cri-tools pin was several releases behind) along with the exclude that hid the containerd job, and pin the remaining job to containerd's "active" track (currently v2.2) via CONTAINERD_VERSION. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: OpenAI Codex <codex@openai.com>	2026-06-09 14:33:30 +02:00
Fabiano Fidêncio	0d6234e7be	ci: share kata image publishing workflows Unify kata-deploy and kata-monitor image publishing behind a single reusable workflow, and rename workflow files to generic kata-images names. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: OpenAI Codex <codex@openai.com>	2026-06-09 14:33:30 +02:00
Fabiano Fidêncio	e122d7ffb0	versions: bump containerd to 2.3 and define minimum/latest test matrix Bump the containerd version used by CI from v1.7.25 to v2.3.0. Rename the version-range fields in versions.yaml and throughout the GitHub Actions workflows from lts/active/version/sandbox_api to minimum/latest to make their meaning self-evident: minimum: "v1.7" # oldest containerd branch under test latest: "v2.3" # newest containerd branch under test Drop the bare version field (superseded by the matrix) and the sandbox_api alias (covered by latest). Update all containerd_version matrix entries in the workflow files accordingly, and update gha-run-k8s-common.sh to resolve the new key names. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: Cursor <noreply@cursor.com>	2026-06-08 19:20:14 +02:00
Fabiano Fidêncio	230e01b04e	Merge pull request #13126 from kata-containers/topic/runtimes-introduce-azure-specific-configs runtime/runtime-rs: introduce Azure specific configs	2026-06-02 09:17:09 +02:00
Greg Kurz	8a49ecb159	Merge pull request #13097 from BbolroC/fix-shim-components-for-s390x ci: Refactor boot-image-se build and update shim components	2026-06-01 11:43:42 +02:00
manuelh-dev	953b306ff3	Merge pull request #12979 from manuelh-dev/mahuber/erofs-tmpfs-mount runtime-rs/agent: support EROFS snapshots without a rwlayer	2026-05-29 13:50:27 -07:00
Zvonko Kaiser	7f906ec95d	build: add kata-deploy-publish target Mirror the CI payload publish flow in local builds, including image and helm chart publishing, while reusing the same chart upload helper in payload-after-push to avoid duplicated chart packaging logic. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-05-29 16:22:12 +02:00
Hyounggyu Choi	3175bf683e	GHA: Remove secret CI_HKD_PATH from workflows As the boot-image-se builds a fake image, the secret CI_HKD_PATH is not necessary anymore. Remove it from the workflows. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-05-29 11:35:40 +02:00
Hyounggyu Choi	640fa488a5	ci: Refactor boot-image-se build and update shim components - Add FAKE_SE_IMAGE mode support in SE image build scripts for CI without real SE setup - Simplify workflow by removing build-asset-boot-image-se job - Integrate fake-boot-image-se into build matrix instead of separate job - Skip attestation for fake-boot-image-se builds - Update qemu-se and qemu-se-runtime-rs shim components to use: - rootfs-initrd-confidential instead of rootfs-image-confidential - boot-image-se component This change streamlines the s390x SE build process and makes it easier to test without requiring actual Secure Execution infrastructure. This fixes deployment issues on non-TEE systems where TEE-specific artifacts (like boot-image-se for IBM SEL) are not included in the kata-deploy image, while ensuring TEE systems still get all required components. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-05-29 11:35:40 +02:00
Manuel Huber	7d9a143747	ci: cover EROFS snapshotter default_size=0 path kata-deploy currently hard-codes the EROFS snapshotter default_size to "10G", so the CoCo EROFS CI lane only exercises the path where the snapshotter provides an rwlayer. Use the generic containerd.userDropIn support for the EROFS default_size and thread it through the Kubernetes CI helpers. Keep the kata-deploy default at "10G" to preserve current behavior, but allow the workflow to set "0" for the runtime-rs no-rwlayer path. Expand the existing EROFS snapshotter job to run both values. The override is written to containerd as a TOML string so "0" is not parsed as an integer. Assisted-by: OpenAI Codex <codex@openai.com> Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-05-28 22:54:56 +00:00
Fabiano Fidêncio	bddf1ecab4	build: stop producing cloud-hypervisor-glibc artifacts Drop cloud-hypervisor-glibc from local and CI kata-deploy build targets now that Azure CLH uses the standard cloud-hypervisor artifact set. This removes obsolete build matrix entries and installer target handling. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-28 23:32:37 +02:00
Fabiano Fidêncio	81ce51a9aa	ci: target Azure CLH runtimes directly in AKS tests Switch AKS Mariner matrix entries to clh-azure handlers and remove the temporary host-OS based helm value overrides. Update integration test wiring and required test labels so CI tracks the new runtime names. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-28 23:32:37 +02:00
Fabiano Fidêncio	c65d64873b	kata-deploy: prebuild payload-specific component artifacts Build and publish the kata-deploy binary and CoCo guest-pull nydus snapshotter as dedicated per-arch artifacts, then consume those tarballs when assembling the kata-deploy image. This avoids rebuilding those components in the payload image (which would happen in serial) path and reduces overall CI build time. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-25 22:13:41 +02:00
Fabiano Fidêncio	cbcdd999e4	Merge pull request #12957 from Apokleos/fix-sb-api runtime-rs: Fix sandbox-api lifecycle and CRI status handling	2026-05-23 09:26:14 +02:00
Alex Lyn	b5349f4d78	versions: bump containerd to 2.3 for sandbox API tests containerd 2.3 requires Go 1.26.3, but Kata still pins Go 1.25.10. Use Go 1.26.3 for the sandbox-api job so that make cri-integration can build containerd from source. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-05-22 10:46:16 +08:00
Alex Lyn	328fccfbbd	ci: Re-enable run-containerd-sandboxapi job The job was disabled because TestImageLoad was failing when using the shim sandboxer with runc due to a containerd bug (config.json not being written to the bundle directory). Now that check_daemon_setup uses podsandbox for the runc sanity check, the root cause of the failure is worked around on our side and the job can be re-enabled. Also update the runner to ubuntu-24.04. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-05-22 10:45:26 +08:00
Fabiano Fidêncio	9a0acc6c4c	kata-deploy: ship individual component tarballs; drop merged tarball Update the Dockerfile to copy each kata-static-<name>.tar.zst directly into the image alongside shim-components.json, replacing the old artifact-extractor stage that unpacked a single merged tarball. Update the publish-kata-deploy-payload and release CI workflows to download individual per-component artifacts instead of waiting for a merged tarball, and simplify kata-deploy-build-and-upload-payload.sh accordingly. The kata-deploy image build is no longer blocked on the merge step. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-20 20:52:36 +02:00
Fabiano Fidêncio	c87e327876	kata-deploy: split shim-v2 into shim-v2-go and shim-v2-rust Split the monolithic shim-v2 build target into separate shim-v2-go and shim-v2-rust targets in kata-deploy-binaries.sh, the local-build Makefile, and the four architecture CI workflows. The Go and Rust shims now each produce their own kata-static-<name>.tar.zst artifact, allowing downstream consumers to select only the shim variant they need. MEASURED_ROOTFS is set per-arch for the Rust job in CI. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-20 20:52:36 +02:00
Fabiano Fidêncio	2c1dec0c14	Merge pull request #13035 from stevenhorsman/docs-static-checks-cleanup ci: remove docs URL alive check workflow	2026-05-18 17:59:03 +02:00
Amulyam24	631dc72ceb	gha: move static checks to self hosted runners for ppc64le Move build checks to self hosted runners for Power. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2026-05-13 11:07:52 +01:00
stevenhorsman	9398ab3433	ci: remove docs URL alive check workflow The docs URL alive check workflow has been disabled for months and never passed since we moved to GHA., so removes the workflow and all associated code. Note: Although the static-checks.sh --doc code was wider scope than URL check, it wasn't being called anywhere else, so it was removed too. Signed-off-by: stevenhorsman <steven@uk.ibm.com> Assisted-by: IBM Bob	2026-05-13 09:12:00 +01:00
stevenhorsman	37e7bf0773	ci: correct environment variable syntax in stale issues workflow The stale issues workflow was using shell syntax ${AGE} instead of GitHub Actions syntax ${{ env.AGE }} for the days-before-issue-stale parameter. This prevented the workflow from correctly reading the calculated AGE value. Also added days-before-stale: -1 to disable default stale behavior and ensure only issue-specific settings apply. Signed-off-by: stevenhorsman <steven@uk.ibm.com> Assisted-By: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-05-11 09:31:36 +01:00
Fabiano Fidêncio	19c194aa94	ci: Add runtime-rs GPU shims to NVIDIA GPU CI workflow Add qemu-nvidia-gpu-runtime-rs and qemu-nvidia-gpu-snp-runtime-rs to the NVIDIA GPU test matrix so CI covers the new runtime-rs shims. Introduce a `coco` boolean field in each matrix entry and use it for all CoCo-related conditionals (KBS, snapshotter, KBS deploy/cleanup steps). This replaces fragile name-string comparisons that were already broken for the runtime-rs variants: `nvidia-gpu (runtime-rs)` was incorrectly getting KBS steps, and `nvidia-gpu-snp (runtime-rs)` was not getting the right env vars. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-05-07 10:33:26 +02:00
Fabiano Fidêncio	acfb9f9762	Merge pull request #12954 from zvonkok/modular-makefile build: remove gha-adjust-to-use-prebuilt-components.sh	2026-05-07 10:32:32 +02:00
Greg Kurz	c18932b5ab	build-checks: Remove `make vendor` The `generate_vendor.sh` script already knows how to create a tarball with all the rust and go vendored code within the repo. It is used by the release workflow to provide vendored code to downstream consummers that might need it. There isn't any vendored code in the repo anymore. It thus doesn't seem quite useful to run `make vendor` in CI. Stop doing it. Signed-off-by: Greg Kurz <groug@kaod.org>	2026-05-06 09:49:50 +02:00
Greg Kurz	1c1945f997	ci: Add go mod tidy check to static checks Ensures go.mod and go.sum files are kept up-to-date on PRs that modify Go code, go modules, or the Go version in versions.yaml. The workflow can also be run directly from the GitHub UI, in order to check the tidyness of the target branch. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> Signed-off-by: Greg Kurz <groug@kaod.org>	2026-05-06 09:31:57 +02:00
Fabiano Fidêncio	6033f25e0e	Merge pull request #12965 from kata-containers/dependabot/github_actions/actions/checkout-6.0.2 build(deps): bump actions/checkout from 4.2.2 to 6.0.2	2026-05-04 19:37:54 +02:00
Fabiano Fidêncio	a3d6829ed4	Merge pull request #12964 from kata-containers/dependabot/github_actions/editorconfig-checker/action-editorconfig-checker-2.2.0 build(deps): bump editorconfig-checker/action-editorconfig-checker from 2.1.0 to 2.2.0	2026-05-04 19:37:42 +02:00
Fabiano Fidêncio	7c61c55011	Merge pull request #12966 from kata-containers/dependabot/github_actions/streetsidesoftware/cspell-action-8.4.0 build(deps): bump streetsidesoftware/cspell-action from 8.3.0 to 8.4.0	2026-05-04 19:37:28 +02:00
dependabot[bot]	b2931c6759	build(deps): bump actions/checkout from 4.2.2 to 6.0.2 Bumps [actions/checkout](https://github.com/actions/checkout) from 4.2.2 to 6.0.2. - [Release notes](https://github.com/actions/checkout/releases) - [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md) - [Commits](https://github.com/actions/checkout/compare/v4.2.2...de0fac2e4500dabe0009e67214ff5f5447ce83dd) --- updated-dependencies: - dependency-name: actions/checkout dependency-version: 6.0.2 dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-04 17:05:59 +00:00
Fabiano Fidêncio	3d43259463	Merge pull request #12974 from fidencio/topic/ci-tdx-nightly-run-with-runtime-rs ci: tdx: Remove ITA key usage and run qemu-tdx-runtime-rs on nightly	2026-05-04 19:04:03 +02:00
Fabiano Fidêncio	8655d87892	ci: nvidia: Disable NVRC trace logging on nightly runs On nightly CI, run the NVIDIA GPU tests without setting nvrc.log=trace. This gives us end-to-end test coverage that more closely matches how users would actually run Kata Containers with NVIDIA GPUs, since trace logging is not enabled by default in production. NVRC trace logging remains enabled for PR runs, where the extra verbosity is useful for debugging failures. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-03 18:13:07 +02:00
Fabiano Fidêncio	51d5f2ea7b	ci: Run runtime-rs tests for TDX on nightly As we're in the process to stabilise runtime-rs for the coming 4.0.0 release, we better start running as many tests as possible with that. The TDX runtime-rs job is gated to nightly runs only (pr-number == "nightly") since we only have a single TDX machine and cannot afford to run both qemu-tdx and qemu-tdx-runtime-rs on every PR. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-03 18:05:58 +02:00
Fabiano Fidêncio	8c3c7aa871	ci: Drop ITA_KEY usage from CI workflows The ITA_KEY secret was conditionally passed to TDX jobs for Intel Trust Authority attestation, but it is no longer needed. Remove it from all workflow files and the test helper export. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-03 18:05:51 +02:00
stevenhorsman	9715a7cca2	release: fix release workflow concurrency deadlock Architecture-specific release workflows were using the same concurrency group when called from release.yaml, causing GitHub Actions to detect a deadlock and cancel the builds. Fix by appending architecture suffix to each workflow's concurrency group, allowing parallel execution without conflicts. Assisted-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-05-02 20:13:17 +01:00
dependabot[bot]	7a1fa7842d	build(deps): bump streetsidesoftware/cspell-action from 8.3.0 to 8.4.0 Bumps [streetsidesoftware/cspell-action](https://github.com/streetsidesoftware/cspell-action) from 8.3.0 to 8.4.0. - [Release notes](https://github.com/streetsidesoftware/cspell-action/releases) - [Changelog](https://github.com/streetsidesoftware/cspell-action/blob/main/CHANGELOG.md) - [Commits](`9cd41bb518...de2a73e963`) --- updated-dependencies: - dependency-name: streetsidesoftware/cspell-action dependency-version: 8.4.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-01 18:06:22 +00:00
dependabot[bot]	883edd798f	build(deps): bump editorconfig-checker/action-editorconfig-checker Bumps [editorconfig-checker/action-editorconfig-checker](https://github.com/editorconfig-checker/action-editorconfig-checker) from 2.1.0 to 2.2.0. - [Release notes](https://github.com/editorconfig-checker/action-editorconfig-checker/releases) - [Commits](`4b6cd6190d...840e866d93`) --- updated-dependencies: - dependency-name: editorconfig-checker/action-editorconfig-checker dependency-version: 2.2.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-01 18:05:35 +00:00
Zvonko Kaiser	35dfb11fe4	build: replace prebuilt-components sed hack with DEPS= Mutating the Makefile in-place to strip prereqs was fragile and limited to one target per invocation. DEPS= skips deps declaratively and propagates through recursive make, so multi-target builds can opt out in one shot. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-04-30 00:48:46 +00:00
Fabiano Fidêncio	ef15324b04	Revert "ci: Only run arm64 k8s tests on nightly builds" This reverts commit `c5b159c556`, as now we have 3 runners plugged into the CI. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-04-29 07:38:12 +02:00
Steve Horsman	2435970fe8	Merge pull request #12933 from fidencio/topic/runtime-rs-decouple-dragonball-from-non-x86-checks runtime-rs: drop misleading unsupported arches gating	2026-04-28 18:36:16 +01:00
Aurélien Bombo	e4fbddb91a	ci: rename cloud-hypervisor to clh-runtime-rs This aligns on qemu-runtime-rs and makes more sense. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-04-28 10:58:01 -05:00
Fabiano Fidêncio	8ab97a60f3	ci: install protobuf-compiler for runtime-rs build-checks The `runtime-rs` component of `build-checks.yaml` declared `rust` as its only dependency, but the runtime-rs build pulls in `prost-build v0.8.0` (via `ttrpc-codegen` -> `containerd-shim-protos`, and via the in-tree `hypervisor` crate), and `prost-build`'s build script needs a `protoc` binary at compile time. This worked on x86_64 and aarch64 only because `prost-build v0.8.0` ships bundled `protoc` binaries for those targets. On s390x (and ppc64le, when the matrix gets there) there is no bundled binary, so the build fails with: Failed to find the protoc binary. The PROTOC environment variable is not set, there is no bundled protoc for this platform, and protoc is not in the PATH The reason this didn't show up in CI before is that `make test` and `make check` for runtime-rs were wrapped in arch-specific `ifeq` blocks in `src/runtime-rs/Makefile` that turned them into no-ops on s390x/ppc64le/riscv64gc. The previous commit dropped those gates so `make {test,check}` now actually run on every arch, which exposes this latent CI gap. Match what `agent`, `libs`, `agent-ctl`, `kata-ctl` and `genpolicy` already declare and add `protobuf-compiler` to runtime-rs's needs. The existing `Install protobuf-compiler` step in this workflow already runs `sudo apt-get -y install protobuf-compiler`, which the s390x/ppc64le runners support (those other components have been using it on s390x for some time). Made-with: Cursor Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Made-with: Cursor	2026-04-28 16:25:31 +02:00
stevenhorsman	09ac10e8df	workflows: Remove workflow concurrency It seems like some of our workflow concurrency rules are clashing with the job-level ones for some reason and cancelling jobs, so remove these problematic workflow rules. Co-authored-by: Fabiano Fidêncio <fabiano@fidencio.org> Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-28 14:56:07 +01:00
stevenhorsman	d5411e00f6	workflows: Fix version on pinned action docker/build-push-action@bcafcacb16 seemed to be given the wrong version in the comment, so update this to be correct Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-28 13:10:36 +01:00
stevenhorsman	063a13ccd0	workflows: Bump zizmor to 1.22 Bump zizmor to the 1.22 version to pick up new rule updates. Later bumps to follow once this has proven stable Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-28 13:10:36 +01:00

1 2 3 4 5 ...

1160 Commits