kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-04-09 13:32:08 +00:00

Author	SHA1	Message	Date
Fabiano Fidêncio	461907918d	kata-deploy: pin nydus-snapshotter via versions.yaml Resolve externals.nydus-snapshotter version and url in the Docker image build with yq from the repo-root versions.yaml instead of Dockerfile ARG defaults. Drop the redundant workflow that only enforced parity between those two sources. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-04-07 10:07:06 +08:00
Fabiano Fidêncio	ccfdf5e11b	Merge pull request #12754 from llink5/fix/docker26-networking-9340 runtime: fix Docker 26+ networking by rescanning after Start	2026-04-03 13:15:38 +02:00
Alex Lyn	4a1c2b6620	Merge pull request #12309 from kata-containers/stale-issues-by-date workflows: Create workflow to stale issues based on date	2026-04-03 09:31:34 +08:00
llink5	f7878cc385	runtime: fix Docker 26+ networking by rescanning after Start Docker 26+ configures container networking (veth pair, IP addresses, routes) after task creation rather than before. Kata's endpoint scan runs during CreateSandbox, before the interfaces exist, resulting in VMs starting without network connectivity (no -netdev passed to QEMU). Add RescanNetwork() which runs asynchronously after the Start RPC. It polls the network namespace until Docker's interfaces appear, then hotplugs them to QEMU and informs the guest agent to configure them inside the VM. Additional fixes: - mountinfo parser: find fs type dynamically instead of hardcoded field index, fixing parsing with optional mount tags (shared:, master:) - IsDockerContainer: check CreateRuntime hooks for Docker 26+ - DockerNetnsPath: extract netns path from libnetwork-setkey hook args with path traversal protection - detectHypervisorNetns: verify PID ownership via /proc/pid/cmdline to guard against PID recycling - startVM guard: rescan when len(endpoints)==0 after VM start Fixes: #9340 Signed-off-by: llink5 <llink5@users.noreply.github.com>	2026-04-02 21:23:16 +02:00
Steve Horsman	58101a2166	Merge pull request #12656 from stevenhorsman/actions/checkout-bump workflows: Update actions/checkout version	2026-04-01 17:34:39 +01:00
Aurélien Bombo	78289d19f7	gha: Pin actionlint version Pin to the latest released version as a security measure. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-31 10:51:17 -05:00
Aurélien Bombo	3122fa651e	gha: Avoid noisy deployment logs in PRs GitHub recently announced that developers can now use environments without auto-deployment, which allows us to avoid the noisy deployment logs in our PRs: https://github.blog/changelog/2026-03-19-github-actions-late-march-2026-updates/#github-actions-now-allows-developers-to-use-environments-without-auto-deployment Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-31 10:51:13 -05:00
stevenhorsman	99eaa8fcb1	workflows: Create workflow to stale issues based on date The standard stale/action is intended to be run regularly with a date offset, but we want to have one we can run against a specific date in order to run the stale bot against issues created since a particular release milestone, so calculate the offset in one step and use it in the next. At the moment we want to run this to stale issues before 9th October 2022 when Kata 3.0 was release, so default to this. Note the stale action only processes a few issues at a time to avoid rate limiting, so why we want a cron job to it can get through the backlog, but also to stale/unstale issues that are commented on. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-03-31 15:57:37 +01:00
stevenhorsman	b3179bdd8e	workflows: Update actions/checkout version Update the action to resolve the following warning in GHA: > Node.js 20 actions are deprecated. The following actions are running > on Node.js 20 and may not work as expected: > actions/checkout@11bd71901b. > Actions will be forced to run with Node.js 24 by default starting June 2nd, 2026. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-03-30 10:45:28 +01:00
Fabiano Fidêncio	514a2b1a7c	Merge pull request #12264 from fidencio/topic/nvidia-gpu-cc-use-nydus-snapshotter nvidia: cc: Use nydus-snapshotter	2026-03-23 12:50:15 +01:00
Fabiano Fidêncio	864f181faf	Merge pull request #12694 from manuelh-dev/mahuber/nv-test-timeout tests: nvidia: Increase run test timeout	2026-03-23 09:13:20 +01:00
Alex Lyn	d2c2ec6e23	Merge pull request #12633 from LandonTClipp/docs_materialx docs: Move to mkdocs-material, port Helm to docs site	2026-03-23 09:29:25 +08:00
Fabiano Fidêncio	740d380b8e	tests: nvidia: cc: Use nydus-snapshotter So we can test what we just changed in the config files. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-22 10:10:34 +01:00
Agam Dua	91d6c39f06	kernel: Fix debug build and add debug symbols to installation Fixed a bug with the debug kernel build where common/ was repeated after the common path variable, resulting in the debug confs never being picked up. This exposed a subsequent bug where the debug conf was included in other builds, this is also fixed by creating a separate directory for debug confs with one file at the moment, debug.conf that contains debug configurations and bpf specific configs. To enable kernel builds (specifically for bpf) the dwarves package was added to the kernel dockerfile for the pahole package. Signed-off-by: Agam Dua <agam_dua@apple.com>	2026-03-20 14:50:23 -07:00
Agam Dua	5ab0744c25	ci: Add pipeline for building and distributing the debug kernel Add the debug kernel to the kata tarball alongside the other kernels. Also update the kernel README documentation to describe the new debug kernel build process. Signed-off-by: Agam Dua <agam_dua@apple.com>	2026-03-20 14:50:23 -07:00
LandonTClipp	795869152d	docs: Move to mkdocs-material, port Helm to docs site This supersedes https://github.com/kata-containers/kata-containers/pull/12622. I replaced Zensical with mkdocs-materialx. Materialx is a fork of mkdocs-material created after mkdocs-material was put into maintenance mode. We'll use this platform until Zensical is more feature complete. Added a few of the existing docs into the site to make a more user-friendly flow. Signed-off-by: LandonTClipp <11232769+LandonTClipp@users.noreply.github.com>	2026-03-20 14:51:39 -05:00
Manuel Huber	8903b12d34	tests: nvidia: Increase run test timeout Increase the timeout as a few new features and tests are going to be onboarded for the NVIDIA GPU CI. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-03-20 11:12:52 -07:00
Manuel Huber	ae59cf26a0	kata-deploy: Check kata-tarball size limits For kata tarballs we eventually release to GitHub, check their size against the GitHub size limit. With this, we fail in case of an ongoing release process in 'CI \| Publish Kata Containers payload' instead of only later on in the 'Release Kata Containers' action, and we fail during PR builds, avoiding this situation at all. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-03-20 10:40:55 -07:00
stevenhorsman	e62df07b6a	static-checks: Delete kata-spell-check The old hunspell based spell-check was causing contributors challenges and proving a barrier to doc updates. We've replaced it with a cspell based-solution, so clean up the old approach. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-03-19 10:22:54 +00:00
stevenhorsman	c2cedd7c02	workflows: Add spellcheck workflow Add a separate spellcheck workflow, so we can replace the complex hunspell approach embedded in static-checks Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-03-19 10:22:54 +00:00
Manuel Huber	660e3bb653	gpu: Obsolete the NVIDIA initrd build As the NVIDIA stack has shifted to using an image for both the confidential and non-confidential variants, we retire the initrd build. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-03-16 21:29:58 -04:00
Aurélien Bombo	f8e234c6f9	Merge pull request #12650 from kata-containers/sprt/remove-csi ci: Stop building/deploying CSI driver	2026-03-16 16:53:02 -05:00
stevenhorsman	064a960aaa	workflows: Bump OSV scanner Bump to the latest version to pick up bug fixes Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-03-12 07:00:11 +00:00
Aurélien Bombo	dd2c4c0db3	Revert "coco: ci: Add no-op steps to deploy CSI driver" This reverts commit `5e4990bcf5`. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-11 12:55:23 -05:00
Aurélien Bombo	d598e0baf1	Revert "ci: Implement build step for CSI driver" This partially reverts commit `fb87bf221f`. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-11 12:55:23 -05:00
Aurélien Bombo	aae54f704c	ci: Stop deploying the CSI driver The design moved away from CSI driver so stop deploying that. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-09 14:52:17 -05:00
Fabiano Fidêncio	4e024bfb43	Revert "tests: Skip testing k3s/rke2 with nydus snapshotter" This reverts commit `ab25592533`, as now we're deploying k3s/rke2 in a way that we properly test them. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-04 11:26:31 +01:00
Fabiano Fidêncio	b0345d50e8	build: kernel: Do not expect a modules tarball for vanilla kernel When I added this I had in mind the period that we still relied on the SEV module being generated, which we don't do for quite a long time. This wrong assumption caused the cache to ALWAYS fail, increasing our build time considerably for no reason. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-03 20:14:42 +01:00
Fabiano Fidêncio	ab25592533	tests: Skip testing k3s/rke2 with nydus snapshotter We depend on a k3s commit so we can properly test it, or we need to change our CI quite a bit to deploy a full template with that imports in. For now, let's just skip the testing in k3s/rke2 and we'll address it in a different PR. ref: `b51167a996` Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-03 12:55:10 +01:00
Fabiano Fidêncio	fa3c3eb2ce	ci: Add autogenerated policy tests on k0s, k3s, rke2 and microk8s These tests run only on nightly and when triggering the dev CI manually. They cover both nydus snapshotter with guest-pull and experimental-force-guest-pull, using qemu-coco-dev and qemu-coco-dev-runtime-rs, and are included in the run-kata-coco-tests workflow behind the extensive-matrix-autogenerated-policy flag. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-03-03 12:55:10 +01:00
Amulyam24	0754a17fed	gha: pass the arch for setup-go on ppc64le By default, setup-go installs ppc64 binary instead of ppc64le, resulting in an exec format error. Pass the arch explicitly to fix this. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2026-03-03 16:41:10 +05:30
Steve Horsman	501d8d1916	Merge pull request #12596 from kata-containers/remove-install_go workflow \| tests: Remove install go	2026-03-02 12:36:58 +00:00
Steve Horsman	2695007ef8	Merge pull request #12584 from stevenhorsman/switch-actionlint-workflow workflow: Update actionlint workflows	2026-02-27 13:03:58 +00:00
stevenhorsman	b71bb47e21	workflow: Use setup-go to install go Rather than having our own script, just use the github action to install go when needed. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-27 12:42:43 +00:00
Steve Horsman	3442fc7d07	Merge pull request #12477 from kata-containers/workflow-improvements workflow: Recommended improvements	2026-02-27 11:57:22 +00:00
stevenhorsman	308442e887	workflow: Update actionlint workflows The actionlint gh extension is outdated and the wrapping seems unnecessary when there is a github action that seems to be maintained, so let's update to use that Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 11:52:19 +00:00
stevenhorsman	856ba08c71	workflows: Swap our go install for setup-go Unfortunately, due to golang/go#75031, there is an issue that results in `go: no such tool "covdata"` with a automatically installed 1.25 toolchain, so the approach to skip the install_go.sh script (which causes double install problems) didn't work. Try the alternative approach of using setup-go action, which should do a more comprehensive job Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-25 13:46:40 +00:00
stevenhorsman	b187983f84	workflows: build_checks: skip the go install if installed Some of our static checks are hitting issues with duplicate go versions installed. Given that we in go.mod we set the version to match our required toolchain, if go is already installed we can let go handle the toolchain version management instead of installing a second version Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-24 14:33:04 +00:00
Fabiano Fidêncio	a0d954cf7c	tests: Enable auto-generated policies for experimental_force_guest_pull We want to run with auto-generated policies when using experimental_force_guest_pull. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-23 22:15:18 +01:00
Fabiano Fidêncio	1523c48a2b	tests: k8s: Align coco / erofs job declaration Later on we may even think about merging those, but for now let's at least make sure the envs used are the same / declared in a similar place for each job. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-21 08:44:47 +01:00
Fabiano Fidêncio	1b9b53248e	tests: k8s: coco: rely more on free runners Run all CoCo non-TEE variants in a single job on the free runner with an explicit environment matrix (vmm, snapshotter, pull_type, kbs, containerd_version). Here we're testing CoCo only with the "active" version of containerd. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-21 08:44:47 +01:00
Fabiano Fidêncio	1fa3475e36	tests: k8s: rely more on free runners We were running most of the k8s integration tests on AKS. The ones that don't actually depend on AKS's environment now run on normal ubuntu-24.04 GitHub runners instead: we bring up a kubeadm cluster there, test with both containerd lts and active, and skip attestation tests since those runtimes don't need them. AKS is left only for the jobs that do depend on it. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-21 08:44:47 +01:00
Fabiano Fidêncio	f4dcb66a3c	ci: add workflow to push ORAS tarball cache Add push-oras-tarball-cache workflow that runs on push to main when versions.yaml changes (and on workflow_dispatch). It populates the ghcr.io ORAS cache with gperf and busybox tarballs from versions.yaml. Remove the push_to_cache call from download-with-oras-cache.sh since it was never triggered in CI. Cache population is now done solely by the new workflow and by populate-oras-tarball-cache.sh when run manually. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-13 12:57:48 +01:00
stevenhorsman	c5aadada98	workflows: Pin all actions Previously zizmor only mandated pinning of third-party actions, but has recommended rolling this out to all actions now. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-12 16:26:45 +00:00
stevenhorsman	cdd7c35c10	workflows: Remove unneeded strategy In a refactor we've remove the `matrix` section of this strategy, so the whole section isn't needed any more, so clean this up. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-12 16:26:45 +00:00
Fabiano Fidêncio	5c0269881e	tests: Make editorconfig-checker happy - Trim trailing whitespace and ensure final newline in non-vendor files - Add .editorconfig-checker.json excluding vendor dirs, .patch, .img, .dtb, .drawio, *.svg, and pkg/cloud-hypervisor/client so CI only checks project code - Leave generated and binary assets unchanged (excluded from checker) Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-10 21:58:28 +01:00
Manuel Huber	a6ca5c6628	ci: add editorconfig checker This adds a basic configuration for editorconfig checker. The supplied configuration checks against trailing whitespaces and issues with newlines. Example: \| tools/packaging/kernel/configs/fragments/x86_64/numa.conf: \| Wrong line endings or no final newline \| tools/packaging/release/generate_vendor.sh: \| 44: Trailing whitespace Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-09 15:03:26 -08:00
Fabiano Fidêncio	ab515712d4	kernel: Unify kernel and kernel-confidential Build a single kernel for both kernel and kernel-confidential on x86_64 and s390x. The kernel is built with TEE support (-x) on those arches only. This helps to simplilfy and to maintain the code, and having a single kernel was the original plan since forever. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-09 18:28:23 +01:00
Fabiano Fidêncio	c5b5433866	kernel: Unify nvidia-gpu and nvidia-gpu-confidential Build a single kernel for both nvidia-gpu and nvidia-gpu-confidential, simplifying and reducing code maintenance. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-09 18:28:23 +01:00
Manuel Huber	a786582d0b	rootfs: deprecate initramfs dm-verity mode Remove the initramfs folder, its build steps, and use the kernel based dm-verity enforcement for the handlers which used the initramfs mode. Also, remove the initramfs verity mode capability from the shims and their configs. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-05 23:04:35 +01:00

1 2 3 4 5 ...

1054 Commits