kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-01-25 14:44:16 +00:00

Author	SHA1	Message	Date
Fabiano Fidêncio	5b82b160e2	runtime-rs: Add arm64 QEMU support Add the necessary configuration and code changes to support QEMU on arm64 architecture in runtime-rs. Changes: - Set MACHINETYPE to "virt" for arm64 - Add machine accelerators "usb=off,gic-version=host" required for proper arm64 virtualization - Add arm64-specific kernel parameter "iommu.passthrough=0" - Guard vIOMMU (Intel IOMMU) to skip on arm64 since it's not supported These changes align runtime-rs with the Go runtime's arm64 QEMU support. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Signed-off-by: Kevin Zhao <kevin.zhao@linaro.org>	2026-01-23 19:48:31 +01:00
Steve Horsman	2cd76796bd	Merge pull request #12305 from stevenhorsman/fix-stalebot-permissions ci: Fix stalebot permissions	2026-01-22 10:02:43 +00:00
Hyounggyu Choi	bc131a84b9	GHA: Set timeout for kata-deploy and kbs cleanup It was observed that some kata-deploy cleanup steps could hang, causing the workflow to never finish properly. In these cases, a QEMU process was not cleaned up and kept printing debug logs to the journal. Over time, this maxed out the runner’s disk usage and caused the runner service to stop. Set timeouts for the relevant cleanup steps to avoid this. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-01-22 10:32:24 +01:00
stevenhorsman	19efeae12e	workflow: Fix stalebot permissions When looking into stale bot more for issues, I realised that our existing stale job would need permissions to work. Unfortunately the behaviour of the actions without these permissions is to log, but still finish as successful. This means it was hard to spot we had an issue. Add the required permissions to get this working again and improve the message Also add concurrency rule to make zizmor happy Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-01-21 17:28:59 +00:00
Fabiano Fidêncio	e0158869b1	tests: Add common bats test runner function Add run_bats_tests() function to common.bash that provides consistent test execution and reporting across all test suites (k8s, nvidia, kata-deploy). This removes duplicated test runner code from run_kubernetes_tests.sh, run_kubernetes_nv_tests.sh, and run-kata-deploy-tests.sh. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-01-20 12:31:55 +01:00
Fabiano Fidêncio	96e1fb4ca6	tools: Remove runk The runk tool hasn't been supported for a few years, with no maintainers since ManaSugi stopped being involved in the project and the CI was disabled in 2024. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-01-19 14:43:53 +01:00
Fabiano Fidêncio	d7aa793dde	Revert "ci: Run a nightly job using the kata-deploy rust" This reverts commit `6130d7330f`, as we're officially swithcing to the rust version of kata-deploy. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-01-19 14:07:49 +01:00
Amulyam24	859313d904	ci: move the job payload after push to an alternate runner for ppc64le To unlock the release, move the job to publish kata payload after push to an alternate runner(IBM owned) for ppc64le. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2026-01-16 11:14:42 +05:30
Fabiano Fidêncio	4e99860fd2	workflows: nvidia: Adjust to kernel / roots build decouple We don't need to store the kernel headers anymore. We do need to store the kernel modules, instead. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-01-14 20:45:54 +01:00
LandonTClipp	94fde1356c	docs: Add Zensical Doc Site Generation This commit adds a Github workflow for building a Github Pages site for the markdown files in the docs/ directory. Zensical is a new markdown-based static site generation framework built by the creators of Material for Mkdocs. https://zensical.org/ This commit does not clean the doc structure, so site navigation is initially going to be messy. Signed-off-by: LandonTClipp <11232769+LandonTClipp@users.noreply.github.com>	2026-01-13 12:42:02 +01:00
Fabiano Fidêncio	9fec31f400	tools: kubectl: Add kubectl version as a tag This is a suggestion from Choi, so we can easily test with a specific kubectl version and also easily understand which kubectl version is being used in case of failure. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-01-12 15:48:44 +01:00
Fabiano Fidêncio	26dfcb627b	tools: Build kubectl image This image will be used by our helm charts to verify that a kata-containers deployment is correct. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-01-12 15:48:44 +01:00
Mikko Ylinen	e02e226431	packaging: build OVMF for Intel TDX again OVMF build for Intel TDX (aka "TDVF") was disabled in favor of Ubuntu/ CentOS pre-upstream releases of Intel TDX. See `4292c4c3b1`. It's time to re-enable the build and move runtime configurations to use it (the latter will be done in a later commit). This is a partial revert of `4292c4c3b` with the following changes: - Stop calling OVMF for Intel TDX "TDVF" and follow the naming distros use for TDX enabled build: OVMF.inteltdx.fd. - Single binary OVMF.inteltdx.fd is supported using -bios QEMU param. - Secure Boot infrastructure is disabled since Kata does not support it. Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>	2026-01-08 10:21:47 +01:00
shwetha-s-poojary	1929ca8879	workflows: payload: do not remove AGENT_TOOLSDIRECTORY Remove line that deletes $AGENT_TOOLSDIRECTORY Signed-off-by: shwetha-s-poojary <shwetha.s-poojary@ibm.com>	2025-12-19 05:24:36 -08:00
Fabiano Fidêncio	5415cf4e0f	workflows: payload: Remove unneeded stuff from the runner Otherwise we may hit a `no space left on device` when building the rust kata-deploy binary. This happens mostly because of the muli-staging build used to generate a distroless final container. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-17 09:57:02 +01:00
Fabiano Fidêncio	6130d7330f	ci: Run a nightly job using the kata-deploy rust Let's shamelessly duplicate the nightly job to have at least nightly runs using the rust implementation of kata-deploy. The reason for doing that is to be pragmatic, as pragmatic as possible, and avoid switching away of the scripts before 3.24.0 release, while still testing both ways till the switch happens. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-17 09:57:02 +01:00
Fabiano Fidêncio	830d15d4c8	tests: Adapt to using kata-tools Instead of relying and the fully bloated kata tarball. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-16 12:55:07 +01:00
Fabiano Fidêncio	a2534e7bc8	kata-tools: Release as its own tarball We're only releasing those for amd64 as that's the only architecture we've been building the packages for. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-16 12:55:07 +01:00
Fabiano Fidêncio	6d2f393be4	build: Split tools build from the other artefacts build Let's ensure we can create a specific "tools" tarball, which will help those who only need to pull those either for testing or production usage. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-16 12:55:07 +01:00
Fabiano Fidêncio	ded6d1636f	kata-deploy: Remove deprecated features from 3.23.0 Let's remove the deprecated features that were marked for removal after Kata Containers 3.23.0: kata-deploy.sh: - Remove non-arch-specific variable fallbacks (SHIMS, DEFAULT_SHIM, SNAPSHOTTER_HANDLER_MAPPING, ALLOWED_HYPERVISOR_ANNOTATIONS, PULL_TYPE_MAPPING, EXPERIMENTAL_FORCE_GUEST_PULL). Each arch now has its own default value. - Remove CREATE_RUNTIMECLASSES and CREATE_DEFAULT_RUNTIMECLASS variables and associated functions (create_runtimeclasses, delete_runtimeclasses, adjust_shim_for_nfd). RuntimeClasses are now managed by Helm chart, not the daemonset script. - Unsupported architectures now fail with an error instead of falling back to non-arch-specific defaults. Helm chart: - Remove all deprecated env values (createRuntimeClasses, createDefaultRuntimeClass, debug, shims, shims_, defaultShim, defaultShim_, allowedHypervisorAnnotations, snapshotterHandlerMapping, snapshotterHandlerMapping_, agentHttpsProxy, agentNoProxy, pullTypeMapping, pullTypeMapping_, _experimentalSetupSnapshotter, _experimentalForceGuestPull, _experimentalForceGuestPull_*). - Remove backward compatibility code from _helpers.tpl that checked for legacy env values. - Remove legacy env.shims check from runtimeclasses.yaml. - Remove CREATE_RUNTIMECLASSES and CREATE_DEFAULT_RUNTIMECLASS env vars from kata-deploy.yaml and post-delete-job.yaml. - Update RBAC to only include runtimeclasses get/patch permissions (needed for NFD patching), removing create/delete/list/update/watch. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-13 16:32:00 +01:00
Fabiano Fidêncio	46c7d6c9f8	ci: arm64-non-k8s: temporarily skip the tests The runner is down for a few weeks. I may end up bringing in my personal runner, but I'm not confident I can easily do this before the holidays, thus I'm skipping the tests for now. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-11 12:14:32 +01:00
Fabiano Fidêncio	3db7b88eff	tests: remove containerd guest pull stability tests Remove the existing containerd guest pull stability tests workflow as we're going to rebuild all the VMs used for testing and introduce new, more focused stability tests for nydus-snapshotter. The new tests will be added soon, as part of another PR. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-08 16:29:11 +01:00
Manuel Huber	34efa83afc	tests: nvidia: cc: Add attestation test Add the attestation bats test case to the NVIDIA CI and provide a second pod manifest for the attestation test with a GPU. This will enable composite attestation in a subsequent step. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-12-05 11:48:55 +01:00
Manuel Huber	3427b5c00e	ci: nvidia: Install kata-artifacts In preparation for Kata agent security policy testing, installing Kata tools to provide genpolicy. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-12-01 17:59:19 +00:00
Steve Horsman	8534afb9e8	Merge pull request #12150 from stevenhorsman/add-gatekeeper-triggers ci: Add two extra gatekeeper triggers	2025-11-28 09:34:41 +00:00
Fabiano Fidêncio	776e08dbba	build: Add nvidia image rootfs builds So far we've only been building the initrd for the nvidia rootfs. However, we're also interested on having the image beind used for a few use-cases. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-27 22:46:07 +01:00
stevenhorsman	531311090c	ci: Add two extra gatekeeper triggers We hit a case that gatekeeper was failing due to thinking the WIP check had failed, but since it ran the PR had been edited to remove that from the title. We should listen to edits and unlabels of the PR to ensure that gatekeeper doesn't get outdated in situations like this. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-11-27 16:45:04 +00:00
Steve Horsman	c5ae8c4ba0	Merge pull request #12144 from BbolroC/use-runs-on-to-choose-runners GHA: Use `runs-on` only for choosing proper runners	2025-11-27 09:54:39 +00:00
Alex Lyn	df8315c865	Merge pull request #12130 from Apokleos/stability-rs tests: Enable stability tests for runtime-rs	2025-11-27 14:27:58 +08:00
Steve Horsman	aae483bf1d	Merge pull request #12096 from Amulyam24/enable-ibm-runners ci: re-enable IBM runners for ppc64le and s390x	2025-11-26 13:51:21 +00:00
Amulyam24	43a004444a	ci: re-enable IBM runners for ppc64le and s390x This PR re-enables the IBM runners for ppc64le/s390x build jobs and s390x static checks. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2025-11-26 16:20:01 +05:30
Hyounggyu Choi	6f761149a7	GHA: Use `runs-on` only for choosing proper runners Fixes: #12123 `include` in #12069, introduced to choose a different runner based on component, leads to another set of redundant jobs where `matrix.command` is empty. This commit gets back to the `runs-on` solution, but makes the condition human-readable. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-11-26 11:35:30 +01:00
stevenhorsman	4c59cf1a5d	workflows: Add Report tests to all workflows In the CoCo tests jobs @wainersm create a report tests step that summarises the jobs, so they are easier to understand and get results for. This is very useful, so let's roll it out to all the bats tests. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-11-26 09:28:36 +00:00
Alex Lyn	7f4d856e38	tests: Enable nydus tests for qemu-runtime-rs We need enable nydus tests for qemu-runtime-rs, and this commit aims to do it. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-11-25 17:45:57 +08:00
Alex Lyn	23393d47f6	tests: Enable stability tests for qemu-runtime-rs on nontee Enable the stability tests for qemu-runtime-rs CoCo on non-TEE environments Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-11-25 16:18:37 +08:00
Alex Lyn	f1d971040d	tests: Enable run-nerdctl-tests for qemu-runtime-rs Enable nerdctl tests for qemu-runtime-rs Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-11-25 16:14:50 +08:00
Alex Lyn	c7842aed16	tests: Enable stability tests for runtime-rs As previous set without qemu-runtime-rs, we enable it in this commit. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-11-25 16:12:12 +08:00
Manuel Huber	6c6fc50aa5	tests: nvidia: cc: allow-all policy and init-data Add an allow-all policy for the CC GPU tests and ensure the init-data device is being created (hypervisor annotations). Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-11-21 09:24:15 +01:00
Steve Horsman	87b180383e	Merge pull request #11802 from kata-containers/dependabot/github_actions/oras-project/setup-oras-1.2.4 build(deps): bump oras-project/setup-oras from 1.2.2 to 1.2.4	2025-11-19 09:58:37 +00:00
Fabiano Fidêncio	5beb1af202	tests: Pass EXPERIMENTAL_FORCE_GUEST_PULL to the test Right now we have only been passing the env var to the deployment script, but we really need to pass it to the tests script as well. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-18 14:46:48 +01:00
Steve Horsman	1d0d066869	Merge pull request #12069 from Amulyam24/static-checks-ppc github: run agent checks for Power on ppc64le instead of ubuntu-24.04-ppc64le	2025-11-14 10:18:37 +00:00
stevenhorsman	79082171ca	workflows: Add Delete AKS cluster timeout When testing this branch, on several occasions the Delete AKS cluster step has hung for multiple hours, so add a timeout to prevent this. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-11-13 14:18:43 +00:00
stevenhorsman	0335012824	tests/k8s: Enable tests for qemu-coco-dev-runtime-rs Add the runtime class to the non-tee tests and enable it to run in the test code Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-11-13 14:18:43 +00:00
Amulyam24	b32b54c4af	github: do not run agent checks for Power on ubuntu-24.04-ppc64le The new environment of Power runners for agent checks is causing two test case failures w.r.to selinux and inode which needs further understanding and is mostly an issue due to environemnt change and not to do with the agent. Fall back to running agent checks on original ppc64le self hosted runners. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2025-11-13 15:56:43 +05:30
stevenhorsman	ba56a2c372	workflows: Switch to ubuntu-22.04-arm runner As the arm 22.04 runner isn't working at the moment, let's test the 24.04 version to see if that is better. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-11-12 15:37:09 +00:00
Fabiano Fidêncio	a04cdbc40f	tests: Enforce qemu-coco-dev for experimental_force_guest_pull The fact that we were not explicitly setting the VMM was leading to us testing with the default runtime class (qemu). :-/ Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-12 16:07:05 +01:00
dependabot[bot]	c715d8648c	build(deps): bump oras-project/setup-oras from 1.2.2 to 1.2.4 Bumps [oras-project/setup-oras](https://github.com/oras-project/setup-oras) from 1.2.2 to 1.2.4. - [Release notes](https://github.com/oras-project/setup-oras/releases) - [Commits](`5c0b487ce3...22ce207df3`) --- updated-dependencies: - dependency-name: oras-project/setup-oras dependency-version: 1.2.4 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2025-11-12 09:45:27 +00:00
Fabiano Fidêncio	6d3c20bc45	riscv: Introduce its own nightly tests By doing this, the ones interested on RISC-V support can still have a ood visibility of its state, without the extra noise in our CI. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-12 09:46:17 +01:00
Fabiano Fidêncio	d82eb8d0f1	ci: Drop docker tests We have had those tests broken for months. It's time to get rid of those. NOTE that we could easily revert this commit and re-add those tests as soon as we find someone to maintain and be responsible for such integration. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-11 17:02:02 +01:00
Fabiano Fidêncio	464764c7e0	tests: nvidia: kbs: Ensure KBS_INGRESS=nodeport I've missed doing this doing the KBS deployment set up. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-10 13:01:30 +01:00

1 2 3 4 5 ...

1017 Commits