kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-08-17 15:38:00 +00:00

Author	SHA1	Message	Date
Pavel Mores	eb47f15b10	runtime-rs: support ProtectionDevice in qemu-rs As an example, or a test case, we add some implementation of SEV/SEV-SNP. Within the QEMU command line generation, the 'Cpu' object is extended to accomodate the EPYC-v4 CPU type for SEV-SNP. 'Machine' is extended to support the confidential-guest-support parameter which is useful for other TEEs as well. Support for emitting the -bios command line switch is added as that seems to be the preferred way of supplying a path to firmware for SEV/SEV-SNP. Support for emitting '-object sev-guest' and '-object sev-snp-guest' with an appropriate set of parameters is added as well. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-02-26 09:11:35 +01:00
Pavel Mores	87deb68ab7	runtime-rs: add implementation of ProtectionDevice ProtectionDevice is a new device type whose implementation structure matches the one of other devices in the device module. It is split into an inner "config" part which contains device details (we implement SEV/SEV-SNP for now) and the customary outer "device" part which just adds a device instance ID and the customary Device trait implementation. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-02-26 09:11:35 +01:00
Pavel Mores	a3f973db3b	runtime-rs: extend SEV/SEV-SNP detection by including a details struct This matches the existing TDX handling where additional details are retrieved right away after TDX is detected. Note that the actual details (cbitpos) acquisition is NOT included at this time. This change might seem bigger than it is. The change itself is just in protection.rs, the rest are corresponding adjustments. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-02-26 09:11:35 +01:00
Pavel Mores	c549d12da7	runtime-rs: parse SEV-SNP related config file settings The 'sev_snp_guest' default value of 'false' is in compliance with the golang runtime behaviour. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-02-26 09:11:35 +01:00
Markus Rudy	d58f38dfab	genpolicy: add get_process_fields to CronJob This function was accidentally left unimplemented for CronJob, resulting in runAsUser not being supported there. Fixes: #10653 Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-02-26 09:00:04 +01:00
Ruoqing He	ec020399b9	ci: Enable partial components build-check on riscv Since we have RISC-V builders available now, let's start with `agent-ctl`, `trace-forwarder` and `genpolicy` components to run build-checks on these `riscv-builder`s, and gradually add the rest components when they are ready, to catch up with other architectures eventually. This workflow could be mannually triggered, `riscv-builder` will be the default instance when that is the case. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-02-26 15:38:39 +08:00
Markus Rudy	1f6833bd0d	runtime: add cause to CDI errors Adding devices by CDI annotation can fail for a variety of reasons. If that happens, it's helpful to know the root cause of the issue (CDI spec missing, malformatted, requested device not present, etc.). This commit adds the root cause of the CDI device addition to the errors reported back to the caller. Since this error is bubbled up all the way back to the shimv2 task.Create handler, it will be visible in Kubernetes logs and enable fixing the root cause. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-02-26 08:36:15 +01:00
Paul Meyer	9981cdd8a8	genpolicy: fail when layer can't be processed Currently, if a layer can't be processed, we log this a warning and continue execution, finally exit with a zero exit code. This can lead to the generation of invalid policies. One reason a layer might not be processed is that the pull of that layer fails. We need all layers to be processed successfully to generate a valid policy, as otherwise we will miss the verity hash for that layer or we might miss the USER information from a passwd stored in that layer. This will cause our VM to not get through the agent's policy validation. Returning an error instead of printing a warning will cause genpolicy to fail in such cases. Signed-off-by: Paul Meyer <katexochen0@gmail.com>	2025-02-26 08:30:59 +01:00
Fabiano Fidêncio	b3b570e4c4	agent: Fix non-guest-pull build As the guest-pull is a very Confidental Containers specific feature, let's make sure we, at least, don't break folks who decide to build Kata Containers' agent without having this feature enabled (for instance, for the sake of the agent size). Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-25 21:48:41 +01:00
Zvonko Kaiser	04c56a0aaf	Merge pull request #10931 from zvonkok/iommufd-fix gpu: IOMMUFD fix	2025-02-25 12:50:24 -05:00
Ruoqing He	ed50e31625	build: Reorganize target selection Architectures here with `musl` available are minority, which is more suitable for enumeration. With this change, we are implicitly choosing gnu target for `ppc64le`, `riscv64` and `s390x`. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-02-26 00:56:54 +08:00
Ruoqing He	562911e170	build: Add riscv mapping for common.bash While installing Rust and Golang in our CI workflow, `arch_to_golang` and `arch_to_rust` are needed for inferring the correct arch string for riscv64 architecture. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-02-26 00:56:54 +08:00
Ruoqing He	62e2473c32	build: Add riscv64 to utils.mk Since `ARCH` for `riscv64` is `riscv64gc`, we'll need to override it in `utils.mk`, and forcing `gnu` target for `riscv64` because `musl` target is not yet made ready. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-02-26 00:56:54 +08:00
Zvonko Kaiser	804e5cd332	gpu: IOMMUFD provide proper ID We need a proper ID otherwise QEMU sometimes fails with invalid ID. Use the same pattern as with the old VFIO implementation. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-25 16:24:17 +00:00
stevenhorsman	c97e9e1592	workflows: Add codeql config I noticed that CodeQl using the default config hasn't scanned since May 2024, so figured it would be worth trying an explicit configuration to see if that gets better results. It's mostly the template, but updated to be more relevant: - Only scan PRs and pushes to the `main` branch - Set a pinned runner version rather than latest (with mac support) - Edit the list of languages to be scanned to be more relevant for kata-containers Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-25 15:05:43 +00:00
Fabiano Fidêncio	e09ae2cc0b	Merge pull request #10921 from RuoqingHe/drop-redundant-override build: Drop redundant ARCH override	2025-02-25 14:54:36 +01:00
Fabiano Fidêncio	c01e7f1ed5	Merge pull request #10932 from kata-containers/topic/consolidate-publish-workflow workflows: Refactor publish workflows	2025-02-25 14:50:40 +01:00
stevenhorsman	5000fca664	workflows: Add build-checks to manual CI Currently the ci-on-push workflow that runs on PRs runs two jobs: gatekeeper-skipper.yaml and ci.yaml. In order to test things like for the error ``` too many workflows are referenced, total: 21, limit: 20 ``` on topic branches, we need ci-devel.yaml to have an extra workflow to match ci-on-push, so add the build-checks as this is helpful to run on topic branches anyway. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-25 11:38:49 +00:00
stevenhorsman	23434791f2	workflows: Refactor publish workflows Replace the four different publish workflows with a single one that take input parameters of the arch and runner, so reduce the amount of duplicated code and try and avoid the ``` too many workflows are referenced, total: 21, limit: 20 ``` error	2025-02-25 10:49:09 +00:00
Fabiano Fidêncio	e3eb9e4f28	Merge pull request #10929 from kata-containers/topic/enable-arm-tests arm: ci: k8s: Enable CI	2025-02-24 19:34:28 +01:00
Fabiano Fidêncio	a6186b6244	ci: k8s: arm: Skip "Check the number vcpus are ..." test See https://github.com/kata-containers/kata-containers/issues/10928 Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-24 18:43:24 +01:00
Fabiano Fidêncio	1798804c32	ci: k8s: arm: Skip "Pod quota" test See https://github.com/kata-containers/kata-containers/issues/10927 Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-24 18:43:24 +01:00
Fabiano Fidêncio	053827cacc	ci: k8s: arm: Skip "Running within memory constraints" test See https://github.com/kata-containers/kata-containers/issues/10926 Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-24 18:43:24 +01:00
Fabiano Fidêncio	7bd444fa52	ci: Run k8s tests on arm64 Let's take advantege of the current arm64 runners, and make sure we have those tests running there as well. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org> Signed-off-by: Kevin Zhao <kevin.zhao@linaro.org>	2025-02-24 18:43:20 +01:00
Aurélien Bombo	16aa6b9b4b	Merge pull request #10911 from kata-containers/sprt/fix-cgroup-race agent: Fix race condition with cgroup watchers	2025-02-24 10:28:58 -06:00
Ruoqing He	265a751837	build: Drop redundant ARCH override There are many `override ARCH = powerpc64le` after where `utils.mk` is included, which are redundant. Drop those redundant `override`s. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-02-24 22:04:28 +08:00
Fabiano Fidêncio	aa30f9ab1f	versions: Use jammy for x86_64 confidential initrd Set confidential initrd to use jammy rootfs Signed-off-by: Ryan Savino <ryan.savino@amd.com>	2025-02-22 23:57:16 -06:00
Aurélien Bombo	adca339c3c	ci: Fix GH throttling in run-nerdctl-tests Specify a GH API token to avoid the below throttling error: https://github.com/kata-containers/kata-containers/actions/runs/13450787436/job/37585810679?pr=10911#step:4:96 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-02-21 17:52:17 -06:00
Aurélien Bombo	111803e168	runtime: cgroups: Remove commented out code Doesn't seem like we're going to use this and it's confusing when inspecting code. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-02-21 17:52:17 -06:00
Aurélien Bombo	1f8c15fa48	Revert "tests: Skip k8s job test on qemu-coco-dev" This reverts commit `a8ccd9a2ac`. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-02-21 17:52:17 -06:00
Aurélien Bombo	7542dbffb8	Revert "tests: disable k8s-policy-job.bats on coco-dev" This reverts commit `47ce5dad9d`. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-02-21 17:52:17 -06:00
Aurélien Bombo	a1ed923740	agent: Fix race condition with cgroup watchers In the CI, test containers intermittently fail to start after creation, with an error like below (see #10872 for more details): # State: Terminated # Reason: StartError # Message: failed to start containerd task "afd43e77fae0815afbc7205eac78f94859e247968a6a4e8bcbb987690fcf10a6": No such file or directory (os error 2) I've observed this error to repro with the following containers, which have in common that they're all very short-lived by design (more tests might be affected): * k8s-job.bats * k8s-seccomp.bats * k8s-hostname.bats * k8s-policy-job.bats * k8s-policy-logs.bats Furthermore, appending a `; sleep 1` to the command line for those containers seemed to consistently get rid of the error. Investigating further, I've uncovered a race between the end of the container process and the setting up of the cgroup watchers (to report OOMs). If the process terminates first, the agent will try to watch cgroup paths that don't exist anymore, and it will fail to start the container. The added error context in notifier.rs confirms that the error comes from the missing cgroup: https://github.com/kata-containers/kata-containers/actions/runs/13450787436/job/37585901466#step:17:6536 The fix simply consists in creating the watchers before we start the container but still after we create it -- this is non-blocking, and IIUC the cgroup is guaranteed to already be present then. Fixes: #10872 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-02-21 17:52:11 -06:00
Fabiano Fidêncio	aaa7008cad	versions: Add a comment about "jammy" being 22.04 I missed that when I added the other comments, so, for the sake of consistency, let's just add it there as well. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-21 16:02:38 -06:00
Fabiano Fidêncio	a7d33cc0cb	build: Ensure MEASURED_ROOTFS is only used for images We never ever tested MEASURED_ROOTFS with initrd, and I sincerely do not know why we've been setting that to "yes" in the initrd cases. Let's drop it, as it may be causing issues with the jobs that rely on the rootfs-initrd-confidential. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-21 15:32:20 -06:00
Dan Mihai	b90c537f79	Merge pull request #10881 from mythi/build-fixes minor build fixes	2025-02-21 09:54:55 -08:00
Jeremi Piotrowski	304978ad47	Merge pull request #10784 from arvindskumar99/disable_nesting_checks Disabling Nesting Check for SNP upstream	2025-02-21 12:39:18 +01:00
Xuewei Niu	cdb29a4fd1	Merge pull request #10780 from RuoqingHe/setup-dragonball-workspace dragonball: Appease clippy, setup workspace and centralize RustVMM	2025-02-21 14:04:19 +08:00
Hyounggyu Choi	58647bb654	Merge pull request #10743 from zvonkok/iommufd-gpu-fix IOMMUFD GPU enhancement	2025-02-20 23:43:00 +01:00
Zvonko Kaiser	7cca2c4925	gpu: Use a dedicated VFIO group vs iommufd entry We do not want to abuse the sysfsentry lets use a dedicated devfsentry. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-20 18:27:52 +00:00
Zvonko Kaiser	9add633258	qemu: Add command line for IOMMUFD For each IOMMUFD device create an object and assign it to the device, we need additional information that is populated now correctly to decide if we run the old VFIO or new VFIO backend. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-20 18:27:50 +00:00
Fabiano Fidêncio	19a7f27736	Merge pull request #10906 from BbolroC/remove-measured-rootfs-check-for-shimv2-on-s390x shim-v2: Remove MEASURED_ROOTFS assignment for s390x	2025-02-20 15:53:50 +01:00
arvindskumar99	c0a3ecb27b	config: Disabling nesting check for SNP Adding disable_nesting_checks to accomodate SNP on Azure Signed-off-by: arvindskumar99 <arvinkum@amd.com>	2025-02-20 12:24:08 +01:00
Hyounggyu Choi	1a9dabd433	shim-v2: Remove MEASURED_ROOTFS assignment for s390x As a follow-up for #10904, we do not need to set MEASURED_ROOTFS to no on s390x explicitly. The GHA workflow already exports this variable. This commit removes the redundant assignment. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-02-20 10:43:36 +01:00
Greg Kurz	f51d84b466	Merge pull request #10904 from BbolroC/turn-off-measured-rootfs-s390x-gha-workflows GHA: Turn off MEASURED_ROOTFS in build-kata-static-tarball-s390x	2025-02-20 10:24:23 +01:00
Aurélien Bombo	601c403603	Merge pull request #10818 from burgerdev/plumbing agent: clear log pipes if denied by policy	2025-02-19 16:28:58 -06:00
Aurélien Bombo	cb3467535c	tests: Add policy test for ReadStreamRequest This test verifies that, when ReadStreamRequest is blocked by the policy, the logs are empty and the container does not deadlock. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-02-19 14:03:41 -06:00
Hyounggyu Choi	ca40462a1c	Merge pull request #10903 from BbolroC/fixes-for-cri-containerd-on-ubuntu24 tests: Support systemd unit files in /usr/lib as well as /lib	2025-02-19 19:45:55 +01:00
Hyounggyu Choi	d973d41efb	GHA: Turn off MEASURED_ROOTFS in build-kata-static-tarball-s390x This is the first attempt to remove the following code: ``` if [ "${ARCH}" == "s390x" ]; then export MEASURED_ROOTFS=no fi ``` from install_shimv2() in kata-deploy-binaries.sh. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-02-19 18:19:19 +01:00
Zvonko Kaiser	238db32126	Merge pull request #10868 from zvonkok/qemu-tdx-experimental-workflow QEMU TDX experimental workflow	2025-02-19 10:09:27 -05:00
Zvonko Kaiser	f0eef73a89	gpu: Add no_patches.txt for TDX flavour As alwasy if we do not have any patches create the no_patches.txt for the specific tag gpu_tdx_... Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-19 14:59:04 +00:00

... 5 6 7 8 9 ...

15709 Commits