kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-03-18 02:32:26 +00:00

Author	SHA1	Message	Date
Aurélien Bombo	e17f96251d	runtime{,-rs}/clh: Disable virtio-pmem This disables virtio-pmem support for Cloud Hypervisor by changing Kata config defaults and removing the relevant code paths. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-02-18 11:47:53 -06:00
Markus Rudy	8365afa336	qemu: log exit code after failure When qemu exits prematurely, we usually see a message like msg="Cannot start VM" error="exiting QMP loop, command cancelled" This is an indirect hint, caused by the QMP server shutting down. It takes experience to understand what it even means, and it still does not show what's actually the problem. With this commit, we're taking the error return from the qemu subprocess and surface it in the logs, if it's not nil. This means we automatically capture any non-zero exit codes in the logs. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2026-02-17 21:03:13 +01:00
Aurélien Bombo	981f693a88	Merge pull request #11140 from balintTobik/hyperv_warning runtime: refactor hypervisor devices cgroup creation	2026-02-13 15:16:09 -06:00
stevenhorsman	55a89f6836	runtime: doc: Remove usage of golang.org/x/net/context This package is deprecated and we aren't using it any more Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-13 17:55:23 +01:00
Balint Tobik	295a6a81d0	runtime: refactor hypervisor devices cgroup creation Separatly added hypervisor devices to cgroup to omit not relevant warnings and fail if none of them are available. Also fix a testcase reload removed kernel modules to later testcases and skip some tests on ARM because lack of virtualization support Fixes #6656 Signed-off-by: Balint Tobik <btobik@redhat.com>	2026-02-13 09:23:08 +01:00
stevenhorsman	e84d234721	doc: Update broken/slow URLs Update the URLs to better/existing links Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-10 21:58:28 +01:00
Fabiano Fidêncio	5c0269881e	tests: Make editorconfig-checker happy - Trim trailing whitespace and ensure final newline in non-vendor files - Add .editorconfig-checker.json excluding vendor dirs, .patch, .img, .dtb, .drawio, *.svg, and pkg/cloud-hypervisor/client so CI only checks project code - Leave generated and binary assets unchanged (excluded from checker) Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-10 21:58:28 +01:00
Paul Meyer	c5ad3f9b26	Merge pull request #12472 from katexochen/p/disable-nvdimm-cc runtime: disable nvdimm for confidential guest	2026-02-10 14:54:40 +01:00
Paul Meyer	a5f554922c	runtime: disable nvdimm for confidential guest There is code to disable this at runtime when confidential_guest is enabled anyway[^1], but it will omit a warning every time. All the touched configuration files set confidential_guest to true, so we already know nvdimm isn't supported. [^1]: `16a7ed6e14/src/runtime/virtcontainers/qemu_amd64.go (L144-L148)` Signed-off-by: Paul Meyer <katexochen0@gmail.com>	2026-02-10 08:38:18 +01:00
Konstantin Khlebnikov	5d99a141d9	runtime: add hypervisor options for NUMA topology With enable_numa=true hypervisor will expose host NUMA topology as is: map vm NUMA nodes to host 1:1 and bind vpus to relates CPUS. Option "numa_mapping" allows to redefine NUMA nodes mapping: - map each vm node to particular host node or several numa nodes - emulate numa on host without numa (useful for tests) Signed-off-by: Konstantin Khlebnikov <koct9i@gmail.com> Co-authored-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-02-09 20:09:25 +01:00
Fabiano Fidêncio	ab515712d4	kernel: Unify kernel and kernel-confidential Build a single kernel for both kernel and kernel-confidential on x86_64 and s390x. The kernel is built with TEE support (-x) on those arches only. This helps to simplilfy and to maintain the code, and having a single kernel was the original plan since forever. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-09 18:28:23 +01:00
Fabiano Fidêncio	c5b5433866	kernel: Unify nvidia-gpu and nvidia-gpu-confidential Build a single kernel for both nvidia-gpu and nvidia-gpu-confidential, simplifying and reducing code maintenance. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-09 18:28:23 +01:00
Alex Lyn	41e8acbc5e	runtime: Map empty ReadStdout/ReadStderr response to io.EOF After the kata-agent "drain-after-exit" change, stdout/stderr EOF is signaled by a successful ReadStdout/ReadStderr reply with empty Data (len==0), instead of an RPC error. However, runtime-go currently returns (0, nil) to io.CopyBuffer() when resp.Data is empty, which violates Go io.Reader semantics and can cause `kubectl exec` to hang after the command output is already printed. To avoid exec hang: In readProcessStream(), map an empty response (len(resp.Data)==0) into (0, io.EOF). This allows the stdout/stderr copy goroutines to terminate, closes exitIOch, and unblocks the wait path so exec can complete normally. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-02-09 15:56:13 +01:00
stevenhorsman	b909c41128	runtime: Bump x/net to v0.49.0 Bump x/net to resolve CVEs: - GO-2026-4441 - GO-2026-4440 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-09 14:49:31 +01:00
stevenhorsman	b29312289f	versions: Bump go to 1.24.13 Bump go to 1.24.13 to fix CVE GO-2026-4337 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-09 14:49:31 +01:00
Manuel Huber	a786582d0b	rootfs: deprecate initramfs dm-verity mode Remove the initramfs folder, its build steps, and use the kernel based dm-verity enforcement for the handlers which used the initramfs mode. Also, remove the initramfs verity mode capability from the shims and their configs. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-05 23:04:35 +01:00
Manuel Huber	7958be8634	runtime: Make kernel_verity_params overwritable Similar to the kernel_params annotation, add a kernel_verity_params annotation and add logic to make these parameters overwritable. For instance, this can be used in test logic to provide bogus dm-verity hashes for negative tests. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-05 23:04:35 +01:00
Manuel Huber	f639c3fa17	runtime: Enable kernelinit dm-verity variant This change introduces the kernel_verity_parameters knob to the Go based shim, picking up dm-verity information in a new config field (the corresponding build variable is already produced by the shim build). The change extends the shim to parse dm-verity information from this parameter and to construct the kernel command line appropriately, based on the indicated initramfs or kernelinit build variant. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-05 23:04:35 +01:00
Manuel Huber	83a0bd1360	gpu: use dm-verity for the non-TEE GPU handler Use a dm-verity protected rootfs image for the non-TEE NVIDIA GPU handler as well. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-05 23:04:35 +01:00
Manuel Huber	d37db5f068	rootfs: Restore "gpu: Handle root_hash.txt ..." This reverts commit `923f97bc66` in order to re-instantiate the logic from commit `e4a13b9a4a`. The latter commit was previously reverted due to the NVIDIA GPU TEE handler using an initrd, not an image. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-05 23:04:35 +01:00
Manuel Huber	6d0bb49716	runtime: nvidia: Use img and sanitize whitespaces Shift NVIDIA shim configurations to use an image instead of an initrd, and remove trailing whitespaces from the configs. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-05 23:04:35 +01:00
Greg Kurz	e430b2641c	Merge pull request #12435 from bpradipt/crio-annotation shim: Add CRI-O annotation support for device cold plug	2026-02-05 09:29:19 +01:00
Manuel Huber	30c7325e75	runtimes: Sanitize trailing whitespaces Clean up trailing whitespaces, making life easier for those who have configured their IDE to clean these up. Suggest to not add new code with trailing whitespaces etc. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-03 11:46:30 -08:00
Pradipta Banerjee	8a449d358f	shim: Add CRI-O annotation support for device cold plug Add support for CRI-O annotations when fetching pod identifiers for device cold plug. The code now checks containerd CRI annotations first, then falls back to CRI-O annotations if they are empty. This enables device cold plug to work with both containerd and CRI-O container runtimes. Annotations supported: - containerd: io.kubernetes.cri.sandbox-name, io.kubernetes.cri.sandbox-namespace - CRI-O: io.kubernetes.cri-o.KubeName, io.kubernetes.cri-o.Namespace Signed-off-by: Pradipta Banerjee <pradipta.banerjee@gmail.com>	2026-02-03 04:51:15 +00:00
Mikko Ylinen	927be7b8ad	runtime: tdx: move to use QEMU from kata-deploy Currently, a working TDX setup expects users to install special TDX support builds from Canonical/CentOS virt-sig for TDX to work. kata-deploy configured TDX runtime handler to use QEMU from the distro's paths. With TDX support now being available in upstream Linux and Ubuntu 24.04 having an install candidate (linux-image-generic-6.17) for a new enough kernel, move TDX configuration to use QEMU from kata-deploy. While this is the new default, going back to the original setup is possible by making manual changes to TDX runtime handlers. Note: runtime-rs is already using QEMUPATH for TDX. Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>	2026-02-02 11:10:52 +02:00
Fabiano Fidêncio	500146bfee	versions: Bump Go to 1.24.12 Update Go from 1.24.11 to 1.24.12 to address security vulnerabilities in the standard library: - GO-2026-4342: Excessive CPU consumption in archive/zip - GO-2026-4341: Memory exhaustion in net/url query parsing - GO-2026-4340: TLS handshake encryption level issue in crypto/tls Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-01-29 00:23:26 +01:00
Dan Mihai	20ca4d2d79	runtime: DEFDISABLEBLOCK := true 1. Add disable_block_device_use to CLH settings file, for parity with the already existing QEMU settings. 2. Set DEFDISABLEBLOCK := true by default for both QEMU and CLH. After this change, Kata Guests will use by default virtio-fs to access container rootfs directories from their Hosts. Hosts that were designed to use Host block devices attached to the Guests can re-enable these rootfs block devices by changing the value of disable_block_device_use back to false in their settings files. 3. Add test using container image without any rootfs layers. Depending on the container runtime and image snapshotter being used, the empty container rootfs image might get stored on a host block device that cannot be safely hotplugged to a guest VM, because the host is using the same block device. 4. Add block device hotplug safety warning into the Kata Shim configuration files. Signed-off-by: Dan Mihai <dmihai@microsoft.com> Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Signed-off-by: Cameron McDermott <cameron@northflank.com>	2026-01-28 19:47:49 +01:00
Joji Mekkattuparamban	1440dd7468	shim: enforce iommufd for confidential guest vfio Confidential guests cannot use traditional IOMMU Group based VFIO. Instead, they need to use IMMUFD. This is mainly because the group abstraction is incompatible with a confidential device model. If traditional VFIO is specified for a confidential guest, detect the error and bail out early. Fixes #12393 Signed-off-by: Joji Mekkattuparamban <jojim@nvidia.com>	2026-01-28 00:11:38 +01:00
tak-ka3	29e7dd27f1	runtime: Add -info flag support for containerd v2.0+ Add support for the -info flag that containerd v2.0+ passes to shims. The flag outputs RuntimeInfo protobuf to stdout containing the shim name and version information. Fixes #12133 Signed-off-by: tak-ka3 <takumi.hiraoka@acompany-ac.com>	2026-01-22 19:26:44 +01:00
Steve Horsman	ba47bb6583	Merge pull request #11421 from kata-containers/dependabot/go_modules/src/runtime/github.com/urfave/cli-1.22.17 build(deps): bump github.com/urfave/cli from 1.22.14 to 1.22.17 in /src/runtime	2026-01-21 11:46:02 +00:00
XanderC	93beb58c5d	runtime: fix network initialization for non-hotplug VMMs In startVM(), for VMMs without hotplug support (e.g., Firecracker or QEMU microvm), the runtime runs prestart hooks but misses rescanning the network namespace. This causes VMs to boot with uninitialized network configs, as updates from CNI plugins are not captured. This patch adds a network rescan via AddEndpoints after prestart hooks for the non-hotplug path, ensuring correct network info is passed to the VMM configuration before the VM starts. Fixes #11500 Signed-off-by: XanderC <xanderc@qq.com>	2026-01-17 23:56:59 +01:00
Fabiano Fidêncio	33b1f0786e	Revert "arm64: Do not use DAX with the rootfs image" This reverts commit `2acb94ef2d`, as we have a kernel patch approved fixing the issue. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-01-17 19:15:53 +01:00
Manuel Huber	956f43c6c6	runtime: skip MoveTo for systemd cgroups Systemd-managed cgroups use the slice:prefix:name format, which is not a filesystem path. Calling MoveTo() on such paths fails with "invalid group path" and can abort cleanup before Delete() runs. In some cases, this causes pod teardown delays. Skip MoveTo for systemd-formatted sandbox/overhead cgroup paths when sandbox_cgroup_only is true; systemd moves tasks on unit deletion. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-01-16 16:41:38 +01:00
Manuel Huber	6753c3ac08	runtime: nvidia: Disable NVDIMM Disable NVDIMM. When using GPU passthrough, using NVDIMM would create a r/o file-backed memory region. When using a GPU, QEMU tries to DMA- map guest memory for the device, resulting in a mapping error: memory listener initialization failed: Region mem0: vfio_container_dma_map ... -22 (Invalid argument). For the CC configs, NVDIMM is disabled by default in qemu_amd64.go with a warning, but we also explicitly disable the setting in the shim configuration file. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-01-14 22:51:07 +01:00
Fabiano Fidêncio	2acb94ef2d	arm64: Do not use DAX with the rootfs image Kernel 6.18.x has an issue with DAX, which is not yet fixed upstream: ``` [ 0.737679] EXT4-fs (pmem0p1): mounted filesystem 79676804-7c8b-491a-b2a6-9bae3c72af70 ro with ordered data mode. Quota mode: disabled. [ 0.737891] VFS: Mounted root (ext4 filesystem) readonly on device 259:1. [ 0.739119] devtmpfs: mounted [ 0.739476] Freeing unused kernel memory: 1920K [ 0.740156] Run /sbin/init as init process [ 0.740229] with arguments: [ 0.740286] /sbin/init [ 0.740321] with environment: [ 0.740369] HOME=/ [ 0.740400] TERM=linux [ 0.743162] Unable to handle kernel paging request at virtual address fffffdffbf000008 [ 0.743285] Mem abort info: [ 0.743316] ESR = 0x0000000096000006 [ 0.743371] EC = 0x25: DABT (current EL), IL = 32 bits [ 0.743444] SET = 0, FnV = 0 [ 0.743489] EA = 0, S1PTW = 0 [ 0.743545] FSC = 0x06: level 2 translation fault [ 0.743610] Data abort info: [ 0.743656] ISV = 0, ISS = 0x00000006, ISS2 = 0x00000000 [ 0.743720] CM = 0, WnR = 0, TnD = 0, TagAccess = 0 [ 0.743785] GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0 [ 0.743848] swapper pgtable: 4k pages, 48-bit VAs, pgdp=00000000b9d17000 [ 0.743931] [fffffdffbf000008] pgd=10000000bfa3d403, p4d=10000000bfa3d403, pud=1000000040bfe403, pmd=0000000000000000 [ 0.744070] Internal error: Oops: 0000000096000006 [#1] SMP [ 0.748888] CPU: 0 UID: 0 PID: 1 Comm: init Not tainted 6.18.4 #1 NONE [ 0.749421] pstate: 004000c5 (nzcv daIF +PAN -UAO -TCO -DIT -SSBS BTYPE=--) [ 0.749969] pc : dax_disassociate_entry.constprop.0+0x20/0x50 [ 0.750444] lr : dax_insert_entry+0xcc/0x408 [ 0.750802] sp : ffff80008000b9e0 [ 0.751083] x29: ffff80008000b9e0 x28: 0000000000000000 x27: 0000000000000000 [ 0.751682] x26: 0000000001963d01 x25: ffff0000004f7d90 x24: 0000000000000000 [ 0.752264] x23: 0000000000000000 x22: ffff80008000bcc8 x21: 0000000000000011 [ 0.752836] x20: ffff80008000ba90 x19: 0000000001963d01 x18: 0000000000000000 [ 0.753407] x17: 0000000000000000 x16: 0000000000000000 x15: 0000000000000000 [ 0.753970] x14: ffffbf3154b9ae70 x13: 0000000000000000 x12: ffffbf3154b9ae70 [ 0.754548] x11: ffffffffffffffff x10: 0000000000000000 x9 : 0000000000000000 [ 0.755122] x8 : 000000000000000d x7 : 000000000000001f x6 : 0000000000000000 [ 0.755707] x5 : 0000000000000000 x4 : 0000000000000000 x3 : fffffdffc0000000 [ 0.756287] x2 : 0000000000000008 x1 : 0000000040000000 x0 : fffffdffbf000000 [ 0.756871] Call trace: [ 0.757107] dax_disassociate_entry.constprop.0+0x20/0x50 (P) [ 0.757592] dax_iomap_pte_fault+0x4fc/0x808 [ 0.757951] dax_iomap_fault+0x28/0x30 [ 0.758258] ext4_dax_huge_fault+0x80/0x2dc [ 0.758594] ext4_dax_fault+0x10/0x3c [ 0.758892] __do_fault+0x38/0x12c [ 0.759175] __handle_mm_fault+0x530/0xcf0 [ 0.759518] handle_mm_fault+0xe4/0x230 [ 0.759833] do_page_fault+0x17c/0x4dc [ 0.760144] do_translation_fault+0x30/0x38 [ 0.760483] do_mem_abort+0x40/0x8c [ 0.760771] el0_ia+0x4c/0x170 [ 0.761032] el0t_64_sync_handler+0xd8/0xdc [ 0.761371] el0t_64_sync+0x168/0x16c [ 0.761677] Code: f9453021 f2dfbfe3 cb813080 8b001860 (f9400401) [ 0.762168] ---[ end trace 0000000000000000 ]--- [ 0.762550] note: init[1] exited with irqs disabled [ 0.762631] Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b ``` For now, we limit the rootfs that we ship to ARM64 to not use DAX, in the future we'll re-enable it as soon as the patch lands on mainstream kernel. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-01-14 11:46:40 +01:00
dependabot[bot]	2edb161c53	build(deps): bump github.com/urfave/cli in /src/runtime Bumps [github.com/urfave/cli](https://github.com/urfave/cli) from 1.22.14 to 1.22.17. - [Release notes](https://github.com/urfave/cli/releases) - [Changelog](https://github.com/urfave/cli/blob/main/docs/CHANGELOG.md) - [Commits](https://github.com/urfave/cli/compare/v1.22.14...v1.22.17) --- updated-dependencies: - dependency-name: github.com/urfave/cli dependency-version: 1.22.17 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2026-01-13 09:04:41 +00:00
Manuel Huber	9e30283952	runtime: nvidia: change kernel parameters Remove the agent hotplug timeout parameter from the kernel command line. Having shifted to VFIO cold-plug, this parameter is no longer needed. Remove the no longer required parameter for TDX and thus align the SNP and TDX configurations. Add a parameter to avoid the kernel to mount the /dev tmpfs. NVRC and later on kata-agent attempt this. While kata-agent does not panic when mounting /dev fails, NVRC makes mounting /dev a hard requirement. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-01-12 16:11:28 -08:00
Mikko Ylinen	cc6277b735	Revert "tdx: Update GPU config for the latest TDX stack" Prefer the "full feature TDVF" instead of the generic OVMF build. See Option-B in https://github.com/tianocore/edk2/tree/master/OvmfPkg/IntelTdx#configurations-and-features for the extra hardening supported. FIRMWAREPATH_NV also seems to be TDX specific unlike the Makefile suggests. Therefore, it can be dropped completely. This reverts commit `66ccc25724`.	2026-01-08 10:21:47 +01:00
Mikko Ylinen	e02e226431	packaging: build OVMF for Intel TDX again OVMF build for Intel TDX (aka "TDVF") was disabled in favor of Ubuntu/ CentOS pre-upstream releases of Intel TDX. See `4292c4c3b1`. It's time to re-enable the build and move runtime configurations to use it (the latter will be done in a later commit). This is a partial revert of `4292c4c3b` with the following changes: - Stop calling OVMF for Intel TDX "TDVF" and follow the naming distros use for TDX enabled build: OVMF.inteltdx.fd. - Single binary OVMF.inteltdx.fd is supported using -bios QEMU param. - Secure Boot infrastructure is disabled since Kata does not support it. Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>	2026-01-08 10:21:47 +01:00
Fabiano Fidêncio	88cdfab604	runtime: nvidia: Align static_sandbox_resource_mgmt Let's ensure we have those aligned for both CC and non-CC use-case. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-17 17:04:51 +01:00
Fabiano Fidêncio	995770dbeb	runtime: nvidia: Use cold-plug by default Now that we have the way to do cold-plug, let's ensure we also use it for the non-CC use case. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-17 17:04:51 +01:00
dependabot[bot]	2137b1fa3a	build(deps): bump github.com/containernetworking/plugins in /src/runtime Bumps [github.com/containernetworking/plugins](https://github.com/containernetworking/plugins) from 1.7.1 to 1.9.0. - [Release notes](https://github.com/containernetworking/plugins/releases) - [Commits](https://github.com/containernetworking/plugins/compare/v1.7.1...v1.9.0) --- updated-dependencies: - dependency-name: github.com/containernetworking/plugins dependency-version: 1.9.0 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2025-12-10 16:10:24 +01:00
LandonTClipp	b50a73912d	runtime: Config test extension for IOMMUFDID Adding additional cases for the IOMMUFDID method to check for non-IOMMUFD paths are passed. The method should do the right thing. Signed-off-by: LandonTClipp <11232769+LandonTClipp@users.noreply.github.com>	2025-12-10 15:46:28 +01:00
LandonTClipp	d5e4cf6b4d	runtime: Add test for ExecuteVFIODeviceAdd Copilot made a good point that we should have a test for this. Thus, this commit. Signed-off-by: LandonTClipp <11232769+LandonTClipp@users.noreply.github.com>	2025-12-10 15:46:28 +01:00
LandonTClipp	137866f793	runtime: Allow QMP commands to be logged in debug level Logging the QMP commands gives us a lot of flexibility to troubleshoot issues with what is being sent to QEMU. Signed-off-by: LandonTClipp <11232769+LandonTClipp@users.noreply.github.com>	2025-12-10 15:46:28 +01:00
LandonTClipp	a3b5764f67	runtime: Fix import cycle and add unit test for IOMMUFDID() An import cycle was introduced because of a mutual need for the constant that describes the prefix of IOMMUFD files. We need to extract this out into a higher-level package. Signed-off-by: LandonTClipp <11232769+LandonTClipp@users.noreply.github.com>	2025-12-10 15:46:28 +01:00
LandonTClipp	09438fd54f	runtime: Add IOMMUFD Object Creation for QEMU QMP Commands The QMP commands sent to QEMU did not properly set up IOMMUFD objects in the codepath that handles VFIO device hot-plugging. This is mainly relevant in the Kubernetes use-case where the VFIO devices are not available when QEMU is first launched. Signed-off-by: LandonTClipp <11232769+LandonTClipp@users.noreply.github.com>	2025-12-10 15:46:28 +01:00
Manuel Huber	cb8fd2e3b1	runtime: gpu: Skip CDI annos for pause container The pause container does not need CDI annotations, these are only intended for workload containers. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-12-10 13:26:04 +01:00
Zvonko Kaiser	f8ad17499d	gpu: VFIO handling container vs sandbox If the sandbox has cold-plugged a IOMMUFD device but the device-plugins sends us a /dev/vfio/<NUM> device we need to check if the IOMMUFD device and the VFIO device are the same We have the sibling.BDF we now need to extract the BDF of the devPath that is either /dev/vfio/<NUM> or /dev/vfio/devices/vfio<NUM> Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-12-05 16:53:31 +01:00
Fabiano Fidêncio	923f97bc66	rootfs: Temporarily revert "gpu: Handle root_hash.txt correctly" This reverts commit `e4a13b9a4a`, as it caused some issues with the GPU workflows. Reverting it is better, as it unblocks other PRs. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-12-05 11:47:37 +01:00

1 2 3 4 5 ...

2184 Commits