kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-03-06 04:42:04 +00:00

Author	SHA1	Message	Date
Aurélien Bombo	68e67d7f8a	docs: Use more accurate wording for /dev hostPath behavior I got lazy when I first added this section in `5c21b1f`, so updating the language to specify that any non-regular host file (under /dev) qualifies, not just devices. This matches the actual code, see: `330bfff4be/src/runtime/virtcontainers/mount.go (L57-L83)` Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-02 11:32:01 +00:00
Steve Horsman	b147cb1319	Merge pull request #12587 from fidencio/topic/runtime-add-configurable-kubelet-root-dir runtimes: add configurable kubelet root dir	2026-02-28 19:06:14 +00:00
Xuewei Niu	8a4ae090e6	Merge pull request #12513 from lifupan/event_publish send the task create/start/delete event to containerd	2026-02-28 14:41:46 +08:00
Zvonko Kaiser	afe09803a1	gpu: Ignore OVMF and use the Kernel for proper PCI setup Sometimes OVFM provides incorrect values to the kernel we override it by telling the kernel to handle the PCI space setup like allocating the proper window sizes and assigning the proper busses to each device. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-02-27 22:54:31 +01:00
Manuel Huber	88f746dea8	runtime: nvidia: Use OVMF for NV GPU handler Shift to using OVMF instead of using SeaBios. Signed-off-by: Manuel Huber <manuelh@nvidia.com> Update src/runtime/Makefile Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-02-27 22:54:31 +01:00
Zvonko Kaiser	eec397ac08	qemu: Remove PCIe root port BAR reserve sizing Stop computing and setting mem-reserve and pref64-reserve on PCIe root ports and switch ports. Remove getBARsMaxAddressableMemory() which scanned host GPU BARs to pre-calculate these values. The previous approach only considered GPU devices (IsGPU(), class 0x0302) when scanning for BAR sizes, so devices like NVSwitches (class 0x0680) with their 32MB non-prefetchable BAR0 were not accounted for and received the 4MB default. Additionally, GetTotalAddressableMemory() classifies BARs by 32/64-bit address width rather than by the prefetchable flag that QEMU's mem-reserve vs pref64-reserve maps to. Modern QEMU introspects VFIO device BARs when they are attached to root ports and sizes the MMIO windows accordingly. Modern OVMF (edk2-stable202502+) automatically calculates the 64-bit PCI MMIO aperture based on the BARs of actually present devices during PCI enumeration. Omitting the reserve parameters lets QEMU and OVMF handle MMIO window sizing correctly for all device types including GPUs, NVSwitches, and NICs without requiring host-side BAR scanning. This also removes the nvpci dependency from qemu_arch_base.go. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-02-27 22:54:31 +01:00
Zvonko Kaiser	bb7fd335f3	qemu: Remove OVMF X-PciMmio64Mb fw_cfg hint Modern OVMF (edk2-stable202502 and later) automatically sizes the 64-bit PCI MMIO aperture based on the BARs of actually attached devices during PCI enumeration. The opt/ovmf/X-PciMmio64Mb fw_cfg hint is no longer needed to ensure large-BAR devices like NVIDIA GPUs receive adequate MMIO space. The previous approach was fragile: the runtime scanned host PCI devices to estimate the required aperture size, but only considered GPU devices (class 0x0302), missing NVSwitches and other devices with large BARs. Removing this code avoids confusion about MMIO sizing responsibility. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-02-27 22:54:31 +01:00
Fabiano Fidêncio	330bfff4be	kata-deploy: Fix nydus snapshotter config (on v3 config version) On containerd v3 config, disable_snapshot_annotations must be set under the images plugin, not the runtime plugin. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-27 18:20:30 +01:00
Fabiano Fidêncio	0a73638744	runtime: add configurable kubelet root dir Different kubernetes distributions, such as k0s, use a different kubelet root dir location instead of the default /var/lib/kubelet, so ConfigMap and Secret volume propagation were failing. This adds a kubelet_root_dir config option that the go runtime uses when matching volume paths and kata-deploy now sets it automatically for k0s via a drop-in file. runtime-rs does not need this option: it identifies ConfigMap/Secret, projected, and downward-api volumes by volume-type path segment (kubernetes.io~configmap, etc.), not by kubelet root prefix. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-27 14:10:57 +01:00
Steve Horsman	2695007ef8	Merge pull request #12584 from stevenhorsman/switch-actionlint-workflow workflow: Update actionlint workflows	2026-02-27 13:03:58 +00:00
Steve Horsman	3442fc7d07	Merge pull request #12477 from kata-containers/workflow-improvements workflow: Recommended improvements	2026-02-27 11:57:22 +00:00
Markus Rudy	d9d886b419	agent-policy: read bundle-id from OCI spec rootfs The host path of bundles is not portable and could be literally anything depending on containerd configuration, so we can't rely on a specific prefix when deriving the bundle-id. Instead, we derive the bundle-id from the target root path in the guest. NOTE: fixes https://github.com/kata-containers/kata-containers/issues/10065 Signed-off-by: Markus Rudy <mr@edgeless.systems> Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-27 10:24:38 +01:00
Hyounggyu Choi	be5ae7d1e1	Merge pull request #12573 from BbolroC/support-memory-hotplug-go-runtime-s390x runtime: Support memory hotplug via virtio-mem on s390x	2026-02-27 09:59:40 +01:00
Steve Horsman	c6014ddfe4	Merge pull request #12574 from sathieu/kata-deploy-kubectl-image kata-deploy: allow to configure kubectl image	2026-02-27 08:42:06 +00:00
Steve Horsman	1048132eb1	Merge pull request #12564 from stevenhorsman/remove-unused-dependencies Try and remove unused crates	2026-02-26 13:53:44 +00:00
Aurélien Bombo	2a13f33d50	Merge pull request #12565 from microsoft/danmihai1/clh-51.1 versions: update cloud hypervisor to v51.1	2026-02-26 07:52:57 -06:00
Hyounggyu Choi	b1847f9598	tests: Run TestContainerMemoryUpdate() on s390x only with virtio-mem Let's run `TestContainerMemoryUpdate` on s390x only when virtio-mem is enabled. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-02-26 14:21:34 +01:00
Hyounggyu Choi	b9f3d5aa67	runtime: Support memory hotplug with virtio-mem on s390x This commit adds logic to properly handle memory hotplug for QemuCCWVirtio in the ExecMemdevAdd() path. The new logic is triggered only when virtio-mem is enabled. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-02-26 14:21:34 +01:00
Hyounggyu Choi	19771671c2	runtime: Handle virtio-mem resize in hotplugAddMemory() ResizeMemory() already contains the virtio-mem resize logic. However, hotplugAddMemory(), which is invoked via a different path, lacked this handling and always fell back to the pc-dimm path, even when virtio-mem was configured. This commit adds virtio-mem resize handling to hotplugAddMemory(). It also adds corresponding unit tests. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-02-26 14:21:34 +01:00
Fabiano Fidêncio	8c91e7889c	helm-chart: support digest pinning for images When image.reference or kubectlImage.reference already contains a digest (e.g. quay.io/...@sha256:...), use the reference as-is instead of appending :tag. This avoids invalid image strings like 'image@sha256🔤' when tag is empty and allows users to pin by digest. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-26 13:39:51 +01:00
Mathieu Parent	b61d169472	kata-deploy: allow to configure kubectl image This can be used to: - pin tag (current is 20260112) - pin digest - use another image Signed-off-by: Mathieu Parent <mathieu.parent@insee.fr>	2026-02-26 13:12:03 +01:00
stevenhorsman	308442e887	workflow: Update actionlint workflows The actionlint gh extension is outdated and the wrapping seems unnecessary when there is a github action that seems to be maintained, so let's update to use that Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 11:52:19 +00:00
stevenhorsman	82c27181d8	kata-deploy: Remove unused crates cargo machete has identified `serde` and `thiserror` as being unused, so remove them from Cargo.toml Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 09:38:35 +00:00
stevenhorsman	bdbfe9915b	kata-ctl: Remove unused crates cargo machete has identified the follow crates as unused: - containerd-shim-protos - safe-path - strum - ttrpc strum is neded (and maybe isn't picked up due to it being used by macros?), so add it to the ignore list and remove the rest Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 09:38:35 +00:00
stevenhorsman	b4365bdcaa	genpolicy: Remove unused crates `cargo machete` has identified `openssl` and `serde-transcode` as being un-used. openssl is required, so add it to the ignore list and just remove serde-transcode Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 09:38:35 +00:00
stevenhorsman	382c6d2a2f	agent-ctl: Remove unused crates `log` and `rustjail` are flagged by cargo machete as unused, so lets remove them to reduce the footprint of crates in this tool Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 09:38:35 +00:00
stevenhorsman	e43a17c2ba	runtime-rs: Remove unused crates - Remove unused crates to reduce our size and the work needed to do updates - Also update package.metadata.cargo-machete with some crates that are incorrectly coming up as unused Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 09:37:46 +00:00
stevenhorsman	8177a440ca	libs: Remove unused crates Remove unused crates to reduce our size and the work needed to do updates Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 09:37:46 +00:00
stevenhorsman	ed7ef68510	dragonball: Remove unused crates Remove the crates that cargo machete has assessed as being unused Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 09:37:15 +00:00
stevenhorsman	c1b8c6bce6	dragonball: Update cargo-machete config cargo machete can't understand `host-device = ["dep:vfio-bindings"`, so tell it to ignore `vfio-bindings` and not suggest it's unused Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 09:37:14 +00:00
stevenhorsman	1139a9bb8a	trace-forwarder: Try and remove unused crates I ran cargo machete in trace-forwarder and it suggested that some of the packages were not used, including a chain with a vulnerability, so try and remove them to resolve RUSTSEC-2021-0139 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 09:37:14 +00:00
Steve Horsman	675c0c3450	Merge pull request #12553 from kata-containers/dependabot/cargo/src/tools/agent-ctl/keccak-0.1.6 build(deps): bump keccak from 0.1.5 to 0.1.6 in /src/tools/agent-ctl	2026-02-26 08:53:57 +00:00
Steve Horsman	9a921bb396	Merge pull request #12575 from kata-containers/build-checks-go-install-setup-go workflows: Swap our go install for setup-go	2026-02-26 08:51:56 +00:00
Steve Horsman	da0ca483b0	Merge pull request #12572 from fitzthum/bump-trustee versions: bump Trustee to latest version	2026-02-26 08:48:37 +00:00
Alex Lyn	57b0148356	Merge pull request #12400 from Apokleos/enhance-snp-rs runtime-rs: Enhance Qemu/SNP Protection	2026-02-26 15:29:33 +08:00
Dan Mihai	2361dc7ca0	tests: k8s: reinstate testing on mariner hosts Reinstate mariner host testing - including the Agent Policy tests on these hosts - now that a new CLH version brought in the required fixes. This reverts commit `ea53779b90`. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-02-25 21:01:25 +00:00
Dan Mihai	7973e4e2a8	runtime: clh: disable nested vCPUs on MSHV The recently-added nested property is true by default, but is not supported yet on MSHV. See cloud-hypervisor/cloud-hypervisor#7408 for additional information. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-02-25 21:01:25 +00:00
Dan Mihai	24ac2ccb5c	runtime-rs: clh: specify raw image format Specify raw image format for all guest block devices. - Attempting to auto-detect the image format from CLH would be riskier for the Host. - Creating a new raw image file, auto-detecting its format, and then creating a filesystem from the Guest onto the block device is no longer supported by CLH v51. Therefore, Kata CI's k8s-block-volume.bats would fail without specifying the raw format when hot plugging its block device. - See cloud-hypervisor/cloud-hypervisor@b3e8e2a for additional information. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-02-25 21:01:25 +00:00
Dan Mihai	dc398e801c	runtime: clh: specify raw image format Specify raw image format for all guest block devices. - Attempting to auto-detect the image format from CLH would be riskier for the Host. - Creating a new raw image file, auto-detecting its format, and then creating a filesystem from the Guest onto the block device is no longer supported by CLH v51. Therefore, Kata CI's k8s-block-volume.bats would fail without specifying the raw format when hot plugging its block device. - See cloud-hypervisor/cloud-hypervisor@b3e8e2a for additional information. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-02-25 21:01:25 +00:00
Dan Mihai	0629354ca0	versions: update cloud hypervisor to v51.1 ``` v51.1 ===== This is a bug fix release. The following issues have been addressed: * Fix image_type in OpenAPI definition (#7734) v51.0 ===== This release has been tracked in v51.0 group of our roadmap project. Security Fixes This release fixes a security vulnerability in disk image handling. Details can be found in GHSA-jmr4-g2hv-mjj6. * A new `backing_files=on\|off` option has been added to `--disk` to explicitly control whether QCOW2 backing files are permitted. This defaults to `off` to prevent the loading of backing files entirely. (#7685) * Explicit image type specification via the user interface, removing reliance on format autodetection (#7728). * Prevent sector-zero writes for autodetected raw images (#7728). Significant QCOW2 v3 Improvements A large number of QCOW2 v3 specification features have been implemented: * RAW backing file support for QCOW2 overlays (#7570) * Zero bit in L2 entries (#7627) * Incompatible feature bit validation (#7612) * Dirty bit support (#7636) * Variable refcount widths (1 to 64-bit) (#7633) * Corrupt bit detection and marking (#7639) * Autoclear feature bits handling (#7648) * Thread safety fix for multiple virtio queues (`num_queues > 1`) (#7661) * Correct zero-fill for reads beyond backing file size (#7678) * Live disk resize support (#7687) ACPI Generic Initiator Support ACPI Generic Initiator Affinity (SRAT Type 5) support has been added to associate VFIO-PCI devices with dedicated memory/CPU-less NUMA nodes. This enables the guest OS to make NUMA-aware memory allocation decisions for device workloads. A new `device_id` parameter has been added to `--numa` for specifying VFIO devices. (#7626) Block Device DISCARD and WRITE_ZEROES Support The `virtio-blk` device now supports `DISCARD` and `WRITE_ZEROES` operations for QCOW2 and RAW image formats. This enables thin provisioning and efficient space reclamation when guests trim filesystems. A new `sparse=on\|off` option has been added to `--disk` to control disk space management: `sparse=on` (default) enables thin provisioning with space reclamation, while `sparse=off` provides thick provisioning with consistent I/O latency. (#7666) Notable Performance Improvements * Transparent Huge Pages (THP) support has been extended to cover anonymous shared memory (`shared=on`) via `madvise`. Previously, THP was only used for non-shared memory. (#7646) * The `vhost-user-net` device now uses the default set of vhost-user virtio features, including `VIRTIO_F_RING_INDIRECT_DESC`, which provides a performance improvement. (#7653) MSHV Support Improvements * Optimize CPU state update after emulation by only updating special registers when changed (#7603) * Enable SMT for guests with `threads_per_core > 1` (#7668) * Stub `save_data_tables()` to unblock VM pause/resume (#7692) * Handle `GHCB_INFO_SPECIAL_DBGPRINT` VMG exit in SEV-SNP guest exit handler (#7703) * Fix CVM boot failure on MSHV (#7548) * Fix CPU topology detection for multithreaded configurations (#7576) Notable Bug Fixes * Fix VFIO device hot-remove leaving group and container file descriptors open, preventing re-add (#7676) * Fix snapshot restore when backing file is on read-only storage with `shared=false` (#7674) * Enforce `VIRTIO_BLK_F_RO` even if guest does not negotiate it (#7705) * Fix read-only block device FLUSH requests from OVMF preventing VMs from booting (#7706) * Fix vhost-user device not properly dropping unowned file descriptors (#7679) * Fix `vhost-user-block` `get_config` interoperability (#7617) * Fix vsock TOCTOU race condition by copying packet header from guest memory before processing (#7530) * Fix vsock handling of large TX packets spanning multiple data descriptors (#7680) * Add `gettid()` to all seccomp filters (#7596) * Fix MAC address parsing that wrongly allowed `+` instead of hex characters (#7579) * Improve UUID parse error message and `--net` fd help text (#7702) * Fix various inconsistencies in our OpenAPI specification file (#7716, #7726) * Various documentation fixes (#7602, #7606) ``` Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2026-02-25 21:01:25 +00:00
Tobin Feldman-Fitzthum	b4b5db2f1c	tests: fixup SNP attestation test for new Trustee version Trustee now returns the binary SNP TCB claims as hex rather than base64 (for consistency with other platforms). Fortunately, the sev-snp-measure tool has a flag for setting the output type of the launch digest. I think hex is the default, but let's keep the flag here to be explicit. Signed-off-by: Tobin Feldman-Fitzthum <tfeldmanfitz@nvidia.com>	2026-02-25 09:57:36 -08:00
Steve Horsman	a655605e8f	Merge pull request #12566 from manuelh-dev/mahuber/fail-exp-timeout tests: Extend fail timeout for failure test	2026-02-25 16:11:53 +00:00
stevenhorsman	856ba08c71	workflows: Swap our go install for setup-go Unfortunately, due to golang/go#75031, there is an issue that results in `go: no such tool "covdata"` with a automatically installed 1.25 toolchain, so the approach to skip the install_go.sh script (which causes double install problems) didn't work. Try the alternative approach of using setup-go action, which should do a more comprehensive job Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-25 13:46:40 +00:00
Alex Lyn	2fb6376a99	dragonball: Reduce warnings in dragonball when using 1.91 rust tools Some warnings come up when we use bumped rust-1.91, this commit aims to eliminate warnings. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-02-25 21:04:35 +08:00
Alex Lyn	dc87c1f732	runtime-rs: Add support for configurable Qemu/SEV-SNP guest policy This commit enables the SEV-SNP guest policy to be explicitly configured via the runtime configuration in runtime-rs. To provide both ease of use and maximum flexibility, the following logic is implemented: 1. If the user provides a custom `snp_guest_policy` in the configuration, this value is passed directly to the QEMU SEV-SNP guest object. 2. If the user does not specify a policy, the driver defaults to `0x30000`, matching QEMU's standard default for SEV-SNP guests. This enhancement allows users to fine-tune security constraints through the policy bitmask, while ensuring a sensible and functional default for standard SNP deployments. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-02-25 21:04:35 +08:00
Alex Lyn	9fc708ec4f	kata-types: Add SNP launch configuration fields to SecurityInfo This commit introduces three new fields to the `SecurityInfo` struct to support SEV-SNP (Secure Nested Paging) attestation and measurement capabilities: (1) `snp_id_block`: A 96-byte Base64-encoded ID block for the SNP_LAUNCH_FINISH command. (2) `snp_id_auth`: A 4096-byte Base64-encoded authentication structure accompanying the ID block. (3) `snp_guest_policy`: A bitmask for the SNP guest policy, passed to the SNP_LAUNCH_START command. These fields enable users to provide identity information to the SNP firmware, allowing for remote attestation and verified guest launches. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-02-25 21:04:35 +08:00
Alex Lyn	f9ffc95c3c	runtime-rs: Introduce a SNP policy field in ObjectSevSnpGuest A bitmask for the SNP guest policy is introduced in ObjectSevSnpGuest to help pass to Qemu cmdline. And defaults to 0x30000 (QEMU's default) to maintain standard behavior it just looks like as: "policy=0x30000" Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-02-25 21:04:35 +08:00
Alex Lyn	21e0df4c06	runtime-rs: Add kernel irqchip with split for SNP Add more param with split when qemu launches for SNP. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-02-25 21:04:35 +08:00
Alex Lyn	ebe87d0e6f	runtime-rs: Disable memory hotplug setting within SEV-SNP For SEV-SNP, memory overcommit is not supported. we only set the memory size. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-02-25 21:04:35 +08:00
Alex Lyn	830667c041	runtime-rs: Add two methods for Qemu Memory to control memory set Introduce two methods to help set max memory and num_slots. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-02-25 21:04:35 +08:00

1 2 3 4 5 ...

18060 Commits