kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-05-04 12:31:27 +00:00

Author	SHA1	Message	Date
Alex Lyn	d8db044c63	runtime-rs: Add erofs rootfs handling logic in handler_rootfs Add handling for multi-layer EROFS rootfs in RootFsResource handler_rootfs method. It will correctly handle the multi-layers erofs rootfs. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-19 13:24:31 +02:00
Alex Lyn	8d7051436a	runtime-rs: Add support for erofs rootfs with multi-layer Add erofs_rootfs.rs implementing ErofsMultiLayerRootfs for multi-layer EROFS rootfs with VMDK descriptor generation. It's the core implementation of Erofs rootfs within runtime. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-19 13:24:31 +02:00
Alex Lyn	cb706219ae	runtime-rs: Change Rootfs::get_storage return type Change Rootfs::get_storage to return Option<Vec<Storage>> to support multi-layer rootfs with multiple storages. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-18 22:46:33 +02:00
Alex Lyn	c06bc388c2	runtime-rs: Add format argument to hotplug_block_device method Add format argument to hotplug_block_device for flexibly specifying different block formats. With this, we can support kinds of formats, currently raw and vmdk are supported, and some other formats will be supported in future. Aside the formats, the corresponding handling logics are also required to properly handle its options needed in QMP blockdev-add. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-18 22:46:33 +02:00
Alex Lyn	15740439eb	runtime-rs: Add BlockDeviceFormat enum to support more block formats In practice, we need more kinds of block formats, not limited to `Raw`. This commit aims to add BlockDeviceFormat enum for kinds of block device formats support, like RAW, VMDK, etc. And it will do some following actions to make this changes work well, including format field in BlockConfig. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-18 19:00:44 +02:00
Alex Lyn	8ed4fa1406	runtime-rs: Add RUNTIME_ALLOW_MOUNTS to RuntimeInfo Add RUNTIME_ALLOW_MOUNTS annotation to RuntimeInfo to specify custom mount types allowed by the runtime. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-18 19:00:44 +02:00
Fabiano Fidêncio	d04bb98e09	runtime-rs: Increase reconnect_timeout_ms for confidential VMs The Go runtime's CoCo dev config uses dial_timeout = 45s, but all runtime-rs confidential VM configs had reconnect_timeout_ms set to 3000ms (3s) or 5000ms (SE). This is too short for confidential VMs, especially on arm64 where UEFI firmware (AAVMF) adds significant boot time on top of the measured boot process, causing ECONNRESET errors on the vsock connection before the agent is ready. Bump reconnect_timeout_ms to 45000ms across all confidential VM configs (coco-dev, SNP, TDX, SE) to match the Go runtime. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Made-with: Cursor	2026-04-18 00:48:13 +02:00
Saul Paredes	6f6e45522e	Merge pull request #11562 from Apokleos/clh-initdata runtime-rs: Add CoCo/protected device for initdata within runtime-rs/Cloud Hypervisor	2026-04-17 11:09:19 -07:00
Fabiano Fidêncio	690f5a2b62	Merge pull request #12862 from fidencio/topic/runtime-rs-enable-measured-rootfs-tests runtime-rs: enable measured rootfs for qemu-coco-dev-runtime-rs	2026-04-17 18:48:47 +02:00
Fabiano Fidêncio	1ec0e344e5	runtime-rs: enable measured rootfs for qemu-coco-dev-runtime-rs Add kernel_verity_params to the qemu-coco-dev-runtime-rs configuration so the runtime can assemble dm-verity kernel parameters, and remove the test skip that was disabling measured rootfs tests for this hypervisor. Fixes: #12851 Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-04-17 15:22:17 +02:00
Fabiano Fidêncio	eda3bc6190	runtime-rs: wire GetDiagnosticData for termination logs Add runtime-rs support for the GetDiagnosticData RPC. This extends the Agent trait, types, and protocol translation layer with the new request/response types. During container stop, when shared_fs is "none" and the terminationMessagePolicy annotation is "File", the runtime copies the termination log from the guest via GetDiagnosticData. The call is best-effort to avoid blocking container teardown. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-04-17 13:16:25 +02:00
Alex Lyn	c546b3c585	Merge pull request #12843 from microsoft/saul/build-opt runtime-rs: add build optimization flags	2026-04-16 09:05:20 +08:00
Alex Lyn	2f6319f130	runtime-rs: Fix unformatted code in runtime-rs When build runtime-rs, one unformatted code block comes up,as below: ``` - config - .hypervisor - .entry("qemu".to_owned()) - .and_modify(\|hv\| { - hv.cpu_info.default_vcpus = default_vcpus; - hv.cpu_info.default_maxvcpus = default_maxvcpus; - hv.memory_info.default_memory = default_memory; - hv.memory_info.default_maxmemory = default_maxmemory; - }); + config.hypervisor.entry("qemu".to_owned()).and_modify(\|hv\| { + hv.cpu_info.default_vcpus = default_vcpus; + hv.cpu_info.default_maxvcpus = default_maxvcpus; + hv.memory_info.default_memory = default_memory; + hv.memory_info.default_maxmemory = default_maxmemory; + }); ``` Let's format it now. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-15 14:48:23 +02:00
Saul Paredes	9404104aba	runtime-rs: add build optimization flags Enable the following optimizations when building runtime-rs in release mode: - lto: true - codegen-units=1: Setting these reduce the binary size and improve performance at the cost of longer build times. Without these flags: - build time: 4m 55s - binary size: 51 MB With these flags: - build time: 7m 21s - binary size: 38MB Per https://github.com/kata-containers/kata-containers/issues/1125 and local experiments, a smaller binary size leads to a smaller shim memory footprint. - https://nnethercote.github.io/perf-book/build-configuration.html#codegen-units - https://nnethercote.github.io/perf-book/build-configuration.html#link-time-optimization Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2026-04-14 15:52:38 -07:00
stevenhorsman	5bcc006447	runtime-rs: Add missing license The ch-config crate was missing a license Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-11 08:46:32 +01:00
Fabiano Fidêncio	3ce3644c3c	Merge pull request #12807 from PiotrProkop/blk-sector-rust runtime-rs: allow specifying logical/physical sector size for block devices	2026-04-11 00:42:45 +02:00
Fabiano Fidêncio	36a2d8e7f2	agent: Make launch_process_timeout configurable The hardcoded DEFAULT_LAUNCH_PROCESS_TIMEOUT of 6 seconds in the kata agent is insufficient for environments with NVIDIA GPUs and NVSwitches, where the attestation-agent needs significantly more time to collect evidence during initialization (e.g. ~2 seconds per NVSwitch). When the timeout expires, the agent (PID 1) exits with an error, causing the guest kernel to perform an orderly shutdown before the attestation-agent has finished starting. Make this timeout configurable via the kernel parameter agent.launch_process_timeout (in seconds), preserving the 6-second default for backward compatibility. The Go runtime is wired up to pass this value from the TOML config's [agent.kata] section through to the kernel command line. The NVIDIA GPU configs set the new default to 15 seconds. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Made-with: Cursor	2026-04-10 14:47:01 +02:00
PiotrProkop	82de35c720	runtime-rs: allow specifying logical/physical sector size for block devices Add two new configuration knobs that control the logical and physical sector sizes advertised by virtio-blk devices to the guest: block_device_logical_sector_size (config file) block_device_physical_sector_size (config file) io.katacontainers.config.hypervisor.blk_logical_sector_size (annotation) io.katacontainers.config.hypervisor.blk_physical_sector_size (annotation) The annotation names are abbreviated relative to the config file keys because Kubernetes enforces a 63-character limit on annotation name segments, and the full names would exceed it. Both settings default to 0 (let QEMU decide). When set, they are passed as logical_block_size and physical_block_size in the QMP device_add command during block device hotplug. Setting logical_sector_size smaller then container filesystem block size will cause EINVAL on mount. The physical_sector_size can always be set independently. Values must be 0 or a power of 2 in the range [512, 65536]; other values are rejected with an error at sandbox creation time. Signed-off-by: PiotrProkop <pprokop@nvidia.com>	2026-04-10 11:14:51 +02:00
Fabiano Fidêncio	218077506b	Merge pull request #12769 from RuoqingHe/runtime-rs-allow-install-on-riscv runtime-rs: Allow installation on RISC-V platforms	2026-04-10 10:24:40 +02:00
Steve Horsman	9e8069569e	Merge pull request #12734 from Apokleos/rm-v9p-rs runtime-rs: Remove virtio-9p Shared Filesystem Support	2026-04-09 16:15:55 +01:00
Hyounggyu Choi	f15f7f49f1	Merge pull request #12787 from fidencio/topic/runtime-rs-qemu-arm64-use-static-sandbox-resource-mgmt runtime: qemu: Enable static sandbox resource management on ARM & s390x	2026-04-09 09:18:11 +02:00
Ruoqing He	98ee385220	runtime-rs: Consolidate unsupported arch Consolidate arch we don't support at the moment, and avoid hard coding error messages per arch. Signed-off-by: Ruoqing He <ruoqing.he@lingcage.com>	2026-04-09 04:18:50 +00:00
Ruoqing He	26ffe1223b	runtime-rs: Allow install on riscv64 platform runtime-rs works with QEMU on RISC-V platforms, let's enable installation on RISC-V. Signed-off-by: Ruoqing He <ruoqing.he@lingcage.com>	2026-04-09 04:18:50 +00:00
Amanda Liem	79f844d057	runtime: SNP img-based rootfs with dm-verity Follow-on to kata-containers/kata-containers#12396 Switch SNP config from initrd-based to image-based rootfs with dm-verity. The runtime assembles the dm-mod.create kernel cmdline from kernel_verity_params, and with kernel-hashes=on the root hash is included in the SNP launch measurement. Also add qemu-snp to the measured rootfs integration test. Signed-off-by: Amanda Liem <aliem@amd.com>	2026-04-08 16:46:32 +00:00
Fabiano Fidêncio	e0141991d3	runtime-rs: Enable static sandbox resource management on s390x runtime-rs memory hotplug hard-codes the `pc-dimm` device driver, which is an x86-only QEMU device model. On s390x, the `s390-ccw-virtio` machine type does not support `pc-dimm` at all — the Go runtime handles this by using `virtio-mem-ccw` instead (controlled by the `enable_virtio_mem` config knob, defaulting to true on s390x). runtime-rs has no virtio-mem support, so any attempt to dynamically hotplug memory on s390x fails with: 'pc-dimm' is not a valid device model name This is a pre-existing limitation on main — it has never worked. It is now visible because commit `45dfb6ff25` ("runtime-rs: Fix initial vCPU / memory with static_sandbox_resource_mgmt") expanded runtime-rs test coverage, causing k8s-memory.bats and k8s-oom.bats to actually exercise this code path on s390x. Let's enforce using static_sandbox_resources_mgmt also for s390x so the VM is sized upfront at creation time, bypassing the broken dynamic hotplug path entirely. If someone decides to implement hotplug support for s390x, the work would basically be an implemntation of virtio-mem-ccw support in the runtime-rs QEMU backend (boot-time device creation, qom-set based resize, and virtio-mem aware memory accounting), mirroring what the Go runtime already does, but I'm not game for this (sorry). Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Made-with: Cursor	2026-04-08 16:36:00 +02:00
Fabiano Fidêncio	ffab9b7eee	runtime: qemu: Enable static sandbox resource management on ARM runtime-rs lacks several features needed for CPU hotplug on ARM: pflash/UEFI firmware passthrough, SMP topology in -smp, nr_cpus kernel parameter, and QMP vCPU add handling for the virt machine type (which requires core-id only placement with socket/thread/die set to -1). Without static sandbox resource management, these gaps cause failures in tests like k8s-memory.bats where the VM is not correctly sized for the workload. Enable static_sandbox_resource_mgmt for aarch64 in the QEMU runtime-rs configuration so the VM is pre-sized at creation time, sidestepping the need for hotplug entirely. Together with this we're aligning the go runtime to the very same behaviour. Fixes: #10928 Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Made-with: Cursor	2026-04-08 16:36:00 +02:00
Fabiano Fidêncio	0e5e4802d7	runtime-rs: Fix initial vCPU / memory with static_sandbox_resource_mgmt InitialSizeManager::setup_config() is responsible for applying the sandbox workload sizing (computed from containerd/CRI-O sandbox annotations) to the hypervisor configuration before VM creation. Previously, the workload vCPU count was only logged but never actually added to default_vcpus, so the VM was always created with only the base vCPUs from the configuration/annotations. This caused the k8s-sandbox-vcpus-allocation test to fail with qemu-snp-runtime-rs: a pod with default_vcpus=0.75 and a container CPU limit of 1.2 should see ceil(0.75 + 1.2) = 2 vCPUs, but only got 1. Additionally, the workload memory was being added to default_memory unconditionally, diverging from the Go runtime which only applies both CPU and memory additions when static_sandbox_resource_mgmt is enabled. In the non-static path, adding workload resources here would cause double-counting: once from setup_config() at sandbox creation, and again from update_cpu_resources()/update_mem_resources() when individual containers are added. Guard both additions behind static_sandbox_resource_mgmt, matching the Go runtime's behavior in src/runtime/pkg/oci/utils.go: if sandboxConfig.StaticResourceMgmt { sandboxConfig.HypervisorConfig.NumVCPUsF += sandboxConfig.SandboxResources.WorkloadCPUs sandboxConfig.HypervisorConfig.MemorySize += sandboxConfig.SandboxResources.WorkloadMemMB } Fixes: k8s-sandbox-vcpus-allocation test failure on qemu-snp-runtime-rs Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Made-with: Cursor	2026-04-08 16:36:00 +02:00
Alex Lyn	d6546f2a56	runtime-rs: Remove virtio-9p from configuration*.toml.in As virtio-9p is never supported in runtime-rs, we have more choices to replace it with blockfile snapshotter or erofs snapshotter(in future). It's time to remove its documents and reduce misleading guidance. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-07 23:15:39 +02:00
Fabiano Fidêncio	9a5aaf7ecb	runtime-rs: move create_container_timeout before [mem_agent] section The create_container_timeout key was placed after the [agent.@PROJECT_TYPE@.mem_agent] TOML section header, which meant TOML parsed it as a field of mem_agent rather than of the parent agent table. This was silently ignored before, but now that MemAgent has #[serde(deny_unknown_fields)] it causes a parse error. Move the key above the [mem_agent] section so it belongs to the correct [agent.@PROJECT_TYPE@] table. Also fix configuration-qemu-coco-dev which had a duplicate entry: keep only the correctly placed one with the COCO timeout value. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-04-07 11:23:59 +02:00
Fabiano Fidêncio	a6e891e733	runtime-rs: s/dbg_monitor_socket/extra_monitor_socket/g Let's align this with what's been already used for the go runtime. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-04-07 10:50:42 +02:00
Paul Meyer	b32c5234f4	runtime-rs: deny unknown fields in config ..where possible. Failing on unknown fields makes migration easier, as we do not silently ignore configuration options that previously worked in runtime-go. However, serde can't deny unknown fields where flatten is used, so this can't be used everywhere sadly. There were also errors in test fixtures that were unnoticed. These are fixed here, too. Signed-off-by: Paul Meyer <katexochen0@gmail.com> Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-04-07 10:50:25 +02:00
Alex Lyn	d88d7faf4c	runtime-rs: Add support SEV-SNP host data in Clh This commit aims to add support SEV-SNP host data within cloud-hypervisor, which will help pass down the initdata coming from upper layer. In SEV-SNP cases, with it enabled, the Cloud-hypervisor will receive host-data string, otherwise it's set default None. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-07 10:30:36 +08:00
alex.lyn	7040c66d07	runtime-rs: Pass protection device config to VM boot This commit completes the integration of the protection device configuration into the Cloud Hypervisor VM boot process. Previously, the `protection_device` returned by `get_shared_devices()` was ignored. This change now correctly extracts the `protection_device` and passes it to the `VmConfig` creation during `boot_vm`. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2026-04-07 09:53:37 +08:00
alex.lyn	6dde0a91ca	runtime-rs: Process CoCo protection devices for InitData This commit extends the Cloud Hypervisor integration within `runtime-rs` to correctly process and extract CoCo related device configurations. Specifically, it includes: - Adds handling for `DeviceType::Protection` to `add_device`, allowing these devices to be queued for processing. - Retrieve the pending device list, identify and extract `ProtectionDevConfig` and implements the logic to handle the `mrconfigid` from protection device. This ensures that initdata related prameters are properly consumed by the runtime and made it available for Cloud Hypervisor to utilize during the setup of CVM. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2026-04-07 09:52:02 +08:00
alex.lyn	e6fff6f145	runtime-rs: Pass protection device config to payload conversion This commit implements the conversion logic for the newly introduced CoCo/Initdata configurations, ensuring that `ProtectionDevConfig` data, particularly the `mrconfigid`, is correctly passed. And the main Changes are as below: - Importing `ProtectionDevConfig` for use in conversion logic. - Extracting `protection_device` from `NamedHypervisorConfig` and passing it during the conversion to `VmConfig`. - Updating the `TryFrom` implementation for `PayloadConfig` to accept the `ProtectionDevConfig` and use its `mrconfigid` to populate the `PayloadConfig`'s `mrconfigid` field. - Adjusting unit tests to reflect the updated `PayloadConfig` conversion signature and `mrconfigid` handling. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2026-04-07 09:49:32 +08:00
alex.lyn	c13f0486b3	runtime-rs: Add protection device and payload configuration Introduces `ProtectionDevConfig` and `mrconfigid` fields to support CoCo/InitData feature in Cloud Hypervisor. This change enables the runtime-rs/cloud-hypervisor to: - Configure and manage 'protection devices' associated with hypervisor instances (Specially when the CVM starts). - Include 'MRCONFIGID' in the payload configuration, allowing integrity verification via initdata of the VM's payload within TEEs. This is a foundational step towards robust CVM support, enhancing the trustworthiness and integrity verification with initdata inside TEE platforms. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2026-04-07 09:49:30 +08:00
Fabiano Fidêncio	0a739b3b55	Merge pull request #12755 from katexochen/runtime-rs-config-cleanup runtime-rs: cleanup config	2026-04-06 13:14:58 +02:00
Alex Lyn	35cafe8715	runtime-rs: configure TDX machine options with kernel_irqchip=split When TDX confidential guest support is enabled, set `kernel_irqchip=split` for TDX CVM: ... -machine \ q35,accel=kvm,kernel_irqchip=split,confidential-guest-support=tdx \ ... Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-05 10:18:47 +02:00
Fabiano Fidêncio	f074ceec6d	Merge pull request #12682 from PiotrProkop/fix-direct-io-kata runtime-rs: fix setting directio via config file	2026-04-03 16:11:57 +02:00
Yuting Nie	517882f93d	runtime-rs: Fix typo in share_fs error message There's a typo in the error message which gets prompted when an unsupported share_fs was configured. Fixed shred -> shared. Signed-off-by: Yuting Nie <yuting.nie@spacemit.com>	2026-04-03 05:23:46 +00:00
Fabiano Fidêncio	09194d71bb	Merge pull request #12767 from nubificus/fix/fc-rs runtime-rs: Fix FC API fields	2026-04-02 18:24:35 +02:00
PiotrProkop	67af63a540	runtime-rs: fix setting directio via config file This fix applies the config file value as a fallback when block_device_cache_direct annotation is not explicitly set on the pod. Signed-off-by: PiotrProkop <pprokop@nvidia.com>	2026-04-01 16:59:04 +02:00
Anastassios Nanos	02c82b174a	runtime-rs: Fix FC API fields A FC update caused bad requests for the runtime-rs runtime when specifying the vcpu count and block rate limiter fields. Signed-off-by: Anastassios Nanos <ananos@nubificus.co.uk>	2026-04-01 14:50:51 +00:00
Amulyam24	dcb7d025c7	runtime-rs: Use libc::TUNSETIFF instead of wrapper TUNSETIFF() While attaching the tap device, it fails on ppc64le with EBADF "cannot create tap device. File descriptor in bad state (os error 77)\"): unknown” Refactor the ioctl call to use the standard libc::TUNSETIFF constant. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2026-04-01 09:36:45 +01:00
Amulyam24	8d25ff2c36	runtime-rs: implement set_capabilities for qemu After the qemu VM is booted, while storing the guest details, it fails to set capabilities as it is not yet implemented for QEMU, this change adds a default implementation for it. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2026-04-01 09:36:45 +01:00
Amulyam24	778524467b	runtime-rs: enable building runtime-rs on ppc64le Adds changes in Makefile to build runtime-rs on ppc64le with QEMU support. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2026-04-01 09:36:45 +01:00
Paul Meyer	086c0ed18e	runtime-rs: cleanup config Some fields were misspelled, misplaced at an outdated path or copied over from runtime-go but aren't supported in runtime-rs. Cleaning them up to avoid confusion and ease migration. Signed-off-by: Paul Meyer <katexochen0@gmail.com>	2026-03-31 10:51:25 +02:00
Alex Lyn	48ef2220e8	docs: Update runtime-rs README with accurate architecture documentation Add comprehensive hypervisor support table (Dragonball, QEMU, Cloud Hypervisor, Firecracker, Remote). Document all runtime handlers (VirtContainer, LinuxContainer, WasmContainer) and resource types. List all configuration files including CoCo variants (TDX, SNP, SE). Add shim-ctl crate to crates table for development tooling reference. Add Feature Flags section documenting dragonball and cloud-hypervisor options. Simplify and restructure content for clarity while preserving technical accuracy. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-03-29 19:17:03 +02:00
Alex Lyn	a747b9f774	docs: Improve and refine hypervisor README documentation Enhance documentation in the hypervisor README.md file with: (1) Standardized terminology and formatting (VMM capitalization) (2) Improved paragraph transitions and logical flow (3) Fixed punctuation errors in code blocks Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-03-29 19:17:03 +02:00
Alex Lyn	65b2a75aca	runtime-rs: Fix typo USE_BUILDIN_DB with USE_BUILTIN_DB Corrects the typo 'BUILDIN' to the standard 'BUILTIN' across the codebase to improve code quality and documentation consistency. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-03-29 19:17:03 +02:00

1 2 3 4 5 ...

1190 Commits