kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-07-02 07:02:16 +00:00

Author	SHA1	Message	Date
Fabiano Fidêncio	d3a9669be5	runtime-rs: implement EncryptedEmptyDirVolume Add the core volume handler for block-encrypted emptyDir support in runtime-rs, bringing it to parity with the Go runtime (PR #10559). When emptydir_mode is set to "block-encrypted", host emptyDir bind mounts are intercepted and handled as follows: 1. A sparse disk image (disk.img) is created inside the emptyDir folder, sized to match the host filesystem capacity. 2. A mountInfo.json is written under the kata direct-volume root with volume_type "blk", fs_type "ext4", and metadata encryptionKey=ephemeral. 3. The disk image is plugged into the guest VM as a virtio-blk device via the hypervisor device manager. 4. An agent::Storage is built with driver_options containing encryption_key=ephemeral and shared=true, so the kata-agent delegates formatting and encryption to CDH using LUKS2. The volume is registered in the dispatch chain before the regular block-volume check, and ephemeral disk metadata is tracked for sandbox-level cleanup at teardown. Also re-exports EMPTYDIR_MODE_* constants from kata-types::config so downstream crates can reference them. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: Cursor <cursoragent@cursor.com>	2026-05-14 22:56:11 +02:00
Alex Lyn	1441b2b84a	runtime-rs: Fix warnings in rust runtime So many unformatted rust codes cause uncommitted change files in rust runtime and its libs or agent sources, which can be easily found just by `cargo fmt --all`. Let's reduce such noisy bad experiences Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-05-08 14:56:00 +08:00
Alex Lyn	a51e0b630e	agent: Update VFIO device handling for GPU cold-plug Extend the in-guest agent's VFIO device handler to support the cold-plug flow. When the runtime cold-plugs a GPU before the VM boots, the agent needs to bind the device to the vfio-pci driver inside the guest and set up the correct /dev/vfio/ group nodes so the workload can access the GPU. This updates the device discovery logic to handle the PCI topology that QEMU presents for cold-plugged vfio-pci devices and ensures the IOMMU group is properly resolved from the guest's sysfs. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com> Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-05-07 10:33:26 +02:00
Greg Kurz	bb933f65e4	vendor: Remove `make vendor` across the repo `make vendor` isn't required anymore. People who need vendored code should use the `tools/packaging/release/generate_vendor.sh` script instead. Assisted-by: Claude AI Signed-off-by: Greg Kurz <groug@kaod.org>	2026-05-06 09:49:52 +02:00
Markus Rudy	044c96a9d6	agent: remove standard-oci-runtime feature This feature was only added for runk, which was removed entirely in `96e1fb4ca6`. Fixes: #12849 Signed-off-by: Markus Rudy <mr@edgeless.systems>	2026-04-28 10:35:14 +02:00
Spyros Seimenis	d7385eee99	genpolicy: make FileType::from portable across Darwin libc::S_IF* are u16 on Darwin/BSD and u32 on Linux. The match in FileType::from and its tests mix both widths and don't compile on Darwin. Cast everything to u32; on Linux that's a no-op, hence the clippy::unnecessary_cast allow (rust-lang/rust-clippy#6466). Fixes: #12916 Signed-off-by: Spyros Seimenis <sse@edgeless.systems>	2026-04-27 12:14:04 +03:00
Steve Horsman	d5785b4eba	Merge pull request #12872 from stevenhorsman/bump-rust-to-1.93 Bump rust to 1.93	2026-04-27 09:01:00 +01:00
Fabiano Fidêncio	74d9d043f0	agent: raise regorus policy length limits regorus 0.9.0 introduced a hard, per-engine ceiling on parsed-policy size (1024 columns / 1 MiB / 20 000 lines, see lexer.rs:30 in microsoft/regorus). The 1024-column cap rejects realistic policies emitted by `genpolicy`: the `NVIDIA_REQUIRE_CUDA` environment variable on `nvcr.io/nvidia/k8s/cuda-sample` is roughly 1.3 KiB on a single line, so the agent's `set_policy()` returns an error, the agent (PID 1) exits, the guest kernel reboots, and the runtime eventually times out connecting to the agent's vsock. regorus PR #624 ("feat: make policy length limits configurable per engine") adds `Engine::set_policy_length_config`, but it has not been released yet -- the latest published version is still 0.9.1, which predates that change. Pin `regorus` to the upstream commit that includes #624 and call the new setter from `AgentPolicy::new_engine()` with values that comfortably fit any policy we expect to evaluate (64 KiB per line, 16 MiB per file, 200 000 lines) while still rejecting pathological/minified input. Once a regorus release > 0.9.1 ships with #624, the dependency can be moved back to crates.io. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-04-26 10:18:26 +02:00
Markus Rudy	c8fe6a60d0	genpolicy: update regorus to 0.9.1 The version we used before was released in 2024, it's about time to use a newer version. The new version of the crate comes with a license, which addresses a `cargo deny` finding. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2026-04-26 10:18:26 +02:00
stevenhorsman	d1a20b1887	agent: Fix let_unit_value warning in pipestream tests Remove unnecessary let binding for unit value expression to fix clippy warning in Rust 1.93. Assisted-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-25 11:27:39 +01:00
stevenhorsman	7ab2f0eeb6	agent: Fix needless_borrow warning in container tests Remove unnecessary reference operator from expression that is immediately dereferenced by the compiler to fix clippy warning in Rust 1.93. Assisted-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-25 11:27:39 +01:00
stevenhorsman	f6b694eac3	agent: Fix bool_assert_comparison warnings in rustjail tests Replace assert_eq! with literal bool values with assert! or assert! with negation for more idiomatic assertions to fix clippy warnings in Rust 1.93. Assisted-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-25 11:27:39 +01:00
stevenhorsman	d6adb912d9	agent: Fix unnecessary_cast warnings Replace 'as u8' casts with type suffix literals (_u8) for binary literals to fix clippy warnings in Rust 1.93. Assisted-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-25 11:27:39 +01:00
stevenhorsman	dd9cca74e7	agent: Fix search_is_some warning in rustjail tests Replace .iter().any(\|&ap\| ap == p) with .contains(&p) for more idiomatic code to fix clippy warning in Rust 1.93. Assisted-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-25 11:27:39 +01:00
stevenhorsman	395804cc98	agent: Fix needless_borrow warnings in rustjail tests Remove unnecessary reference operators from format!() calls passed to Command::arg() to fix clippy warnings in Rust 1.93. Assisted-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-25 11:27:39 +01:00
stevenhorsman	7554502af7	agent: Fix useless_vec warnings in rustjail tests Replace vec![] with array literals [] for immutable test data to fix clippy warnings in Rust 1.93. Assisted-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-25 11:27:39 +01:00
stevenhorsman	4dfc0eb101	agent: Fix non_octal_byte_escapes warning in rustjail tests Replace octal escape sequences (\040) with hex escape sequences (\x20) for space characters in mountinfo test data to fix clippy warning in Rust 1.93. Assisted-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-25 11:27:39 +01:00
stevenhorsman	2092127210	agent: Fix octal_escapes warning in rustjail tests Replace decimal literal with cast (0660 as u32) with proper octal literal syntax (0o660) to fix clippy warning in Rust 1.93. Assisted-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-25 11:27:39 +01:00
stevenhorsman	2413ef55dd	agent: Remove unnecessary unwrap Replace is_some() check followed by unwrap() with if let pattern to address clippy::unnecessary_unwrap warning in Rust 1.93. Assisted-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-25 11:27:38 +01:00
Fabiano Fidêncio	966e9b7f80	agent: skip non-PCI addresses in PCIDEVICE env vars Device plugins may set PCIDEVICE_* environment variables with non-PCI identifiers (e.g. "mlx5_core.sf.10" for mlx5 Scalable Functions). The update_env_pci() function assumed all values were PCI BDF addresses and failed to parse them, causing container creation to fail with: "PCI address mlx5_core.sf.10 should have the format DDDD:BB:SS.F" Skip PCIDEVICE_* entries whose values don't parse as PCI addresses, leaving them untouched for the workload. The corresponding _INFO variable is also left as-is since no mapping is collected. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-04-25 12:26:20 +02:00
Markus Rudy	639ff3578d	genpolicy: restrict symlinks in CopyFile Allowing arbitrary symlinks in the shared directory is unsafe for confidential VM use cases. In order to make CopyFile safe both for the VM as well for the consuming containers, we implement the following rules for symlinks (in addition to the existing rules for other files): 1. Symlinks may not be placed directly into the shared directory. 2. Symlinks must not point 'upwards', i.e. contain `..` as a path element. 3. Symlinks must be relative. These rules ensure that all writes initiated by CopyFile are restricted to the shared directory (protecting the VM), and that symlinks can't point outside their mount points (protecting the container). These new restrictions mean that we can't support arbitrary mount sources (which might not follow these rules), but the usual k8s suspects (ConfigMap, Secret, ServiceAccountToken) should still pass. In order to aid writing the policy, we convert the CopyFileRequest to a structure that does not contain binary data, but well-defined strings and types. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2026-04-22 15:46:12 +02:00
Markus Rudy	d6bd666b3f	agent: fix naming for symlinks in CopyFile The agent referred to the `data` field of an incoming CopyFileRequest as the 'src'. This is misleading, because 'source' is not mentioned in the specification (where links are just a path with attached bytes), and because the documentation for the `ln` utility calls the path LINK_NAME and the data TARGET. This commit fixes the glitch and calls the first argument to `symlinkat` the target. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2026-04-22 15:46:12 +02:00
Markus Rudy	5c362adcff	agent: add required features for standalone build Building the kata-agent-policy crate only succeeded when its parents (agent and genpolicy) pulled in the required features. This commit adds the required features to the crate itself, such that it can be built standalone and IDEs don't show errors while browsing it. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2026-04-22 15:46:12 +02:00
Alex Lyn	ce3473d272	agent: Kill processes before removing container directory in destroy() When using multi-layer EROFS snapshotter, the destroy() method fails to kill container processes, causing process leaks in shared PID namespace scenarios. Problem Background: 1. Multi-layer EROFS creates temporary mount points under the container's root directory: - /run/kata-containers/<cid>/multi-layer/upper (ext4, writable) - /run/kata-containers/<cid>/multi-layer/lower-0 (EROFS, read-only) 2. The original destroy() method executed in this order: (1) umount rootfs (2) fs::remove_dir_all(&self.root) <- FAILS with "Read-only file system" (3) cgroup cleanup and process killing <- NEVER EXECUTED 3. When remove_dir_all() encounters the read-only EROFS mount point, it returns EROFS error (os error 30), causing destroy() to exit early without killing processes. Why This Fix: 1. The test case k8s-kill-all-process-in-container.bats creates an init container with a background process (tail -f /dev/null), expecting it to be killed when the init container is destroyed. 2. With shared PID namespace (shareProcessNamespace: true), the orphaned process continues running, causing the test to fail. Solution: 1. Reorder the destroy() method to kill processes BEFORE attempting to remove the container directory: (1) Get PIDs from cgroup and send SIGKILL (2) Destroy cgroup (3) umount rootfs (4) fs::remove_dir_all(&self.root) 2. This ensures processes are always killed regardless of filesystem cleanup status, matching the behavior of overlayfs snapshotter. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-19 13:24:31 +02:00
Alex Lyn	c745d18e00	agent: Add virtio-scsi for multilayer erofs storage handler It aims to suppport virtio-scsi driver for handling vmdk and rwlayer storage in kata-agent. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-19 13:24:31 +02:00
Alex Lyn	37a542c20f	agent: Refactor multi-layer EROFS handling with unified flow Refactor the multi-layer EROFS storage handling to improve code maintainability and reduce duplication. Key changes: (1) Extract update_storage_device() to unify device state management for both multi-layer and standard storages (2) Simplify handle_multi_layer_storage() to focus on device creation, returning MultiLayerProcessResult struct instead of managing state (3) Unify the processing flow in add_storages() with clear separation: (4) Support multiple EROFS lower layers with dynamic lower-N mount paths (5) Improve mkdir directive handling with deferred {{ mount 1 }} resolution This reduces code duplication, improves readability, and makes the storage handling logic more consistent across different storage types. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-19 13:24:31 +02:00
Alex Lyn	27c59f15a0	agent: Register MultiLayerErofsHandler and process multiple EROFS Introduce MultiLayerErofsHandler and method of handle_multi_layer_storage for multi-layer storage: (1) Register MultiLayerErofsHandler to STORAGE_HANDLERS to handle multi-layer EROFS storage with driver type 'multi-layer-erofs'. (2) Add handle_multi_layer_erofs function to process multiple EROFS storages with X-kata.multi-layer marker together in guest. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-19 13:24:31 +02:00
Alex Lyn	6ce9180333	agent: Add support for EROFS rootfs handling in kata-agent Add multi_layer_erofs.rs implementing guest-side processing logics of multi-layer EROFS rootfs with overlay mount support. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-19 13:24:31 +02:00
Fabiano Fidêncio	64c139208f	agent: add GetDiagnosticData RPC with termination log support Add a new extensible GetDiagnosticData RPC that retrieves diagnostic information from the guest VM. The request carries a log_type string field to specify what kind of data is requested, and a container_id field to identify the target container. The first supported log_type is "termination_log", which reads the Kubernetes termination message file from inside the guest. This is needed for shared_fs=none configurations where the host cannot directly access the guest filesystem. On the Go runtime side, the container stop() path now calls GetDiagnosticData to copy the termination message to the host when running with NoSharedFS and the terminationMessagePolicy annotation is set to "File". The call is best-effort: failures are logged as warnings rather than blocking container teardown. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Signed-off-by: Silenio Quarti <silenio_quarti@ca.ibm.com>	2026-04-17 13:01:13 +02:00
Fabiano Fidêncio	36a2d8e7f2	agent: Make launch_process_timeout configurable The hardcoded DEFAULT_LAUNCH_PROCESS_TIMEOUT of 6 seconds in the kata agent is insufficient for environments with NVIDIA GPUs and NVSwitches, where the attestation-agent needs significantly more time to collect evidence during initialization (e.g. ~2 seconds per NVSwitch). When the timeout expires, the agent (PID 1) exits with an error, causing the guest kernel to perform an orderly shutdown before the attestation-agent has finished starting. Make this timeout configurable via the kernel parameter agent.launch_process_timeout (in seconds), preserving the 6-second default for backward compatibility. The Go runtime is wired up to pass this value from the TOML config's [agent.kata] section through to the kernel command line. The NVIDIA GPU configs set the new default to 15 seconds. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Made-with: Cursor	2026-04-10 14:47:01 +02:00
Alex Lyn	2bac201364	agent: Remove virtio-9p storage handler Remove the Virtio9pHandler implementation and its registration from the storage handler manager: (1) Remove Virtio9pHandler struct and StorageHandler implementation. (2) Remove DRIVER_9P_TYPE and Virtio9pHandler from STORAGE_HANDLERS registration. (3) Update watcher.rs comments to remove 9p references. This completes the removal of virtio-9p support in the agent. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-04-07 23:15:39 +02:00
stevenhorsman	5390e470d3	agent: Remove Cargo.lock Following on from #12690, the agent is part of the repo workspace, so no longer needs a lock file. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-04-01 09:11:28 +01:00
Jiahao Wang	1163b6581f	agent: Change TARGET_PATH to root workspace After agent was moved to root workspace, the products are now under the repo root. Change the TARGET_PATH accordingly to tell Makefile where to lookup output. Signed-off-by: Jiahao Wang <jiahao.wang@lingcage.com>	2026-03-29 06:35:54 +00:00
Jiahao Wang	29e5d5d951	build: Move agent to root workspace This commit adds kata agent to the root workspace, as a follow up work of #12413. Remove agent from exclude list, and make it as a member of root workspace. Signed-off-by: Jiahao Wang <jiahao.wang@lingcage.com>	2026-03-29 06:35:38 +00:00
Fabiano Fidêncio	aa6890eae1	Merge pull request #12675 from manuelh-dev/mahuber/cdh-storage-options agent: add mkfs_opts parameter to cdh_secure_mount	2026-03-23 15:18:38 +01:00
stevenhorsman	38a655487f	vsock-exporter: Switch bincode for serde_json bincode is not maintained, so switch to serde_json to resolve RUSTSEC-2025-0141 Assisted-By: Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-03-19 10:45:17 +00:00
stevenhorsman	e1d7d5bef8	agent: Remove async-std It's a dev-dependency that doesn't seem to be used, so remove it and resolve RUSTSEC-2025-0052 Assisted-By: Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-03-19 10:45:17 +00:00
stevenhorsman	e4eda5e1d8	agent: Bump tracing-subscriber - Bump tracing-subscriber to 0.3.20 to resolve RUSTSEC-2025-0055 - Switch deprecated `slog_info!` for `slog::info!` Generated-By: Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-03-19 10:45:17 +00:00
Manuel Huber	62d74bb1fd	agent: add mkfs_opts parameter to cdh_secure_mount Add an mkfs_opts parameter to cdh_secure_mount so that its users can parametrize these options depending on their needs. For now, there is two users providing explicit values (container image layer storage and container data storage features). Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-03-18 15:37:32 -07:00
stevenhorsman	501578cc5a	agent: Remove non-idiomatic unwrap Calling .unwrap() after an .is_some() check is considered non-idiomatic in as it performs redundant work and makes the code more verbose. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-03-17 16:04:58 +00:00
Manuel Huber	169f92ff09	agent: cdh: Update CDH and API With the new CDH version, the secure_mount API changes. Further, the new CDH version no longer uses the luks-encrypt-storage script but utilizes libcryptsetup as well as mkfs.ext4 and dd. Hence, adapt some of the CDH and Kata components build steps Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-03-16 09:43:17 -07:00
Aurélien Bombo	9fe03fb170	genpolicy: Support trusted ephemeral data storage * Introduces a new cluster_config setting encrypted_emptydir defaulting to true. * Adapts genpolicy for encrypted emptyDirs. Crucially, the rules.rego change checks that the mount and the storage are well-formed together: * i_storage.source matches a known regex. * i_storage.mount_point == $(spath)/BASE64(i_storage.source) * i_storage.mount_point == p_storage.mount_point * i_storage.mount_point == i_mount.source Note that policy enforcement is necessary to prevent rogue device injection. E.g. the agent could not blindly encrypt all block devices as some use cases only need dm-verity. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-09 14:52:17 -05:00
Aurélien Bombo	eaa711617e	agent: Support trusted ephemeral data storage Handles block-based emptyDirs plugged via virtio-blk and virtio-scsi by encrypting and formatting them. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2026-03-09 14:52:17 -05:00
stevenhorsman	8177a440ca	libs: Remove unused crates Remove unused crates to reduce our size and the work needed to do updates Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-26 09:37:46 +00:00
Alex Lyn	d298df7014	kata-types: Add cross-platform host_memory_mib() helper for host memory Introduce host_memory_mib() with OS-specific implementations (Linux/Android via nix::sysinfo, macOS via sysctl) selected at compile time. This improves portability and allows consistent host memory sizing/validation across different platforms. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-02-25 21:04:26 +08:00
Alex Lyn	b3d60698af	runtime-rs: move host memory adjustment into MemoryInfo using nix sysinfo As the memory related information has been serialized at the sandbox initalization specially at the moment of parsing configuration toml. This commit aims to refactor MemoryInfo initialization logics: (1) Remove memory sizing/host-memory adjustment logic from QEMU cmdline Memory::new() (2) Initialize/adjust memory values via kata-types MemoryInfo (single source of truth) (3) Replace sysinfo::System::new_with_specifics with nix::sys::sysinfo::sysinfo() to get host RAM Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-02-25 19:32:44 +08:00
Jacek Tomasiak	8025fa0457	agent: Don't pass empty options to mount With some older kernels some fs implementations don't handle empty options strings well. This leads to failures in "setup rootfs" step. E.g. `cgroup: cgroup2: unknown option ""`. This is fixed by mapping empty string to `None` before passing to `nix::mount`. Signed-off-by: Jacek Tomasiak <jtomasiak@arista.com> Signed-off-by: Jacek Tomasiak <jacek.tomasiak@gmail.com>	2026-02-16 14:55:59 +01:00
stevenhorsman	90dbd3f562	agent: Bump rkyv version to 0.7.46 Bump to remediate RUSTSEC-2026-0001 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-14 00:33:45 +01:00
stevenhorsman	ffcb10b6a3	agent: Bump time crate to 0.3.47 Update time to resolve CVE-2026-25727. Note: this involved bumping the versions of slog-term and slog-json and bumping the MSRV to 1.88.0 which time 0.3.47 requires. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-09 21:44:51 +01:00
stevenhorsman	e49a61eea2	agent: Bump bytes to 1.11.1 To remediate CVE-2026-25541 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-02-09 21:43:23 +01:00

1 2 3 4 5 ...

1445 Commits