kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-10-22 20:39:41 +00:00

Author	SHA1	Message	Date
Alex Lyn	4c386b51d9	runtime-rs: Add support for handling virtio-scsi devices As virtio-scsi has been set the default block device driver, the runtime also need to correctly handle the virtio-scsi info, specially the SCSI address required within kata-agent handling logic. And getting and assigning the scsi_addr to kata agent device id will be enough. This commit just do such work. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-10-10 11:31:04 +08:00
stevenfryto	bde6eb7c3a	runtime-rs: add generic support for running the VMM in non-root mode This commit introduces generic support for running the VMM in rootless mode in runtime-rs: 1.Detect whether the VMM is running in rootless mode. 2.Before starting the VMM process, create a non-root user and launch the VMM with that user’s UID and GID; also add the KVM user's group ID to the VMM process's supplementary groups so the VMM process can access /dev/kvm. 3.Add the setup of the rootless directory located in the dir /run/user/<uid> directory, and modify some path variables to be functions that return the path with the rootless directory prefix when running in rootless mode. Fixes: #11414 Signed-off-by: stevenfryto <sunzitai_1832@bupt.edu.cn>	2025-09-25 19:30:29 +08:00
Alex Lyn	4e793d635e	Merge pull request #11736 from kata-containers/enhance-copyfile runtime-rs: Enhance copyfile when sharedfs is disabled	2025-09-23 14:15:44 +08:00
Alex Lyn	5dd36c6c0f	runtime-rs: Correctly set permission and mode for dir when copy files Correctly set dir's permissions and mode. This update ensures: The dir_mode field of CopyFileRequest is set to DIR_MODE_PERMS (equivalent to Go's 0o750 \| os.ModeDir), which is primarily used for the top-level directory creation permissions. The file_mode field now directly uses metadata.mode() (equivalent to Go's st.Mode) for the target entry. This change aims to resolve potential permission issues or inconsistencies during directory and file creation within the guest environment by precisely matching the expected mode propagation of the Kata agent. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-09-22 17:59:57 +08:00
Alex Lyn	429133cedb	runtime-rs: Introduce shared FS volume management in VolumeResource The core purpose of introducing volume_manager to VolumeResource is to centralize the management of shared file system volumes. By creating a single VolumeManager instance within VolumeResource, all shared file volumes are managed by one central entity. This single volume_manager can accurately track the references of all ShareFsVolume instances to the shared volumes, ensuring correct reference counting, proper volume lifecycle management, and preventing issues like volumes being overwritten. This new design ensures that all shared volumes are managed by a central entity, which: (1) Guarantees correct reference counting. (2) Manages the volume lifecycle correctly, avoiding issues like volumes being overwritten. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-09-22 15:03:41 +08:00
Alex Lyn	90c99541da	runtime-rs: Integrate VolumeManager into ShareFsVolume lifecycle This commit integrates the new `VolumeManager` into the `ShareFsVolume` lifecycle. Instead of directly copying files, `ShareFsVolume::new` now uses the `VolumeManager` to get a guest path and determine if the volume needs to be copied. It also updates the `cleanup` function to release the volume's reference count, allowing the `VolumeManager` to manage its state and clean up resources when no longer in use. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-09-22 15:03:27 +08:00
Alex Lyn	e73daa2f14	runtime-rs: Add sandbox level volume manager within non-sharedfs This commit introduces a new `VolumeManager` to track the state of shared volumes, including their reference count and its corresponding container ids. The manager's goal is to handle the lifecycle of shared filesystem volumes, including: (1) Volume State Tracking: Tracks the mapping from host source paths to guest destination paths. (2) Reference Counting: Manages reference counts for each volume, preventing premature cleanup when multiple containers share the same source. (3) Deterministic guest paths: Generates unique guest paths using random string to avoid naming conflicts. (4) Improved Management: Provides a centralized way to handle volume creation, copying, and release, including aborting file watchers when volumes are no longer in use. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-09-22 14:45:16 +08:00
Alex Lyn	313c7313f0	runtime-rs: Refactor code to improve copyfile logic and readability This commit refactors the `CopyFile` related code to streamline the logic for creating guest directories and make the code structure clearer. Its main goal is to improve the overall maintainability and facilitate future feature extensions. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-09-22 11:30:47 +08:00
Alex Lyn	f36377070a	runtime-rs: Enhance Copyfile to ensure existing contents synchronized This commit is designed to perform a full sync before starting monitoring to ensure that files which exist before monitoring starts are also synced. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-09-22 11:30:35 +08:00
Alex Lyn	5ca403b5d9	runtime-rs: Allow per-device AIO mode configuration for block devices This commit enhances control over block device AIO modes via hotplug. Previously, hotplugging block devices was set with default AIO mode (io_uring). Even if users reset the AIO mode in the configuration file, the changes would not be correctly applied to individual block devices. With this update, users can now explicitly configure the AIO mode for hot-plugging block devices via the configuration, and those settings will be correctly applied. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-09-22 10:13:44 +08:00
Alex Lyn	425e93a9b8	runtime-rs: Get more block device info within Device Manager We need more information about block device, just relapce the original method get_block_driver with get_block_device_info and return its BlockDeviceInfo. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-09-22 10:13:44 +08:00
Fupan Li	4a92fc1129	runtime-rs: add the sandbox's shm volume support Docker containers support specifying the shm size using the --shm-size option and support sandbox-level shm volumes, so we've added support for shm volumes. Since Kubernetes doesn't support specifying the shm size, it typically uses a memory-based emptydir as the container's shm, and its size can be specified. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-09-16 16:32:41 +02:00
Alex Lyn	04dedda6ed	runtime-rs: Bugfix for kata virtual volume overlay fstype As prvious configure with overlayfs is incorrect, which causes the agent policy validation failure. And it's also different with runtime-go's configuration. In this patch, we'll correct its fstype with overlay and align with runtime on this matter. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-09-14 16:38:09 +08:00
Hyounggyu Choi	150c90e32a	Merge pull request #11728 from BbolroC/fix-sealed-secret-volume runtime-rs: Adjust path for sealed secret mount check	2025-09-02 16:57:33 +02:00
Hyounggyu Choi	65fdb18c96	runtime-rs: Adjust path for sealed secret mount check Mount validation for sealed secret requires the base path to start with `/run/kata-containers/shared/containers`. Previously, it used `/run/kata-containers/sandbox/passthrough`, which caused test failures where volume mounts are used. This commit renames the path to satisfy the validation check. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-08-28 15:38:07 +02:00
Caspian443	617af4cb3b	runtime-rs: Empty block-rootfs Storage.options and align with Go runtime - Set guest Storage.options for block rootfs to empty (do not propagate host mount options). - Align behavior with Go runtime: only add xfs nouuid when needed. Signed-off-by: Caspian443 <scrisis843@gmail.com>	2025-08-26 01:27:21 +00:00
Alex Lyn	903e608c23	runtime-rs: Add only static ARP entries with handle_neighours To make it aligned with runtime-go, we need add only static ARP entries into the targets. Fixes #11697 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-19 20:09:20 +08:00
Pavel Mores	e2156721fd	runtime-rs: add tests to exercise floating-point 'default_vcpus' Also included (as commented out) is a test that does not pass although it should. See source code comment for explanation why fixing this seems beyond the scope of this PR. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Pavel Mores	1f95d9401b	runtime-rs: change representation of default_vcpus from i32 to f32 This commit focuses purely on the formal change of type. If any subsequent changes in semantics are needed they are purposely avoided here so that the commit can be reviewed as a 100% formal and 0% semantic change. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Pavel Mores	cdc0eab8e4	runtime-rs: make sandbox vcpu allocation more accurate This commit addresses a part of the same problem as PR #7623 did for the golang runtime. So far we've been rounding up individual containers' vCPU requests and then summing them up which can lead to allocation of excess vCPUs as described in the mentioned PR's cover letter. We address this by reversing the order of operations, we sum the (possibly fractional) container requests and only then round up the total. We also align runtime-rs's behaviour with runtime-go in that we now include the default vcpu request from the config file ('default_vcpu') in the total. We diverge from PR #7623 in that `default_vcpu` is still treated as an integer (this will be a topic of a separate commit), and that this implementation avoids relying on 32-bit floating point arithmetic as there are some potential problems with using f32. For instance, some numbers commonly used in decimal, notably all of single-decimal-digit numbers 0.1, 0.2 .. 0.9 except 0.5, are periodic in binary and thus fundamentally not representable exactly. Arithmetics performed on such numbers can lead to surprising results, e.g. adding 0.1 ten times gives 1.0000001, not 1, and taking a ceil() results in 2, clearly a wrong answer in vcpu allocation. So instead, we take advantage of the fact that container requests happen to be expressed as a quota/period fraction so we can sum up quotas, fundamentally integral numbers (possibly fractional only due to the need to rewrite them with a common denominator) with much less danger of precision loss. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Xuewei Niu	6f6d64604f	Merge pull request #11598 from justxuewei/cgroups	2025-07-25 17:53:03 +08:00
Xuewei Niu	60e3679eb7	runtime-rs: Add full cgroups support on host Add full cgroups support on host. Cgroups are managed by `FsManager` and `SystemdManager`. As the names impies, the `FsManager` manages cgroups through cgroupfs, while the `SystemdManager` manages cgroups through systemd. The two manages support cgroup v1 and cgroup v2. Two types of cgroups path are supported: 1. For colon paths, for example "foo.slice:bar:baz", the runtime manages cgroups by `SystemdManager`; 2. For relative/absolute paths, the runtime manages cgroups by `FsManager`. vCPU threads are added into the sandbox cgroups in cgroup v1 + cgroupfs, others, cgroup v1 + systemd, cgroup v2 + cgroupfs, cgroup v2 + systemd, VMM process is added into the cgroups. The systemd doesn't provide a way to add thread to a unit. `add_thread()` in `SystemdManager` is equivalent to `add_process()`. Cgroup v2 supports threaded mode. However, we should enable threaded mode from leaf node to the root node (`/`) iteratively [1]. This means the runtime needs to modify the cgroups created by container runtime (e.g. containerd). Considering cgroupfs + cgroup v2 is not a common combination, its behavior is aligned with systemd + cgroup v2, which is not allowed to manage process at the thread level. 1: https://www.kernel.org/doc/html/v4.18/admin-guide/cgroup-v2.html#threads Fixes: #11356 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2025-07-25 14:52:55 +08:00
alex.lyn	613dba6f1f	runtime-rs: Some extra work to enhance copyfile with sharedfs disabled As some reasons, it first should make it align with runtime-go, this commit will do this work. Fixes #11543 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-25 11:39:20 +08:00
alex.lyn	fbd84fd3f4	runtime-rs: Support virtio-scsi device within handle_block_volume It supports handling scsi device when block device driver is `scsi`. And it will ensure a correct storage source with LUN. Fixes #11516 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-24 11:57:00 +08:00
alex.lyn	56c0c172fa	runtime-rs: Fix initdata length field missing when create block The init data could not be read properly within kata-agent because the data length field was omitted, a consequence of a mismatch in the data write format. Fixes #11556 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-15 19:22:17 +08:00
alex.lyn	8f8b196705	runtime-rs: refactor merging metadata within image_pull refactor implementation for merging metadata. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-03 17:07:08 +08:00
alex.lyn	7a59d7f937	runtime-rs: Import the public const value from libs Introduce a const value `KATA_VIRTUAL_VOLUME_PREFIX` defined in the libs/kata-types, and it'll be better import such const value from there. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-03 09:42:17 +08:00
Champ-Goblem	91cadb7bfe	runtime-rs: Fix calculation of odd memory sizes An odd memory size leads to the runtime breaking during its startup, as shown below: ``` Warning FailedCreatePodSandBox 34s kubelet Failed to create pod sandbox: rpc error: code = Unknown desc = failed to start sandbox "708c81910f4e67e53b4170b6615083339b220154cb9a0c521b3232cdb40d50f9": failed to create containerd task: failed to create shim task: Others("failed to handle message start sandbox in task handler\n\nCaused by:\n 0: start vm\n 1: set vm base config\n 2: set vm configuration\n 3: Failed to set vm configuration VmConfigInfo { vcpu_count: 2, max_vcpu_count: 16, cpu_pm: \"on\", cpu_topology: CpuTopology { threads_per_core: 1, cores_per_die: 1, dies_per_socket: 1, sockets: 1 }, vpmu_feature: 0, mem_type: \"shmem\", mem_file_path: \"\", mem_size_mib: 4513, serial_path: Some(\"/run/kata/708c81910f4e67e53b4170b6615083339b220154cb9a0c521b3232cdb40d50f9/console.sock\"), pci_hotplug_enabled: true }\n 4: vmm action error: MachineConfig(InvalidMemorySize(4513))\n\nStack backtrace:\n 0: anyhow::error::<impl anyhow::Error>::msg\n 1: hypervisor::dragonball::vmm_instance::VmmInstance::handle_request\n 2: hypervisor::dragonball::vmm_instance::VmmInstance::set_vm_configuration\n 3: hypervisor::dragonball::inner::DragonballInner::set_vm_base_config\n 4: <hypervisor::dragonball::Dragonball as hypervisor::Hypervisor>::start_vm::{{closure}}::{{closure}}\n 5: <hypervisor::dragonball::Dragonball as hypervisor::Hypervisor>::start_vm::{{closure}}\n 6: <virt_container::sandbox::VirtSandbox as common::sandbox::Sandbox>::start::{{closure}}::{{closure}}\n 7: <virt_container::sandbox::VirtSandbox as common::sandbox::Sandbox>::start::{{closure}}\n 8: runtimes::manager::RuntimeHandlerManager::handler_task_message::{{closure}}::{{closure}}\n 9: runtimes::manager::RuntimeHandlerManager::handler_task_message::{{closure}}\n 10: <service::task_service::TaskService as containerd_shim_protos::shim::shim_ttrpc_async::Task>::create::{{closure}}\n 11: <containerd_shim_protos::shim::shim_ttrpc_async::CreateMethod as ttrpc::asynchronous::utils::MethodHandler>::handler::{{closure}}\n 12: <tokio::time::timeout::Timeout<T> as core::future::future::Future>::poll\n 13: ttrpc::asynchronous::server::HandlerContext::handle_msg::{{closure}}\n 14: <core::future::poll_fn::PollFn<F> as core::future::future::Future>::poll\n 15: <ttrpc::asynchronous::server::ServerReader as ttrpc::asynchronous::connection::ReaderDelegate>::handle_msg::{{closure}}::{{closure}}\n 16: tokio::runtime::task::core::Core<T,S>::poll\n 17: tokio::runtime::task::harness::Harness<T,S>::poll\n 18: tokio::runtime::scheduler::multi_thread::worker::Context::run_task\n 19: tokio::runtime::scheduler::multi_thread::worker::Context::run\n 20: tokio::runtime::context::runtime::enter_runtime\n 21: tokio::runtime::scheduler::multi_thread::worker::run\n 22: <tokio::runtime::blocking::task::BlockingTask<T> as core::future::future::Future>::poll\n 23: tokio::runtime::task::core::Core<T,S>::poll\n 24: tokio::runtime::task::harness::Harness<T,S>::poll\n 25: tokio::runtime::blocking::pool::Inner::run\n 26: std::sys::backtrace::__rust_begin_short_backtrace\n 27: core::ops::function::FnOnce::call_once{{vtable.shim}}\n 28: std::sys::pal::unix:🧵:Thread:🆕:thread_start") ``` As we cannot control what the users will set, let's just round it up to the next acceptable value. Signed-off-by: Champ-Goblem <cameron@northflank.com> Signed-off-by: Fabiano Fidêncio <fidencio@northflank.com>	2025-06-28 14:29:18 +02:00
Fupan Li	48c8e0f296	runtime-rs: fix the issue return the wrong volume In the pre commit:74eccc54e7b31cc4c9abd8b6e4007c3a4c1d4dd4, it missed return the right rootfs volume. In the is_block_rootfs fn, if the rootfs is based on a block device such as devicemapper, it should clear the volume's source and let the device_manager to use the dev_id to get the device's host path. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-06-25 10:02:52 +08:00
Fupan Li	74eccc54e7	runtime-rs: add the blockfile based rootfs support For containerd's Blockfile Snapshotter, it will pass a rootfs mounts with a rawfile as a mount source and mount options with "loop" embeded. To support this type of rootfs, it is necessary to identify this as a blockfile rootfs through the "loop" flag, and then use the volume source of the rootfs as the source of the block device to hot-insert it into the guest. Fixes:#11464 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-06-24 22:31:54 +08:00
alex.lyn	6ea1494701	runtime-rs: Add InitData Resource type for block device management To correctly manage initdata as a block device, a new InitData Resource type, inherently a block device, has been introduced within the ResourceManager. As a component of the Sandbox's resources, this InitData Resource needs to be appropriately handled by the Device Manager's handler. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-06-24 10:25:57 +08:00
alex.lyn	8c1482a221	runtime-rs: Introduce coco_data dir and initdata block Implement resource storage infrastructure with initial initdata support: 1. Create dedicated `coco_data` directory for: - Centralized management of CoCo resources; - Future expansion of CoCo artifacts; 2. Atomic initdata block as foundational component in `coco_data`, it will implement creation of compressed initdata blocks with: - Gzip compression with level customization (0-9) - Sector-aligned (512B) image format with magic header - Adaptive buffering (4KB-128KB) based on payload size - Temp-file atomic writes with 0o600 permissions Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-06-24 10:25:57 +08:00
alex.lyn	cebb259e51	runtime-rs: Introduce force guest pulling image Container image integrity protection is a critical practice involving a multi-layered defense mechanism. While container images inherently offer basic integrity verification through Content-Addressable Storage (CAS) (ensuring pulled content matches stored hashes), a combination of other measures is crucial for production environments. These layers include: Encrypted Transport (HTTPS/TLS) to prevent tampering during transfer; Image Signing to confirm the image originates from a trusted source; Vulnerability Scanning to ensure the image content is "healthy"; and Trusted Registries with stringent access controls. In certain scenarios, such as when container image confidentiality requirements are not stringent, and integrity is already ensured via the aforementioned mechanisms (especially CAS and HTTPS/TLS), adopting "force guest pull" can be a viable option. This implies that even when pulling images from a container registry, their integrity remains guaranteed through content hashes and other built-in mechanisms, without relying on additional host-side verification or specialized transfer methods. Since this feature is already available in runtime-go and offers synergistic benefits with guest pull, we have chosen to support force guest pull. Fixes #10690 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-06-16 16:49:17 +08:00
alex.lyn	c9ffbaf30d	runtime-rs: Support handling Kata Virtual Volume in handle_rootfs In CoCo scenarios, there's no image pulling on host side, and it will disable such operations, that's to say, there's no files sharing between host and guest, especially for container rootfs. We introduce Kata Virtual Volume to help handle such cases: (1) Introduce is_kata_virtual_volume to ensure the volume is kata virtual volume. (2) Introduce VirtualVolume Handling logic in handle_rootfs when the mount is kata virtual volume. Fixes #10690 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-06-16 16:49:17 +08:00
alex.lyn	2600fc6f43	runtime-rs: Add Spec annotation to help pass image information We need get the relevent image ref from OCI runtime Spec, especially the annotation of it. Fixes #10690 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-06-16 16:49:17 +08:00
alex.lyn	d4e9369d3d	runtime-rs: Implement guest-pull rootfs via virtual volumes This commit introduces comprehensive support for rootfs mount mgmt through Kata Virtual Volumes, specifically enabling the guest-pull mechanism. It enhances the runtime's ability to: (1) Extract image references from container annotations (CRI/CRI-O). (2) Process `KataVirtualVolume` objects, configuring them for guest-pull operations. (3) Set up the agent's storage for guest-pulled images. This functionality streamlines the process of pulling container images directly within the guest for rootfs, aligning with guest-side image management strategies. Fixes #10690 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-06-16 16:49:17 +08:00
Ruoqing He	d7dfab92be	runtime-rs: Fix clippy `manual_inspect` Manually fix `manual_inspect` clippy warning reported by rust 1.85.1. ```console error: using `map` over `inspect` --> crates/resource/src/cdi_devices/container_device.rs:50:10 \| 50 \| .map(\|device\| { \| ^^^ \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#manual_inspect = note: `-D clippy::manual-inspect` implied by `-D warnings` = help: to override `-D warnings` add `#[allow(clippy::manual_inspect)]` help: try \| 50 ~ .inspect(\|device\| { 51 \| // push every device's Device to agent_devices 52 ~ devices_agent.push(device.device.clone()); \| ``` Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-06-11 13:50:10 +00:00
Ruoqing He	4c467f57de	runtime-rs: Fix clippy `needless_return` Fix `needless_return` clippy warning as suggested by rust 1.85.1. ```console error: unneeded `return` statement --> crates/resource/src/rootfs/nydus_rootfs.rs:199:5 \| 199 \| return Some(prefetch_list_path.display().to_string()); \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#needless_return = note: `-D clippy::needless-return` implied by `-D warnings` = help: to override `-D warnings` add `#[allow(clippy::needless_return)]` help: remove `return` \| 199 - return Some(prefetch_list_path.display().to_string()); 199 + Some(prefetch_list_path.display().to_string()) \| ``` Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-06-11 13:50:10 +00:00
Ruoqing He	781510202a	runtime-rs: Log error instead of format Log on error condition when `umount` operation fail instead of `format!` error message. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-06-08 08:28:22 +00:00
Fabiano Fidêncio	02c46471fd	rust: Update cgroups-rs to its v0.3.5 release We're switching to using a rev as it may take some time for the package to be updated on crates.io. Signed-off-by: Fabiano Fidêncio <fidencio@northflank.com>	2025-05-30 21:49:50 +02:00
Fabiano Fidêncio	d3f81ec337	Merge pull request #11240 from Apokleos/copydir runtime-rs: Propagate k8s configs correctly when sharedfs is disabled	2025-05-27 12:41:21 +02:00
Fupan Li	15cbc545ca	runtime-rs: fix the issue of delete cgroup failed When try to delete a cgroup, it's needed to move all of the tasks/procs in the cgroup into root cgroup and then delete it. Since for cgroup v2, it doesn't support to move thread into root cgroup, thus move the processes instead of moving tasks can fix this issue. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-05-22 12:15:02 +08:00
alex.lyn	4b27ca9233	runtime-rs: Implement volume copy allowlist check For security reasons, we have restricted directory copying. Introduces the `is_allowlisted_copy_volume` function to verify if a given volume path is present in an allowed copy directory. This enhances security by ensuring only permitted volumes are copied Currently, only directories under the path `/var/lib/kubelet/pods/<uid>/volumes/{kubernetes.io~configmap, kubernetes.io~secret, kubernetes.io~downward-api, kubernetes.io~projected}` are allowed to be copied into the guest. Copying of other directories will be prohibited. Fixes #11237 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-05-20 16:57:10 +08:00
alex.lyn	654e6db91f	runtime-rs: Add inotify-based real-time directory synchronization Introduce event-driven file sync mechanism between host and guest when sharedfs is disabled, which will help monitor the host path in time and do sync files changes: 1. Introduce FsWatcher to monitor directory changes via inotify; 2. Support recursive watching with configurable filters; 3. Add debounce logic (default 500ms cooldown) to handle burst events; 4. Trigger `copy_dir_recursively` on stable state; 5. Handle CREATE/MODIFY/DELETE/MOVED/CLOSE_WRITE events; Fixes #11237 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-05-20 16:55:49 +08:00
alex.lyn	79b832b2f5	runtime-rs: Propagate k8s configs correctly when sharedfs is disabled In Kubernetes (k8s), while Kata Pods often use virtiofs for injecting Service Accounts, Secrets, and ConfigMaps, security-sensitive environments like CoCo disable host-guest sharing. Consequently, when SharedFs is disabled, we propagate these configurations into the guest via file copy and bind mount for correct container access. Fixes #11237 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-05-20 16:55:49 +08:00
alex.lyn	8da7cd1611	runtime-rs: Impl recursive directory copy with metadata preservation Add async directory traversal using BFS algorithm: (1) Support file type handling: Regular files (S_IFREG) with content streaming; Directories (S_IFDIR) with mode preservation; Symbolic links (S_IFLNK) with target recreation; (2) Maintain POSIX metadata: UID/GID preservation,File mode bits, and Directory permissions (3) Implement async I/O operations for: Directory enumeration, file reading, symlink target resolution Fixes #11237 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-05-20 16:55:49 +08:00
Fupan Li	9a03815f18	Merge pull request #11095 from lifupan/ephemeral_volume runtime-rs: add the ephemeral memory based volume support	2025-05-20 09:18:34 +08:00
alex.lyn	d435712ccb	runtime-rs: Introduce PortDevice in resource manager in sandbox A new resource type `PortDevice` is introduced which is dedicated for handling root ports/switch ports during sandbox creation(VM). Fixes #10361 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-05-15 20:10:49 +08:00
stevenhorsman	7807e6c29a	versions: Bump byte-unit and rust_decimal Bump the crates to update them and pull in a newer version of borsh to remediate RUSTSEC-2023-0033 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-09 16:09:22 +01:00
Fabiano Fidêncio	78bf9d7500	Merge pull request #11232 from lifupan/mtu runtime: add the mtu support for updating routes	2025-05-06 15:55:04 +02:00

1 2 3 4 5

244 Commits