kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-04-04 11:03:52 +00:00

Author	SHA1	Message	Date
Sumedh Alok Sharma	c7c811071a	agent-ctl: Add option --vm to boot pod VM for testing. This change introduces a new command line option `--vm` to boot up a pod VM for testing. The tool connects with kata agent running inside the VM to send the test commands. The tool uses `hypervisor` crates from runtime-rs for VM lifecycle management. Current implementation supports Qemu & Cloud Hypervisor as VMMs. In summary: - tool parses the VMM specific runtime-rs kata config file in /opt/kata/share/defaults/kata-containers/runtime-rs/* - prepares and starts a VM using runtime-rs::hypervisor vm APIs - retrieves agent's server address to setup connection - tests the requested commands & shutdown the VM Fixes #11566 Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2025-08-11 11:03:18 +00:00
Alex Lyn	196d7d674d	runtime-rs: Label system journal log with kata Route kata-shim logs directly to systemd-journald under 'kata' identifier. This refactoring enables `kata-shim` logs to be properly attributed to 'kata' in systemd-journald, instead of inheriting the 'containerd' identifier. Previously, `kata-shim` logs were challenging to filter and debug as they appeared under the `containerd.service` unit. This commit resolves this by: 1. Introducing a `LogDestination` enum to explicitly define logging targets (File or Journal). 2. Modifying logger creation to set `SYSLOG_IDENTIFIER=kata` when logging to Journald. 3. Ensuring type safety and correct ownership handling for different logging backends. This significantly enhances the observability and debuggability of Kata Containers, making it easier to monitor and troubleshoot Kata-specific events. Fixes: #11590 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com> Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-10 16:00:36 +08:00
Alex Lyn	9816ffdac7	Merge pull request #11653 from Apokleos/align-initdata-annoation Align initdata annoation with kata-runtime	2025-08-08 16:24:09 +08:00
Fupan Li	b50777a174	Merge pull request #10580 from pmores/make-vcpu-allocation-more-accurate runtime-rs: make vcpu allocation more accurate	2025-08-08 14:14:40 +08:00
Xuewei Niu	beea0c34c5	Merge pull request #11060 from kata-containers/sprt/vfsd-metadata runtime: virtio-fs: Support "metadata" cache mode	2025-08-08 11:13:57 +08:00
Aurélien Bombo	6d96875d04	runtime: virtio-fs: Support "metadata" cache mode The Rust virtiofsd supports a "metadata" cache mode [1] that wasn't present in the C version [2], so this PR adds support for that. [1] https://gitlab.com/virtio-fs/virtiofsd [2] https://qemu.weilnetz.de/doc/5.1/tools/virtiofsd.html#cmdoption-virtiofsd-cache Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-08-07 21:24:40 +08:00
Pavel Mores	00bfa3fa02	runtime-rs: re-adjust config after modifying it with annotations Configuration information is adjusted after loading from file but so far, there has been no similar check for configuration coming from annotations. This commit introduces re-adjusting config after annotations have been processed. A small refactor was necessary as a prerequisite which introduces function TomlConfig::adjust_config() to make it easier to invoke the adjustment for a whole TomlConfig instance. This function is analogous to the existing validate() function. The immediate motivation for this change is to make sure that 0 in "default_vcpus" annotation will be properly adjusted to 1 as is the case if 0 is loaded from a config file. This is required to match the golang runtime behaviour. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Pavel Mores	e2156721fd	runtime-rs: add tests to exercise floating-point 'default_vcpus' Also included (as commented out) is a test that does not pass although it should. See source code comment for explanation why fixing this seems beyond the scope of this PR. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Pavel Mores	1f95d9401b	runtime-rs: change representation of default_vcpus from i32 to f32 This commit focuses purely on the formal change of type. If any subsequent changes in semantics are needed they are purposely avoided here so that the commit can be reviewed as a 100% formal and 0% semantic change. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Pavel Mores	cdc0eab8e4	runtime-rs: make sandbox vcpu allocation more accurate This commit addresses a part of the same problem as PR #7623 did for the golang runtime. So far we've been rounding up individual containers' vCPU requests and then summing them up which can lead to allocation of excess vCPUs as described in the mentioned PR's cover letter. We address this by reversing the order of operations, we sum the (possibly fractional) container requests and only then round up the total. We also align runtime-rs's behaviour with runtime-go in that we now include the default vcpu request from the config file ('default_vcpu') in the total. We diverge from PR #7623 in that `default_vcpu` is still treated as an integer (this will be a topic of a separate commit), and that this implementation avoids relying on 32-bit floating point arithmetic as there are some potential problems with using f32. For instance, some numbers commonly used in decimal, notably all of single-decimal-digit numbers 0.1, 0.2 .. 0.9 except 0.5, are periodic in binary and thus fundamentally not representable exactly. Arithmetics performed on such numbers can lead to surprising results, e.g. adding 0.1 ten times gives 1.0000001, not 1, and taking a ceil() results in 2, clearly a wrong answer in vcpu allocation. So instead, we take advantage of the fact that container requests happen to be expressed as a quota/period fraction so we can sum up quotas, fundamentally integral numbers (possibly fractional only due to the need to rewrite them with a common denominator) with much less danger of precision loss. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Christophe de Dinechin	ec480dc438	qemu: Respect the JSON schema for hot plug When hot-plugging CPUs on QEMU, we send a QMP command with JSON arguments. QEMU 9.2 recently became more strict[1] enforcing the JSON schema for QMP parameters. As a result, running Kata Containers with QEMU 9.2 results in a message complaining that the core-id parameter is expected to be an integer: ``` qmp hotplug cpu, cpuID=cpu-0 socketID=1, error: QMP command failed: Invalid parameter type for 'core-id', expected: integer ``` Fix that by changing the core-id, socket-id and thread-id to be integer values. [1]: `be93fd5372` Fixes: #11633 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2025-08-07 09:13:57 +02:00
Alex Lyn	37685c41c7	runtime-rs: Correct the coresponding initdata annotation const As we have changed the initdata annotation definition, Accordingly, we also need correct its const definition with KATA_ANNO_CFG_RUNTIME_INIT_DATA. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-07 10:45:28 +08:00
Alex Lyn	ede773db17	kata-types: Align the initdata annotation with kata-runtime's definition To make it work within CI, we do alignment with kata-runtime's definition with "io.katacontainers.config.runtime.cc_init_data". Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-03 22:51:39 +08:00
Markus Rudy	9e38fd2562	tools: add image for Go proto bindings In order to have a reproducible code generation process, we need to pin the versions of the tools used. This is accomplished easiest by generating inside a container. This commit adds a container image definition with fixed dependencies for Golang proto/ttrpc code generation, and changes the agent Makefile to invoke the update-generated-proto.sh script from within that container. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-07-31 17:58:25 +01:00
Markus Rudy	f7a36df290	runtime: generate proto files The generated Go bindings for the agent are out of date. This commit was produced by running src/agent/src/libs/protocols/hack/update-generated-proto.sh with protobuf compiler versions matching those of the last run, according to the generated code comments. Since there are new RPC methods, those needed to be added to the HybridVSockTTRPCMockImp. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-07-31 17:58:25 +01:00
Dan Mihai	c11c972465	genpolicy: config layer logging clean-up Use a simple debug!() for logging the config_layer string, instead of transcoding, etc. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-07-28 18:30:13 +00:00
Dan Mihai	30bfa2dfcc	genpolicy: use CoCo settings by default - "confidential_emptyDir" becomes "emptyDir" in the settings file. - "confidential_configMap" becomes "configMap" in settings. - "mount_source_cpath" becomes "cpath". - The new "root_path" gets used instead of the old "cpath" to point to the container root path.. - "confidential_guest" is no longer used. By default it gets replaced by "enable_configmap_secret_storages"=false, because CoCo is using CopyFileRequest instead of the Storage data structures for ConfigMap and/or Secret volume mounts during CreateContainerRequest. - The value of "guest_pull" becomes true by default. - "image_layer_verification" is no longer used - just CoCo's guest pull is supported. - The Request input files from unit tests are changing to reflect the new default settings values described above. - tests/integration/kubernetes/tests_common.sh adjusts the settings for platforms that are not set-up for CoCo during CI (i.e., platforms other than SNP, TDX, and CoCo Dev). Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-07-28 18:30:13 +00:00
Dan Mihai	94995d7102	genpolicy: skip pulling layers for guest-pull Skip pulling container image layers when guest-pull=true. The contents of these layers were ignored due to: - #11162, and - tarfs snapshotter support having been removed from genpolicy. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-07-28 18:30:13 +00:00
Dan Mihai	f6016f4f36	genpolicy: remove tarfs snapshotter support AKS Confidential Containers are using the tarfs snapshotter. CoCo upstream doesn't use this snapshotter, so remove this Policy complexity from upstream. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-07-28 18:30:10 +00:00
Xuewei Niu	2a3c8b04df	Merge pull request #11613 from RuoqingHe/clippy-fix-for-libs-20250721 mem-agent: Ignore Cargo.lock	2025-07-28 17:45:29 +08:00
RuoqingHe	3f46347dc5	Merge pull request #11618 from RuoqingHe/fix-dragonball-default-build dragonball: Fix warnings in default build	2025-07-28 11:24:46 +08:00
Ruoqing He	4ca6c2d917	mem-agent: Ignore Cargo.lock `mem-agent` here is now a library and do not contain examples, ignore Cargo.lock to get rid of untracked file noise produced by `cargo run` or `cargo test`. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-07-28 10:32:46 +08:00
Ruoqing He	3ec10b3721	runtime: clh: Re-generate client code against v47.0 Re-generates the client code against Cloud Hypervisor v47.0. Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by openapi-generator. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-07-25 20:44:14 +02:00
Xuewei Niu	6f6d64604f	Merge pull request #11598 from justxuewei/cgroups	2025-07-25 17:53:03 +08:00
Ruoqing He	639273366a	dragonball: Gate `MmapRegion` behind `virtio-fs` `MmapRegion` is only used while `virtio-fs` is enabled during testing dragonball, gate the import behind `virtio-fs` feature. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-07-25 09:09:35 +00:00
Ruoqing He	2e81ac463a	dragonball: Allow unused to suppress warnings Some variables went unused if certain features are not enabled, use `#[allow(unused)]` to suppress those warnings at the time being. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-07-25 09:07:19 +00:00
Ruoqing He	5f7da1ccaa	dragonball: Silence never read fields Some fields in structures used for testing purpose are never read, rename to send out the message. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-07-25 09:07:19 +00:00
Ruoqing He	225e6fffbc	dragonball: Gate `VcpuManagerError` behind `host-device` `VcpuManagerError` is only needed when `host-device` feature is enabled, gate the import behind that feature. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-07-25 09:07:19 +00:00
Ruoqing He	0502b05718	dragonball: Remove `with-serde` feature assertion Code inside `test_mac_addr_serialization_and_deserialization` test does not actually require this `with-serde` feature to test, removing the assertion here to enable this test. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-07-25 09:05:55 +00:00
Xuewei Niu	60e3679eb7	runtime-rs: Add full cgroups support on host Add full cgroups support on host. Cgroups are managed by `FsManager` and `SystemdManager`. As the names impies, the `FsManager` manages cgroups through cgroupfs, while the `SystemdManager` manages cgroups through systemd. The two manages support cgroup v1 and cgroup v2. Two types of cgroups path are supported: 1. For colon paths, for example "foo.slice:bar:baz", the runtime manages cgroups by `SystemdManager`; 2. For relative/absolute paths, the runtime manages cgroups by `FsManager`. vCPU threads are added into the sandbox cgroups in cgroup v1 + cgroupfs, others, cgroup v1 + systemd, cgroup v2 + cgroupfs, cgroup v2 + systemd, VMM process is added into the cgroups. The systemd doesn't provide a way to add thread to a unit. `add_thread()` in `SystemdManager` is equivalent to `add_process()`. Cgroup v2 supports threaded mode. However, we should enable threaded mode from leaf node to the root node (`/`) iteratively [1]. This means the runtime needs to modify the cgroups created by container runtime (e.g. containerd). Considering cgroupfs + cgroup v2 is not a common combination, its behavior is aligned with systemd + cgroup v2, which is not allowed to manage process at the thread level. 1: https://www.kernel.org/doc/html/v4.18/admin-guide/cgroup-v2.html#threads Fixes: #11356 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2025-07-25 14:52:55 +08:00
alex.lyn	613dba6f1f	runtime-rs: Some extra work to enhance copyfile with sharedfs disabled As some reasons, it first should make it align with runtime-go, this commit will do this work. Fixes #11543 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-25 11:39:20 +08:00
Steve Horsman	c762a3dd4f	Merge pull request #11372 from kata-containers/dependabot/cargo/src/dragonball/openssl-af8515b6e0 build(deps): bump the openssl group across 4 directories with 1 update	2025-07-24 13:27:24 +01:00
Xuewei Niu	635272f3e8	runtime-rs: Ignore SIGTERM signal in shim When enabling systemd cgroup driver and sandbox cgroup only, the shim is under a systemd unit. When the unit is stopping, systemd sends SIGTERM to the shim. The shim can't exit immediately, as there are some cleanups to do. Therefore, ignoring SIGTERM is required here. The shim should complete the work within a period (Kata sets it to 300s by default). Once a timeout occurs, systemd will send SIGKILL. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2025-07-24 17:15:15 +08:00
Xuewei Niu	79f29bc523	runtime-rs: QEMU get_thread_ids() returns real vCPU's tids The information is obtained through QMP query_cpus_fast. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2025-07-24 17:15:15 +08:00
alex.lyn	b40d65bc1b	runtime-rs: support block device driver virtio-scsi within qemu-rs It is important that we continue to support VirtIO-SCSI. While VirtIO-BLK is a common choice, virtio-scsi offers significant performance advantages in specific scenarios, particularly when utilizing iothreads and with NVMe Fabrics. Maintaining Flexibility and Choice by supporting both virtio-blk and virtio-scsi, we provide greater flexibility for users to choose the optimal storage（virtio-blk, virtio-scsi) interface based on their specific workload requirements and hardware configurations. As virtio-scsi controller has been created when qemu vm starts with block device driver is set to `virtio-scsi`. This commit is for blockdev_add the backend block device and device_add frondend virtio-scsi device via qmp. Fixes #11516 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-24 14:00:02 +08:00
alex.lyn	e683a7fd37	runtime-rs: Change the device_id with block device index As block device index is an very important unique id of a block device and can indicate a block device which is equivalent to device_id. In case of index is required in calculating scsi LUN and reduce useless arguments within reusing `hotplug_block_device`, we'd better change the device_id with block device index. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-24 11:57:00 +08:00
alex.lyn	4521cae0c0	runtime-rs: Support AIO for hotplugging block device within qemu In this commit, block device aio are introduced within hotplug_block_device within qemu via qmp and the "iouring" is set the default. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-24 11:57:00 +08:00
alex.lyn	b4d276bc2b	runtime-rs: Handle virtio-scsi within device manager It should be correctly handled within the device manager when do create_block_device if the driver_option is virtio-scsi. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-24 11:57:00 +08:00
alex.lyn	fbd84fd3f4	runtime-rs: Support virtio-scsi device within handle_block_volume It supports handling scsi device when block device driver is `scsi`. And it will ensure a correct storage source with LUN. Fixes #11516 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-24 11:57:00 +08:00
alex.lyn	57645c0786	runtime-rs: Add support for block device AIO In this commit, three block device aio modes are introduced and the "iouring" is set the default. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-24 11:57:00 +08:00
alex.lyn	40e6aacc34	runtime-rs: Introduce scsi_addr within BlockConfig for SCSI devices It's used to help discover scsi devices inside guest and also add a new const value `KATA_SCSI_DEV_TYPE` to help pass information. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-24 11:57:00 +08:00
alex.lyn	125383e53c	runtime-rs: Add support for configurable block device aio AIO is the I/O mechanism used by qemu with options: - threads Pthread based disk I/O. - native Native Linux I/O. - io_uring (default mode) Linux io_uring API. This provides the fastest I/O operations on Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-24 11:56:52 +08:00
dependabot[bot]	ef9d960763	build(deps): bump the openssl group across 4 directories with 1 update Bumps the openssl group with 1 update in the /src/dragonball directory: [openssl](https://github.com/sfackler/rust-openssl). Bumps the openssl group with 1 update in the /src/runtime-rs directory: [openssl](https://github.com/sfackler/rust-openssl). Bumps the openssl group with 1 update in the /src/tools/genpolicy directory: [openssl](https://github.com/sfackler/rust-openssl). Bumps the openssl group with 1 update in the /src/tools/kata-ctl directory: [openssl](https://github.com/sfackler/rust-openssl). Updates `openssl` from 0.10.72 to 0.10.73 - [Release notes](https://github.com/sfackler/rust-openssl/releases) - [Commits](https://github.com/sfackler/rust-openssl/compare/openssl-v0.10.72...openssl-v0.10.73) Updates `openssl` from 0.10.72 to 0.10.73 - [Release notes](https://github.com/sfackler/rust-openssl/releases) - [Commits](https://github.com/sfackler/rust-openssl/compare/openssl-v0.10.72...openssl-v0.10.73) Updates `openssl` from 0.10.72 to 0.10.73 - [Release notes](https://github.com/sfackler/rust-openssl/releases) - [Commits](https://github.com/sfackler/rust-openssl/compare/openssl-v0.10.72...openssl-v0.10.73) Updates `openssl` from 0.10.72 to 0.10.73 - [Release notes](https://github.com/sfackler/rust-openssl/releases) - [Commits](https://github.com/sfackler/rust-openssl/compare/openssl-v0.10.72...openssl-v0.10.73) --- updated-dependencies: - dependency-name: openssl dependency-version: 0.10.73 dependency-type: indirect update-type: version-update:semver-patch dependency-group: openssl - dependency-name: openssl dependency-version: 0.10.73 dependency-type: indirect update-type: version-update:semver-patch dependency-group: openssl - dependency-name: openssl dependency-version: 0.10.73 dependency-type: direct:production update-type: version-update:semver-patch dependency-group: openssl - dependency-name: openssl dependency-version: 0.10.73 dependency-type: indirect update-type: version-update:semver-patch dependency-group: openssl ... Signed-off-by: dependabot[bot] <support@github.com>	2025-07-23 15:17:12 +00:00
alex.lyn	a12ae58431	runtime-rs: Support hotplugging host block devices within qemu-rs Although Previous implementation of hotplugging block device via QMP can successfully hot-plug the regular file based block device, but it fails when the backend is /dev/xxx(e.g. /dev/loop0). With analysis about it, we can know that it lacks the ablility to hotplug host block devices. This commit will fill the gap, and make it work well for host block devices. Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-07-22 15:40:03 +08:00
Steve Horsman	09efcfbd86	Merge pull request #11606 from kata-containers/dependabot/cargo/src/tools/genpolicy/zerocopy-0.6.6 build(deps): bump zerocopy from 0.6.1 to 0.6.6 in /src/tools/genpolicy	2025-07-21 18:58:56 +01:00
dependabot[bot]	a9c8377073	build(deps): bump zerocopy from 0.6.1 to 0.6.6 in /src/tools/genpolicy --- updated-dependencies: - dependency-name: zerocopy dependency-version: 0.6.6 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2025-07-21 12:50:38 +00:00
dependabot[bot]	0b4c434ece	build(deps): bump unsafe-libyaml in /src/tools/kata-ctl Bumps [unsafe-libyaml](https://github.com/dtolnay/unsafe-libyaml) from 0.2.9 to 0.2.11. - [Release notes](https://github.com/dtolnay/unsafe-libyaml/releases) - [Commits](https://github.com/dtolnay/unsafe-libyaml/compare/0.2.9...0.2.11) --- updated-dependencies: - dependency-name: unsafe-libyaml dependency-version: 0.2.11 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2025-07-21 12:46:27 +00:00
stevenhorsman	162ba19b85	agent-ctl: Bump rusttls Bump rusttls to >=0.23.18 to remediate RUSTSEC-2024-0399 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-07-21 10:41:59 +01:00
stevenhorsman	42339e9cdf	dragonball: Update url crate Update url to 2.5.4 to bump idna to 1.0.3 and remediate RUSTSEC-2024-0421 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-07-21 10:35:05 +01:00
stevenhorsman	1795361589	runk: Update rustjail Update the rustjail crate to pull in the latest security fixes Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-07-21 10:31:18 +01:00

1 2 3 4 5 ...

5393 Commits