kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-07-01 22:50:54 +00:00

Author	SHA1	Message	Date
Fabiano Fidêncio	110843d6e1	Merge pull request #13138 from manuelh-dev/mahuber/runt-rs-mem-file-removal runtime(-rs): remove file_mem_backend config option	2026-06-12 17:13:04 +02:00
Fupan Li	9553614f32	Merge pull request #12772 from Apokleos/nydus-standalone runtime-rs: Nydus standalone mode support in runtime-rs	2026-06-12 10:36:17 +08:00
Manuel Huber	70d8f1bf3d	runtime: remove file_mem_backend config option Remove the Go runtime file_mem_backend and valid_file_mem_backends config knobs, along with the corresponding sandbox annotation handling. The runtime still enables file-backed shared memory automatically for virtio-fs by using /dev/shm as the backing directory. This only removes the user-selectable backend path. Signed-off-by: Manuel Huber <manuelh@nvidia.com> Assisted-by: OpenAI Codex <codex@openai.com>	2026-06-12 00:07:16 +00:00
Manuel Huber	86fd65271c	runtime-rs: remove file_mem_backend config option While the config knob is being parsed, it is being unused in the rust shim. This renders the config knob useless. Remove the file_mem_backend config option as there is no current users for it. As this option is being usable in the go shim, we leave it intact. For the rust shim, /dev/shm is still being used in a similar way to the go shim when filesystem sharing is enabled (virtio-fs). Future use cases where other file_mem_backends are being utilized are currently planning to define these backends in a similar manner: based on the configuration/platform, determine the proper file memory backend, but do not let end users determine the file memory backend. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-06-12 00:07:16 +00:00
Fabiano Fidêncio	b323697f37	Merge pull request #13111 from Apokleos/monitor-disk-usage Metrics: Add support for monitoring disk usage via statfs	2026-06-12 00:41:31 +02:00
Alex Lyn	fa84eecd2d	runtime-rs: Implement ShareVirtioFsNydus for standalone mode Introduce `ShareVirtioFsNydus` to enable standalone Nydus rootfs support. This implementation acts as the bridge between runtime-rs and the external `nydusd` daemon. Key Capabilities: (1) Trait Implementation: Implements `ShareFs` (for VM device/storage) and `NydusShareFs` (for RAFS lifecycle) traits. (2) Daemon Lifecycle Management: Handles `nydusd` spawning, supervision, and graceful shutdown. (3) Native Overlay Support: Configures `nydusd` with `passthrough_fs` backend to provide native overlay (upperdir/workdir) support. (4) API Integration: Utilizes `NydusClient` for granular control over RAFS mount/umount operations. (5) QEMU Integration: Enables `virtio-fs-nydus` device support, facilitating standalone mode execution. This implementation allows Kata containers to utilize an external `nydusd` process for Nydus rootfs management, providing a cleaner separation between the runtime and the Nydus daemon lifecycle. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 21:42:48 +02:00
Alex Lyn	edfe9ea403	runtime-rs: refine ShareFs abstraction with lifecycle and Nydus traits Refactor the `ShareFs` trait to improve modularity and support standalone Nydus mode: (1) Added `stop()` method to manage daemon teardown. (2) Introduced a dedicated trait for Nydus-specific data-plane operations. This refactoring cleans up the `ShareFs` trait by consolidating daemon lifecycle handling and isolating Nydus-specific extensions, paving the way for cleaner standalone Nydus implementation. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 21:42:48 +02:00
Alex Lyn	720a8688b4	runtime-rs: Add daemon manager for nydusd process lifecycle Implement Nydusd to manage nydusd daemon process: (1) start: spawn process, validate paths, wait for API ready, setup passthrough fs. (2) stop: kill process, cleanup socket files. (3) mount_rafs/mount_rafs_with_overlay: high-level filesystem mount operations. (4) build_args: construct virtiofs mode command line arguments. This provides process lifecycle management with internal NydusClient Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 21:42:48 +02:00
Alex Lyn	c1ebf269f7	runtime-rs: Add nydus client for nydusd API communication via HTTP Implement NydusClient to interact with nydusd daemon via Unix socket: (1) check_status: query daemon state via GET /api/v1/daemon. (2) mount/umount: manage filesystem mounts via POST/DELETE /api/v1/mount. (3) wait_until_ready: poll daemon until RUNNING state. This provides a lightweight, stateless HTTP client layer for nydusd API. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 21:42:48 +02:00
Alex Lyn	4c63b8e3de	agent: handle ENOSYS in overlayfs storage handler In standalone nydusd mode with virtio-fs passthrough, the guest-side mkdir may fail with ENOSYS. Update the overlayfs storage handler to skip directory creation when the directory already exists, logging a warning instead of failing. This ensures container rootfs setup succeeds when nydusd's native overlay manages the directory structure. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 21:25:18 +02:00
Alex Lyn	8eb564dfb8	kata-sys-util: handle ENOSYS gracefully in mount destination creation When using virtio-fs with nydusd's passthrough_fs, mkdir operations may return ENOSYS on certain filesystem configurations. This causes mount destination creation to fail unexpectedly. Handle ENOSYS errors gracefully alongside AlreadyExists by verifying the directory exists after the failed mkdir attempt, allowing the mount to proceed if the directory is already present. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 21:25:18 +02:00
Alex Lyn	b50f803a4e	kata-types: add virtio-fs-nydus shared fs configuration support Add "virtio-fs-nydus" as a recognized shared filesystem type in the hypervisor configuration. This enables the standalone nydusd mode where nydusd runs as a separate process alongside virtiofsd. The key changes: (1) Add VIRTIO_FS_NYDUS constant for the new shared fs type. (2) Register virtio-fs-nydus in adjust() and validate() paths, reusing the same virtio-fs validation logic since both use vhost-user protocol Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 21:25:18 +02:00
Alex Lyn	854e76fb47	kata-types: Enhance related stuff for independent io threads Refactor comments and tests stuff for independent iothreads. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 20:47:20 +02:00
Alex Lyn	b0ebbc685d	runtime-rs: Add support for independent iothreads for virtio blk devices As independent iothreads can work in both virtio-scsi and virtio-blk devices, this commit aims to enable such feature in virtio-blk-pci devices. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 20:47:20 +02:00
Alex Lyn	980ecfdd96	runtime-rs: Add support iodependent iothreads within virtio-blk 1. Determine iothread for virtio-blk devices, only attach iothread when: (1) enable_iothreads is true (2) indep_iothreads > 0 (3) block driver is not virtio-scsi (i.e., it's virtio-blk) And for more complex cases, some enhancements will be done in future 2. Add iothread parameter for virtio-blk devices if specified. If iothreads set and passed, we will have to set it correctly for virtio-blk devices via qmp with device_add arguments. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 20:47:20 +02:00
Alex Lyn	36e626649d	runtime-rs: Add support independent IO threads in qemu cmdline To make it work well for independent IO threads for virtio-blk devices. A new method for independent IO threads for virtio-blk hotplug devices within qemu command line. Note that as ObjectIoThread has been done for days, it can be directly reused in this case. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 20:47:20 +02:00
Alex Lyn	86d165c0cc	kata-types: Introduce a dedicated annotation for indep_iothreads To make it more flexible when users want to set this feature, one more way to make it valid is via annotations. The dedicated annnotation of "io.katacontainers.config.hypervisor.indep_iothreads" is introduced within k8s clusters. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 20:47:20 +02:00
Alex Lyn	bdc57b16e5	runtime-rs: Add configurable indep_iothreads in configurations It's useful and helpful to set indep_iothreads with enable_iothreads for high IO performance. And we need provide an entry for people to set it if needed. This commit will introduce two configurable items: - Makefile: DEFINDEPIOTHREADS when make build. - configurations: indep_iothreads for people to set. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 20:47:20 +02:00
Alex Lyn	d086d324e0	kata-types: Introduce independent IO thread for virtio-blk devices The 'indep_iothreads' field is introduced in Hypervisor to make it configurable for number of independent IO threads for virtio-blk devices. When set to a value greater than 0, creates independent IO threads that can be attached to virtio-blk devices during hotplug. Note that it requires 'enable_iothreads' to be true for virtio-blk devices to use these threads. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 20:47:20 +02:00
Alex Lyn	5a00053b38	kata-agent: Implement filesystem space usage collection via statfs Add update_guest_filesystem_metrics() that collects disk space usage (total/used/available) for all read-write mounted filesystems inside the guest VM. This enables monitoring guest disk usage in kata/coco pod through the existing GetMetrics RPC. And its output metrics looks like as below: - kata_guest_filesystem_bytes{mount="/",device="vda",item="total\|used\|available"} - kata_guest_filesystem_inodes{mount="/",device="vda",item="total\|used\|available"} Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 20:47:05 +02:00
Alex Lyn	6c66724591	kata-agent: Add filesystem space usage metric declarations Add two new GaugeVec metrics to expose guest filesystem space usage: (1) kata_guest_filesystem_bytes{mount, device, item}: space in bytes (total/used/available) (2) kata_guest_filesystem_inodes{mount, device, item}: inode counts (total/used/available) Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2026-06-11 20:47:05 +02:00
stevenhorsman	fb4600d66a	runtime-rs: Fix test breakage In #13147, for some reason a test block was added in the middle of code and the code was stale when merged, which meant that a second `mod test` section was added, breaking our tests. Merge the two to fix this. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-06-11 19:03:33 +02:00
Fabiano Fidêncio	21657b9cd9	Merge pull request #13147 from manuelh-dev/mahuber/debug-go-rust runtime-rs: Honor enable_debug for logs and adjust debugging documentation	2026-06-11 08:57:36 +02:00
Hyounggyu Choi	7cc6767fa2	runtime*: use static_sandbox_resource_mgmt defaults for qemu-se Switch qemu-se config templates to use the TEE/CoCo-specific static_sandbox_resource_mgmt defaults instead of the generic QEMU defaults. qemu-se-runtime-rs config now uses DEFSTATICRESOURCEMGMT_COCO while runtime qemu-se config now uses DEFSTATICRESOURCEMGMT_TEE. This aligns static sandbox resource management behavior with confidential container expectations for qemu-se variants. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-06-09 14:45:50 +02:00
Alex Lyn	6500e018c0	Merge pull request #13093 from RainaYL/rainax/tdx_boot_pr dragonball: Add steps to boot TDX VM	2026-06-09 10:13:57 +08:00
Fabiano Fidêncio	4dc288401e	runtime-rs: make sandbox cgroup runtime attach idempotent The dragonball nerdctl CI job can race when creating and attaching the runtime process to the sandbox cgroup, surfacing an os error 17 (AlreadyExists) during shim task creation. Let's retry add_proc once on this pre-existing cgroup condition so startup remains robust. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: Codex <codex@openai.com>	2026-06-08 13:11:34 +02:00
Fabiano Fidêncio	4d569c22b4	runtime-rs: enforce a minimum vsock reconnect window Low-CPU sandboxes can take longer than a few seconds to complete guest boot and start the agent. Let's clamp the reconnect timeout to a safe minimum so sandbox startup does not fail early with transient vsock ECONNRESET. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: Codex <codex@openai.com>	2026-06-08 13:11:34 +02:00
Fabiano Fidêncio	ed34d7811d	runtime-rs: supplement static sizing from sandbox annotations When static sandbox resource management is enabled, CRI CPU/memory sizing may live only in sandbox annotations and be missing from the OCI spec. Let's fill missing sizing fields from annotations before applying static VM sizing so runtime-rs follows the expected Kubernetes behavior for constrained pods. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: Codex <codex@openai.com>	2026-06-08 13:11:34 +02:00
Fabiano Fidêncio	e93558e810	runtime-rs: default static sizing-related config flags to true Add top-level runtime-rs Makefile options `DEFSANDBOXCGROUP_ONLY` and `DEFSTATICRESOURCEMGMT`, both defaulting to true, and use them for the runtime defaults that previously disabled these paths. This aligns runtime-rs defaults with static sandbox resource management, which sizes sandbox memory up front instead of relying on memory hotplug, helping avoid architecture-specific hotplug limitations. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-06-08 12:57:40 +02:00
Steve Horsman	2ac6bb173b	Merge pull request #13036 from stevenhorsman/jaeger-to-otlp-tracing-switch trace-forwarder: migrate from Jaeger to OTLP exporter	2026-06-05 14:30:26 +01:00
Steve Horsman	1624ebe362	Merge pull request #13135 from kata-containers/dependabot/cargo/tar-0.4.46 build(deps): bump tar from 0.4.45 to 0.4.46	2026-06-05 09:44:46 +01:00
stevenhorsman	b737ae48bf	trace-forwarder: migrate from Jaeger to OTLP exporter Migrate trace-forwarder from the deprecated opentelemetry-jaeger exporter to the modern opentelemetry-otlp exporter. This change remediates GHSA-2f9f-gq7v-9h6m (CVE-2026-43868), a medium-severity vulnerability in Apache Thrift. The opentelemetry-jaeger crate is no longer maintained and depends on vulnerable thrift versions (0.13.0 and 0.16.0). The opentelemetry-otlp exporter does not use thrift and is actively maintained. Changes: - Replace opentelemetry-jaeger with opentelemetry-otlp in Cargo.toml - Update tracer.rs to use OTLP exporter instead of Jaeger exporter - Replace --jaeger-host/--jaeger-port flags with --otlp-endpoint flag - Update server.rs to use TracerProvider instead of SpanExporter - Update documentation to reflect OTLP migration - Add examples for common OTLP-compatible collectors Breaking change: Users must update their trace-forwarder invocations to use --otlp-endpoint instead of --jaeger-host and --jaeger-port. Default endpoint: http://localhost:4317 (OTLP gRPC) Generated-by: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2026-06-04 19:39:47 +01:00
Dan Mihai	c78ccc2e9f	Merge pull request #13088 from kata-containers/dependabot/cargo/openssl-0.10.80 build(deps): bump openssl from 0.10.79 to 0.10.80	2026-06-04 11:38:08 -07:00
Fabiano Fidêncio	743b0a4839	Merge pull request #13165 from stevenhorsman/bump-go-to-1.25.11 versions: bump golang to 1.25.11	2026-06-04 20:24:57 +02:00
Fabiano Fidêncio	80e2473440	runtime-rs: shut down shim daemon on a failed create When CreateContainer fails before the runtime instance is registered (e.g. a hypervisor/cgroup error), no sandbox exists to drive the normal teardown. containerd's follow-up Shutdown RPC then reaches get_runtime_instance(), fails with "runtime not ready", and returns before the service loop is ever told to stop. Because the shim ignores SIGTERM, the containerd-shim-kata-v2 daemon is left running and orphaned. Make the Shutdown RPC force the daemon to exit when there is no runtime instance, emitting the same Action::Shutdown that sandbox.shutdown() sends on the normal path. This guarantees the shim process is reaped after a failed create instead of leaking. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com> Assisted-by: Cursor <noreply@cursor.com>	2026-06-04 14:12:01 +02:00
Fabiano Fidêncio	2a1ce7b8c4	Merge pull request #12539 from mythi/no-vcpu-hotplug Disable CPU hotplug when confidential guest setting enabled	2026-06-04 10:56:52 +02:00
dependabot[bot]	4ab63d0a5d	build(deps): bump tar from 0.4.45 to 0.4.46 Bumps [tar](https://github.com/composefs/tar-rs) from 0.4.45 to 0.4.46. - [Release notes](https://github.com/composefs/tar-rs/releases) - [Commits](https://github.com/composefs/tar-rs/compare/0.4.45...0.4.46) --- updated-dependencies: - dependency-name: tar dependency-version: 0.4.46 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2026-06-04 07:52:44 +00:00
dependabot[bot]	d155f1a4ab	build(deps): bump openssl from 0.10.79 to 0.10.80 Bumps [openssl](https://github.com/rust-openssl/rust-openssl) from 0.10.79 to 0.10.80. - [Release notes](https://github.com/rust-openssl/rust-openssl/releases) - [Commits](https://github.com/rust-openssl/rust-openssl/compare/openssl-v0.10.79...openssl-v0.10.80) --- updated-dependencies: - dependency-name: openssl dependency-version: 0.10.80 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2026-06-04 07:51:50 +00:00
stevenhorsman	879912be25	versions: bump golang to 1.25.11 Bump the go version to resolve CVEs: - GO-2026-5037 - GO-2026-5038 - GO-2026-5039 Signed-off-by: stevenhorsman <steven@uk.ibm.com> Generated-By: IBM Bob	2026-06-04 08:49:17 +01:00
Mikko Ylinen	e475d870fb	runtime: qemu: don't set maxcpus when confidential guest is enabled QEMU maxcpus enables CPU hotplug capabilities but it's unused when confidential guest is enabled. Change Go runtime code to skip setting maxcpus QEMU cmdline if CPU hotplug is not needed. Commit `07db945b09` built a relationship between kernel's cmdline nr_cpus and the maxcpus config. Now that maxcpus is dropped for confidential guests, drop nr_cpus from kernel commandline too. This hopefully helps with the reference values computation too. Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>	2026-06-03 15:27:35 +03:00
Mikko Ylinen	2e625d0bab	runtime-rs: qemu: don't set maxcpus when confidential guest is enabled QEMU maxcpus enables CPU hotplug capabilities but it's unused when confidential guest is enabled. Change runtime-rs code to skip setting maxcpus QEMU cmdline if CPU hotplug is not needed. Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>	2026-06-03 15:27:35 +03:00
stevenhorsman	46d704a7ab	log-parser: bump golang.org/x/sys dependency Bump golang.org/x/sys from v0.1.0 to v0.44.0 to resolve CVE: - GO-2026-5024 Signed-off-by: stevenhorsman <steven@uk.ibm.com> Generated-By: IBM Bob	2026-06-03 09:56:54 +01:00
stevenhorsman	08ab789d9a	csi-kata-directvolume: bump golang.org/x dependencies Bump golang.org/x/net from v0.53.0 to v0.55.0 and golang.org/x/sys from v0.43.0 to v0.44.0 to resolve CVEs: - GO-2026-5024 - GO-2026-5025 - GO-2026-5026 - GO-2026-5027 - GO-2026-5028 - GO-2026-5029 - GO-2026-5030 Signed-off-by: stevenhorsman <steven@uk.ibm.com> Generated-By: IBM Bob	2026-06-03 09:56:54 +01:00
stevenhorsman	c0f549860e	runtime: bump golang.org/x dependencies Bump golang.org/x/net from v0.53.0 to v0.55.0 and golang.org/x/sys from v0.43.0 to v0.44.0 to resolve CVEs: - GO-2026-5024 - GO-2026-5025 - GO-2026-5026 - GO-2026-5027 - GO-2026-5028 - GO-2026-5029 - GO-2026-5030 Signed-off-by: stevenhorsman <steven@uk.ibm.com> Generated-By: IBM Bob	2026-06-03 09:56:54 +01:00
Fabiano Fidêncio	a2bb3f64b0	Merge pull request #12436 from mythi/tdx-updates-2026-3 runtime(-rs): tdx: use TDX QGS via unix-domain-socket by default	2026-06-03 08:50:26 +02:00
Fabiano Fidêncio	ecd9344dd1	Merge pull request #13144 from stevenhorsman/bump-rust-to-1.94 Bump rust to 1.94	2026-06-02 09:58:56 +02:00
Fabiano Fidêncio	230e01b04e	Merge pull request #13126 from kata-containers/topic/runtimes-introduce-azure-specific-configs runtime/runtime-rs: introduce Azure specific configs	2026-06-02 09:17:09 +02:00
Manuel Huber	57ee67a6aa	runtime-rs: Honor enable_debug for logs Make enable_debug promote the effective component log level from the default info level to debug for runtime, agent, and hypervisor logs. Keep an explicit log_level value authoritative so users can still choose trace, warn, or another level. Signed-off-by: Manuel Huber <manuelh@nvidia.com> Assisted-by: OpenAI Codex <codex@openai.com>	2026-06-01 21:29:08 +00:00
stevenhorsman	b1928cc22f	runtime-rs: run cargo fmt for Rust 1.94 Run cargo fmt on runtime-rs to ensure consistent formatting with Rust 1.94 toolchain. Signed-off-by: stevenhorsman <steven@uk.ibm.com> Generated-By: IBM Bob Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2026-06-01 17:32:06 +01:00
stevenhorsman	f9c95a279e	dragonball: Remove unnecessary unsafe blocks in cpuid Rust 1.94 now warns about unnecessary unsafe blocks around __get_cpuid_max(), __cpuid_count(), and host_cpuid() calls. Remove the unsafe blocks as they are no longer needed. This fixes the following clippy warnings in dbs-arch: - warning: unnecessary `unsafe` block at brand_string.rs:106 - warning: unnecessary `unsafe` block at brand_string.rs:114 - warning: unnecessary `unsafe` block at common.rs:28 - warning: unnecessary `unsafe` block at common.rs:36 Signed-off-by: stevenhorsman <steven@uk.ibm.com> Generated-By: IBM Bob	2026-06-01 17:07:16 +01:00

1 2 3 4 5 ...

6551 Commits