kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-08-14 06:06:12 +00:00

Author	SHA1	Message	Date
Alex Lyn	556255cdd5	ci: bugfix Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-12 11:39:43 +01:00
Alex Lyn	4a091438a9	runtime-rs: support initdata within nontee scenarios NoProtection cases Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-12 11:39:43 +01:00
Alex Lyn	aa973f8b59	kata-types: adjust initdata with runtime.cc_init_data Debug cc_init_data Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-12 11:39:43 +01:00
Alex Lyn	873df29b3f	CI: debug guest pull image Debug guest pull image Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-12 11:39:43 +01:00
Sumedh Alok Sharma	0398073c55	agent-ctl: Add option --vm to boot pod VM for testing. This change introduces a new command line option `--vm` to boot up a pod VM for testing. The tool connects with kata agent running inside the VM to send the test commands. The tool uses `hypervisor` crates from runtime-rs for VM lifecycle management. Current implementation supports Qemu & Cloud Hypervisor as VMMs. In summary: - tool parses the VMM specific runtime-rs kata config file in /opt/kata/share/defaults/kata-containers/runtime-rs/* - prepares and starts a VM using runtime-rs::hypervisor vm APIs - retrieves agent's server address to setup connection - tests the requested commands & shutdown the VM Fixes #11566 Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2025-08-12 11:39:43 +01:00
stevenhorsman	e451e3dcd0	WIP: tests/k8s: call teardown_common in some policy tests The teardown_common will print the description of the running pods, kill them all and print the system's syslogs afterwards. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-08-12 11:29:34 +01:00
stevenhorsman	8a53ac5618	workflows: Add Delete AKS cluster timeout When testing this branch, on several occasions the Delete AKS cluster step has hung for multiple hours, so add a timeout to prevent this. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-08-11 11:00:54 +01:00
stevenhorsman	284c2db931	tests/k8s: call teardown_common in k8s-job.bats The teardown_common will print the description of the running pods, kill them all and print the system's syslogs afterwards. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-08-11 11:00:54 +01:00
stevenhorsman	6b395c8556	DO NOT MERGE: Comment out tests to save ci cycles	2025-08-11 11:00:54 +01:00
stevenhorsman	86ecaffb78	tests/k8s: Enable tests for qemu-runtime-rs-coco-dev Add the runtime class to the non-tee tests and enable it to run in the test code Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-08-11 11:00:24 +01:00
stevenhorsman	2741e42b34	kata-deploy: Add kata-qemu-runtime-rs-coco-dev runtime class Add the runtime class and shim references for the new non-tee runtime-rs class Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-08-11 11:00:24 +01:00
stevenhorsman	3c7cfd0c36	runtime-rs: Add qemu-runtime-rs-coco-dev Create non-tee runtime class for runtime-rs qemu CoCo development without requiring TEE hardware. Based on the qemu-runtime-rs config, but with updated guest image, kernel and shared_fs Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-08-11 11:00:24 +01:00
Alex Lyn	f0b3fb2796	runtime-rs: Support share-rw=true when hotplug block device within qemu Support for the share-rw=true parameter has been added. While this parameter is essential for maintaining data consistency across multiple QEMU instances sharing a backend disk image, its implementation also serves to standardize parameters with the block device hotplug functionality in kata-runtime/qemu. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-11 11:00:24 +01:00
Alex Lyn	e0e5cf2180	runtime-rs: Add idempotency to hotplug block device operations Due to the lack of atomicity in the operation, a partial failure can lead to an inconsistent QEMU state, which pollutes subsequent operations. This can easily trigger a "Duplicate nodes" error. To prevent this, we should query the state before performing the operation. ee should ensure its validation and idempotency when making the function idempotent allows it to be safely retried. Fixes #11649 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-11 11:00:24 +01:00
Alex Lyn	ef175797b6	runtime-rs: move get_scsi_id_lun upper within hotplug_block_device Move the closure get_scsi_id_lun upper within hotplug_block_device and make it more helpful. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-11 11:00:24 +01:00
Hyounggyu Choi	407252a863	Merge pull request #11641 from Apokleos/kata-log runtime-rs: Label system journal log with kata	2025-08-11 08:44:31 +02:00
Alex Lyn	196d7d674d	runtime-rs: Label system journal log with kata Route kata-shim logs directly to systemd-journald under 'kata' identifier. This refactoring enables `kata-shim` logs to be properly attributed to 'kata' in systemd-journald, instead of inheriting the 'containerd' identifier. Previously, `kata-shim` logs were challenging to filter and debug as they appeared under the `containerd.service` unit. This commit resolves this by: 1. Introducing a `LogDestination` enum to explicitly define logging targets (File or Journal). 2. Modifying logger creation to set `SYSLOG_IDENTIFIER=kata` when logging to Journald. 3. Ensuring type safety and correct ownership handling for different logging backends. This significantly enhances the observability and debuggability of Kata Containers, making it easier to monitor and troubleshoot Kata-specific events. Fixes: #11590 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com> Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-10 16:00:36 +08:00
Aurélien Bombo	be148c7f72	Merge pull request #11666 from kata-containers/sprt/static-check-exclude-security-md ci: static-checks: add SECURITY.md to exclude list	2025-08-08 12:50:29 -05:00
Fabiano Fidêncio	dcbdf56281	Merge pull request #11660 from zvonkok/remove-stable ci: Remove stable	2025-08-08 14:18:25 +02:00
Xuewei Niu	1d2f2d6350	Merge pull request #11219 from fidencio/topic/version-qemu-bump-to-10.0.0 version: Bump QEMU to v10.0.0	2025-08-08 19:04:45 +08:00
RuoqingHe	aaf8de3dbf	Merge pull request #11669 from kevinzs2048/add-timeout ci: cri-containerd: add 5s timeout for creating sanbox with crictl	2025-08-08 18:25:58 +08:00
Alex Lyn	9816ffdac7	Merge pull request #11653 from Apokleos/align-initdata-annoation Align initdata annoation with kata-runtime	2025-08-08 16:24:09 +08:00
Kevin Zhao	1aa65167d7	CI: cri-containerd: add 5s timeout for creating sanbox with crictl After moving Arm64 CI nodes to new one, we do faced an interesting issue for timeout when it executes the command with crictl runp, the error is usally: code = DeadlineExceeded Fixes: #11662 Signed-off-by: Kevin Zhao <kevin.zhao@linaro.org>	2025-08-08 15:41:39 +08:00
Fupan Li	b50777a174	Merge pull request #10580 from pmores/make-vcpu-allocation-more-accurate runtime-rs: make vcpu allocation more accurate	2025-08-08 14:14:40 +08:00
Xuewei Niu	beea0c34c5	Merge pull request #11060 from kata-containers/sprt/vfsd-metadata runtime: virtio-fs: Support "metadata" cache mode	2025-08-08 11:13:57 +08:00
Fabiano Fidêncio	f9e16431c1	version: Bump QEMU to v10.0.3 As the new release of QEMU is out, let's switch to it and take advantage of bug fixes and improvements. QEMU changelog: https://wiki.qemu.org/ChangeLog/10.0 Signed-off-by: Fabiano Fidêncio <fidencio@northflank.com>	2025-08-07 22:31:30 +02:00
Greg Kurz	f9a6359674	Merge pull request #11667 from c3d/bug/11633-qmp qemu: Respect the JSON schema for hot plug	2025-08-07 16:04:12 +02:00
Aurélien Bombo	6d96875d04	runtime: virtio-fs: Support "metadata" cache mode The Rust virtiofsd supports a "metadata" cache mode [1] that wasn't present in the C version [2], so this PR adds support for that. [1] https://gitlab.com/virtio-fs/virtiofsd [2] https://qemu.weilnetz.de/doc/5.1/tools/virtiofsd.html#cmdoption-virtiofsd-cache Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-08-07 21:24:40 +08:00
Pavel Mores	69f21692ed	runtime-rs: enable vcpu allocation tests in CI This series should make runtime-rs's vcpu allocation behaviour match the behaviour of runtime-go so we can now enable pertinent tests which were skipped so far due the difference between both shims. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Pavel Mores	00bfa3fa02	runtime-rs: re-adjust config after modifying it with annotations Configuration information is adjusted after loading from file but so far, there has been no similar check for configuration coming from annotations. This commit introduces re-adjusting config after annotations have been processed. A small refactor was necessary as a prerequisite which introduces function TomlConfig::adjust_config() to make it easier to invoke the adjustment for a whole TomlConfig instance. This function is analogous to the existing validate() function. The immediate motivation for this change is to make sure that 0 in "default_vcpus" annotation will be properly adjusted to 1 as is the case if 0 is loaded from a config file. This is required to match the golang runtime behaviour. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Pavel Mores	e2156721fd	runtime-rs: add tests to exercise floating-point 'default_vcpus' Also included (as commented out) is a test that does not pass although it should. See source code comment for explanation why fixing this seems beyond the scope of this PR. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Pavel Mores	1f95d9401b	runtime-rs: change representation of default_vcpus from i32 to f32 This commit focuses purely on the formal change of type. If any subsequent changes in semantics are needed they are purposely avoided here so that the commit can be reviewed as a 100% formal and 0% semantic change. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Pavel Mores	cdc0eab8e4	runtime-rs: make sandbox vcpu allocation more accurate This commit addresses a part of the same problem as PR #7623 did for the golang runtime. So far we've been rounding up individual containers' vCPU requests and then summing them up which can lead to allocation of excess vCPUs as described in the mentioned PR's cover letter. We address this by reversing the order of operations, we sum the (possibly fractional) container requests and only then round up the total. We also align runtime-rs's behaviour with runtime-go in that we now include the default vcpu request from the config file ('default_vcpu') in the total. We diverge from PR #7623 in that `default_vcpu` is still treated as an integer (this will be a topic of a separate commit), and that this implementation avoids relying on 32-bit floating point arithmetic as there are some potential problems with using f32. For instance, some numbers commonly used in decimal, notably all of single-decimal-digit numbers 0.1, 0.2 .. 0.9 except 0.5, are periodic in binary and thus fundamentally not representable exactly. Arithmetics performed on such numbers can lead to surprising results, e.g. adding 0.1 ten times gives 1.0000001, not 1, and taking a ceil() results in 2, clearly a wrong answer in vcpu allocation. So instead, we take advantage of the fact that container requests happen to be expressed as a quota/period fraction so we can sum up quotas, fundamentally integral numbers (possibly fractional only due to the need to rewrite them with a common denominator) with much less danger of precision loss. Signed-off-by: Pavel Mores <pmores@redhat.com>	2025-08-07 10:32:44 +02:00
Christophe de Dinechin	ec480dc438	qemu: Respect the JSON schema for hot plug When hot-plugging CPUs on QEMU, we send a QMP command with JSON arguments. QEMU 9.2 recently became more strict[1] enforcing the JSON schema for QMP parameters. As a result, running Kata Containers with QEMU 9.2 results in a message complaining that the core-id parameter is expected to be an integer: ``` qmp hotplug cpu, cpuID=cpu-0 socketID=1, error: QMP command failed: Invalid parameter type for 'core-id', expected: integer ``` Fix that by changing the core-id, socket-id and thread-id to be integer values. [1]: `be93fd5372` Fixes: #11633 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2025-08-07 09:13:57 +02:00
Alex Lyn	37685c41c7	runtime-rs: Correct the coresponding initdata annotation const As we have changed the initdata annotation definition, Accordingly, we also need correct its const definition with KATA_ANNO_CFG_RUNTIME_INIT_DATA. Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-07 10:45:28 +08:00
Alex Lyn	163f04a918	Merge pull request #11651 from microsoft/danmihai1/debug-kubectl-logs tests: k8s-sandbox-vcpus-allocation debug info	2025-08-07 10:27:29 +08:00
Aurélien Bombo	e3b4d87b6d	ci: static-checks: add SECURITY.md to exclude list This adds SECURITY.md to the list of GH-native files that should be excluded by the reference checker. Today this is useful for downstreams who already have a SECURITY.md file for compliance reasons. When Kata onboards that file, this commit will also be required. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-08-06 11:24:52 -05:00
Zvonko Kaiser	1b1b3af9ab	ci: Remove trigger for stable branch We do not support stable branches anymore, remove the trigger for it. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-08-06 09:22:24 +08:00
Hyounggyu Choi	af01434226	Merge pull request #11646 from kata-containers/sprt/param-static-checks ci: static-checks: Auto-detect repo by default	2025-08-05 22:13:20 +02:00
Alex Lyn	ede773db17	kata-types: Align the initdata annotation with kata-runtime's definition To make it work within CI, we do alignment with kata-runtime's definition with "io.katacontainers.config.runtime.cc_init_data". Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2025-08-03 22:51:39 +08:00
Dan Mihai	05eca5ca25	tests: k8s-sandbox-vcpus-allocation debug info Print more details about the behavior of "kubectl logs", trying to understand errors like: https://github.com/kata-containers/kata-containers/actions/runs/16662887973/job/47164791712 not ok 1 Check the number vcpus are correctly allocated to the sandbox (in test file k8s-sandbox-vcpus-allocation.bats, line 37) `[ `kubectl logs ${pods[$i]}` -eq ${expected_vcpus[$i]} ]' failed with status 2 No resources found in kata-containers-k8s-tests namespace. ... k8s-sandbox-vcpus-allocation.bats: line 37: [: -eq: unary operator expected Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-08-01 20:09:17 +00:00
Aurélien Bombo	c47bff6d6a	Merge pull request #11637 from kata-containers/sprt/remove-install-az-cli gha: Remove unnecessary install-azure-cli step	2025-08-01 09:34:46 -05:00
Fabiano Fidêncio	82f141a02e	Merge pull request #11632 from burgerdev/codegen runtime: reproducible generation of Golang proto bindings	2025-07-31 23:49:18 +02:00
Fabiano Fidêncio	7198c8789e	Merge pull request #11639 from zvonkok/gpu_guest_components gpu: guest components	2025-07-31 21:42:31 +02:00
Aurélien Bombo	9585e608e5	ci: static-checks: Auto-detect repo by default This auto-detects the repo by default (instead of having to specify KATA_DEV_MODE=true) so that forked repos can leverage the static-checks.yaml CI check without modification. An alternative would have been to pass the repo in static-checks.yaml. However, because of the matrix, this would've changed the check name, which is a pain to handle in either the gatekeeper/GH UI. Example fork failure: https://github.com/microsoft/kata-containers/actions/runs/16656407213/job/47142421739#step:8:75 I've tested this change to work in a fork. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-07-31 14:33:24 -05:00
Zvonko Kaiser	8422411d91	gpu: Add coco guest components The second stage needs to consider the coco guest components Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-07-31 17:11:21 +00:00
Markus Rudy	3fd354b991	ci: add codegen to static-checks Signed-off-by: Markus Rudy <mr@edgeless.systems> Fixes: #11631 Co-authored-by: Steve Horsman <steven@uk.ibm.com>	2025-07-31 17:58:25 +01:00
Markus Rudy	9e38fd2562	tools: add image for Go proto bindings In order to have a reproducible code generation process, we need to pin the versions of the tools used. This is accomplished easiest by generating inside a container. This commit adds a container image definition with fixed dependencies for Golang proto/ttrpc code generation, and changes the agent Makefile to invoke the update-generated-proto.sh script from within that container. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-07-31 17:58:25 +01:00
Markus Rudy	f7a36df290	runtime: generate proto files The generated Go bindings for the agent are out of date. This commit was produced by running src/agent/src/libs/protocols/hack/update-generated-proto.sh with protobuf compiler versions matching those of the last run, according to the generated code comments. Since there are new RPC methods, those needed to be added to the HybridVSockTTRPCMockImp. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-07-31 17:58:25 +01:00
Fabiano Fidêncio	d077ed4c1e	Merge pull request #11645 from kata-containers/topic/fix-kbuild-sign-pin-issue build: nvidia: Fix KBUILD_SIGN_PIN breakage	2025-07-31 18:31:34 +02:00

1 2 3 4 5 ...

16596 Commits