kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-08-17 23:47:07 +00:00

Author	SHA1	Message	Date
stevenhorsman	d28a512d29	agent: Wait for network before init_image_service Based on the guidance from @Xynnn007 in #10851 > The new version of image-rs will do attestation once ClientBuilder.build().await() is called, while the old version will do so lazily the first image pull request comes. Looks like it's called in rpc::start() in kata-agent, when I'm afraid the network hasn't been initialized yet. > I am not sure if the guest network is prepared after the DNS is configured (in create_sandbox), if so we can move (the init_image_service) right after that. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-13 11:44:51 +00:00
Tobin Feldman-Fitzthum	a13d5a3f04	agent: Bump image-rs to 514c561d93 As this brings in the commit bumping ttrpc to 0.8.4, which fixes connection issues with kernel 6.12.9+. As image-rs has a new builder pattern and several of the values in the image client config have been renamed, let's change the agent to account for this. Signed-off-by: Tobin Feldman-Fitzthum <tobin@linux.ibm.com> Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org> Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-13 11:44:51 +00:00
Steve Horsman	8614e5efc4	Merge pull request #10869 from stevenhorsman/bump-kcli-ubuntu-version ci: k8s: Bump kcli image version	2025-02-13 09:59:20 +00:00
Antoine Gaillard	4b5b788918	agent: Use init subcgroup for process attachment in DinD cgroups v2 enforces stricter delegation rules, preventing operations on cgroups outside our ownership boundary. When running Docker-in-Docker (DinD), processes must be attached to an "init" subcgroup within the systemd unit. This fix detects and uses the init subcgroup when proxying process attachment. Fixes #10733 Signed-off-by: Antoine Gaillard <antoine.gaillard@datadoghq.com>	2025-02-13 10:44:51 +01:00
Dan Mihai	958cd8dd9f	Merge pull request #10613 from 3u13r/feat/policy/refactor-out-policy-crate-and-network-namespace policy: add policy crate and add network namespace check to policy	2025-02-12 18:28:09 -08:00
Alex Lyn	e1b780492f	Merge pull request #10839 from RuoqingHe/appease-clippy dragonball: Appease clippy	2025-02-13 09:12:15 +08:00
Zvonko Kaiser	acd2a933da	Merge pull request #10864 from fidencio/topic/packaging-move-to-ubuntu-22-04 packaging: Move builds to Ubuntu 22.04	2025-02-12 14:29:41 -05:00
Wainer Moschetta	62e239ceaa	Merge pull request #10810 from arvindskumar99/nydus_perm_install Skipping SNP and SEV from deploying and deleting Snapshotter	2025-02-12 14:38:56 -03:00
stevenhorsman	fd7bcd88d0	ci: k8s: Bump kcli image version When trying to deploy nydus on kcli locally we get the following failure: ``` root@sh-kata-ci1:~# kubectl get pods -n nydus-system NAMESPACE NAME READY STATUS RESTARTS AGE nydus-system nydus-snapshotter-5kdqs 0/1 CrashLoopBackOff 4 (84s ago) 7m29s ``` Digging into this I found that the nydus-snapshotter service is failing with: ``` ubuntu@kata-k8s-worker-0:~$ journalctl -u nydus-snapshotter.service -- Logs begin at Wed 2025-02-12 15:06:08 UTC, end at Wed 2025-02-12 15:20:27 UTC. -- Feb 12 15:10:39 kata-k8s-worker-0 systemd[1]: Started nydus snapshotter. Feb 12 15:10:39 kata-k8s-worker-0 containerd-nydus-grpc[6349]: /usr/local/bin/containerd-nydus-grpc: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.32' not found (required b> Feb 12 15:10:39 kata-k8s-worker-0 containerd-nydus-grpc[6349]: /usr/local/bin/containerd-nydus-grpc: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.34' not found (required b> Feb 12 15:10:39 kata-k8s-worker-0 systemd[1]: nydus-snapshotter.service: Main process exited, code=exited, status=1/FAILURE ``` I think this is because 20.04 has version: ``` ubuntu@kata-k8s-worker-0:~$ ldd --version ldd (Ubuntu GLIBC 2.31-0ubuntu9.16) 2.31 ``` so it's too old for the nydus snapshotter. Also 20.04 is EoL soon, so bumping is better. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-12 15:38:18 +00:00
Zvonko Kaiser	fbc8454d3d	Merge pull request #10866 from zvonkok/enable-cc-gpu-build gpu: enable confidential initrd build	2025-02-12 09:26:08 -05:00
Ruoqing He	897e2e2b6e	dragonball: Appease clippy Some problem hidden in `dbs` crates are revealed after making these crates workspace components, fix according to `cargo clippy` suggests. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-02-12 19:44:34 +08:00
Leonard Cohnen	ec0af6fbda	policy: check the linux network namespace Peer pods have a linux namespace of type network. We want to make sure that all container in the same pod use the same namespace. Therefore, we add the first namespace path to the state and check all other requests against that. This commit also adds the corresponding integration test in the policy crate showcasing the benefit of having rust integration tests for the policy. Signed-off-by: Leonard Cohnen <lc@edgeless.systems>	2025-02-12 10:41:15 +01:00
Leonard Cohnen	7aca7a6671	policy: use agent policy crate in genpolicy test The generated rego policies for `CreateContainerRequest` are stateful and that state is handled in the policy crate. We use this policy crate in the genpolicy integration test to be able to test if those state changes are handled correctly without spinning up an agent or even a cluster. This also allows to easily test on a e.g., CreateContainerRequest level instead of relying on changing the yaml that is applied to a cluster. Signed-off-by: Leonard Cohnen <lc@edgeless.systems>	2025-02-12 10:41:15 +01:00
Leonard Cohnen	d03738a757	genpolicy: expose create as library This commit allows to programmatically invoke genpolicy. This allows for other rust tools that don't want to consume genpolicy as binary to generate policies. One such use-case is the policy integration test implemented in the following commits. Signed-off-by: Leonard Cohnen <lc@edgeless.systems>	2025-02-12 10:41:15 +01:00
Leonard Cohnen	cf54a1b0e1	agent: move policy module into separate crate The policy module augments the policy generated with genpolicy by keeping and providing state to each invocation. Therefore, it is not sufficient anymore to test the passing of requests in the genpolicy crate. Since in Rust, integration tests cannot call functions that are not exposed publicly, this commit factors out the policy module of the agent into its own crate and exposes the necessary functions to be consumed by the agent and an integration tests. The integration test itself is implemented in the following commits. Signed-off-by: Leonard Cohnen <lc@edgeless.systems>	2025-02-12 10:41:15 +01:00
Fupan Li	ec7b2aa441	Merge pull request #10850 from teawater/direct Clean the config block_device_cache_direct of runtime-rs	2025-02-12 09:45:37 +08:00
Zvonko Kaiser	5431841a80	Merge pull request #10814 from kata-containers/shellcheck-gha gha: Add shellcheck	2025-02-11 18:30:41 -05:00
Zvonko Kaiser	2d8531cd20	gpu: Add TDX experimental target for GPUs We have custom branches on coco/qemu to support GPUs in TDX and SNP add experimental target. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-11 17:32:31 +00:00
Zvonko Kaiser	7ded74c068	gpu: Add version for QEMU+TDX+SNP SNP and TDX patches for GPU are not compatible hence we need an own build for TDX. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-11 17:32:31 +00:00
Zvonko Kaiser	e4679055c6	gpu: qemu-snp-experimental no patches The branch has all the needed cherry-picks Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-11 17:32:31 +00:00
Zvonko Kaiser	7a219b3f03	gpu: Add GPU+SNP QEMU build Since the CPU SNP is upstreamed and available via our default QEMU target we're repurposing the SNP-experimental for the GPU+SNP enablement. First step is to update the version we're basing it off. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-11 17:32:31 +00:00
Zvonko Kaiser	b231a795d7	gha: Add shellcheck We need to start to fix our scripts. Lets run shellcheck and see what needs to be reworked. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-11 16:00:34 +00:00
Zvonko Kaiser	befb2a7c33	gpu: Confidential Initrd Start building the confidential initrd Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-11 15:41:36 +00:00
Fupan Li	5b809ca440	CI: a workaround for containerd v2.x e2e test the latest containerd had an issue for its e2e test, thus we should do the following fix to workaround this issue. For much info about this issue, please see: https://github.com/containerd/containerd/pull/11240 Once this pr was merged and release new version, we can remove this workaround. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Fupan Li	a3fd3d90bc	ci: Add the sandbox api testcases A test case is added based on the intergrated cri-containerd case. The difference between cri containerd integrated testcase and sandbox api testcase is the "sandboxer" setting in the sandbox runtime handler. If the "sandboxer" is set to "" or "podsandbox", then containerd will use the legacy shimv2 api, and if the "sandboxer" is set to "shim", then it will use the sandbox api to launch the pod. In addition, add a containerd v2.0.0 version. Because containerd officially supports the sandbox api from version 2.0.0. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Fupan Li	36bf080c1e	runtime-rs: register the sandbox api service add and resiger the sandbox api service, thus runtime-rs can deal with the sandbox api rpc call from the containerd. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Fupan Li	8332f427d2	runtime-rs: add the wait and status method for sandbox api Add the sandbox wait and sandbox status method for sandbox api. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Fupan Li	2d6b1e6b13	runtime-rs: add the sandbox api support For Kata-Containers, we add SandboxService for these new calls alongside the existing TaskService, including processing requests and replies, and properly calling VirtSandbox's interfaces. By splitting the start logic of the sandbox, virt_container is compatible with calls from the SandboxService and TaskService. In addition, we modify the processing of resource configuration to solve the problem that SandboxService does not have a spec file when creating a pod. Sandbox api can be supported from containerd 1.7. But there's a difference from container 2.0. To enbale it from 2.0, you can support the sandbox api for a specific runtime by adding: sandboxer = "shim", take kata runtime as an example: [plugins."io.containerd.grpc.v1.cri".containerd.runtimes.kata] runtime_type = "io.containerd.kata.v2" sandboxer = "shim" privileged_without_host_devices = true pod_annotations = ["io.katacontainers.*"] For container version 1.7, you can enable it by: 1: add env ENABLE_CRI_SANDBOXES=true 2: add sandbox_mode = "shim" to runtime config. Acknowledgement This work was based on @wllenyj's POC code: (`f5b62a2d7c`) Signed-off-by: Fupan Li <fupan.lfp@antgroup.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2025-02-11 15:21:53 +01:00
Fupan Li	65e908a584	runtime-rs: add the sandbox init for sandbox api For the processing of init sandbox, the init of task api has some more special processing procedures than the init of sandbox api, so these two types of init are separated here. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Fupan Li	be40646d04	runtime-rs: move the sandbox start from sandbox init function Split the sandbox start from the sandbox init process, and call them separately. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Fupan Li	438f81b108	runtime-rs: only get the containerd id when start container When start the sandbox, the sandbox id would be passed from the shim command line, and it only need to get the containerd id from oci spec when starting the pod container instead of the pod sandbox. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Fupan Li	9492c45d06	runtime-rs: load the cgroup path correctly When the sandbox api was enabled, the pause container would be removed and sandbox start api only pass an empty bundle directory, which means there's no oci spec file under it, thus the cgroup config couldn't get the cgroup path from pause container's oci spec. So we should set a default cgroup path for sandbox api case. In the future, we can promote containerd to pass the cgroup path during the sandbox start phase. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Fupan Li	78b96a6e2e	runtime-rs: fix the issue of missing create sandbox dir It's needed to make sure the sandbox storage path exist before return it. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Fupan Li	97785b1f3f	runtime-rs: rustfmt against lib.rs It seemed some files was mssing run rustfmt. This commit do rustfmt for them. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Fupan Li	33555037c0	protocols: Add the cri api protos Add the cri api protos to support the sandbox api. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Hui Zhu	27cff15015	runtime-rs: Remove block_device_cache_direct from config of fc Remove block_device_cache_direct from config of fc in runtime-rs because fc doesn't support this config. Fixes: #10849 Signed-off-by: Hui Zhu <teawater@antgroup.com>	2025-02-11 14:04:11 +08:00
Hui Zhu	70d9afbd1f	runtime-rs: Add block_device_cache_direct to config of ch and dragonball Add block_device_cache_direct to config of ch and dragonball in runtime-rs because they support this config. Fixes: #10849 Signed-off-by: Hui Zhu <teawater@antgroup.com>	2025-02-11 14:04:11 +08:00
Hui Zhu	db04c7ec93	runtime-rs: Add block_device_cache_direct config to ch and qemu Add block_device_cache_direct config to ch and qemu in runtime-rs. Fixes: #10849 Signed-off-by: Hui Zhu <teawater@antgroup.com>	2025-02-11 14:04:11 +08:00
Hui Zhu	e4cbc6abce	runtime-rs: CloudHypervisorInner: Change config type This commit change config in CloudHypervisorInner to normal HypervisorConfig to decrease the change of its type. Fixes: #10849 Signed-off-by: Hui Zhu <teawater@antgroup.com>	2025-02-11 14:04:11 +08:00
Fabiano Fidêncio	75ac09baba	packaging: Move builds to Ubuntu 22.04 As Ubuntu 20.04 will reach its EOL in April. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-10 21:25:43 +01:00
Fabiano Fidêncio	c9f5966f56	Merge pull request #10860 from kata-containers/topic/debug-ci workflows: build: Do not store unnecessary content on the tarball	2025-02-10 20:01:37 +01:00
Fabiano Fidêncio	ec290853e9	workflows: build: Do not store unnecessary content on the tarball Otherwise we may end up simply unpacking kata-containers specific binaries into the same location that system ones are needed, leading to a broken system (most likely what happened with the metrics CI, and also what's happening with the GHA runners). Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-10 18:57:29 +01:00
Steve Horsman	fb341f8ebb	Merge pull request #10857 from fidencio/topic/ci-tdx-only-use-one-machine-for-testing ci: Only use the Ubuntu TDX machine in the CI	2025-02-10 15:25:06 +00:00
Fabiano Fidêncio	23cb5bb6c2	ci: Only use the Ubuntu TDX machine in the CI We've been hitting issues with the CentOS 9 Stream machine, which Intel doesn't have cycles to debug. After raising this up in the Confidential Containers community meeting we got the green light from Red Hat (Ariel Adam) to just disable the CI based on CentOS 9 Stream for now. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-10 12:50:16 +01:00
Zvonko Kaiser	eb1cf792de	Merge pull request #10791 from kata-containers/gpu_ci_cd gpu: Add first target and fix extratarballs	2025-02-06 15:47:27 -05:00
Zvonko Kaiser	62a975603e	Merge pull request #10806 from stevenhorsman/rust-1.80.0-bump Rust 1.80.0 bump	2025-02-06 14:49:23 -05:00
Dan Mihai	fdf3088be0	Merge pull request #10842 from microsoft/danmihai1/disable-job-policy-test tests: disable k8s-policy-job.bats on coco-dev	2025-02-06 09:09:49 -08:00
Hyounggyu Choi	48c5b1fb55	Merge pull request #10841 from BbolroC/make-measured-rootfs-configurable local-build: Do not build measured rootfs on s390x	2025-02-06 16:07:15 +01:00
Hyounggyu Choi	1bdb34e880	tests: Skip trusted storage tests for IBM SE Let's skip all tests for trusted storage until #10838 is resolved. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-02-06 12:09:14 +01:00
Hyounggyu Choi	27ce3eef12	local-build: Do not use measured rootfs on s390x IBM SE ensures to make initrd measured by genprotimg and verified by ultravisor. Let's not build the measured rootf on s390x. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-02-06 10:12:55 +01:00

... 7 8 9 10 11 ...

15709 Commits