kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-04-27 11:31:05 +00:00

Author	SHA1	Message	Date
Zvonko Kaiser	1d9915147d	release: Remove artifacts for release We need to make sure the release does not have any residual binaries left for the release payload Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-17 20:16:48 +00:00
Anastassios Nanos	ae1be28ddd	Merge pull request #10880 from nubificus/3.14.0-release release: Bump version to 3.14.0	2025-02-17 20:25:30 +02:00
Zvonko Kaiser	72833cb00b	Merge pull request #10878 from zvonkok/agent_cdi_timeout gpu: agent cdi timeout	2025-02-17 12:49:51 -05:00
Zvonko Kaiser	fda095a4c9	Merge pull request #10786 from zvonkok/gpu-config-update gpu: Update config files	2025-02-17 12:45:54 -05:00
Anastassios Nanos	c7347cb76d	release: Bump version to 3.14.0 Bump VERSION and helm-chart versions Signed-off-by: Anastassios Nanos <ananos@nubificus.co.uk>	2025-02-17 16:47:24 +00:00
Fabiano Fidêncio	639bc84329	Merge pull request #10787 from fidencio/topic/bump-kernel-to-6.12.11 version: Bump kernel to 6.12.13	2025-02-17 17:39:14 +01:00
Fabiano Fidêncio	7ae5fa463e	versions: Bump coco-guest-components So attestation-agent and others have a version including the ttrpc bump to v0.8.4, allowing us to use the latest LTS kernel. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-17 15:16:54 +01:00
Fabiano Fidêncio	1381cab6f0	build: Fix rootfs cache logic We've been appending to the wrong variable for quite some time, it seems, leading to not actually regenerating the rootfs when needed. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-17 13:55:36 +01:00
Fabiano Fidêncio	7fc7328bbc	versions: Bump kernel to 6.12.13 Let's try to keep up with the LTS patch releases. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-17 13:47:35 +01:00
Simon Kaegi	f5edbfd696	kernel: support loop device in v6.8+ kernels Set CONFIG_BLK_DEV_WRITE_MOUNTED=y to restore previous kernel behaviour. Kernel v6.8+ will by default block buffer writes to block devices mounted by filesystems. This unfortunately is what we need to use mounted loop devices needed by some teams to build OSIs and as an overlay backing store. More info on this config item [here](https://cateee.net/lkddb/web-lkddb/BLK_DEV_WRITE_MOUNTED.html) Fixes: #10808 Signed-off-by: Simon Kaegi <simon.kaegi@gmail.com>	2025-02-17 13:47:35 +01:00
Fabiano Fidêncio	d96e8375c4	Merge pull request #10885 from stevenhorsman/bump-agent-crates-to-resolve-CVEs agent: Bump agent crates to resolve CVEs	2025-02-17 12:11:43 +01:00
stevenhorsman	e5a284474d	deps: Update cookie-store & publicsuffix Run: ``` cargo update -p cookie-store cargo update -p publicsuffix ``` to update the version of idna and resolve CVE-2024-12224 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-14 17:30:03 +00:00
stevenhorsman	5656fc6139	deps: Bump reqwest Bump reqwest to 0.12.12 to pick up fixes Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-14 17:30:03 +00:00
stevenhorsman	3a3849efff	deps: Update quinn-proto Update quin-proto to fix CVE-2024-45311 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-14 17:30:03 +00:00
Fabiano Fidêncio	64ceb0832a	Merge pull request #10851 from fidencio/topic/bump-image-rs-to-bring-in-ttrpc-0.8.4 agent: Bump image-rs to 514c561d93	2025-02-14 18:21:56 +01:00
Fabiano Fidêncio	d5878437a4	Merge pull request #10845 from DataDog/dind-subcgroup-fix Add process to init subcgroup when we're using dind with cgroups v2	2025-02-14 18:12:24 +01:00
Steve Horsman	469c651fc0	Merge pull request #10879 from nubificus/fix_version packaging(release): Properly handle version tag for the release bundle	2025-02-14 14:40:37 +00:00
Zvonko Kaiser	908aacfa78	gpu: Update the logging around CDI Removed a rogue printf and updated the logging to say that we're waiting for CDI spec(s) to be generated rather than saying there is an error, it's not we have a timeout after that it is an error. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-14 14:32:00 +00:00
Zvonko Kaiser	4bda16565b	gpu: Update timeouts With the create_container_timeout the dial_timeout is lest important. Add the custom timeout for GPUs in create_container_timeout Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-14 14:29:18 +00:00
Zvonko Kaiser	66ccc25724	tdx: Update GPU config for the latest TDX stack We need extra kernel_params for TDX Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-14 14:29:18 +00:00
Zvonko Kaiser	d4dd87a974	gpu: Update config files With the recent changed to cgroupsv1 and AGENT_INIT=no we need update to the config files. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-14 14:29:18 +00:00
Anastassios Nanos	b13db29aaa	packaging(release): Properly handle version tag for the release bundle The tags created automatically for published Github releases are probably not annotated, so by simply running `git describe` we are not getting the correct tag. Use a `git describe --tags` to allow git to look at all tags, not just annotated ones. Signed-off-by: Anastassios Nanos <ananos@nubificus.co.uk>	2025-02-14 12:41:08 +00:00
Zvonko Kaiser	2499d013bd	gpu: Update handle_cdi_devices AgentConfig now has the cdi_timeout from the kernel cmdline, update the proper function signature and use it in the for loop. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-13 20:11:48 +00:00
Zvonko Kaiser	d28410ed75	Merge pull request #10877 from AdithyaKrishnan/main CI: Deprecate SEV	2025-02-13 14:55:11 -05:00
Zvonko Kaiser	95aa21f018	gpu: Add CDI timeout via kernel config Some systems like a DGX where we have 8 H100 or 8 H800 GPUs need some extended time to be initialized. We need to make sure we can configure CDI timeout, to enable even systems with 16 GPUs. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-13 19:23:19 +00:00
Adithya Krishnan Kannan	6cc5b79507	CI: Deprecate SEV Phase 1 of Issue #10840 AMD has deprecated SEV support on Kata Containers, and going forward, SNP will be the only AMD feature supported. As a first step in this deprecation process, we are removing the SEV CI workflow from the test suite to unblock the CI. Will be adding future commits to remove redundant SEV code paths. Signed-Off-By: Adithya Krishnan Kannan <AdithyaKrishnan.Kannan@amd.com>	2025-02-13 12:20:21 -06:00
Steve Horsman	0a39f59a9b	Merge pull request #10874 from stevenhorsman/skip-consistently-failing-block-volume-test tests: Skip block volume test on fc, stratovirt	2025-02-13 15:39:45 +00:00
Zvonko Kaiser	a0766986e7	Merge pull request #10832 from RuoqingHe/update-yq ci: Update yq to v4.44.5 to support riscv64	2025-02-13 08:33:02 -05:00
stevenhorsman	56fb2a9482	tests: Skip block volume test on fc, stratovirt The block volume test has failed on 10/10 nightlies and all the PRs I've seen, so skip it until it can be assessed. See #10873 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-13 11:50:35 +00:00
stevenhorsman	2d266df846	test: Update expected error in signed image tests We are seeing a different error in the new version of image-rs, so update our tests to match. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-13 11:44:51 +00:00
stevenhorsman	d28a512d29	agent: Wait for network before init_image_service Based on the guidance from @Xynnn007 in #10851 > The new version of image-rs will do attestation once ClientBuilder.build().await() is called, while the old version will do so lazily the first image pull request comes. Looks like it's called in rpc::start() in kata-agent, when I'm afraid the network hasn't been initialized yet. > I am not sure if the guest network is prepared after the DNS is configured (in create_sandbox), if so we can move (the init_image_service) right after that. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-13 11:44:51 +00:00
Tobin Feldman-Fitzthum	a13d5a3f04	agent: Bump image-rs to 514c561d93 As this brings in the commit bumping ttrpc to 0.8.4, which fixes connection issues with kernel 6.12.9+. As image-rs has a new builder pattern and several of the values in the image client config have been renamed, let's change the agent to account for this. Signed-off-by: Tobin Feldman-Fitzthum <tobin@linux.ibm.com> Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org> Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-13 11:44:51 +00:00
Steve Horsman	8614e5efc4	Merge pull request #10869 from stevenhorsman/bump-kcli-ubuntu-version ci: k8s: Bump kcli image version	2025-02-13 09:59:20 +00:00
Antoine Gaillard	4b5b788918	agent: Use init subcgroup for process attachment in DinD cgroups v2 enforces stricter delegation rules, preventing operations on cgroups outside our ownership boundary. When running Docker-in-Docker (DinD), processes must be attached to an "init" subcgroup within the systemd unit. This fix detects and uses the init subcgroup when proxying process attachment. Fixes #10733 Signed-off-by: Antoine Gaillard <antoine.gaillard@datadoghq.com>	2025-02-13 10:44:51 +01:00
Dan Mihai	958cd8dd9f	Merge pull request #10613 from 3u13r/feat/policy/refactor-out-policy-crate-and-network-namespace policy: add policy crate and add network namespace check to policy	2025-02-12 18:28:09 -08:00
Alex Lyn	e1b780492f	Merge pull request #10839 from RuoqingHe/appease-clippy dragonball: Appease clippy	2025-02-13 09:12:15 +08:00
Zvonko Kaiser	acd2a933da	Merge pull request #10864 from fidencio/topic/packaging-move-to-ubuntu-22-04 packaging: Move builds to Ubuntu 22.04	2025-02-12 14:29:41 -05:00
Wainer Moschetta	62e239ceaa	Merge pull request #10810 from arvindskumar99/nydus_perm_install Skipping SNP and SEV from deploying and deleting Snapshotter	2025-02-12 14:38:56 -03:00
stevenhorsman	fd7bcd88d0	ci: k8s: Bump kcli image version When trying to deploy nydus on kcli locally we get the following failure: ``` root@sh-kata-ci1:~# kubectl get pods -n nydus-system NAMESPACE NAME READY STATUS RESTARTS AGE nydus-system nydus-snapshotter-5kdqs 0/1 CrashLoopBackOff 4 (84s ago) 7m29s ``` Digging into this I found that the nydus-snapshotter service is failing with: ``` ubuntu@kata-k8s-worker-0:~$ journalctl -u nydus-snapshotter.service -- Logs begin at Wed 2025-02-12 15:06:08 UTC, end at Wed 2025-02-12 15:20:27 UTC. -- Feb 12 15:10:39 kata-k8s-worker-0 systemd[1]: Started nydus snapshotter. Feb 12 15:10:39 kata-k8s-worker-0 containerd-nydus-grpc[6349]: /usr/local/bin/containerd-nydus-grpc: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.32' not found (required b> Feb 12 15:10:39 kata-k8s-worker-0 containerd-nydus-grpc[6349]: /usr/local/bin/containerd-nydus-grpc: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.34' not found (required b> Feb 12 15:10:39 kata-k8s-worker-0 systemd[1]: nydus-snapshotter.service: Main process exited, code=exited, status=1/FAILURE ``` I think this is because 20.04 has version: ``` ubuntu@kata-k8s-worker-0:~$ ldd --version ldd (Ubuntu GLIBC 2.31-0ubuntu9.16) 2.31 ``` so it's too old for the nydus snapshotter. Also 20.04 is EoL soon, so bumping is better. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-12 15:38:18 +00:00
Zvonko Kaiser	fbc8454d3d	Merge pull request #10866 from zvonkok/enable-cc-gpu-build gpu: enable confidential initrd build	2025-02-12 09:26:08 -05:00
Ruoqing He	897e2e2b6e	dragonball: Appease clippy Some problem hidden in `dbs` crates are revealed after making these crates workspace components, fix according to `cargo clippy` suggests. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-02-12 19:44:34 +08:00
Leonard Cohnen	ec0af6fbda	policy: check the linux network namespace Peer pods have a linux namespace of type network. We want to make sure that all container in the same pod use the same namespace. Therefore, we add the first namespace path to the state and check all other requests against that. This commit also adds the corresponding integration test in the policy crate showcasing the benefit of having rust integration tests for the policy. Signed-off-by: Leonard Cohnen <lc@edgeless.systems>	2025-02-12 10:41:15 +01:00
Leonard Cohnen	7aca7a6671	policy: use agent policy crate in genpolicy test The generated rego policies for `CreateContainerRequest` are stateful and that state is handled in the policy crate. We use this policy crate in the genpolicy integration test to be able to test if those state changes are handled correctly without spinning up an agent or even a cluster. This also allows to easily test on a e.g., CreateContainerRequest level instead of relying on changing the yaml that is applied to a cluster. Signed-off-by: Leonard Cohnen <lc@edgeless.systems>	2025-02-12 10:41:15 +01:00
Leonard Cohnen	d03738a757	genpolicy: expose create as library This commit allows to programmatically invoke genpolicy. This allows for other rust tools that don't want to consume genpolicy as binary to generate policies. One such use-case is the policy integration test implemented in the following commits. Signed-off-by: Leonard Cohnen <lc@edgeless.systems>	2025-02-12 10:41:15 +01:00
Leonard Cohnen	cf54a1b0e1	agent: move policy module into separate crate The policy module augments the policy generated with genpolicy by keeping and providing state to each invocation. Therefore, it is not sufficient anymore to test the passing of requests in the genpolicy crate. Since in Rust, integration tests cannot call functions that are not exposed publicly, this commit factors out the policy module of the agent into its own crate and exposes the necessary functions to be consumed by the agent and an integration tests. The integration test itself is implemented in the following commits. Signed-off-by: Leonard Cohnen <lc@edgeless.systems>	2025-02-12 10:41:15 +01:00
Fupan Li	ec7b2aa441	Merge pull request #10850 from teawater/direct Clean the config block_device_cache_direct of runtime-rs	2025-02-12 09:45:37 +08:00
Zvonko Kaiser	5431841a80	Merge pull request #10814 from kata-containers/shellcheck-gha gha: Add shellcheck	2025-02-11 18:30:41 -05:00
Zvonko Kaiser	b231a795d7	gha: Add shellcheck We need to start to fix our scripts. Lets run shellcheck and see what needs to be reworked. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-11 16:00:34 +00:00
Zvonko Kaiser	befb2a7c33	gpu: Confidential Initrd Start building the confidential initrd Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-11 15:41:36 +00:00
Fupan Li	5b809ca440	CI: a workaround for containerd v2.x e2e test the latest containerd had an issue for its e2e test, thus we should do the following fix to workaround this issue. For much info about this issue, please see: https://github.com/containerd/containerd/pull/11240 Once this pr was merged and release new version, we can remove this workaround. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00

1 2 3 4 5 ...

15333 Commits