kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-09-02 01:16:27 +00:00

Author	SHA1	Message	Date
Markus Rudy	a1baaf6fe2	genpolicy: ignore groups with same name as user containerd does not automatically add groups to the list of additional GIDs when the groups have the same name as the user: https://github.com/containerd/containerd/blob/f482992/pkg/oci/spec_opts.go#L852-L854 This is a bug and should be corrected, but it has been present since at least 1.6.0 and thus affects almost all containerd deployments in existence. Thus, we adopt the same behavior and ignore groups with the same name as the user when calculating additional GIDs. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-06-04 10:29:49 +02:00
Xuewei Niu	77ca2fe88b	runtime-rs: Reduce the number of duplicate log entries being printed When connecting to guest through vsock, a log is printed for each failure. The failure comes from two main reasons: (1) the guest is not ready or (2) some real errors happen. Printing logs for the first case leads to log clutter, and your logs will like this: ``` Feb 07 02:47:24 ubuntu containerd[520]: {"msg":"connect uds \"/run/kata/... Feb 07 02:47:24 ubuntu containerd[520]: {"msg":"connect uds \"/run/kata/... Feb 07 02:47:24 ubuntu containerd[520]: {"msg":"connect uds \"/run/kata/... Feb 07 02:47:24 ubuntu containerd[520]: {"msg":"connect uds \"/run/kata/... Feb 07 02:47:24 ubuntu containerd[520]: {"msg":"connect uds \"/run/kata/... ``` To avoid this, the sock implmentations save the last error and return it after all retries are exhausted. Users are able to check all errors by setting the log level to trace. Reorganize the log format to "{sock type}: {message}" to make it clearer. Apart from that, errors return by the socks use `self`, instead of `ConnectConfig`, since the `ConnectConfig` doesn't provide any useful information. Disable infinite loop for the log forwarder. There is retry logic in the sock implmentations. We can consider the agent-log unavailable if `sock.connect()` encounters an error. Fixes: #10847 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2025-06-04 12:25:32 +08:00
Xuewei Niu	3f8dd821e6	dragonball: Remove a useless dead_code attribute The vhost-user-fs has been added to Dragonball, so we can remove `update_memory`'s dead_code attribute. Fixes: #8691 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2025-06-04 11:34:16 +08:00
Ruoqing He	77e68b164e	agent: Upgrade `ttrpc-codegen` to 0.5.0 Propagate `ttrpc-codegen` upgrade from `libs/protocols` to `agent`. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-06-04 01:16:46 +00:00
Ryan Savino	1e686dbca7	agent: Remove casting and fix Arc declaration Removed unnecessary dynamic dispatch for services. Properly dereferenced service Box values and stored in Arc. Co-authored-by: Ruoqing He <heruoqing@iscas.ac.cn> Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn> Signed-Off-By: Ryan Savino <ryan.savino@amd.com>	2025-06-04 01:16:46 +00:00
Ruoqing He	0471f01074	libs: Bump `ttrpc-codegen` and `protobuf` Previous version of `ttrpc-codegen` is generating outdated `#![allow(box_pointers)]` which was deprecated. Bump `ttrpc-codegen` from v0.4.2 to v0.5.0 and `protobuf` from vx to v3.7.1 to get rid of this. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-06-04 01:16:18 +00:00
Markus Rudy	eeb3d1384b	genpolicy: compare additionalGIDs as sets The additional GIDs are handled by genpolicy as a BTreeSet. This set is then serialized to an ordered JSON array. On the containerd side, the GIDs are added to a list in the order they are discovered in /etc/group, and the main GID of the user is prepended to that list. This means that we don't have any guarantees that the input GIDs will be sorted. Since the order does not matter here, comparing the list of GIDs as sets is close enough. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-06-03 20:18:35 +02:00
Aurélien Bombo	8c3f8f8e21	Merge pull request #11339 from kata-containers/sprt/require-agent-ctl ci: Require agent-ctl tests	2025-06-03 11:58:33 -04:00
Steve Horsman	74e47382f8	Merge pull request #11016 from stevenhorsman/dependabot-configuration workflows: Add dependabot config	2025-06-03 15:12:32 +01:00
Steve Horsman	8176eefdac	Merge pull request #10748 from zvonkok/helm-doc doc: Add Helm Chart entry	2025-06-03 14:48:19 +01:00
Markus Rudy	02ad39ddf1	genpolicy: push down warning about missing passwd file The warning used to trigger even if the passwd file was not needed. This commit moves it down to where it actually matters. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-06-03 11:19:29 +02:00
Markus Rudy	ec969e4dcd	genpolicy: remove redundant group check https://github.com/kata-containers/kata-containers/pull/11077 established that the GID from the image config is never used for deriving the primary group of the container process. This commit removes the associated logic that derived a GID from a named group. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-06-03 10:59:10 +02:00
Zvonko Kaiser	985e965adb	doc: Added Helm Chart README.md We need more and accurate documentation. Let's start by providing an Helm Chart install doc and as a second step remove the kustomize steps. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com> Co-authored-by: Steve Horsman <steven@uk.ibm.com>	2025-06-02 23:26:16 +00:00
Dan Mihai	dc0da567cd	Merge pull request #11340 from microsoft/danmihai1/image-size-alignment image: custom guest rootfs image file size alignment	2025-06-02 14:33:21 -07:00
Dan Mihai	c2c194d860	kata-deploy: smaller guest image file for mariner Align up the mariner Guest image file size to 2M instead of the default 128M alignment. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-06-02 16:15:17 +00:00
Dan Mihai	65385a5bf9	image: custom guest rootfs image file size alignment The Guest rootfs image file size is aligned up to 128M boundary, since commmit `2b0d5b2`. This change allows users to use a custom alignment value - e.g., to align up to 2M, users will be able to specify IMAGE_SIZE_ALIGNMENT_MB=2 for image_builder.sh. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-06-02 16:15:17 +00:00
Steve Horsman	c575048aa7	Merge pull request #11329 from Xynnn007/fix-initdata-snp Fix \| Support initdata for SNP	2025-06-02 15:24:12 +01:00
stevenhorsman	ae352e7e34	ci: Add dependabot groups - Create groups for commonly seen cargo packages so that rather than getting up to 9 PRs for each rust components, bumps to the same package are grouped together. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-06-02 14:45:31 +01:00
stevenhorsman	a94388cf61	ci: Add dependabot config - Create a dependabot configuration to check for updates to our rust and golang packages each day and our github actions each month Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-06-02 14:45:31 +01:00
Xynnn007	8750eadff2	test: turn SNP on for initdata tests After the last commit, the initdata test on SNP should be ok. Thus we turn on this flag for CI. Fixes #11300 Signed-off-by: Xynnn007 <xynnn@linux.alibaba.com>	2025-06-02 20:33:19 +08:00
Xynnn007	39aa481da1	runtime: fix initdata support for SNP the qemu commandline of SNP should start with `sev-snp-guest`, and then following other parameters separeted by ','. This patch fixes the parameter order. Signed-off-by: Xynnn007 <xynnn@linux.alibaba.com>	2025-06-02 20:33:19 +08:00
Fabiano Fidêncio	57f3cb8b3b	Merge pull request #11344 from fidencio/topic/kernel-add-tuntap-move-memagent-stuff kernel: Add CONFIG_TUN (needed for VPNs) and move mem-agent related configs to common	2025-06-01 21:32:07 +02:00
RuoqingHe	51cc960cdd	Merge pull request #11346 from fidencio/topic/bump-cgroups-rs rust: Update cgroups-rs to its v0.3.5 release	2025-05-31 04:13:05 +02:00
Fabiano Fidêncio	48f8496209	Merge pull request #11327 from Champ-Goblem/agent/increase-limit-nofile agent: increase LimitNOFILE in the systemd service	2025-05-30 21:56:01 +02:00
Fabiano Fidêncio	02c46471fd	rust: Update cgroups-rs to its v0.3.5 release We're switching to using a rev as it may take some time for the package to be updated on crates.io. Signed-off-by: Fabiano Fidêncio <fidencio@northflank.com>	2025-05-30 21:49:50 +02:00
Fabiano Fidêncio	dadbfd42c8	kernel: Move mem-agent configs to the common kernel build There's no benefit on keeping those restricted to the dragonball build, when they can be used with other VMMs as well (as long as they support the mem-agent). Signed-off-by: Fabiano Fidêncio <fidencio@northflank.com>	2025-05-30 21:49:22 +02:00
Champ-Goblem	a37080917d	kernel: Add CONFIG_TUN for VPN services TUN/TAP is a must for VPN related services. Signed-off-by: Champ-Goblem <cameron@northflank.com> Signed-off-by: Fabiano Fidêncio <fidencio@northflank.com>	2025-05-30 21:49:22 +02:00
Fabiano Fidêncio	b8a7350a3d	Merge pull request #11324 from Champ-Goblem/runtime/fix-cgroup-deletion runtime: fix cgroupv2 deletion when sandbox_cgroup_only=false	2025-05-30 21:23:07 +02:00
Champ-Goblem	ef642fe890	runtime: fix cgroupv2 deletion when sandbox_cgroup_only=false Currently, when a new sandbox resource controller is created with cgroupsv2 and sandbox_cgroup_only is disabled, the cgroup management falls back to cgroupfs. During deletion, `IsSystemdCgroup` checks if the path contains `:` and tries to delete the cgroup via systemd. However, the cgroup was originally set up via cgroupfs and this process fails with `lstat /sys/fs/cgroup/kubepods.slice/kubepods-besteffort.slice/....scope: no such file or directory`. This patch updates the deletion logic to take in to account the sandbox_cgroup_only=false option and in this case uses the cgroupfs delete. Fixes: #11036 Signed-off-by: Champ-Goblem <cameron@northflank.com>	2025-05-30 17:51:31 +02:00
Champ-Goblem	f4007e5dc1	agent: increase LimitNOFILE in the systemd service Increase the NOFILE limit in the systemd service, this helps with running databases in the Kata runtime. Signed-off-by: Champ-Goblem <cameron@northflank.com>	2025-05-30 17:49:29 +02:00
Fabiano Fidêncio	3f5dc87284	Merge pull request #11333 from stevenhorsman/csi-driver-permissions-fix workflow: add packages: write to csi-driver publish	2025-05-30 17:45:47 +02:00
Zvonko Kaiser	4586511c01	doc: Add Helm Chart entry Since 3.12 we're shipping the helm-chart per default with each release. Update the documentation to use helm rather then the kata-deploy manifests. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-05-30 14:45:01 +00:00
Aurélien Bombo	c03b38c7e3	ci: Require agent-ctl tests This adds `run-kata-agent-apis` to the list of required tests. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-05-29 14:09:42 -05:00
stevenhorsman	586d9adfe5	workflow: add packages: write to csi-driver publish This one was missed in the earlier PR Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-29 15:57:07 +01:00
Steve Horsman	3da213a8c8	Merge pull request #11326 from kata-containers/top-level-workflow-permissions Top level workflow permissions	2025-05-29 10:03:06 +01:00
stevenhorsman	c34416f53a	workflows: Add explicit permissions where needed We have a number of jobs that either need,or nest workflows that need gh permissions, such as for pushing to ghcr, or doing attest build provenance. This means they need write permissions on things like `packages`, `id-token` and `attestations`, so we need to set these permissions at the job-level (along with `contents: read`), so they are not restricted by our safe defaults. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-28 19:34:28 +01:00
stevenhorsman	088e97075c	workflow: Add top-level permissions Set: ``` permissions: contents: read ``` as the default top-level permissions explicitly to conform to recommended security practices e.g. https://github.com/ossf/scorecard/blob/main/docs/checks.md#token-permissions	2025-05-28 19:34:28 +01:00
Dan Mihai	353d0822fd	Merge pull request #11314 from katexochen/p/svc-name-regex genpolicy: fix svc_name regex	2025-05-28 10:08:38 -07:00
Steve Horsman	7a9d919e3e	Merge pull request #11322 from kata-containers/workflow-permissions workflows: Add explicit permissions for attestation	2025-05-28 17:28:22 +01:00
Steve Horsman	2667d4a345	Merge pull request #11323 from stevenhorsman/gatekeeper-workflow-permissions-ii workflow: Update gatekeeper permissions	2025-05-28 17:05:24 +01:00
stevenhorsman	4d4fb86d34	workflow: Update gatekeeper permissions I shortsightedly forgot that gatekeeper would need to read more than just the commit content in it's python scripts, so add read permissions to actions issues which it uses in it's processing Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-28 15:58:27 +01:00
Steve Horsman	fed63e0801	Merge pull request #11319 from stevenhorsman/remove-old-workflows workflows: Delete workflows	2025-05-28 15:38:19 +01:00
Steve Horsman	49f86aaa0d	Merge pull request #11320 from stevenhorsman/gatekeeper-workflow-permissions workflows: gatekeeper: Update permissions	2025-05-28 15:38:06 +01:00
stevenhorsman	3ff602c1e8	workflows: Add explicit permissions for attestation We have a number of jobs that nest the build-static-tarball workflows later on. Due to these doing attest build provenance, and pushing to ghcr.io, t hey need write permissions on `packages`, `id-token` and `attestations`, so we need to set these permissions on the top-level jobs (along with `contents: read`), so they are not blocked. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-28 12:56:52 +01:00
stevenhorsman	2f0dc2ae24	workflows: gatekeeper: Update permissions Restrict the permissions of gatekeeper flow to read contents only for better security Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-28 09:57:19 +01:00
stevenhorsman	f900b0b776	workflows: Delete workflows Some legacy workflows require write access to github which is a security weakness and don't provide much value, so lets remove them. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-28 09:45:42 +01:00
Alex Lyn	aab6caa141	Merge pull request #10362 from Apokleos/vfio-hotplug-runtime-rs runtime-rs: add support hotplugging vfio device for qemu-rs	2025-05-28 13:21:58 +08:00
Fabiano Fidêncio	ac934e001e	Merge pull request #11244 from katexochen/p/guest-pull-config runtime: add option to force guest pull	2025-05-27 16:00:09 +02:00
alex.lyn	e69a4d203a	runtime-rs: Increase QMP read timeout to mitigate failures It frequently causes "Resource Temporarily Unavailable (OS Error 11)" with the original 250ms read timeout When passing through devices via VFIO in QEMU. The root cause lies in synchronization timeout windows failing to accommodate inherent delays during critical hardware init phases in kernel space. This commit would increase the timeout to 5000ms which was determined through some tests. While not guaranteeing complete resolution for all hardware combinations, this change significantly reduces timeout failures. Fixes # 10361 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2025-05-27 21:06:57 +08:00
Paul Meyer	c4815eb3ad	runtime: add option to force guest pull This enables guest pull via config, without the need of any external snapshotter. When the config enables runtime.experimental_force_guest_pull, instead of relying on annotations to select the way to share the root FS, we always use guest pull. Co-authored-by: Markus Rudy <mr@edgeless.systems> Signed-off-by: Paul Meyer <katexochen0@gmail.com>	2025-05-27 12:42:00 +02:00

1 2 3 4 5 ...

16123 Commits