kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-03-18 10:44:10 +00:00

Author	SHA1	Message	Date
Fabiano Fidêncio	c5b5433866	kernel: Unify nvidia-gpu and nvidia-gpu-confidential Build a single kernel for both nvidia-gpu and nvidia-gpu-confidential, simplifying and reducing code maintenance. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2026-02-09 18:28:23 +01:00
Manuel Huber	d9d1073cf1	gpu: Install packages for devkit Introduce a new function to install additional packages into the devkit flavor. With modprobe, we avoid errors on pod startup related to loading nvidia kernel modules in the NVRC phase. Note, the production flavor gets modprobe from busybox, see its configuration file containing CONFIG_MODPROBE=y. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-06 09:58:32 +01:00
Manuel Huber	a3c4e0b64f	rootfs: Introduce kernelinit dm-verity mode This change introduces the kernelinit dm-verity mode, allowing initramfs-less dm-verity enforcement against the rootfs image. For this, the change introduces a new variable with dm-verity information. This variable will be picked up by shim configurations in subsequent commits. This will allow the shims to build the kernel command line with dm-verity information based on the existing kernel_parameters configuration knob and a new kernel_verity_params configuration knob. The latter specifically provides the relevant dm-verity information. This new configuration knob avoids merging the verity parameters into the kernel_params field. Avoiding this, no cumbersome escape logic is required as we do not need to pass the dm-mod.create="..." parameter directly in the kernel_parameters, but only relevant dm-verity parameters in semi-structured manner (see above). The only place where the final command line is assembled is in the shims. Further, this is a line easy to comment out for developers to disable dm-verity enforcement (or for CI tasks). This change produces the new kernelinit dm-verity parameters for the NVIDIA runtime handlers, and modifies the format of how these parameters are prepared for all handlers. With this, the parameters are currently no longer provided to the kernel_params configuration knob for any runtime handler. This change alone should thus not be used as dm-verity information will no longer be picked up by the shims. systemd-analyze on the coco-dev handler shows that using the kernelinit mode on a local machine, less time is spent in the kernel phase, slightly speeding up pod start-up. On that machine, the average of 172.5ms was reduced to 141ms (4 measurements, each with a basic pod manifest), i.e., the kernel phase duration is improved by about 18 percent. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2026-02-05 23:04:35 +01:00
Zvonko Kaiser	a59f791bf5	gpu: Move CUDA repo selection to versions.yaml We want to enable local and remote CUDA repository builds. Moving the cuda and tools repo to versions.yaml with a unified build for both types. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-01-26 22:19:40 +01:00
Zvonko Kaiser	428cc5d586	gpu: Chroot Cleanup With the newest NVRC we do not need the supported GPUs anymore. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-01-17 19:27:24 +01:00
Zvonko Kaiser	adce41c432	gpu: Bump NVRC Version The new NVRC version works for CC and non-CC use cases, no --feature confidential needed anymore. Bump versions.yaml and adjust deployment instructions. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-01-15 01:51:10 +00:00
Zvonko Kaiser	ffc8725164	gpu: rootfs update decoupling Remove all the driver build instructions, sicne those are now done in the kernel target. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2026-01-14 20:45:54 +01:00
Manuel Huber	3966864376	gpu: introduce devkit build flag Introduce a new devkit parameter which will produce a rootfs without chisselling. This results in a larger rootfs with various packages and binaries being included, for instance, enabling the use of the debug console. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-11-19 15:50:03 +01:00
Manuel Huber	2c9e0f9f4f	gpu: add signed-by to package sources Pin to specific key. CUDA package sources in /etc/apt/sources.list.d already use a specific key. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-11-19 15:50:03 +01:00
Zvonko Kaiser	94abe4fc00	osbuilder: nvrc: Consume NVRC release instead of building it Let's ensure that we consume NVRC releases straight from GitHub instead of building the binaries ourselves. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com> Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-31 12:10:20 +01:00
Zvonko Kaiser	5ff218823c	gpu: Remove unneeded libraries The libs in question were added when moving to developer.nvidia.com but switching back to ubuntu only based builds they are not needed. Remove them to keep the rootfs as minimal as possible. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-10-29 08:03:36 +01:00
Zvonko Kaiser	6d9b4059f5	gpu: Add libs for CC In the case of CC we need additional libraries in the rootfs. Add them conditionally if type == confidential. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-10-29 08:03:36 +01:00
Zvonko Kaiser	39848e0983	gpu: rootfs fixes Build only from Ubuntu repositories do not mix with developer.nvidia.com Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com> Update tools/osbuilder/rootfs-builder/nvidia/nvidia_chroot.sh Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-10-26 19:36:55 +01:00
Manuel Huber	af34308c83	gpu: remove version suffixes for imex and nscq This change ensures that the NVIDIA package repository for nvidia-imex and libnvidia-nspc is being used as source. The NVIDIA repository does not publish these packages with a -580 version suffix, which made us fall back to the packages from the Ubuntu repository. These two packages were recently updated by Ubuntu to depend on nvidia-kernel-common-580-server (this happened from version 580.82.07-0ubuntu1 to version 580.95.05-0ubuntu1). This conflicts with nvidia-kernel-common-580 which gets installed by nvidia-headless-no-dkms-580-open, thus causing a build failure. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-10-21 15:42:51 +02:00
Manuel Huber	4ad8c31b5a	gpu: build nv rootfs with guest pull support While the local-build's folder's Makefile dependencies for the confidential nvidia rootfs targets already declare the pause image and coco-guest-components dependencies, the actual rootfs composition does not contain the pause image bundle and relevant certificates for guest pull. This change ensure the rootfs gets composed with the relevant files. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-10-16 09:20:49 -07:00
Manuel Huber	8221361915	gpu: Use variable to differentiate rootfs variants With this change we namespace the stage one rootfs tarball name and use the same name across all uses. This will help overcome several subtle local build problems. Signed-off-by: Manuel Huber <manuelh@nvidia.com>	2025-10-15 12:39:44 +02:00
Zvonko Kaiser	b00013c717	kernel: Add KBUILD_SIGN_PIN pass through This is needed to the kernel setup picks up the correct config values from our fragments directories. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-10-10 15:45:34 -04:00
Zvonko Kaiser	37bd5e3c9d	gpu: Add kernel CONFIG check We need to make sure that the kernel we're using has the correct configs set, otherwise the module signing will not work. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-10-10 15:45:34 -04:00
Zvonko Kaiser	91739d4425	gpu: PPCIE support DGX like systems For DGX like systems we need additional binaries and libraries, enable the Kata AND CoCo use-case. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com> Update tools/osbuilder/rootfs-builder/nvidia/nvidia_rootfs.sh Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-10-09 00:00:12 +00:00
Zvonko Kaiser	7061f64db5	gpu: Fix confidential build NVRC introduced the confidential feature flag and we haven't updated the rootfs build to accomodate. If rootfs_type==confidential user --feature=confidential Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-10-08 10:01:27 +02:00
Zvonko Kaiser	2260f66339	gpu: Some fixes regarding the rootfs v580 With the 580 driver version we need new dependencies in the rootfs. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-10-08 10:01:27 +02:00
Zvonko Kaiser	2693daf503	gpu: Install dcgm export from the CUDA repo Do not use the repo to install the exporter, we rely on the version tested with Ubuntu <version> Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-10-02 18:05:13 +02:00
Zvonko Kaiser	56c6512781	gpu: Bump to noble and rearrange repos Moving the CUDA repo to the top for all essential packages and adding a repo priority favouring NVIDIA based repos. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-10-02 18:05:13 +02:00
Zvonko Kaiser	3743eb4cea	gpu: Add ligcc for RUST libc=gnul builds Since we cannot build all components with libc=musl and static RUSTFLAG we still need to ship libcc for AA or other guest components. Without this change the guest components do not work and we see /usr/local/bin/attestation-agent: error while loading shared libraries: libgcc_s.so.1: cannot open shared object file: No such file or directory Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-09-26 15:08:58 -04:00
Zvonko Kaiser	e6f12d8f86	gpu: Add latest driver per default Lets make sure that we use latest driver for CI and release. There was a sort step missing. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-09-20 23:50:35 +00:00
Fabiano Fidêncio	ad240a39e6	kata-deploy: tools: tests: Use zstd instead of xz Although the compress ratio is not as optimal as using xz, it's way faster to compress / uncompress, and it's "good enough". This change is not small, but it's still self-contained, and has to get in at once, in order to help bisects in the future. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-08-21 19:53:55 +02:00
Zvonko Kaiser	8422411d91	gpu: Add coco guest components The second stage needs to consider the coco guest components Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-07-31 17:11:21 +00:00
Zvonko Kaiser	da17b06d28	gpu: Pin toolkit version New versions have incompatibilites, pin toolkit to a working version Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-07-14 22:07:21 +00:00
Zvonko Kaiser	97a4a1574e	gpu: Remove gpu-admin-tools NVRC got a new feature reading the CC mode directly from register Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-07-14 21:59:31 +00:00
Zvonko Kaiser	c3b2d69452	gpu: NVRC static build We had the proper config.toml configuration for static builds but were building the glibc target and not the musl target. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-07-03 15:31:00 +00:00
Champ-Goblem	d6c45027f5	nvidia-rootfs: only copy `kata-opa` if `AGENT_POLICY` is enabled In the nvidia rootfs build, only copy in `kata-opa` if `AGENT_POLICY` is enabled. This fixes builds when `AGENT_POLICY` is disabled and opa is not built. Signed-off-by: Champ-Goblem <cameron@northflank.com>	2025-06-11 11:25:10 +02:00
Zvonko Kaiser	445cad7754	gpu: Set the ARCH explicilty for driver builds Kernel Makefiles changed how to deduce the right arch lets set it explicilty to enable arm and amd builds. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-05-01 17:13:20 +00:00
Zvonko Kaiser	2f28be3ad9	gpu: Update creation permissions We need to make sure the device files are created correctly in the rootfs otherwise kata-agent will apply permission 0o000. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-04-14 21:02:34 +00:00
Zvonko Kaiser	eb2f75ee61	gpu: fix init symlinks With the recent changes we need to make sure NVRC is symlinked for init and sbin/init Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-03-03 17:21:59 +00:00
Zvonko Kaiser	94579517d4	shellcheck: Update nvidia_rootfs.sh With the new rules we need more updates. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-28 16:36:05 +00:00
Zvonko Kaiser	af1d6c2407	shecllcheck: Update nvidia_chroot.sh Make shellcheck happy with the new rules new updates needed Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-28 16:27:51 +00:00
Zvonko Kaiser	5ab3192c51	gpu: Update nvidia_rootfs.sh We need to handle KBUILD_SIGN_PIN so that the kbuild can decrypte the signing key Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-28 01:31:35 +00:00
Zvonko Kaiser	39d3b7fb90	gpu: Update NVIDIA chroot script We need to place the signing key and cert at the right place and hide the KBUILD_SIGN_PIN from echo'ing or xtrace Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-28 01:31:35 +00:00
Zvonko Kaiser	eeacd8fd74	gpu: Adapt rootfs build for multi-arch Add aarch64 and x86_64 handling. Especially build the Rust dependency with the correct rust musl target. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-04 16:44:21 +00:00
Zvonko Kaiser	cd7001612a	gpu: rootfs adjust for AGENT_INIT=no Since we're defaulting to AGENT_INIT=no for all the initrd/images adapt the NV build to properly get kata-agent installed. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-01-27 17:56:21 +00:00
Zvonko Kaiser	98e0dc1676	gpu: Add set -u to scripts Make the scripts more robust by failing on unset varaibles Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-01-27 17:56:21 +00:00
Zvonko Kaiser	f153229865	gpu: Add driver version selection Besides latest and lts options add an option to specify the exact driver version. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-01-27 17:56:21 +00:00
Zvonko Kaiser	f0bd83b073	gpu: Fix rootfs build The pyinstaller is located per default under /usr/local/bin some prior versions were installing it to ${HOME}. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-01-15 20:37:51 +00:00
Dan Mihai	0f522c09d9	rootfs: reduced console output by default Use "set -x" only when the user specified DEBUG=1. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-01-13 19:34:05 +00:00
Zvonko Kaiser	0debf77770	gpu: NVIDIA gpu initrd/image build With each release make sure we ship a GPU enabled rootfs/initrd Fixes: #6554 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-11-21 18:57:23 +00:00

45 Commits