mirror of https://github.com/kata-containers/kata-containers.git synced 2026-07-02 07:02:16 +00:00

Files

Fabiano Fidêncio f763e9cca9 tests: Add NUMA topology / GPU placement tests to the NV CIs

Add k8s-nvidia-numa.bats with five tests that validate NUMA behaviour
on hosts where NUMA is configured by default (qemu-nvidia-gpu,
qemu-nvidia-gpu-snp, qemu-nvidia-gpu-tdx):

1. Multi-node sandbox (large workload spanning all host NUMA nodes):
   - Guest NUMA node count matches host
   - Guest vCPU distribution is balanced across nodes (max-min <= 1)
   - Guest memory is distributed across NUMA nodes
   - Host-side vCPU pinning is balanced across NUMA nodes

2. Right-sized single-node sandbox (small workload fitting one node):
   - Guest collapses to a single NUMA node
   - All host vCPU threads pinned to that one NUMA node

3. GPU passthrough with VFIO, multi-node:
   - Guest NUMA topology is balanced (same as test 1)
   - Guest GPU's NUMA node matches the host GPU's NUMA node
     (resolved via the vfio-pci,host=<BDF> from the QEMU command
     line and /sys/bus/pci/devices/<BDF>/numa_node)
   - QEMU command line contains pxb-pcie and policy=bind
   - Host vCPU pinning is balanced

4. GPU passthrough with VFIO, right-sized single-node: small workload
   plus GPU that fits in a single host NUMA node:
   - Guest collapses to a single NUMA node
   - The chosen node is the GPU's host NUMA node, not just any node
     that fits — verified by matching host-nodes= in the memory
     backend and pxb-pcie numa_node= against the GPU's host node
   - Guest GPU reports the same NUMA node as the host GPU

5. Explicit numa_mapping in the runtime TOML (QEMU-only):
   - Drops a config.d/ fragment that sets numa_mapping = ["1"], so the
     auto-derive + right-sizing path is bypassed entirely
   - Guest sees exactly 1 NUMA node
   - QEMU memory backend is bound to host node 1 (host-nodes=1,
     policy=bind), not host node 0
   - Host-side vCPU threads land on host node 1
   - Drop-in is removed on teardown so subsequent tests are unaffected

Guest-side checks use a dedicated container image
(quay.io/kata-containers/numa) that reads sysfs and prints results to
stdout — no kubectl exec or CoCo policy overrides needed.

Host-side checks (crictl, pgrep, taskset) run directly on the host
via sudo; a standalone numa-pinning-check.sh script handles the vCPU
thread affinity inspection.  The config.d/ helpers used by test 5 are
runtime-agnostic (probe Go vs runtime-rs layout on disk) but the test
is gated to qemu-* shims since runtime-rs does not yet implement
NUMA.

Skips cleanly on single-NUMA hosts, unsupported hypervisors, or when
no nvidia.com/pgpu resources are available (GPU tests only).

Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>
Assisted-by: Cursor <cursoragent@cursor.com>

2026-05-24 22:00:46 +02:00

data

tests: Make editorconfig-checker happy

2026-02-10 21:58:28 +01:00

images

…

containerd-kata.md

docs: Update containerd-kata.md with clear settings

2026-03-29 19:17:03 +02:00

how-to-hotplug-memory-arm64.md

…

how-to-import-kata-logs-with-fluentd.md

docs: Fix broken links

2023-10-26 10:17:01 -07:00

how-to-load-kernel-modules-with-kata.md

docs: Update kernel modules loading document

2026-04-22 16:29:46 +08:00

how-to-pull-images-in-guest-with-kata.md

tests: Make editorconfig-checker happy

2026-02-10 21:58:28 +01:00

how-to-run-docker-with-kata.md

…

how-to-run-kata-containers-with-kinds-of-Block-Volumes.md

docs: Fix volume type and fs type

2026-03-29 19:17:03 +02:00

how-to-run-kata-containers-with-SE-VMs.md

kata-deploy: Remove kustomize yamls, rely on helm-chart only

2025-10-08 16:54:19 +02:00

how-to-run-kata-containers-with-SNP-VMs.md

docs: switch to blockfile snapshotter for SEV-SNP in runtime-rs

2026-03-29 19:17:03 +02:00

how-to-run-rootless-vmm.md

tests: Make editorconfig-checker happy

2026-02-10 21:58:28 +01:00

how-to-set-prometheus-in-k8s.md

docs: Rename run-kata-with-k8s with adding crio

2026-03-29 19:17:03 +02:00

how-to-set-sandbox-config-kata.md

Merge commit from fork

2026-05-19 08:22:12 +02:00

how-to-setup-swap-devices-in-guest-kernel.md

docs: Add how-to-use-memory-agent.md to howto

2025-04-02 17:45:59 +08:00

how-to-use-erofs-build-rootfs.md

docs: add guide for building rootfs with EROFS

2023-02-09 20:07:51 +08:00

how-to-use-erofs-snapshotter-with-kata.md

docs: Add how-to guide for using fsmerged EROFS rootfs with Kata

2026-04-19 13:24:31 +02:00

how-to-use-k8s-with-containerd-and-kata.md

tests: k8s: set CreateContainerRequest (on free runners) timeout to 600s

2026-02-21 08:44:47 +01:00

how-to-use-k8s-with-crio-and-kata.md

docs: Remove containerd settings from crio dedicated document

2026-03-29 19:17:03 +02:00

how-to-use-kata-containers-with-firecracker.md

docs: Update devmapper containerd plugin name

2025-11-05 18:42:29 +01:00

how-to-use-kata-with-docker.md

docs: docker: Update docs to mention runtime-rs and what's tested

2026-04-28 10:22:21 +02:00

how-to-use-memory-agent.md

docs: Spelling updates

2026-03-19 10:22:54 +00:00

how-to-use-numa-with-kata.md

tests: Add NUMA topology / GPU placement tests to the NV CIs

2026-05-24 22:00:46 +02:00

how-to-use-passthroughfd-io-within-runtime-rs.md

docs: Add document for how-to-use passthroughfd-IO within runtime-rs

2026-03-29 19:17:03 +02:00

how-to-use-seccomp-with-runtime-rs.md

docs: add document for seccomp

2025-10-09 13:25:17 +08:00

how-to-use-sysctls-with-kata.md

tests: Make editorconfig-checker happy

2026-02-10 21:58:28 +01:00

how-to-use-template-in-runtime-rs.md

tests: Make editorconfig-checker happy

2026-02-10 21:58:28 +01:00

how-to-use-the-kata-agent-policy.md

docs: require user/group/fsGroup/supplementalGroups

2026-03-02 23:48:36 +01:00

how-to-use-virtio-fs-nydus-with-kata.md

doc: Update crictl pod-config

2023-10-02 14:53:46 +01:00

how-to-use-virtio-fs-with-kata.md

kata-deploy: Remove kustomize yamls, rely on helm-chart only

2025-10-08 16:54:19 +02:00

how-to-use-virtio-mem-with-kata.md

doc: Fix spelling

2023-10-03 10:17:38 +01:00

offline_cpu.sh

docs: Fix shellcheck issues in offline_cpu.sh

2026-04-24 08:14:08 +02:00

privileged.md

docs: Rename run-kata-with-k8s with adding crio

2026-03-29 19:17:03 +02:00

README.md

docs: Add NUMA support guide for Kata Containers with QEMU

2026-05-24 22:00:46 +02:00

run-kata-with-crictl.md

docs: Spelling updates

2026-03-19 10:22:54 +00:00

service-mesh.md

tests: Make editorconfig-checker happy

2026-02-10 21:58:28 +01:00

what-is-vm-cache-and-how-do-I-use-it.md

tests: Make editorconfig-checker happy

2026-02-10 21:58:28 +01:00

what-is-vm-templating-and-how-do-I-use-it.md

tests: Make editorconfig-checker happy

2026-02-10 21:58:28 +01:00

README.md

Howto Guides

Kubernetes Integration

Hypervisors Integration

Currently supported hypervisors with Kata Containers include:

qemu
cloud-hypervisor
firecracker

In the case of firecracker the use of a block device snapshotter is needed for the VM rootfs. Refer to the following guide for additional configuration steps:
- Setup Kata containers with firecracker

Confidential Containers Policy

How to auto-generate policy

README.md

Howto Guides

Kubernetes Integration

Hypervisors Integration

Confidential Containers Policy

Advanced Topics