kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-08-17 23:47:07 +00:00

Author	SHA1	Message	Date
Zvonko Kaiser	c6c20ac253	docs: Format the threat-model to 80 chars Truncate long lines to reasonable 80 characters Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-05-28 07:39:26 +00:00
Zvonko Kaiser	d4832b3b74	vfio: Fix hotpunplug We need to remove the device from the tracking map, a container restart will increment the bus index and we will get out of root-ports and crash the machine. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-05-28 07:37:30 +00:00
Zvonko Kaiser	a7931115a0	Merge pull request #8861 from zvonkok/config-pcie-root-switch-port gpu: reintroduce pcie_root_port and add pcie_switch_port	2024-05-27 13:17:57 +02:00
Fabiano Fidêncio	3276bb52b6	Merge pull request #9721 from fidencio/topic/ci-kata-deploy-improvements-and-fixes kata-deploy / kata-cleanup / ci: Fixes and improvements to kata-deploy / kata-cleanup and its usage in the CI	2024-05-27 12:29:40 +02:00
Zvonko Kaiser	4c93bb2d61	qemu: Add CDI device handling for any container type We need special handling for pod_sandbox, pod_container and single_container how and when to inject CDI devices Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-05-27 10:13:01 +00:00
Zvonko Kaiser	c7b41361b2	gpu: reintroduce pcie_root_port and add pcie_switch_port In Kubernetes we still do not have proper VM sizing at sandbox creation level. This KEP tries to mitigates that: kubernetes/enhancements#4113 but this can take some time until Kube and containerd or other runtimes have those changes rolled out. Before we used a static config of VFIO ports, and we introduced CDI support which needs a patched contianerd. We want to eliminate the patched continerd in the GPU case as well. Fixes: #8860 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-05-27 10:13:01 +00:00
Fupan Li	6f6a164451	Merge pull request #9268 from zvonkok/kata-agent-createcontainer kata-agent: CreateContainer Hook	2024-05-27 16:36:22 +08:00
Fabiano Fidêncio	e81e8a4527	tests: kata-deploy: Adjust timeout 10 minutes is waay too long. Let's give it 4 minutes only. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-27 06:23:00 +02:00
Fabiano Fidêncio	fba5793c0d	tests: kata-deploy: Run the tests from "${repo_root_dir}" Let's see if it helps with issues like: ``` error: must build at directory: not a valid directory: evalsymlink failure on '"/home/runner/actions-runner/_work/kata-containers/kata-containers/tests/functional/kata-deploy/../../..//tools/packaging/kata-deploy/kata-cleanup/overlays/k0s"' : lstat /home/runner/actions-runner/_work/kata-containers/kata-containers/tests/functional/kata-deploy/": no such file or directory ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-27 06:23:00 +02:00
Fabiano Fidêncio	8a8a7ea0e5	tests: kata-deploy: Show more logs in the setup() This will also help us to better understand possible failures with the CI. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-27 05:05:06 +02:00
Fabiano Fidêncio	47d9589e9b	tests: kata-deploy: Show output of passing tests This will help us to debug failures and compare passing and failures outputs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-27 05:05:06 +02:00
Fabiano Fidêncio	dbd0d4a090	gha: Only do preventive cleanups for baremetal This takes a few minutes that could be saved, so let's avoid doing this on all the platforms, but simply do this when it's needed (the baremetal use case). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-27 05:05:06 +02:00
Fabiano Fidêncio	ee2ef0641c	tests: k8s: Allow passing "all" to run all the tests Currently only "baremetal" runs all the tests, but we could easily run "all" locally or using the github provided runners, even when not using a "baremetal" system. The reason I'd like to have a differentiation between "all" and "baremetal" is because "baremetal" may require some cleanup, which "all" can simply skip if testing against a fresh created VM. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-27 05:05:06 +02:00
Fabiano Fidêncio	556227cb51	tests: Add the possibility to deploy k0s / rke2 For now we've only exposed the option to deploy kata-deploy for k3s and vanilla kubernetes when using containerd. However, I do need to also deploy k0s and rke2 for an internal CI, and having those exposed here do not hurt, and allow us to easily expand the CI at any time in the future. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-27 05:05:06 +02:00
Fabiano Fidêncio	e3c2f0b0f1	kata-cleanup: Add k0s kustomization k0s was added to kata-deploy, but it's kata-cleanup counterpart was never added. Let's fix it. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-27 05:05:06 +02:00
Fabiano Fidêncio	f15d40f8fb	kata-deploy: Fix k0s deployment k0s deployment has been broken since we moved to using `tomlq` in our scripts. The reason is that before using `tomlq` our script would, involuntarily, end up creating the file. Now, in order to fix the situation, we need to explicitly create the file and let `tomlq` add the needed content. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-27 05:05:06 +02:00
Alex Lyn	713c929a64	Merge pull request #9656 from pmores/document-qemu-rs-conventions runtime-rs: document architecture & implementation conventions in qem…	2024-05-27 10:38:58 +08:00
Xuewei Niu	bb7a1c56e9	Merge pull request #9693 from sidneychang/9690/Adjust-indentation	2024-05-27 00:20:34 +08:00
Alex Lyn	55dbf6121a	Merge pull request #9604 from Apokleos/qmp-cmdline01 runtime-rs: add QMP support for Qemu(part I)	2024-05-26 20:22:59 +08:00
Alex Lyn	028b10ce7a	Merge pull request #9687 from l8huang/vfio-pci-gk agent: collect PCI address mapping for both vfio-pci-gk and vfio-pci device	2024-05-26 17:48:25 +08:00
Steve Horsman	b89c3e35dd	Merge pull request #9583 from cncal/update_check_error_message runtime: make kata-runtime check error more understandable when /dev/kvm doesn't exist	2024-05-24 17:49:43 +01:00
Alex Lyn	41fb7aeb89	runtime-rs: add QMP params suppport in cmdline Fixes: #9603 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-05-24 22:16:24 +08:00
Alex Lyn	7ed6c6896b	runtime-rs: add an option dbg_monitor_socket for HMP support This option allows to add a debug monitor socket when `enable_debug = true` to control QEMU within debugging case. Fixes: #9603 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-05-24 22:16:17 +08:00
Lei Huang	3624573b12	agent: collect PCI address mapping for both vfio-pci-gk and vfio-pci device The `update_env_pci()` function need the PCI address mapping to translate the host PCI address to guest PCI address in below environment variables: - PCIDEVICE_<prefix>_<resource-name>_INFO - PCIDEVICE_<prefix>_<resource-name> So collect PCI address mapping for both vfio-pci-gk and vfio-pci devices. Fixes #9614 Signed-off-by: Lei Huang <leih@nvidia.com>	2024-05-23 21:20:01 -07:00
Fupan Li	d73876252e	Merge pull request #9690 from justxuewei/agent-timeout runtime-rs: Remove obsoleted dial_timeout config	2024-05-24 10:31:12 +08:00
Zvonko Kaiser	3affd83e14	Merge pull request #9605 from l8huang/skip-env kata-agent: update env PCIDEVICE_<prefix>_<resource-name>_INFO	2024-05-23 18:45:00 +02:00
Fabiano Fidêncio	44d6cb7791	Merge pull request #9698 from wainersm/k8s_tests_disable_fail_fast tests/k8s: disable "fail-fast" behavior by default	2024-05-23 18:28:00 +02:00
Fabiano Fidêncio	d83cf39ba1	Merge pull request #9680 from kata-containers/dependabot/go_modules/src/runtime/go_modules-5e29427af7 build(deps): bump golang.org/x/net from 0.24.0 to 0.25.0 in /src/runtime in the go_modules group across 1 directory	2024-05-23 12:55:29 +02:00
Fabiano Fidêncio	d9ee950d8f	Merge pull request #9696 from wainersm/skip_custom_dns_test tests/k8s: skip custom DNS tests on confidential jobs	2024-05-22 23:57:21 +02:00
GabyCT	e08ad8d1b7	Merge pull request #9686 from GabyCT/topic/fixbootclh metrics: Fix minvalue for boot time	2024-05-22 15:46:50 -06:00
Wainer dos Santos Moschetta	76735df427	tests/k8s: disable "fail-fast" behavior by default The k8s test suite halts on the first failure, i.e., failing-fast. This isn't the behavior that we used to see when running tests on Jenkins and it seems that running the entire test suite is still the most productive way. So this disable fail-fast by default. However, if you still wish to run on fail-fast mode then just export K8S_TEST_FAIL_FAST=yes in your environment. Fixes: #9697 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-05-22 18:27:44 -03:00
Fabiano Fidêncio	8eb061cd5b	Merge pull request #9681 from GabyCT/topic/etdx gha: Enable install kbs and coco components for TDX, but still skip the CDH test	2024-05-22 23:18:42 +02:00
Wainer dos Santos Moschetta	43766cdb96	tests/k8s: skip custom DNS tests on confidential jobs This test has failed in confidential runtime jobs. Skip it until we don't have a fix. Fixes: #9663 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-05-22 17:08:22 -03:00
Fabiano Fidêncio	904370ecd6	tests: attestation: tdx: Skip test for now Skipping the test will allow us to have the TDX CI running while we debug the test. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-22 20:04:13 +02:00
Fabiano Fidêncio	414d716eef	tests: kbs: Enable cli installation also on CentOS One of our machines is running CentOS 9 Stream, and we could easily verify that we can build and install the kbs client there, thus we're expanding the installation script to also support CentOS 9 Stream. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-22 20:01:57 +02:00
Fabiano Fidêncio	27d7f4c5b8	tests: kbs: Fix rust installation `externals.coco-kbs.toolchain` is not defined, get the rust_version from `externals.coco-trustee.toolchain` instead. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-22 20:01:57 +02:00
Fabiano Fidêncio	fa8b5c76b8	tests: kbs: Add more info for the TDX deployment Ditto in the commit shortlog. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-22 20:01:57 +02:00
Fabiano Fidêncio	6ffd7b8425	versions: trustee: Bump version to 6adb8383309cbb7 We're bumping the version in order to bring in the customisation needed for setting up a custom pccs, which is needed for the KBS integration tests with Kata Containers + TDX. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-22 20:01:57 +02:00
Fabiano Fidêncio	dbd1fa51cd	tests: kbs: Don't assume /tmp/trustee exists in the machine Instead, check if the directory exists before pushd'ing into it. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-22 20:01:57 +02:00
Gabriela Cervantes	f698caccc0	gha: Enable install kbs and coco components for TDX This PR enables the installation and unistallation of the kbs client as well as general coco components needed for the TDX GHA CI. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-05-22 20:01:57 +02:00
GabyCT	eaaab19763	Merge pull request #9685 from GabyCT/topic/fixic tests: Fix indentation in confidential common script	2024-05-22 11:53:33 -06:00
Gabriela Cervantes	29a10f1373	metrics: Fix minvalue for boot time This PR fixes the minvalue for boot time to avoid the random failures of the GHA CI. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-05-22 17:52:51 +00:00
GabyCT	0b32360ab4	Merge pull request #9684 from stevenhorsman/add-arch-to-component-cache-tags ci: cache: Add arch suffix to all cache tags	2024-05-22 09:24:28 -06:00
Fabiano Fidêncio	0e33ecf7fc	Merge pull request #9653 from JakubLedworowski/fixes-9497-ensure-quote-generation-service-is-added-to-qemu-cmd-2 runtime: Enable connection to Quote Generation Service (QGS)	2024-05-22 15:49:23 +02:00
sidneychang	8938f35627	runtime-rs: Adjust indentation in ifneq statements within Makefile. Replace tab indentation with spaces for the three lines within the ifneq statements, aligning them with the surrounding code. Fixes:#9692 Signed-off-by: sidneychang <2190206983@qq.com>	2024-05-22 20:24:35 +08:00
Fabiano Fidêncio	94f7bbf253	Merge pull request #9682 from fidencio/topic/allow-increasing-cpus-and-memory-via-annotation-for-tdx runtime: tdx: Allow default_{cpu,memory} annotations	2024-05-22 12:07:28 +02:00
Xuewei Niu	d31616cec3	runtime-rs: Remove obsoleted dial_timeout config The `dial_timeout` works fine for Runtime-go, but is obsoleted in Runtime-rs. When the pod cannot connect to the Agent upon starting, we need to adjust the `reconnect_timeout_ms` to increase the number of connection attempts to the Agent. Fixes: #9688 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2024-05-22 17:57:05 +08:00
Jakub Ledworowski	fc680139e5	runtime: Enable connection to Quote Generation Service (QGS) For the TD attestation to work the connection to QGS on the host is needed. By default QGS runs on vsock port 4050, but can be modified by the host owner. Format of the qemu object follows the SocketAddress structure, so it needs to be provided in the JSON format, as in the example below: -object '{"qom-type":"tdx-guest","id":"tdx","quote-generation-socket":{"type":"vsock","cid":"2","port":"4050"}}' Fixes: #9497 Signed-off-by: Jakub Ledworowski <jakub.ledworowski@intel.com>	2024-05-22 11:16:24 +02:00
Alex Lyn	0331859740	Merge pull request #9642 from gkurz/drop-unused-knobs-qemu-rs runtime-rs: Drop some useless QEMU arguments	2024-05-22 16:13:14 +08:00
Alex Lyn	ce030d1804	Merge pull request #9641 from cmaf/runtime-resize-mem-1 runtime: Add missing check in ResizeMemory for CH	2024-05-22 14:05:30 +08:00

1 2 3 4 5 ...

13785 Commits