kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-02-22 14:54:23 +00:00

Author	SHA1	Message	Date
Hyounggyu Choi	6a4ff08156	Merge pull request #9632 from BbolroC/do-not-build-agent-policy-for-s390x local-build: Ensure the default rootfs is built with AGENT_POLICY=yes	2024-05-15 06:56:22 +02:00
Fabiano Fidêncio	92bb235723	osbuilder: Log when the default policy is installed This will help us to debug issues in the future (and would have helped in the past as well). :-) Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-14 20:45:49 +02:00
Fabiano Fidêncio	75bd97e8df	build: Ensure the default rootfs is built with AGENT_POLICY=yes This is needed, as `b1710ee2c0` made the default agent shipped the one with policy support. However, we simply didn't update the rootfs to reflect that, causing then an issue to start the agent as shown by the strace below: ``` open("/etc/kata-opa/default-policy.rego", O_RDONLY\|O_LARGEFILE\|O_CLOEXEC) = -1 ENOENT (No such file or directory) futex(0x7f401eba0c28, FUTEX_WAKE_PRIVATE, 1) = 1 rt_sigprocmask(SIG_BLOCK, ~[RTMIN RT_1 RT_2], [], 8) = 0 tkill(553681, SIGABRT) = 0 rt_sigprocmask(SIG_SETMASK, [], NULL, 8) = 0 --- SIGABRT {si_signo=SIGABRT, si_code=SI_TKILL, si_pid=553681, si_uid=1000} --- +++ killed by SIGABRT (core dumped) +++ ``` This happens as the default policy must be set when the agent is built with policy support, but the code path that copies that into the rootfs is only triggered if the rootfs itself is built with AGENT_POLICY=yes, which we're now doing for both confidential and non-confidential cases. Sadly this was not caught by CI till we the cache was not used for rootfs, which should be solved by the previous commit. Fixes: #9630, #9631 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-14 20:39:15 +02:00
Hyounggyu Choi	37060a7d2e	local-build: Stop using cached artifacts when local-build/* is updated This is to add an info for files at `tools/packaging/kata-deploy/local-build/* to a version of the components and ensure that the cached artefacts are not used when the files of interest are updated. Fixes: #9630 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-05-14 19:47:33 +02:00
Fabiano Fidêncio	9a3392993d	Merge pull request #9629 from ldoktor/tdx_not_supported_warning kata-deploy: Fix tdx_not_supported call	2024-05-14 17:27:56 +02:00
Greg Kurz	f14a1330d4	Merge pull request #9585 from littlejawa/debugging_the_runtime debugging: adding a script and instructions for debugging the GO shim	2024-05-14 15:31:07 +02:00
Lukáš Doktor	d9ae130031	kata-deploy: Fix tdx_not_supported call the `tdx_not_supported_warning` function does not exists, the `tdx_not_supported` should be called instead. Fixes: #9628 Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-05-14 13:26:07 +02:00
Julien Ropé	e7cfc0865a	debugging: adding a script and instructions for debugging the GO shim Using a debugger with the kata runtime is complicated, but it can be done and can be very useful. This commits provides a helper script that simplifies it, and updates the developper's documentation to explain how to use it. Signed-off-by: Julien Ropé <jrope@redhat.com>	2024-05-14 11:12:31 +02:00
Greg Kurz	e2117d3b71	Merge pull request #9571 from emanuellima1/fix-impl-rtc runtime-rs: Fix constructing the RTC struct	2024-05-14 09:17:27 +02:00
Fabiano Fidêncio	4d5e90038c	Merge pull request #9626 from fidencio/topic/prepare-for-3.5.0-release release: Bump VERSIONS file to 3.5.0	2024-05-13 12:52:12 +02:00
Fabiano Fidêncio	0e385452e5	release: Bump VERSIONS file to 3.5.0 Let's bump the VERSIONS file and start preparing for a new release of the project. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-13 10:49:09 +02:00
Fabiano Fidêncio	c64b07f981	Merge pull request #9622 from fidencio/topic/unbreak-nvidia-gpu-build build: nvidia-gpu: Fix cache usage of the headers tarball	2024-05-12 14:40:22 +02:00
Fabiano Fidêncio	9713558477	k0s: Use a different port for kube-route's metrics kube-router decided to use :8080 for its metrics, and this seems to be a change that affected k0s 1.30.0+, leading to kube-router pod crashing all the time and anything can actually be started after that. Due to this issue, let's simply use a different port (:9999) and move on with our tests. Fixes: #9623 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-11 23:18:20 +02:00
Fabiano Fidêncio	4cd048444d	build: nvidia-gpu: Fix cache usage of the headers tarball Whenever we count on having the headers tarball, we must unpack the cached content into the expected directory, otherwise we'd simply fail, as we've been failing in our CI, at the end of the process where we generate the tarball from the cached components. It's weird to me, sincerely, that the headers tarball end up in such weird place (build/kernel-nvidia-gpu/builddir/), but I'll leave that to Zvonko to figure out whether something better can be done, as the intuit of this PR is simply unblock Kata Containers CI. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-11 17:59:53 +02:00
Zvonko Kaiser	4dea73b433	Merge pull request #9616 from zvonkok/nv-kernel-hotfix deploy: Fix wrong pushing of artifacts	2024-05-10 18:38:09 +02:00
Zvonko Kaiser	4d0f42a145	deploy: Fix wrong pushing of artifacts Added explicit case statements for nvidia-gpu and nvidia-gpu-confidential Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-05-10 14:08:32 +00:00
Fabiano Fidêncio	20515fed70	Merge pull request #9484 from zvonkok/nvidia-runtimeclasses deploy: Add runtimeClasses relating to the NVIDIA GPU	2024-05-10 03:52:12 +02:00
Emanuel Lima	59c1567f80	runtime-rs: Fix constructing the RTC struct RTC was being built in a wrong fashion on commit #2bc5e3c6e2ab0145fa9e8be95df0d5086c07a517 RTC was being constructed inside the QemuCmdLine struct, but it should've been built inside the devices vector. Signed-off-by: Emanuel Lima <emlima@redhat.com>	2024-05-09 15:00:47 -03:00
Fabiano Fidêncio	2f686b1179	Merge pull request #9608 from fidencio/topic/tdx-depend-on-distro-host-stack-part-II tdx: Adapt kata-deploy to use QEMU / OVMF from the distros	2024-05-09 10:25:19 +02:00
Zvonko Kaiser	da7e6a0f07	deploy: Add runtimeClasses relating to the NVIDIA GPU Fixes: #9483 For the added configurations we need to provide runtimeClasses. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-09 10:00:59 +02:00
Fabiano Fidêncio	96a100f910	Merge pull request #9482 from zvonkok/kernel-headers-tarball kernel: Add caching of kernel-headers	2024-05-09 09:58:30 +02:00
Fabiano Fidêncio	aba56a8adb	tests: measured-rootfs: Skip policy addition Let's skip the policy addition for now, in order to get the TDX CI back up and running, and then we can re-enable it as soon as we get https://github.com/kata-containers/kata-containers/issues/9612 fixed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-09 07:59:12 +02:00
Fabiano Fidêncio	77f457c0e1	runtime: tdx: Drop sept-ve-disable=on This was needed when we were using an old (and not maintained anymore) host stack. Considering what we have as part of the distros, Today, this can simply be dropped, as I cannot find any reference of this one being needed in any up-to-date documentation. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-09 07:59:12 +02:00
Fabiano Fidêncio	416d00228c	Revert "qemu: tdx: Adapt command line" (partially) This reverts commit `b7cccfa019`. The `private=on` bit has never made its way upstream, and was removed from the latest iteration that we're using. With that in mind, let's revert its usage in the code. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-09 07:59:12 +02:00
Fabiano Fidêncio	1c3037fd25	Revert "govmm: tdx: Expose the private=on\|off knob" This reverts commit `582b5b6b19`. The `private=on` bit has never made its way upstream, and was removed from the latest iteration that we're using. With that in mind, let's revert its addition, and later on its usage in the code. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-09 07:59:12 +02:00
Fabiano Fidêncio	a9720495de	kata-deploy: Ensure the distro QEMU and OVMF are used for TDX Here we're checking the distro's `/etc/os-release` or `/usr/lib/os-release` in order to get which distro we're deploying the Kata Containers artefacts to, and then to properly adjust the QEMU and OVMF with TDX support that's been shipped with the distros. Together with that, we're also printing the instructions provided by the distro on how to enable and use TDX. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-09 07:59:12 +02:00
Fabiano Fidêncio	f48450b360	runtime: config: tdx: Add QEMU / OVMF placeholder var Let's add the PLACEHOLDER_FOR_DISTRO_{QEMU,OVMF}_WITH_TDX_SUPPORT variables instead of actually setting a path, so we can easily replace those as part of our deployment scripts. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-09 07:59:12 +02:00
Fabiano Fidêncio	84b94dc2b1	kata-deploy: Expose /host to the daemon-set We'll need to have access to the host os-release file (either under `/etc/os-release` or under `/usr/lib/os-release`), and the simplest approach that comes to my mind to do is doing what a debug pod would do, mounting `/` as `/host` and then allowing us to have access to those files, and then corectly set the TDX specific QEMU and OVMF (TDVF) paths for the tdx available configurations. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-09 07:59:12 +02:00
Fabiano Fidêncio	f2d40da8e4	versions: build: Remove unused td-shim entry We haven't been using nor testing with td-shim, as Cloud Hypervisor does not officially support TDX yet, and TDVF is supposed to be used with QEMU, instead of td-shim. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-09 07:59:12 +02:00
Fabiano Fidêncio	ea82740b19	versions: build: Remove TDX specific QEMU Let's remove everything related to the TDX specific QEMU building / shipping from our repo, as we'll be relying on the one coming from the distros. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-09 07:59:12 +02:00
Fabiano Fidêncio	4292c4c3b1	versions: build: Remove TDX specific OVMF (TDVF) Let's remove everything related to the TDVF building / shipping from our repo, as we'll be relying on the one coming from the distro. Later on, we may need to re-add TDVF logic, as we're already using upstream edk2 repo / content, but when that's needed we'll simply revert this commit. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-09 07:59:12 +02:00
Alex Lyn	946f0bdfff	Merge pull request #9609 from fidencio/topic/skip-pull-image-tests-on-tees tests: pull-image: Don't run on TEEs	2024-05-09 08:22:55 +08:00
GabyCT	3b8a910393	Merge pull request #9596 from lifupan/main db: fix the issue of failed to init pci root bus	2024-05-08 13:14:20 -06:00
Fabiano Fidêncio	142342012c	tests: pull-image: Don't run on TEEs Let's skip those tests on TEEs as we've been facing a reasonable amount of issues, most likely on the containerd side, related to pulling the image on the guest. Once we're able to fix the issues on containerd, we can get back and re-enable those by reverting this commit. The decision of disabling the tests for TEEs is because the machines may end up in a state where human intervention is necessary to get them back to a functional state, and that's really not optimal for our CI. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-08 18:40:22 +02:00
Fabiano Fidêncio	c0bf9e9bc6	Merge pull request #9607 from fidencio/topic/tdx-depend-on-distro-host-stack-part-I ci: Stop building TDX specific QEMU and OVMF	2024-05-08 15:53:15 +02:00
Zvonko Kaiser	fb0b821771	kernel: Add caching of kernel-headers Fixes: #9481 We need to cache the kernel-headers for the NVIDIA GPU initrd/image build. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-05-08 11:30:39 +00:00
Fabiano Fidêncio	12dc9f83df	ci: Stop building TDX specific QEMU and OVMF This is the first step of the work to start relying on the artefacts coming from the distros (CentOS 9 Stream, and Ubuntu) themselves. Let's have this first one merged, as this will not run the CI due to the changes being on the yaml itself, and then follow-up with the changes needed on other parts of the project (kata-deploy, runtime, etc). Fixes: #9590 -- part I Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-05-08 11:39:32 +02:00
Alex Lyn	875e6e3815	Merge pull request #9601 from cncal/fix_redundant_log qemu: the error is logged only when it occurs	2024-05-08 08:59:01 +08:00
GabyCT	22087f9db9	Merge pull request #9598 from lifupan/main_shim runtime-rs: fix the issue of the leak of dead shim	2024-05-07 10:14:11 -06:00
GabyCT	a564422b7b	Merge pull request #9582 from cncal/main build: fix the confusing build message if yq doesn't exist in GOPATH/bin	2024-05-07 09:34:27 -06:00
Fabiano Fidêncio	cd84414c63	Merge pull request #9600 from GabyCT/topic/deleteoci versions: Remove oci information from versions file	2024-05-07 13:15:35 +02:00
Fabiano Fidêncio	ddf6b367c7	Merge pull request #9568 from kata-containers/dependabot/go_modules/src/runtime/go_modules-22ef55fa20 build(deps): bump the go_modules group across 5 directories with 8 updates	2024-05-07 13:14:48 +02:00
Steve Horsman	e967db60ab	Merge pull request #9592 from sprt/mariner-before-ch39 tests: adapt Mariner CI to unblock CH v39 upgrade	2024-05-07 11:52:55 +01:00
cncal	15d511af97	qemu: the error is logged only when it occurs Everytime I create contianer on arm64 machine, containerd/kata logs a redundant warning as follows: ``` shell time="2024-05-07" level=warning msg="<nil>" arch=arm64 name=containerd-shim-v2 pid=xxx sandbox=fdd1f05 source=virtcontainers/hypervisor ``` I added an error statement so that the error would be logged when it occurs. Signed-off-by: cncal <flycalvin@qq.com>	2024-05-07 14:28:04 +08:00
Gabriela Cervantes	aecede11fc	versions: Remove oci information from versions file This PR removes oci information from versions file as this is not longer being used in kata containers repository. Fixes #9599 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-05-06 20:14:00 +00:00
Fupan Li	3694f3d9fe	runtime-rs: fix the issue of the leak of dead shim We should init and asign the runtime instance to runtime handler, otherwise, if the pause container failed to start, which means the runtime instance failed to start, then the following delete & shutdown request wouldn't be run, thus the dead shim would be left. Fixes: #9597 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2024-05-06 17:31:31 +08:00
Fupan Li	26bee78e8d	db: fix the issue of failed to init pci root bus dragonball reserves 2048G of mmio space for the pci root bus by default on physical addresses greater than 4G. However, for some machines with smaller physical address widths, such as 39-bit wide physical addresses, dragonball reserves the mmio space when initializing the memory. It is less than 2048G, so this commit dynamically calculates and allocates the mmio size of each pci root bus. Fixes: #9509 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2024-05-06 11:34:18 +08:00
Aurélien Bombo	0cc2b07a8c	tests: adapt Mariner CI to unblock CH v39 upgrade The CH v39 upgrade in #9575 is currently blocked because of a bug in the Mariner host kernel. To address this, we temporarily tweak the Mariner CI to use an Ubuntu host and the Kata guest kernel, while retaining the Mariner initrd. This is tracked in #9594. Importantly, this allows us to preserve CI for genpolicy. We had to tweak the default rules.rego however, as the OCI version is now different in the Ubuntu host. This is tracked in #9593. This change has been tested together with CH v39 in #9588. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2024-05-03 16:29:12 +00:00
cncal	48d873b52b	build: fix the confusing build message if yq doesn't exist in GOPATH/bin The build message shows that yq was not found when I tried to build runtime binaries, but I've actually installed yq by yum install. Signed-off-by: cncal <flycalvin@qq.com>	2024-05-03 08:34:45 +08:00
Zvonko Kaiser	e5e0983b56	Merge pull request #9476 from zvonkok/nvidia-config-tomls config: Add NVIDIA GPU SNP, TDX configuration files	2024-05-02 10:27:10 +02:00
Fabiano Fidêncio	f04a7a55ed	Merge pull request #9563 from fidencio/topic/agent-use-policy-by-default build: Build the shipped agent with policy enabled	2024-05-01 12:22:05 +02:00
Fabiano Fidêncio	33a8701904	Merge pull request #9573 from littlejawa/kata_deploy_crio_conf kata-deploy: configure debugging for crio	2024-05-01 12:19:10 +02:00
Julien Ropé	c2aed995b7	kata-deploy: configure debugging for crio Fix the configuration for crio's log_level Fixes: #9556 Signed-off-by: Julien Ropé <jrope@redhat.com>	2024-04-30 17:48:43 +02:00
stevenhorsman	3c2232d898	runtime: fix testVersionString logic - The testVersionString logic use regex to check that the ociVersion is displayed correctly, but with the new go module that version has a `+` in, so we need to quote this to escape special characters Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-04-30 10:54:49 +01:00
dependabot[bot]	391bc35805	build(deps): bump the go_modules group across 5 directories with 8 updates Bumps the go_modules group with 2 updates in the /src/runtime directory: [github.com/containerd/containerd](https://github.com/containerd/containerd) and [github.com/containers/podman/v4](https://github.com/containers/podman). Bumps the go_modules group with 4 updates in the /src/tools/csi-kata-directvolume directory: [golang.org/x/sys](https://github.com/golang/sys), google.golang.org/protobuf, [golang.org/x/net](https://github.com/golang/net) and [google.golang.org/grpc](https://github.com/grpc/grpc-go). Bumps the go_modules group with 2 updates in the /src/tools/log-parser directory: [golang.org/x/sys](https://github.com/golang/sys) and gopkg.in/yaml.v3. Bumps the go_modules group with 2 updates in the /tests directory: [golang.org/x/sys](https://github.com/golang/sys) and gopkg.in/yaml.v3. Bumps the go_modules group with 2 updates in the /tools/testing/kata-webhook directory: [golang.org/x/sys](https://github.com/golang/sys) and [golang.org/x/net](https://github.com/golang/net). Updates `github.com/containerd/containerd` from 1.7.2 to 1.7.11 - [Release notes](https://github.com/containerd/containerd/releases) - [Changelog](https://github.com/containerd/containerd/blob/main/RELEASES.md) - [Commits](https://github.com/containerd/containerd/compare/v1.7.2...v1.7.11) Updates `github.com/containers/podman/v4` from 4.2.0 to 4.9.4 - [Release notes](https://github.com/containers/podman/releases) - [Changelog](https://github.com/containers/podman/blob/v4.9.4/RELEASE_NOTES.md) - [Commits](https://github.com/containers/podman/compare/v4.2.0...v4.9.4) Updates `google.golang.org/protobuf` from 1.29.1 to 1.33.0 Updates `github.com/cyphar/filepath-securejoin` from 0.2.3 to 0.2.4 - [Release notes](https://github.com/cyphar/filepath-securejoin/releases) - [Commits](https://github.com/cyphar/filepath-securejoin/compare/v0.2.3...v0.2.4) Updates `golang.org/x/sys` from 0.15.0 to 0.19.0 - [Commits](https://github.com/golang/sys/compare/v0.15.0...v0.19.0) Updates `google.golang.org/protobuf` from 1.31.0 to 1.33.0 Updates `golang.org/x/net` from 0.19.0 to 0.23.0 - [Commits](https://github.com/golang/net/compare/v0.19.0...v0.23.0) Updates `google.golang.org/grpc` from 1.59.0 to 1.63.2 - [Release notes](https://github.com/grpc/grpc-go/releases) - [Commits](https://github.com/grpc/grpc-go/compare/v1.59.0...v1.63.2) Updates `golang.org/x/sys` from 0.0.0-20191026070338-33540a1f6037 to 0.1.0 - [Commits](https://github.com/golang/sys/compare/v0.15.0...v0.19.0) Updates `gopkg.in/yaml.v3` from 3.0.0-20200313102051-9f266ea9e77c to 3.0.0 Updates `golang.org/x/sys` from 0.0.0-20220429233432-b5fbb4746d32 to 0.19.0 - [Commits](https://github.com/golang/sys/compare/v0.15.0...v0.19.0) Updates `gopkg.in/yaml.v3` from 3.0.0-20210107192922-496545a6307b to 3.0.0 Updates `golang.org/x/sys` from 0.15.0 to 0.19.0 - [Commits](https://github.com/golang/sys/compare/v0.15.0...v0.19.0) Updates `golang.org/x/net` from 0.19.0 to 0.23.0 - [Commits](https://github.com/golang/net/compare/v0.19.0...v0.23.0) --- updated-dependencies: - dependency-name: github.com/containerd/containerd dependency-type: direct:production dependency-group: go_modules - dependency-name: github.com/containers/podman/v4 dependency-type: direct:production dependency-group: go_modules - dependency-name: google.golang.org/protobuf dependency-type: direct:production dependency-group: go_modules - dependency-name: github.com/cyphar/filepath-securejoin dependency-type: indirect dependency-group: go_modules - dependency-name: golang.org/x/sys dependency-type: indirect dependency-group: go_modules - dependency-name: google.golang.org/protobuf dependency-type: indirect dependency-group: go_modules - dependency-name: golang.org/x/net dependency-type: direct:production dependency-group: go_modules - dependency-name: google.golang.org/grpc dependency-type: direct:production dependency-group: go_modules - dependency-name: golang.org/x/sys dependency-type: indirect dependency-group: go_modules - dependency-name: gopkg.in/yaml.v3 dependency-type: indirect dependency-group: go_modules - dependency-name: golang.org/x/sys dependency-type: indirect dependency-group: go_modules - dependency-name: gopkg.in/yaml.v3 dependency-type: indirect dependency-group: go_modules - dependency-name: golang.org/x/sys dependency-type: indirect dependency-group: go_modules - dependency-name: golang.org/x/net dependency-type: indirect dependency-group: go_modules ... Signed-off-by: dependabot[bot] <support@github.com>	2024-04-30 09:46:13 +01:00
Wainer Moschetta	eae429a39b	Merge pull request #9552 from wainersm/kata_cc_dev runtime: new qemu-coco-dev configuration	2024-04-30 05:21:49 -03:00
Zvonko Kaiser	28078ded84	Merge pull request #9570 from stevenhorsman/dependabot-commit-check-skip workflow: static-checks: Skip commit checks for dependabout	2024-04-29 23:00:35 +02:00
Pavel Mores	1dd06cf40d	Merge pull request #9551 from pmores/support-iommu runtime-rs: support IOMMU in qemu VMs	2024-04-29 15:26:11 +02:00
stevenhorsman	0bec8721cc	workflow: Skip commit checks for dependabout Dependabot doesn't follow all our commit format guidelines, so add a check and skip these if the author is `dependabot[bot]` Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-04-29 13:45:51 +01:00
Wainer dos Santos Moschetta	631f6f6ed6	gha: switch CoCo tests on non-TEE to use qemu-coco-dev With the addition of the 'qemu-coco-dev' runtimeClass we no longer need to run CoCo tests on non-TEE environments with 'qemu'. As a result the tests also no longer need to set the "io.katacontainers.config.hypervisor.image" annotation to pods. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-04-29 05:45:11 -03:00
Wainer dos Santos Moschetta	c6708726ff	kata-deploy: install the new kata-qemu-coco-dev runtimeclass Created the runtimeclasses/kata-qemu-coco-dev.yaml file and updated the list of SHIMS. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-04-29 05:45:11 -03:00
Wainer dos Santos Moschetta	42fb5d7760	runtime: new qemu-coco-dev configuration Created a new configuration to configure Kata for CoCo without requiring TEE hardware so to allow developers implement/test/debug platform agnostic code on their workstations. It will also ease testing of CoCo features on CI with non-TEE supported VMs. This is based off qemu configuration. The following differences applied: - switched to confidential guest image/initrd - switched to confidential kernel - switched to 9p shared_fs Fixes #9487 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-04-29 05:45:10 -03:00
Fabiano Fidêncio	d3b300ff95	build: tests: Remove agent-opa Now that the `kata-agent` is being built with policy support, let's stop building the `kata-opa-agent`, reducing the amount of things we need to test and maintain. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-04-28 12:52:54 +02:00
Fabiano Fidêncio	b1710ee2c0	build: Build the shipped agent with policy enabled Now that the OPA binary is not required anymore, let's start shipping the agent with the policy enabled by default. The agent without policy enabled has 30MB, while it's 34MB with the policy enabled. This 4MB (~10%) increase is, IMHO, worth it in order to reduce the amount of components we have to maintain and test, including the possibility to also reduce the amount of possible rootfs / initrd images. Whoever wants to use the agent without policy enabled can simply do that by building their own agent. :-) Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-04-28 12:52:54 +02:00
Fabiano Fidêncio	7b039eb1b9	Merge pull request #9559 from fidencio/topic/remove-opa-stuff rootfs: Stop building and shipping OPA	2024-04-28 12:52:07 +02:00
Fabiano Fidêncio	fe21d7a58b	rootfs: Stop building and shipping OPA Since OPA binary was replaced by the regorus crate, we can finally stop building and shipping the binary. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-04-26 18:51:28 +02:00
Fabiano Fidêncio	7dd2fde22d	Revert "rootfs: Make OPA build working in docker for s390x and ppc64le" This reverts commit `d523e865c0`, as we will not depend on the OPA binary anymore. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-04-26 18:51:27 +02:00
Hyounggyu Choi	62bad976e0	Merge pull request #9562 from BbolroC/bump-golang build: Update golang version to 1.22.2	2024-04-26 17:58:04 +02:00
Steve Horsman	34a1cdc5c7	Merge pull request #9528 from cncal/patch-1 doc: fix missing document link	2024-04-26 15:22:15 +01:00
Hyounggyu Choi	80cb4a6c18	build: Update golang version to 1.22.2 As we have an issue with a golang version for `run-cri-containerd`, it is required to bump the language. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-26 15:50:29 +02:00
Pavel Mores	908ec31d9b	runtime-rs: fix iommu_platform support for qemu vhost-user-fs device iommu_platform support was already added on initial DeviceVhostUserFs introduction, however it incorrectly enabled iommu_platform also on non-CCW (e.g. PCI) systems. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-04-26 14:48:00 +02:00
Pavel Mores	174fc8f44b	runtime-rs: support iommu_platform for qemu virtio-net device Note that it's only supported on CCW systems. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-04-26 14:48:00 +02:00
Pavel Mores	0d038f20cc	runtime-rs: support iommu_platform for qemu virtio-serial device iommu_platform is only turned on for CCW systems. PartialEq is added to VirtioBusType to enable the '==' operator. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-04-26 14:48:00 +02:00
Pavel Mores	66a2dc48ae	runtime-rs: support iommu_platform for qemu vhost-vsock device iommu_platform addition is controlled solely by the configuration file. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-04-26 14:48:00 +02:00
Pavel Mores	d1e6f9cc4e	runtime-rs: add IOMMU to qemu VM if configured The adding itself is done by a new function add_iommu() that conforms with the add_() convention. Note though that this function is called internally, by the QemuCmdLine constructor, simply because there's nothing to trigger its invocation from QemuInner (unlike the other add_() functions so far). Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-04-26 14:48:00 +02:00
Pavel Mores	0859f47a17	runtime-rs: add representation of '-device intel-iommu' to qemu-rs Following the golang shim example, the values are hardcoded. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-04-26 14:47:51 +02:00
Pavel Mores	702bf0d35e	runtime-rs: support qemu machine's 'kernel_irqchip' param We will want to set kernel_irqchip when enabling IOMMU and this commit adds the requisite support. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-04-26 14:42:54 +02:00
Alex Lyn	f72c6ba814	Merge pull request #9519 from emanuellima1/impl-rtc runtime-rs: Add RTC to QEMU cmdline	2024-04-26 17:44:47 +08:00
Dan Mihai	b42ddaf15f	Merge pull request #9530 from microsoft/saulparedes/improve_caching genpolicy: changing caching so the tool can run concurrently with itself	2024-04-25 13:06:23 -07:00
David Esparza	ae317a319f	Merge pull request #9549 from JakubLedworowski/fix-tarball-dockerfile build: Fix tarball not building correctly in docker	2024-04-25 09:40:20 -06:00
James O. D. Hunt	5bd614530f	Merge pull request #9525 from jodh-intel/gha-k8s-ch-dm gha: Enable k8s tests for cloud hypervisor with devicemapper	2024-04-25 09:28:09 +01:00
Fabiano Fidêncio	b4360e7e37	Merge pull request #9510 from microsoft/danmihai1/regorus-policy2 agent: use regorus instead of opa	2024-04-24 21:40:29 +02:00
James O. D. Hunt	ff7349b6f0	gha: Enable k8s tests for cloud hypervisor with devicemapper Enable the k8s tests for cloud hypervisor with devicemapper. Fixes: #9221. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com> Co-authored-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-04-24 16:32:51 +01:00
Dan Mihai	2400a4d249	Merge pull request #9428 from arc9693/archana1/genplicyfixes genpolicy: implement default methods for K8sResource trait	2024-04-24 08:04:19 -07:00
Dan Mihai	ff385eac41	agent: remove unnecessary comment Remove reminder to initialize Policy earlier, because currently there are no plans to initialize earlier. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-24 14:53:51 +00:00
Jakub Ledworowski	73366da9f9	build: Fix tarball not building correctly in docker When docker is installed on the host system using script from https://get.docker.com/ it automatically creates a docker group with gid=999. Then during docker build process of tarball, eg. make qemu-tdx-experimental-tarball docker is also installed inside the image with the same script, which also automatically adds docker group with gid=999. Then, the build tries to add a new group docker_on_host with gid=999, which already exists, which breaks the build. Signed-off-by: Jakub Ledworowski <jakub.ledworowski@intel.com>	2024-04-24 15:35:36 +02:00
Calvin Liu	56a73ee704	doc: fix missing document link Document section hardware-requirements locates to /README.md for now. Signed-off-by: Calvin Liu <flycalvin@qq.com>	2024-04-24 17:34:30 +08:00
Fabiano Fidêncio	4e35f11a3d	Merge pull request #9535 from fidencio/topic/fix-crio-debug-drop-in kata-deploy: Stop append `log_level = "debug"` for CRI-O	2024-04-24 10:03:36 +02:00
Dan Mihai	89c85dfe84	Merge pull request #9432 from UiPath/fix-clh-wait clh: isClhRunning waits for full timeout when clh exits	2024-04-23 13:02:45 -07:00
Hyounggyu Choi	608df9b7df	Merge pull request #9494 from BbolroC/guest-pull-gha-s390x CC: Enable guest-pull tests on non-TEE for s390x	2024-04-23 21:22:37 +02:00
Dan Mihai	e5c3f5fa9b	tests: no generated policy for untested platforms Avoid auto-generating Policy on platforms that haven't been tested yet with auto-generated Policy. Support for auto-generated Policy on these additional platforms is coming up in future PRs, so the tests being fixed here were prematurely enabled. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-23 16:07:03 +00:00
Emanuel Lima	2bc5e3c6e2	runtime-rs: Add RTC to QEMU cmdline Add RTC by hardcoding the ooptions base=utc,driftfix=slew,clock=host Signed-off-by: Emanuel Lima <emlima@redhat.com>	2024-04-23 10:46:30 -03:00
Fabiano Fidêncio	d190c9d4d9	kata-deploy: Stop append `log_level = "debug"` for CRI-O This should only be done once, and if CRI-O restarts, there's a big chance kata-deploy will also restart and the user would end up with a file that looks like: ``` [crio] log_level = "debug" [crio] log_level = "debug" [crio] log_level = "debug" ... ``` And that would simply cause CRI-O to not start. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-04-23 14:51:35 +02:00
Greg Kurz	42a79801f3	Merge pull request #9524 from littlejawa/fix_createruntime_hook_not_called runtime: Call CreateRuntime hooks at container creation time	2024-04-23 13:43:36 +02:00
Fupan Li	469c4e4f44	Merge pull request #9335 from Tim-Zhang/fix-passfd-fifo-open passfd-io: fix FIFO opening and vsock handling	2024-04-23 09:04:45 +08:00
Alex Lyn	bc2cf95e7a	Merge pull request #9517 from amshinde/update-storage-source-pciblock runtime-rs: Update storage source for pci block devices	2024-04-23 07:32:36 +08:00
Dan Mihai	5d31eb4847	agent: use regorus 0.1.4 Use regorus 0.1.4 from crates.io, instead of its source code repository. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-22 23:21:17 +00:00
Dan Mihai	ed6412b63c	tests: k8s: reduce the policy tests output noise Hide some of the kubectl output, to reduce the size and redundancy of this output. Fixes: #9388 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-22 19:59:33 +00:00
Dan Mihai	df23eb09a6	agent: use regorus instead of opa Implement Agent Policy using the regorus crate instead of the OPA daemon. The OPA daemon will be removed from the Guest rootfs in a future PR. Fixes: #9388 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-22 19:58:30 +00:00
Dan Mihai	58e608d61a	tests: remove k8s-policy-set-keys.bats Remove k8s-policy-set-keys.bats in preparation for using the regorus crate instead of the OPA daemon for evaluating the Agent Policy. This test depended on sending HTTP requests to OPA. Fixes: #9388 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-22 19:49:38 +00:00
Dan Mihai	b509c1beee	agent: lock anyhow version to 1.0.58 Lock anyhow version to 1.0.58 because: - Versions between 1.0.59 - 1.0.76 have not been tested yet using Kata CI. However, those versions pass "make test" for the Kata Agent. - Versions 1.0.77 or newer fail during "make test" - see https://github.com/kata-containers/kata-containers/issues/9538. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-22 19:49:15 +00:00
Archana Shinde	cc6b671101	runtime-rs: Update storage source for pci block devices In case of block devices using virtio-block, we need to pass the pci-path as the storage source field to the agent. Current the virt-path is being passed which works just for mmio block devices. In the future when support is added for scsi, block-ccw and pmem devices, the storage source would need to be handled accordingly. Fixes: #9034 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com> Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-04-22 11:36:58 -07:00
Hyounggyu Choi	f10744df99	CC: Enable guest-pull tests on non-TEE for s390x This commit is to add a new CI job to run-k8s-tests-on-zvsi.yaml. Why the job is not configured in run-kata-coco-tests.yaml by having it integrated with `run-k8s-tests-coco-nontee` is: - It uses k3s instead of AKS - It runs on a self-hosted runner These differences make the integrated job not easy to read and maintain when it comes to incorporating other platforms in the near future. Fixes: #9467 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-22 17:15:20 +02:00
Greg Kurz	6ca0f09710	Merge pull request #9518 from microsoft/danmihai1/agent-cargo-lock agent: update cargo.lock	2024-04-22 13:36:06 +02:00
Tim Zhang	aeba483ec8	agent: avoid fd leakage of passfd-io In do_create_container and do_exec_process, we should create the proc_io first, in case there's some error occur below, thus we can make sure the io stream closed when error occur. Signed-off-by: Tim Zhang <tim@hyper.sh> Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2024-04-22 17:39:33 +08:00
Tim Zhang	8441187d5e	runtime-rs: fix FIFO handling Fixes: #9334 In linux, when a FIFO is opened and there are no writers, the reader will continuously receive the HUP event. This can be problematic. To avoid this problem, we open stdin in write mode and keep the stdin-writer We need to open the stdout/stderr as the read mode and keep the open endpoint until the process is delete. otherwise, the process would exit before the containerd side open and read the stdout fifo, thus runD would write all of the stdout contents into the stdout fifo and then closed the write endpoint. Then, containerd open the stdout fifo and try to read, since the write side had closed, thus containerd would block on the read forever. Here we keep the stdout/stderr read endpoint File in the common_process, which would be destroied when containerd send the delete rpc call, at this time the containerd had waited the stdout read return, thus it can make sure the contents in the stdout/stderr fifo wouldn't be lost. Signed-off-by: Tim Zhang <tim@hyper.sh> Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2024-04-22 17:39:33 +08:00
Tim Zhang	d68eb7f0ad	agent: Fix close_stdin for passfd-io In scenario passfd-io, we should wait for stdin to close itself instead of manually intervening in it. Signed-off-by: Tim Zhang <tim@hyper.sh> Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2024-04-22 17:39:32 +08:00
Steve Horsman	ff9985fc50	Merge pull request #9490 from wainersm/port_attestation_nontee_job gha: move attestation tests to run-k8s-tests-coco-nontee	2024-04-22 10:23:11 +01:00
Archana Choudhary	4a010cf71b	genpolicy: add default implementations for K8sResource trait This commit adds default implementations for following methods of K8sResource trait: - generate_policy - serialize Fixes: #8960 Signed-off-by: Archana Choudhary <archana1@microsoft.com>	2024-04-21 12:59:02 +00:00
Archana Choudhary	6edc3b6b0a	genpolicy: add default implementation for use_sandbox_pidns This patch adds a default implementation for the use_sandbox_pidns and updates the structs that implement the K8sResource trait to use the default. Fixes: #8960 Signed-off-by: Archana Choudhary <archana1@microsoft.com>	2024-04-21 12:59:02 +00:00
Archana Choudhary	d5d3f9cda7	genpolicy: add default implementation for use_host_network - Provide default implementation for use_host_network - Remove default implementation from structs implementing the trait K8sResource Fixes: #8960 Signed-off-by: Archana Choudhary <archana1@microsoft.com>	2024-04-21 12:59:02 +00:00
Archana Choudhary	9a3eac5306	genpolicy: add default impl for get_containers - Provide default impl for get_containers - Remove default impl from structs implementing the trait K8sResource Fixes: #8960 Signed-off-by: Archana Choudhary <archana1@microsoft.com>	2024-04-21 12:59:02 +00:00
Archana Choudhary	2db3470602	genpolicy: add default impl for get_container_mounts_and_storages - Provide default impl for get_container_mounts_and_storages - Remove default impl from structs implementing the trait K8sResource Fixes: #8960 Signed-off-by: Archana Choudhary <archana1@microsoft.com>	2024-04-21 12:59:02 +00:00
Archana Choudhary	09b0b4c11d	genpolicy: add default implementation for get_sandbox_name - Provide default implementation for get_sandbox_name in K8sResource trait - Remove default implementation from structs implementing the trait K8sResource Fixes: #8960 Signed-off-by: Archana Choudhary <archana1@microsoft.com>	2024-04-21 12:55:32 +00:00
Archana Choudhary	43e9de8125	genpolicy: add default implementation for get_annotations - Provide default implementation for get_annontations. - Remove default implementation from structs implementing the trait K8sResource Fixes: #8960 Signed-off-by: Archana Choudhary <archana1@microsoft.com>	2024-04-21 12:55:32 +00:00
Saul Paredes	2149cb6502	genpolicy: changing caching so the tool can run concurrently with itself Based on 3a1461b0a5186a92afedaaea33ff2bd120d1cea0 Previously the tool would use the layers_cache folder for all instances and hence delete the cache when it was done, interfereing with other instances. This change makes it so that each instance of the tool will have its own temp folder to use. Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-04-19 15:46:30 -07:00
Wainer dos Santos Moschetta	1e35291fd5	gha: move attestation tests to run-k8s-tests-coco-nontee The new run-k8s-tests-coco-nontee job should be the home of attestation tests. Changed run-k8s-tests-coco-nontee to get KBS installed and by the time the KBS variable is exported in the environment then the attestation tests will kick in (likewise they will skip in run-k8s-tests-on-aks). Fixes #9455 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-04-19 14:51:30 -03:00
Steve Horsman	7e12d588c0	Merge pull request #9485 from sparky005/update_golang.org/x/net update golang.org/x/net	2024-04-19 11:26:13 +01:00
Amulya Meka	12964256a4	Merge pull request #9521 from Amulyam24/gha gha: tag k8s tests on ppc64le to ppc64le-runner-01	2024-04-19 15:08:08 +05:30
Julien Ropé	70e798ed35	runtime: Call CreateRuntime hooks at container creation time CreateRuntime hooks are called at the CreateSandbox time, but not after CreateContainer. Fixes: #9523 Signed-off-by: Julien Ropé <jrope@redhat.com>	2024-04-19 10:25:02 +02:00
Alex Lyn	3456483df9	Merge pull request #9513 from stevenhorsman/bump-stale-version gha: stale: Bump stalebot version	2024-04-19 15:15:10 +08:00
Alex Lyn	c147f0f4ed	Merge pull request #9516 from sprt/rlz-340 release: bump version for 3.4.0 release	2024-04-19 15:12:26 +08:00
Amulyam24	8255ed248a	gha: tag k8s tests on ppc64le to ppc64le-runner-01 This PR aims at running the k8s tests to one runner on ppc64le. Fixes: #9520 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2024-04-19 12:04:25 +05:30
Hyounggyu Choi	304dc1e4da	doc: Update how-to-run-kata-containers-with-SE-VMs.md This is to update a document `how-to-run-kata-containers-with-SE-VMs` on using confidential artifacts to build a secure image. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-19 08:31:12 +02:00
Hyounggyu Choi	8fbed9f6a4	local-build: Use confidential kernel and initrd for boot-image-se This is to make `boot-image-se-tarball` use confidential kernel and initrd instead of vanilla version of artifacts. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-19 07:09:04 +02:00
Dan Mihai	4242801b1c	agent: update cargo.lock Update Kata Agent's Cargo.lock after the recent changes to Cargo.toml. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-18 17:12:48 +00:00
Aurélien Bombo	95971e4a42	release: bump version for 3.4.0 release Release v3.4.0. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2024-04-18 17:08:06 +00:00
Steve Horsman	6dd038fd58	Merge pull request #9501 from zvonkok/check-fixes kata: Remove check for "Fixes" in PR	2024-04-18 17:48:50 +01:00
Hyounggyu Choi	2b9c439fcf	Merge pull request #9508 from BbolroC/gha-s390x-k8s-label gha: Make integration tests for s390x run on s390x-large runners	2024-04-18 18:05:01 +02:00
Adil Sadik	1c5ca0c915	runtime: update golang.org/x/net updates golang.org/x/net to newer version that closes some reported vulnerabilities and security issues Fixes #9486 Signed-off-by: Adil Sadik <sparky.005@gmail.com>	2024-04-18 10:55:02 -04:00
Tim Zhang	221c5b51fe	dragonball: fix EPOLLHUP/EPOLLERR events handling in vsock 1. EPOLLHUP events also need to be read and will be got len 0. 2. We should kill the connection when EPOLLERR events are received. Signed-off-by: Tim Zhang <tim@hyper.sh> Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2024-04-18 20:47:02 +08:00
Hyounggyu Choi	49a0d57f66	gha: Make integration tests for s390x run on s390x-large runners This is to make a workflow `run-k8s-tests` and `run-cri-containerd` (s390x and zvsi) run only on the runners labeled by `s390x-large`. Fixes: #9507 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-18 14:35:24 +02:00
stevenhorsman	cf5c3dc155	gha: stale: Bump stalebot version - Bump the stalebot action version to v9 as that fixes the ``` Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/stale@v8. ``` warning. Fixes: #9512 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-04-18 11:41:09 +01:00
Steve Horsman	bf16b18180	Merge pull request #9503 from stevenhorsman/stale-pr-remove-date gha: stale: Remove the start-date	2024-04-18 09:36:27 +01:00
Hyounggyu Choi	566a6de594	Merge pull request #9505 from BbolroC/remove-crio-nightly-test-s390x gha: Remove k8s-cri-containerd-rhel9-e2e-tests for s390x	2024-04-18 09:31:07 +02:00
Hyounggyu Choi	cc22dc33f2	Merge pull request #9489 from BbolroC/install-opa-in-docker rootfs: Make OPA build working in docker for s390x and pp…	2024-04-18 00:26:11 +02:00
Dan Mihai	5ceed689eb	Merge pull request #9492 from microsoft/danmihai1/pod-tests tests: k8s: inject agent policy failures (part 3)	2024-04-17 14:01:11 -07:00
Hyounggyu Choi	e046f5e652	gha: Remove k8s-cri-containerd-rhel9-e2e-tests for s390x This commit is simply to remove a CI workflow `k8s-cri-containerd-rhel9-e2e-tests`. Fixes: #9504 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-17 15:36:42 +02:00
Zvonko Kaiser	eda3bfe2ef	config: Add NVIDIA GPU SNP, TDX configuration files Fixes: #9475 For TDX and SNP add NVIDIA specific configuration files Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-04-17 12:49:13 +00:00
Wainer Moschetta	2d8e7933c5	Merge pull request #9461 from GabyCT/topic/uninstallkbs tests/k8s: Add uninstall kbs client command function	2024-04-17 09:36:37 -03:00
Zvonko Kaiser	d7b24c04e5	Merge pull request #9473 from zvonkok/gpu-image-initrd-versions version: add initrd, image NVIDIA sections	2024-04-17 13:22:05 +02:00
stevenhorsman	7235988605	gha: stale: Remove the start-date As documented in https://github.com/actions/stale?tab=readme-ov-file#start-date > The start date is used to ignore the issues and pull requests created before the start date. > Particularly useful when you wish to add this stale workflow on an existing repository > and only wish to stale the new issues and pull requests. As we don't want need to treat PRs older than May 2023 as a special case, then remove this option. Fixes: #9502 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-04-17 11:19:56 +01:00
Zvonko Kaiser	395e93acd5	kata: Remove Issue - PR dependency We've discussed this over and over. Let's try to get to an agreement here. I will use this issue to remove the mandatory Issue - PR dependency. Fixes: #9500 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-04-17 09:53:08 +00:00
Archana Shinde	af3b19ed18	Merge pull request #9084 from amshinde/document-intel-gpu-vfio docs: Document Intel Discrete GPUs usage with Kata	2024-04-16 16:17:03 -07:00
Archana Shinde	973a15332a	spell-check: Add missing words to spell-check Add missing words to spell-check dictionaries Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2024-04-16 11:50:02 -07:00
Archana Shinde	6f97dc1f60	static-checks: Rename file in doc to make static checks happy Configuration file for qemu with runtime-rs was recently renamed. Doc contains name for old file. This was somehow not caught in the CI earlier. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2024-04-16 11:50:02 -07:00
Archana Shinde	87f0097b18	docs: Document Intel Discrete GPUs usage with Kata Document describes the steps needed to pass an entire Intel Discrete GPU as well a GPU SR-IOV interface to a Kata Container. Fixes: #9083 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2024-04-16 11:50:02 -07:00
Dan Mihai	2c4d1ef76b	tests: k8s: inject agent policy failures (part 3) Auto-generate the policy and then simulate attacks from the K8s control plane by modifying the test yaml files. The policy then detects and blocks those changes. These test cases are using K8s Pods. Additional policy failures are injected during CI using other types of K8s resources - e.g., using Jobs and Replication Controllers - from separate PRs. Fixes: #9491 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-16 18:15:12 +00:00
Dan Mihai	c26dad8fe5	Merge pull request #9294 from burgerdev/burgerdev/genpolicy-configurable-pause genpolicy: support insecure registries and custom pause containers	2024-04-16 09:39:33 -07:00
GabyCT	9238daf729	Merge pull request #9464 from microsoft/danmihai1/rc-tests tests: k8s: inject agent policy failures (part2)	2024-04-16 10:01:39 -06:00
Hyounggyu Choi	d523e865c0	rootfs: Make OPA build working in docker for s390x and ppc64le The commit is to make the OPA build from source working in `ubuntu-rootfs-osbuilder`. To achieve the goal, the configuration is changed as follows: - Switch the make target to `ci-build-linux-static` not triggering docker-in-docker build - Install go in the builder image for s390x and ppc64le Fixes: #9466 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-16 16:49:12 +02:00
Greg Kurz	aca6a1bcb5	Merge pull request #9353 from pmores/pr-8866-follow-up runtime-rs: refactor qemu driver	2024-04-16 16:07:36 +02:00
Fabiano Fidêncio	7bb5490676	Merge pull request #9479 from wainersm/fix_coco_nontee_jobs gha: make run-kata-coco-tests inherit secrets	2024-04-16 13:46:52 +02:00
Hyounggyu Choi	7b11fd2546	Merge pull request #9471 from BbolroC/coco-kernel-version-s390x version: Add coco name and version for {image,initrd} for s390x	2024-04-15 16:03:20 +02:00
Wainer dos Santos Moschetta	77541008fc	gha: make run-kata-coco-tests inherit secrets The new CoCo non-tee job introduced on commit `0d5399ba92` need to read secrets like AZ_TENANT_ID, so run-kata-coco-tests workflow should inherit the secrets from the caller workflow. Fixes #9477 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-04-15 10:53:44 -03:00
Zvonko Kaiser	78e3ebb011	version: add initrd, image NVIDIA sections Fixes: #9472 For initrd and image, the related NVIDIA will not use the default targets and we will pin them to a specific release. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-04-15 13:31:35 +00:00
Wainer Moschetta	c85e1ca674	Merge pull request #9404 from ldoktor/ci-mcp-timeout ci.ocp: Increase the MCP update time	2024-04-15 09:42:14 -03:00
Hyounggyu Choi	3ec209dcf1	Merge pull request #9469 from BbolroC/coco-kernel-config-s390x kernel: Adjust s390x config for confidential containers	2024-04-15 13:55:28 +02:00
Hyounggyu Choi	8fce600493	version: Add coco name and version for {image,initrd} for s390x In order to build a coco {image,initrd}, it is required to specify its name and version in versions.yaml. This commit is to add the configuration for them, respectively. Fixes: #9470 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-15 12:53:00 +02:00
Hyounggyu Choi	a792dc3e2b	kernel: Adjust s390x config for confidential containers `CONFIG_TN3270_TTY` and `CONFIG_S390_AP_IOMMU` are dropped for s390x in 6.7.x which is used for a confidential kernel. But they are still used for a vanilla kernel. So we need to add them to the whitelist. Fixes: #9465 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-15 10:28:59 +02:00
Hyounggyu Choi	32f58abfde	Merge pull request #9403 from BbolroC/runtime-rs-ci-qemu CI: Enable GHA cri-containerd workflow for runtime-rs with QEMU	2024-04-15 09:31:25 +02:00
Xuewei Niu	402d8a968e	Merge pull request #9430 from UiPath/fix-agent-shutdown agent: shutdown vm on exit when agent is used as init process	2024-04-15 10:47:07 +08:00
Wainer Moschetta	0a04f54a8e	Merge pull request #9454 from GabyCT/topic/pulltype gha: Define unbound PULL TYPE variable	2024-04-12 14:48:56 -03:00
Wainer Moschetta	a0b21d0e14	Merge pull request #9424 from wainersm/cc_guest_pull-encrypted CC: run guest-pull tests on non-TEE jobs	2024-04-12 09:34:35 -03:00
Hyounggyu Choi	cf20a6a4ae	gha: Add qemu-runtime-rs to VMM matrix for run-cri-containerd This commit expands the VMM matrix for run-cri-containerd, adding a new item `qemu-runtime-rs` for a test scenario where the VMM is QEMU and runtime-rs is employed. This expansion affects the workflows for both x86_64 and s390x platforms. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-12 12:25:53 +02:00
Hyounggyu Choi	606f8e1ab2	runtime-rs: Adjust configuration for qemu-runtime-rs To make `qemu-runtime-rs` working for CI, we have to rename a configuration template file and `CONFIG_FILE_QEMU` in Makefile. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-12 12:25:53 +02:00
Hyounggyu Choi	3c217c6c15	ci\|cri-containerd: Introduce qemu-runtime-rs for KATA_HYPERVISOR `qemu-runtime-rs` will be utilized to handle a test scenario where the VMM is QEMU and runtime-rs is employed. Note: Some of the tests are skipped. They are going to be reintegrated in the follow-up PR (Check out #9375). Fixes: #9371 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-12 12:25:53 +02:00
Alexandru Matei	9e01732f7a	agent: shutdown vm on exit when agent is used as init process Linux kernel generates a panic when the init process exits. The kernel is booted with panic=1, hence this leads to a vm reboot. When used as a service the kata-agent service has an ExecStop option which does a full sync and shuts down the vm. This patch mimicks this behavior when kata-agent is used as the init process. Fixes: #9429 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2024-04-12 11:32:31 +03:00
Alexandru Matei	54923164b5	clh: isClhRunning waits for full timeout when clh exits isClhRunning uses signal 0 to test whether the process is still alive or not. This doesn't work because the process is a direct child of the shim. Once it is dead the process becomes zombie. Since no one waits for it the process lingers until its parent dies and init reaps it. Hence sending signal 0 in isClhRunning will always return success whether the process is dead or not. This patch calls wait to reap the process, if it succeeds that means it is our child process, if not we send the signal. Fixes: #9431 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2024-04-12 11:31:53 +03:00
Dan Mihai	e51cbdcff9	tests: k8s: inject agent policy failures (part2) Auto-generate the policy and then simulate attacks from the K8s control plane by modifying the test yaml files. The policy then detects and blocks those changes. These test cases are using K8s Replication Controllers. Additional policy failures will be injected using other types of K8s resources - e.g., using Pods and/or Jobs - in separate PRs. Fixes: #9463 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-11 21:08:53 +00:00
Markus Rudy	77540503f9	genpolicy: add support for insecure registries genpolicy is a handy tool to use in CI systems, to prepare workloads before applying them to the Kubernetes API server. However, many modern build systems like Bazel or Nix restrict network access, and rightfully so, so any registry interaction must take place on localhost. Configuring certificates for localhost is tricky at best, and since there are no privacy concerns for localhost traffic, genpolicy should allow to contact some registries insecurely. As this is a runtime environment detail, not a target environment detail, configuring insecure registries does not belong into the JSON settings, so it's implemented as command line flags. Fixes: #9008 Signed-off-by: Markus Rudy <webmaster@burgerdev.de>	2024-04-11 22:29:03 +02:00
Wainer dos Santos Moschetta	4f74617897	tests: pass --overwrite-existing to aks get-credentials By passing --overwrite-existing to `aks get-credentials` it will stop asking if I want to overwrite the existing credentials. This is handy for running the scripts locally. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-04-11 15:31:40 -03:00
Wainer dos Santos Moschetta	3508f3a43a	tests/k8s: use CoCo image on guest-pull when non-TEE When running on non-TEE environments (e.g. KATA_HYPERVISOR=qemu) the tests should be stressing the CoCo image (/opt/kata/share/kata-containers/kata-containers-confidential.img) although currently the default image/initrd is built to be able to do guest-pull as well. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-04-11 15:31:40 -03:00
Wainer dos Santos Moschetta	c24f13431d	tests/k8s: enable guest-pull tests on non-TEE Enabled guest-pull tests on non-TEE environment. It know requires the SNAPSHOTTER environment variable to avoid it running on jobs where nydus-snapshotter is not installed Fixes: #9410 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-04-11 15:31:40 -03:00
Wainer dos Santos Moschetta	0d5399ba92	gha: Create CoCo tests jobs on non-TEE Created the new run-k8s-tests-coco-nontee jobs for running CoCo tests on non-TEE. It currently generates the run-k8s-tests-coco-nontee(qemu, nydus, guest-pull) job only to run the guest-pull tests. Fixes: #9410 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-04-11 15:31:40 -03:00
Gabriela Cervantes	5420595d03	tests/k8s: Add uninstall kbs client command function This PR adds the function to uninstall kbs client command function specially when we are running with baremetal devices. Fixes #9460 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-04-11 17:06:11 +00:00
Steve Horsman	6b2d655857	Merge pull request #9457 from justxuewei/fs_manager_tests agent: Fix the issue with the "test_new_fs_manager" test	2024-04-11 17:02:58 +01:00
Fabiano Fidêncio	5611233ed8	Merge pull request #9439 from microsoft/danmihai1/job-tests tests: k8s: inject agent policy failures	2024-04-11 17:21:54 +02:00
Markus Rudy	bc2292bc27	genpolicy: make pause container image configurable CRIs don't always use a pause container, but even if they do the concrete container choice is not specified. Even if the CRI config can be tweaked, it's not guaranteed that registries in the public internet can be reached. To be portable across CRI implementations and configurations, the genpolicy user needs to be able to configure the container the tool should append to the policy. Signed-off-by: Markus Rudy <webmaster@burgerdev.de>	2024-04-11 16:26:35 +02:00
Markus Rudy	8b30fa103f	genpolicy: parse json settings during config init Decouple initialization of the Settings struct from creating the AgentPolicy struct, so that the settings are available for evaluating, extending or overriding command line arguments. Signed-off-by: Markus Rudy <webmaster@burgerdev.de>	2024-04-11 16:17:33 +02:00
Xuewei Niu	50f78ec52c	agent: Fix the issue with the "test_new_fs_manager" test This patch introduces a one-time cpath to mitigate the cgroup residuals. It might break the device cgroup merging rules when the cgroup has children. Fixes: #9456 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2024-04-11 18:06:05 +08:00
GabyCT	08dcdc62de	Merge pull request #9423 from GabyCT/topic/improvecleanup tests: Improve the kbs_k8s_delete function	2024-04-10 14:28:21 -06:00
Gabriela Cervantes	4a2ee3670f	gha: Define unbound PULL TYPE variable This PR defines the PULL_TYPE variable to avoid failures of unbound variable when this is being test it locally. Fixes #9453 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-04-10 17:16:19 +00:00
GabyCT	dab837d71d	Merge pull request #9450 from GabyCT/topic/fixinnydus gha: Fix indentation in gha run script	2024-04-10 11:07:56 -06:00
David Esparza	9e1368dbc5	Merge pull request #9391 from dborquez/add-onednn-openvino-ml-benchs add onednn and openvino ml-benchmarks	2024-04-09 19:03:00 -06:00
Dan Mihai	ea31df8bff	Merge pull request #9185 from microsoft/saulparedes/genpolicy_add_containerd_pull genpolicy: Add optional toggle to pull images using containerd	2024-04-09 12:29:19 -07:00
Gabriela Cervantes	6ebdcf8974	gha: Fix indentation in gha run script This PR fixes an identation in gha run script. Fixes #9449 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-04-09 16:37:17 +00:00
Greg Kurz	89353249fc	Merge pull request #8988 from beraldoleal/ci-docs docs: adding an initial CI documentation	2024-04-09 18:26:15 +02:00
Dan Mihai	2252490a96	tests: k8s: inject agent policy failures Auto-generate the policy and then simulate attacks from the K8s control plane by modifying the test yaml files. The policy then detects and blocks those changes. These test cases are using K8s Jobs. Additional policy failures will be injected using other types of K8s resources - e.g., using Pods and/or Replication Controllers - in future PRs. Fixes: #9406 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-09 15:36:57 +00:00
David Esparza	facf3c9364	metrics: Add onednn benchmark. This PR adds onednn test to exercise additional ML benchmarks. Onednn is an Intel-optimized library for Deep Neural Networks. Fixes: #9390 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2024-04-09 09:05:51 -06:00
David Esparza	3bde511d0d	metrics: Add openvino benchmark. This PR adds openvino test in order to exercise additional ML benchmarks. OpenVino bench used to optimize and deploy deep learning models. Fixes: #9389 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2024-04-09 09:05:51 -06:00
David Esparza	b37c5f8ba1	metrics:libs: Add HTTPS and HTTP vars to docker build. Include HTTP and HTTPS env variables in the building docker images because they are required to download packages such as Phoronix. Added a restriction that verifies that docker building images is performed as root. Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2024-04-09 09:05:51 -06:00
David Esparza	3355dd9e2b	metrics:libs: Adds a function to set new kata configuration. Adds a function that receives as a single parameter the name of a valid Kata configuration file which will be established as the default kata configuration to start kata containers. Adds a second function that returns the path to the current kata configuration file. Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2024-04-09 09:05:51 -06:00
David Esparza	cb4380d1c9	metrics: common: Add function to clean the cache. The function clear the Page Cache only. Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2024-04-09 09:05:51 -06:00
David Esparza	3a419ba3b1	metrics: common: Add function to update kata config. Add an extra function that updates kata config to use the max num. of vcpus available and to use the available memory in the system. Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2024-04-09 09:05:51 -06:00
Beraldo Leal	959e56525c	docs: adding an initial CI documentation This is actually a first attempt to document our CI, and all this content was based on the document created by Fabiano Fidencio (kudos to him). We are just moving the content and discussion from Google Docs to here. I used the "poetic license" to add some notes on what I believe our CI will look like in the future. Fixes #9006 Co-authored-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Beraldo Leal <bleal@redhat.com>	2024-04-09 09:21:47 -04:00
Saul Paredes	51498ba99a	genpolicy: toggle containerd pull in tests - Add v1 image test case - Install protobuf-compiler in build check - Reset containerd config to default in kubernetes test if we are testing genpolicy - Update docker_credential crate - Add test that uses default pull method - Use GENPOLICY_PULL_METHOD in test Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-04-08 19:28:29 -07:00
Dan Mihai	f60c9eaec3	Merge pull request #9398 from microsoft/danmihai1/policy-test-cleanup tests: k8s: improve the Agent Policy tests	2024-04-08 15:37:07 -07:00
Gabriela Cervantes	fb4c359cc2	tests: Improve the kbs_k8s_delete function This PR improves the kbs_k8s_delete function to verify that the resources were properly deleted for baremetal environments. Fixes #9379 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-04-08 18:03:07 +00:00
Saul Paredes	c96ebf237c	genpolicy: add containerd pull method Add optional toggle to use existing containerd installation to pull and manage container images. This adds support to a wider set of images that are currently not supported by standard pull method, such as those that use v1 manifest. Fixes: #9144 Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-04-08 09:56:59 -07:00
Greg Kurz	8b996b9307	Merge pull request #9331 from egernst/foobar katautils: check number of cores on the system intead of go runtime	2024-04-08 18:38:49 +02:00
Greg Kurz	934beb5ae4	Merge pull request #9421 from gkurz/bump-node-js-20 gha: Bump various actions to use Node.js 20	2024-04-08 18:22:28 +02:00
Wainer Moschetta	fba1d394d7	Merge pull request #9369 from ChengyuZhu6/sandbox-image agent:image: Support different pause image in the guest for guest pull	2024-04-08 11:06:21 -03:00
Steve Horsman	3242f55691	Merge pull request #8870 from LindaYu17/aa2main port attestation agent from CCv0 branch to main branch	2024-04-08 15:01:07 +01:00
James O. D. Hunt	42936cb92c	Merge pull request #9372 from jodh-intel/docs-kata-manager-update docs: kata-manager: Update with latest details	2024-04-08 13:23:23 +01:00
stevenhorsman	864e9c22ba	agent: doc: Add new config doc Document the new guest_components_rest_api config parameter Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-04-08 11:38:53 +01:00
stevenhorsman	29a5652e31	packaging: guest-components, set new environment variables - Set KBC_PROVIDER and ATTESTER rather than TEE_PLATFORM to avoid tss build issues for vTPM attester(s) - There are future plans to make a matching TEE_PLATFORM, so this can be simplified once that is available Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-04-08 11:38:53 +01:00
stevenhorsman	a284a20a14	tests: Filter CoCo tests on ppc64le/arm - At the moment we aren't supporting ppc64le or aarch64 for CoCo, so filter out these tests from running Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-04-08 11:38:53 +01:00
stevenhorsman	a0c03966c2	versions: Bump guest-components - Bump guest-components to try and test compatibility with the latest version Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-04-08 11:38:53 +01:00
stevenhorsman	101a5bf273	packaging: Update guest-components Dockerfile - Switch to Ubuntu 20.04 for building guest-components as The rootfs is based on 20.04, so we need matching GLIBC versions. See #8955 - Add dependencies needed by TDX verifier as we want to build for all platforms Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-04-08 11:38:53 +01:00
Gabriela Cervantes	6d85025e59	test/k8s: Add basic attestation test - Add basic test case to check that a ruuning pod can use the api-server-rest (and attestation-agent and confidential-data-hub indirectly) to get a resource from a remote KBS Fixes #9057 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> Co-authored-by: Linda Yu <linda.yu@intel.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-04-08 11:38:53 +01:00
Biao Lu	f0edec84f6	agent: Launch api-server-rest If 'rest_api' is configured, let's start the api-server-rest after the attestation-agent and the confidential-data-hub have been started. Fixes: #7555 Signed-off-by: Biao Lu <biao.lu@intel.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Linda Yu <linda.yu@intel.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Jakob Naucke <jakob.naucke@ibm.com> Co-authored-by: Wang, Arron <arron.wang@intel.com> Co-authored-by: zhouliang121 <liang.a.zhou@linux.alibaba.com> Co-authored-by: Alex Carter <alex.carter@ibm.com> Co-authored-by: Suraj Deshmukh <suraj.deshmukh@microsoft.com> Co-authored-by: Xynnn007 <xynnn@linux.alibaba.com>	2024-04-08 11:38:53 +01:00
Biao lu	4d752e6350	agent: Add config for api-server-rest Add configuration for 'rest api server'. Optional configurations are 'agent.rest_api=attestation' will enable attestation api 'agent.rest_api=resource' will enable resource api 'agent.rest_api=all' will enable all (attestation and resource) api Fixes: #7555 Signed-off-by: Biao Lu <biao.lu@intel.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Linda Yu <linda.yu@intel.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Jakob Naucke <jakob.naucke@ibm.com> Co-authored-by: Wang, Arron <arron.wang@intel.com> Co-authored-by: zhouliang121 <liang.a.zhou@linux.alibaba.com> Co-authored-by: Alex Carter <alex.carter@ibm.com> Co-authored-by: Suraj Deshmukh <suraj.deshmukh@microsoft.com> Co-authored-by: Xynnn007 <xynnn@linux.alibaba.com>	2024-04-08 11:06:14 +01:00
Biao Lu	f476d671ed	agent: Launch the confidential data hub Let's introduce a new method to start the confidential data hub and the attestation agent. The former depends on the later, and it needs to be started before the RPC server. Starting the attestation components is based on whether the confidential containers guest components binaries are found in the rootfs. Fixes: #7544 Signed-off-by: Biao Lu <biao.lu@intel.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Linda Yu <linda.yu@intel.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Jakob Naucke <jakob.naucke@ibm.com> Co-authored-by: Wang, Arron <arron.wang@intel.com> Co-authored-by: zhouliang121 <liang.a.zhou@linux.alibaba.com> Co-authored-by: Alex Carter <alex.carter@ibm.com> Co-authored-by: Suraj Deshmukh <suraj.deshmukh@microsoft.com> Co-authored-by: Xynnn007 <xynnn@linux.alibaba.com>	2024-04-08 11:06:14 +01:00
Greg Kurz	be8f0cb520	Merge pull request #9402 from deagon/feat/debug-threads qemu: show the thread name when enable the hypervisor.debug option	2024-04-08 11:04:36 +02:00
Hyounggyu Choi	e39be7a45e	Merge pull request #9415 from BbolroC/fix-dir-removal-error GHA: Implement secondary GITHUB_WORKSPACE cleanup on 1st failure	2024-04-08 10:44:44 +02:00
ChengyuZhu6	8c897f822c	agent:image: Support different pause image in the guest for guest pull Support different pause images in the guest for guest-pull, such as k8s pause image (registry.k8s.io/pause) and openshift pause image (quay.io/bpradipt/okd-pause). Fixes: #9225 -- part III Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-04-07 09:00:10 +08:00
GabyCT	9d2c5b180e	Merge pull request #9419 from GabyCT/topic/fxlatency metrics: Improve latency test cleanup	2024-04-05 16:31:00 -06:00
Wainer Moschetta	aae7048d4f	Merge pull request #9273 from ldoktor/kcli-coco-kbs tests: Support for kbs setup on kcli	2024-04-05 18:55:58 -03:00
Fabiano Fidêncio	f09bb98f51	Merge pull request #8840 from fidencio/topic/update-tdx-artefacts-to-the-new-host-os tdx: Update TDX artefacts to be used with the Ubuntu 23.10 / CentOS 9 stream OSVs.	2024-04-05 22:36:03 +02:00
Fabiano Fidêncio	cdb8531302	hypervisor: Simplify TDX protection detection Let's rely on the kvm module 'tdx' parameter to do so. This aligns with both OSVs (Canonical, Red Hat, SUSE) and the TDX adoption (https://github.com/intel/tdx-linux) stacks. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-04-05 19:51:27 +02:00
Fabiano Fidêncio	2ee03b5dc3	tdvf: Adapt the build command This is done in order to match the example from: https://github.com/intel/tdx-linux/wiki/Instruction-to-set-up-TDX-host-and-guest#build-tdvf-image Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-04-05 19:51:27 +02:00
Fabiano Fidêncio	b7cccfa019	qemu: tdx: Adapt command line This commit is a mess, but I'm not exactly sure what's the best way to make it less messy, as we're getting QEMU TDX to work while partially reverting `1e34220c41`. With that said, let me cover the content of this commit. Firstly, we're reverting all the changes related to "memory-backend-memfd-private", as that's what was used with the previous host stack, but it seems it didn't fly upstream. Secondly, in order to get QEMU to properly work with TDX, we need to enforce the 'private=on' knob and use the "memory-backend-ram", and we're doing so, and also making sure to test the `private=on` newly added knob. I'm sorry for the confusion, I understand this is not optimal, I just don't see an easy path to do changes without leaving the code broken during those changes. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-04-05 19:51:27 +02:00
Greg Kurz	424a5e243f	gha: Bump to `actions/[down\|up]load-artifact@v4` (all the rest) `Node.js 19` is deprecated. Bump to a new version based on `Node.js 20`. This fixes all remaining sites. Fixes #9245 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-04-05 18:36:51 +02:00
Greg Kurz	dbc5dc7806	gha: Bump to `actions/[down\|up]load-artifact@v4` (k8s tests on garm) `Node.js 19` is deprecated. Bump to a new version based on `Node.js 20`. As explained at [1] : > The contents of an Artifact are uploaded together into an immutable > archive. They cannot be altered by subsequent jobs. Both of these > factors help reduce the possibility of accidentally corrupting > Artifact files. This means that artifacts cannot have the same name. Adapt the `run-k8s-tests-on-garm` workflow accordingly by embedding all the other `${{ vmm.* }}` fields and `${{ inputs.tag }}` in the artifact names that would otherwise collide. Fixes #9245 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-04-05 18:36:51 +02:00
Greg Kurz	62a54ffa70	gha: Bump to `actions/[down\|up]load-artifact@v4` (kata static tarball) `Node.js 19` is deprecated. Bump to a new version based on `Node.js 20`. As explained at [1] : > The contents of an Artifact are uploaded together into an immutable > archive. They cannot be altered by subsequent jobs. Both of these > factors help reduce the possibility of accidentally corrupting > Artifact files. This means that artifacts cannot have the same name. Adapt all `build-kata-static-tarball` workflows accordingly by embedding `${{ matrix.asset }}` in the artifact names that would otherwise collide. Fixes #9245 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-04-05 18:36:51 +02:00
Greg Kurz	7f2ce914a1	gha: Bump to `actions/checkout@v4` `Node.js 19` is deprecated. Bump to a new version based on `Node.js 20`. Fixes #9245 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-04-05 18:36:50 +02:00
Greg Kurz	0a43d26c94	gha: Bump to `docker/login-action@v3` `Node.js 19` is deprecated. Bump to a new version based on `Node.js 20`. Fixes #9245 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-04-05 18:36:50 +02:00
Greg Kurz	06c9c0d7db	gha: Bump to `docker/build-push-action@v5` `Node.js 19` is deprecated. Bump to a new version based on `Node.js 20`. Fixes #9245 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-04-05 18:36:50 +02:00
Greg Kurz	8c21844aef	gha: Bump to `docker/setup-buildx-action@v3` `Node.js 19` is deprecated. Bump to a new version based on `Node.js 20`. Fixes #9245 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-04-05 18:36:50 +02:00
Greg Kurz	03cbe6a011	gha: Bump to `docker/setup-qemu-action@v3` `Node.js 19` is deprecated. Bump to a new version based on `Node.js 20`. Fixes #9245 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-04-05 18:36:50 +02:00
Hyounggyu Choi	4493459937	GHA: Implement secondary GITHUB_WORKSPACE cleanup on 1st failure Occasionally, the removal of GITHUB_WORKSPACE fails for self-hosted runners because one of the subdirectories is not empty. This is likely due to another process occupying the directory at the time. Implementing a secondary cleanup resolves this issue. This commit focuses on the implementation for the secondary cleanup. Fixes: #9317 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-04-05 11:41:51 +02:00
Fabiano Fidêncio	6b4cc5ea6a	Revert "qemu: tdx: Workaround SMP issue with TDX 1.5" This reverts commit `d1b54ede29`. Conflicts: src/runtime/virtcontainers/qemu.go This commit was a hack that was needed in order to get QEMU + TDX to work atop of the stack our CI was running on. As we're moving to "the officially supported by distros" host OS, we need to get rid of this. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-04-05 10:23:52 +02:00
Fabiano Fidêncio	582b5b6b19	govmm: tdx: Expose the private=on\|off knob The private=on\|off knob is required in order to properly lauunch a TDX guest VM. This is a brand new property that is part of the still in-flight patches adding TDX support on QEMU. Please, see: `3fdd8072da` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-04-05 10:23:52 +02:00
Fabiano Fidêncio	fe5adae5d9	qemu-tdx: Update to v8.1.0 + TDX patches Let's update the QEMU to the one that's officially maintained by Intel till all the TDX patches make their way upstream. We've had to also update python to explicitly use python3 and add python3-venv as part of the dependencies. Fixes: #8810 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-04-05 10:23:51 +02:00
Alex Lyn	0e0a361f0e	Merge pull request #8782 from Apokleos/device-increate-count bugfix and refactor device increate count	2024-04-05 13:43:49 +08:00
Dan Mihai	6f9f8ae285	Merge pull request #9413 from microsoft/saulparedes/ensure_unique_rg_in_gha gha: ensure unique resource group name	2024-04-04 17:13:09 -07:00
GabyCT	80d926c357	Merge pull request #9411 from microsoft/danmihai1/k8s-job tests: k8s-job: wait for job successful create	2024-04-04 15:14:56 -06:00
Gabriela Cervantes	8e5d401be0	metrics: Improve latency test cleanup This PR improves the latency test cleanup in order to avoid random failures of leaving the pods. Fixes #9418 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-04-04 20:43:53 +00:00
Saul Paredes	f20caac1c0	gha: ensure unique resource group name There's an rg name duplication situation that got introduced by #9385 where 2 different test runs might have same rg name. Add back uniqueness by including the first letter of GENPOLICY_PULL_METHOD to cluster name. Fixes: #9412 Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-04-04 13:13:32 -07:00
GabyCT	aae2679f09	Merge pull request #9409 from GabyCT/topic/ghrunset gha: Define GH_PR_NUMBER variable in gha run k8s common script	2024-04-04 09:46:48 -06:00
Eric Ernst	da01bccd36	katautils: check number of cores on the system intead of go runtime We used to utilize go runtime's "NumCPUs()", which will give the number of cores available to the Go runtime, which may be a subset of physical cores if the shim is started from within a cpuset. From the function's description: "NumCPU returns the number of logical CPUs usable by the current process." As an example, if containerd is run from within a smaller CPUset, the maximum size of a pod will be dictated by this CPUset, instead of what will be available on the rest of the system. Since the shim will be moved into its own cgroup that may have a different CPUset, let's stick with checking physical cores. This also aligns with what we have documented for maxVCPU handling. In the event we fail to read /proc/cpuinfo, let's use the goruntime. Fixes: #9327 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2024-04-03 16:09:16 -07:00
Dan Mihai	3e72b3f360	tests: k8s-job: wait for job successful create Don't just verify SuccessfulCreate - wait for it if needed. Fixes: #9138 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 22:11:15 +00:00
Gabriela Cervantes	73f27e28d1	gha: Define GH_PR_NUMBER variable in gha run k8s common script This PR defines the GH_PR_NUMBER variable in gha run k8s common script to avoid failures like unbound variable when running locally the scripts just like the GHA CI. Fixes #9408 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-04-03 18:25:00 +00:00
GabyCT	c5c229b330	Merge pull request #9397 from GabyCT/topic/removeconmon versions: Remove conmon information from versions.yaml	2024-04-03 11:14:43 -06:00
GabyCT	12947b1ba6	Merge pull request #9344 from GabyCT/topic/kerneldoc docs: Remove stale kernel information	2024-04-03 11:13:54 -06:00
Dan Mihai	07c23a05f2	Merge pull request #9385 from microsoft/saulparedes/add_genpolicy_yaml_params gha: add GENPOLICY_PULL_METHOD	2024-04-03 09:20:16 -07:00
Lukáš Doktor	b8382cea88	ci.ocp: Increase the MCP update time updating the machine config takes even longer than 1200s, use 60m to be sure everything is updated. Fixes: #9338 Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-04-03 15:01:29 +02:00
Alex Lyn	935a1a3b40	runtime-rs: refactor decrease_attach_count with do_decrease_count Try to reduce duplicated code in decrease_attach_count with public new function do_decrease_count. Fixes: #8738 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-04-03 17:19:19 +08:00
Alex Lyn	4f0fab938d	runtime-rs: refactor increase_attach_count with do_increase_count Try to reduce duplicated code in increase_attach_count with public new function do_increase_count. Fixes: #8738 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-04-03 17:19:19 +08:00
Alex Lyn	fff64f1c3e	runtime-rs: introduce dedicated function do_decrease_count Introduce a dedicated public function do_decrease_count to reduce duplicated code in drivers' decrease_attach_count. Fixes: #8738 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-04-03 17:19:08 +08:00
Alex Lyn	5750faaf31	runtime-rs: introduce dedicated function do_increase_count Since there are many implementations of reference counting in the drivers, all of which have the same implementation, we should try to reduce such duplicated code as much as possible. Therefore, a new function is introduced to solve the problem of duplicated code. Fixes: #8738 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-04-03 17:09:17 +08:00
Dan Mihai	f800bd86f6	tests: k8s-sandbox-vcpus-allocation.bats policy Use the "allow all" policy for k8s-sandbox-vcpus-allocation.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:01:33 +00:00
Dan Mihai	4211d93b87	tests: k8s-nginx-connectivity.bats policy Use the "allow all" policy for k8s-nginx-connectivity.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:01:26 +00:00
Dan Mihai	5dcf64ef34	tests: k8s-volume.bats allow all policy Use the "allow all" policy for k8s-volume.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:01:18 +00:00
Dan Mihai	04085d8442	tests: k8s-sysctls.bats allow all policy Use the "allow all" policy for k8s-sysctls.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:01:10 +00:00
Dan Mihai	839993f245	tests: k8s-security-context.bats allow all policy Use the "allow all" policy for k8s-security-context.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:01:03 +00:00
Dan Mihai	02a050b47e	tests: k8s-seccomp.bats allow all policy Use the "allow all" policy for k8s-seccomp.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:00:56 +00:00
Dan Mihai	543e40b80c	tests: k8s-projected-volume.bats allow all policy Use the "allow all" policy for k8s-projected-volume.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:00:47 +00:00
Dan Mihai	3f94e2ee1b	tests: k8s-pod-quota.bats allow all policy Use the "allow all" policy for k8s-pod-quota.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:00:37 +00:00
Dan Mihai	ba23758a42	tests: k8s-optional-empty-secret.bats policy Use the "allow all" policy for k8s-optional-empty-secret.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:00:30 +00:00
Dan Mihai	e4ff6b1d91	tests: k8s-measured-rootfs.bats allow all policy Use the "allow all" policy for k8s-measured-rootfs.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:00:23 +00:00
Dan Mihai	2821326a7e	tests: k8s-liveness-probes.bats allow all policy Use the "allow all" policy for k8s-liveness-probes.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:00:15 +00:00
Dan Mihai	9af3e4cc4a	tests: k8s-inotify.bats allow all policy Use the "allow all" policy for k8s-inotify.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:00:08 +00:00
Dan Mihai	bd45e948cc	tests: k8s-guest-pull-image.bats policy Use the "allow all" policy for k8s-guest-pull-image.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 03:00:00 +00:00
Dan Mihai	be3797ef7c	tests: k8s-footloose.bats allow all policy Use the "allow all" policy for k8s-footloose.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 02:59:50 +00:00
Dan Mihai	18f5e55667	tests: k8s-empty-dirs.bats allow all policy Use the "allow all" policy for k8s-empty-dirs.bats, instead of relying on the Kata Guest image to use the same policy as its default. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 02:59:44 +00:00
Dan Mihai	ef22bd8a2b	tests: k8s: replace run_policy_specific_tests Check from: - k8s-exec-rejected.bats - k8s-policy-set-keys.bats if policy testing is enabled or not, to reduce the complexity of run_kubernetes_tests.sh. After these changes, there are no policy specific commands left in run_kubernetes_tests.sh. add_allow_all_policy_to_yaml() is moving out of run_kubernetes_tests.sh too, but it not used yet. It will be used in future commits. Fixes: #9395 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-03 02:59:28 +00:00
Guoqiang Ding	cd0c31e185	qemu: show the thread name when enable the hypervisor.debug option Add debug-threads=on in the name argument if debug enabled. Fixes: #9400 Signed-off-by: Guoqiang Ding <dgq8211@gmail.com>	2024-04-03 10:36:52 +08:00
Saul Paredes	8a92e81f98	gha: add GENPOLICY_PULL_METHOD Add GENPOLICY_PULL_METHOD that will be used to test pulling container images in genpolicy using the oci-distribution crate and/or the containerd interface. GENPOLICY_PULL_METHOD will start being used in a future PR. Fixes: #9384 Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-04-02 19:03:28 -07:00
Gabriela Cervantes	f3957352f0	versions: Remove conmon information from versions.yaml This PR removes conmon information from versions.yaml as this is not longer being used in kata containers repository. Fixes #9396 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-04-02 16:25:45 +00:00
Dan Mihai	39805822fc	tests: k8s: reduce policy testing complexity Don't add the "allow all" policy to all the test YAML files anymore. After this change, the k8s tests assume that all the Kata CI Guest rootfs image files either: - Don't support Agent Policy at all, or - Include an "allow all" default policy. This relience/assumption will be addressed in a future commit. Fixes: #9395 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-04-02 16:18:31 +00:00
Alex Lyn	7795f9c016	Merge pull request #9365 from GabyCT/topic/removerunc versions: Remove runc version information	2024-04-02 09:21:56 +08:00
Alex Lyn	fa8049af6c	Merge pull request #9383 from Apokleos/unified-cgrp-cmdline kata-agent: enabling cgroups-v2 by systemd.unified_cgroup_hierarchy	2024-04-02 09:08:04 +08:00
Alex Lyn	07bfdf4a22	Merge pull request #9275 from Apokleos/swap-hooks-bindmnt kata-agent: Change order of guest hook and bind mount processing	2024-04-02 07:40:10 +08:00
Alex Lyn	c88014834b	kata-agent: enabling cgroups-v2 by systemd.unified_cgroup_hierarchy Configure the system to mount cgroups-v2 by default during system boot by the systemd system, We must add systemd.unified_cgroup_hierarchy=1 parameter to kernel cmdline, which will be passed by kernel_params in configuration.toml. To enable cgroup-v2, just add systemd.unified_cgroup_hierarchy=true[1] to kernel_params. Fixes: #9336 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-04-01 18:45:12 +08:00
alex.lyn	548f252bc4	runtime-rs: bugfix incorrect use of refcount before vfio attach When there's a pod with multiple containers, there may be case that attach point more than 2, we should not return Err in that case when we are doing attach ops, but just return Ok. Fixes: #8738 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-04-01 11:28:57 +08:00
Alex Lyn	aa9cd232cd	Merge pull request #9358 from GabyCT/topic/nerdrandom gha: Update journal log names for nerdctl artifacts	2024-04-01 09:50:16 +08:00
Alex Lyn	dfa8832406	Merge pull request #9345 from c3d/bug/9342-agent-test-errors agent: Fix errors in `make check`	2024-04-01 09:48:44 +08:00
Dan Mihai	3a7dbcfc17	Merge pull request #9367 from microsoft/danmihai1/infinite-io-stream-copy-loop runtime: remove stream copy infinite loop	2024-03-29 09:37:44 -07:00
Dan Mihai	600f9266f3	runtime: remove stream copy infinite loop This reverts commit `1c5693be86`. Avoid apparent infinite loop when ReadStreamRequest is blocked by policy - for some of the pods. When running the k8s-limit-range.bats test with Policy enabled, the Shim + VMM never get terminated on my cluster. Not sure why the sandbox clean-up works better for other tests, but the k8s-limit-range test pod gets stuck in an infinite loop: stdout io stream copy error happens: error = %wrpc error: code = PermissionDenied desc = \"ReadStreamRequest is blocked by policy ... policy check: ReadStreamRequest ... stdout io stream copy error happens: error = %wrpc error: code = PermissionDenied desc = \"ReadStreamRequest is blocked by policy ... policy check: ReadStreamRequest ... Fixes: #9380 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-03-28 22:43:28 +00:00
James O. D. Hunt	13966f4d1d	docs: kata-manager: Add help for permissions issue The 3.3.0 release installs the `kata-manager` script with overly restrictive permissions (see #9373), so add details to help users handle the situation. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-28 16:22:10 +00:00
James O. D. Hunt	5589e4e291	docs: kata-manager: Update with latest details Now that v3.3.0 has been released, simplify the `kata-manager` documentation. Fixes: #9227. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-28 16:22:10 +00:00
James O. D. Hunt	52fe60c94b	docs: kata-manager: Fix heading levels Add an extra heading indent so that there is only a single top-level heading. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-28 16:21:31 +00:00
Dan Mihai	ebb26edf42	Merge pull request #9347 from microsoft/danmihai1/reduce-exec-test-policy-prints genpolicy: reduce policy debug prints	2024-03-27 15:12:10 -07:00
Gabriela Cervantes	a32418bf32	versions: Remove runc version information This PR removes the runc version information as this is not longer being used in the kata containers scripts. Fixes #9364 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-27 20:32:38 +00:00
Steve Horsman	b3acbe0b7f	Merge pull request #8046 from fitzthum/clean-config runtime: remove unimplemented CoCo configurations	2024-03-27 19:39:48 +00:00
Tobin Feldman-Fitzthum	04d021bd12	packaging: remove SERVICEOFFLOAD option Since we're removing the unused service_offload parameter, don't set it in any of the packaging scripts. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2024-03-27 12:21:13 -05:00
Tobin Feldman-Fitzthum	9856fe5bea	runtime: remove ServiceOffload parameter Since we no longer use the service_offload configuration, remove the ServiceOffload field from the image struct. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2024-03-27 12:21:13 -05:00
Tobin Feldman-Fitzthum	a18c7ca307	runtime: remove unimplemented CoCo configurations These experimental options were added 2 years ago in anticipation of features that would be added in CoCo. These do not match the features that were eventually added and will soon be ported to main. Fixes: #8047 Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2024-03-27 12:21:06 -05:00
Steve Horsman	53fa1fd82d	Merge pull request #9349 from fidencio/topic/ci-k8s-update-cpuid k8s: confidential: Update cpuid to its latest release	2024-03-27 16:57:36 +00:00
Chengyu Zhu	e66a5cb54d	Merge pull request #9332 from ChengyuZhu6/guest-pull-timeout Support to set timeout to pull large image in guest	2024-03-28 00:34:08 +08:00
Christophe de Dinechin	82c4079fd0	agent: Remove useless loop This is the report from `make check`: ``` error: this loop never actually loops --> src/signal.rs:147:9 \| 147 \| / loop { 148 \| \| select! { 149 \| \| _ = handle => { 150 \| \| println!("INFO: task completed"); ... \| 156 \| \| } 157 \| \| } \| \|_________^ \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#never_loop = note: `#[deny(clippy::never_loop)]` on by default ``` There is only one option: you get something or a timeout. You never retry, so the report is correct. Fixes: #9342 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2024-03-27 17:03:44 +01:00
Christophe de Dinechin	df5c88cdf0	agent: Remove lint error about `.flatten` running forever The lint report is the following: ``` error: `flatten()` will run forever if the iterator repeatedly produces an `Err` --> src/rpc.rs:1754:10 \| 1754 \| .flatten() \| ^^^^^^^^^ help: replace with: `map_while(Result::ok)` \| note: this expression returning a `std::io::Lines` may produce an infinite number of `Err` in case of a read error --> src/rpc.rs:1752:5 \| 1752 \| / reader 1753 \| \| .lines() \| \|________________^ = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#lines_filter_map_ok = note: `-D clippy::lines-filter-map-ok` implied by `-D warnings` = help: to override `-D warnings` add `#[allow(clippy::lines_filter_map_ok)]` ``` This commit simply applies the suggestion. Fixes: #9342 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2024-03-27 17:03:44 +01:00
Christophe de Dinechin	bfb55312be	agent: Fix `.enumerate` errors during `make check` Running `make check` in the `src/agent` directory gives: ``` error: you seem to use `.enumerate()` and immediately discard the index --> rustjail/src/mount.rs:572:27 \| 572 \| for (_index, line) in reader.lines().enumerate() { \| ^^^^^^^^^^^^^^^^^^^^^^^^^^ \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#unused_enumerate_index = note: `-D clippy::unused-enumerate-index` implied by `-D warnings` = help: to override `-D warnings` add `#[allow(clippy::unused_enumerate_index)]` help: remove the `.enumerate()` call \| 572 \| for line in reader.lines() { \| ~~~~ ~~~~~~~~~~~~~~ Checking tokio-native-tls v0.3.1 Checking hyper-tls v0.5.0 Checking reqwest v0.11.18 error: could not compile `rustjail` (lib) due to 1 previous error warning: build failed, waiting for other jobs to finish... make: *** [../../utils.mk:177: standard_rust_check] Error 101 ``` Fixes: #9342 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2024-03-27 17:03:44 +01:00
Greg Kurz	e1068da1a0	Merge pull request #9326 from gkurz/draft-release Only tag and publish the release when it is fully ready	2024-03-27 15:59:59 +01:00
ChengyuZhu6	c50d3ebacc	tests:k8s: Add a test to pull large images in the guest Add a test to pull large images in the guest. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-27 21:58:44 +08:00
ChengyuZhu6	8551ee9533	how-to: add createcontainer timeout to sandbox config documentation add createcontainer timeout annotation to sandbox config documentation. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-27 21:58:44 +08:00
ChengyuZhu6	c2dc13ebaa	runtime: support to configure CreateContainer Timeout in configurations support to configure CreateContainerRequestTimeout in the configurations. e.g.: [runtime] ... create_container_timeout = 300 Note: The effective timeout is determined by the lesser of two values: runtime-request-timeout from kubelet config (https://kubernetes.io/docs/reference/command-line-tools-reference/kubelet/#:~:text=runtime%2Drequest%2Dtimeout) and create_container_timeout. In essence, the timeout used for guest pull=runtime-request-timeout<create_container_timeout?runtime-request-timeout:create_container_timeout. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-27 21:58:41 +08:00
Chengyu Zhu	87fc17d4d2	Merge pull request #9341 from ChengyuZhu6/guest-pull-doc docs: Add documents for kata guest image management	2024-03-27 21:20:22 +08:00
ChengyuZhu6	95b2f7f129	how-to: Add a document for kata guest image management usage Add a document for kata guest image management usage. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-27 20:09:37 +08:00
Greg Kurz	693c9487d4	docs: Adjust release documentation Most of the content of `docs/Stable-Branch-Strategy.md` got de-facto deprecated by the re-design of the release process described in #9064. Remove this file and all its references in the repo. The `## Versioning` section has some useful information though. It is moved to `docs/Release-Process.md`. The documentation of the `PATCH` field is adapted according to new workflow. Fixes #9064 - part VI Signed-off-by: Greg Kurz <groug@kaod.org>	2024-03-27 12:41:48 +01:00
Steve Horsman	45aba769c0	Merge pull request #9346 from cmaf/ci-remove-repo-docs Remove additional links to tests directory	2024-03-27 11:13:32 +00:00
Steve Horsman	a1a615a7c8	Merge pull request #9356 from stevenhorsman/agent-opa-ppc64le-s390x workflows: Build agent-opa for more archs	2024-03-27 08:53:28 +00:00
ChengyuZhu6	2224f6d63f	runtime: support to configure CreateContainer timeout in annotation Support to configure CreateContainerRequestTimeout in the annotations. e.g.: annotations: "io.katacontainers.config.runtime.create_container_timeout": "300" Note: The effective timeout is determined by the lesser of two values: runtime-request-timeout from kubelet config (https://kubernetes.io/docs/reference/command-line-tools-reference/kubelet/#:~:text=runtime%2Drequest%2Dtimeout) and create_container_timeout. In essence, the timeout used for guest pull=runtime-request-timeout<create_container_timeout?runtime-request-timeout:create_container_timeout. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-27 15:44:29 +08:00
ChengyuZhu6	39bd462431	runtime: support to set timeout for CreateContainerRequest In the situation to pull images in the guest #8484, it’s important to account for pulling large images. Presently, the image pull process in the guest hinges on `CreateContainerRequest`, which defaults to a 60-second timeout. However, this duration may prove insufficient for pulling larger images, such as those containing AI models. Consequently, we must devise a method to extend the timeout period for large image pull. Fixes: #8141 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-27 15:44:29 +08:00
Gabriela Cervantes	a997e282be	gha: Update journal log names for nerdctl artifacts This PR updates the journal log name for nerdctl artifacts to make sure that we have different names in case we add a parallel GHA job. Fixes #9357 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-26 20:03:54 +00:00
GabyCT	c163d9f114	Merge pull request #9329 from GabyCT/topic/seun scripts: Fix unbound variables in k8s setup script	2024-03-26 11:19:33 -06:00
stevenhorsman	9aa675abb9	workflows: Build agent-opa for more archs Since https://github.com/kata-containers/kata-containers/pull/7769, we support building the OPA binary into the ppc64le and s390x arch versions of the rootfs, so build the policy enabled agent to match for those architectures too. Fixes: #9355 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-03-26 17:02:14 +00:00
Lukáš Doktor	a671b3fc6e	tests: Use full svc address to check kbs service the service might not listen on the default port, use the full service address to ensure we are talking to the right resource. Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-26 16:59:02 +01:00
Lukáš Doktor	6b0eaca4d4	tests: Add support for nodeport ingress for the kbs setup this can be used on kcli or other systems where cluster nodes are accessible from all places where the tests are running. Fixes: #9272 Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-26 16:59:00 +01:00
Greg Kurz	5009fabde4	release: Keep it draft until all artifacts have been published The automated release workflow starts with the creation of the release in GitHub. This is followed by the build and upload of the various artifacts, which can be very long (like hours). During this period, the release appears to be fully available in https://github.com/kata-containers/kata-containers/ even though it lacks all the artifacts. This might be confusing for users or automation consuming the release. Create the release as draft and clear the draft flag when all jobs are done. This ensure that the release will only be tagged and made public when it is fully usable. If some job fails because of network timeout or any other transient error, the correct action is to restart the failed jobs until they eventually all succeed. This is by far the quicker path to complete the release process. If the workflow is canceled for some reason, the draft release is left behind. A new run of the workflow will create a brand new draft release with the same name (not an issue with GitHub). The draft release from the previous run should be manually deleted. This step won't be automated as it looks safer to leave the decision to a human. [1] https://github.com/kata-containers/kata-containers/releases Fixes #9064 - part VI Signed-off-by: Greg Kurz <groug@kaod.org>	2024-03-26 14:48:05 +01:00
Pavel Mores	4c72b02e53	runtime-rs: remove the now-unused code of NetDevice The remaining code in network.rs was mostly moved to utils.rs which seems better home for these utility functions anyway (and a closely related function open_named_tuntap() has already lived there). ToString implementation for Address was removed after some consideration. Address should probably ideally implement Display (as per RFC 565) which would also supply a ToString implementation, however it implements Debug instead, probably to enable automatic implementation of Debug for anything that Address is a member of, if for no other reason. Rather than having two identical functions this commit simply switches to using the Debug implementation for printing Address on qemu command line. Fixes #9352 Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-03-26 12:52:40 +01:00
Pavel Mores	c94e55d45a	runtime-rs: make QemuCmdLine own vsock file descriptor Make file descriptors to be passed to qemu owned by QemuCmdLine. See commit 52958f17cd for more explanation. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-03-26 12:50:41 +01:00
Pavel Mores	0cf0e923fc	runtime-rs: refactor QemuCmdLine::add_network_device() signature add_network_device() doesn't need to be passed NetworkInfo since it already has access to the full HypervisorConfig. Also, one of the goals of QemuCmdLine interface's design is to avoid coupling between QemuCmdLine and the hypervisor crate's device module, if at all possible. That's why add_network_device() shouldn't take device module's NetworkConfig but just parts that are useful in add_network_device()'s implementation. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-03-26 12:50:41 +01:00
Pavel Mores	a4f033f864	runtime-rs: add should_disable_modern() utility function is_running_in_vm() is enough to figure out whether to disable_modern but it's clumsy and verbose to use. should_disable_modern() streamlines the usage by encapsulating the verbosity. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-03-26 12:50:41 +01:00
Pavel Mores	12e40ede97	runtime-rs: reimplement add_network_device() using Netdev & DeviceVirtioNet This commit replaces the existing NetDevice-based implementation with one using Netdev and DeviceVirtioNet. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-03-26 12:50:41 +01:00
Pavel Mores	0a57e2bb32	runtime-rs: refactor NetDevice in qemu driver In keeping with architecture of QemuCmdLine implementation we split the functionality into two objects: Netdev to represent and generate the -netdev part and DeviceVirtioNet for the -device virtio-net-<transport> part. This change is a pure refactor, existing functionality does not change. However, we do remove some stub generalizations and govmm-isms, notably: - we remove the NetDev enum since the only network interface types that kata seems to use with qemu are tuntap and macvtap, both of which are implemented by the same -netdev tap - enum DeviceDriver is also left out since it doesn't seem reasonable to try to represent VFIO NICs (which are completely different from virtio-net ones) with the same struct as virtio-net - we also remove VirtioTransport because there's no use for it so far, but with the expectation that it will be added soon. We also make struct Netdev the owner of any vhost-net and queue file descriptors so that their lifetime is tied ultimately to the lifetime of QemuCmdLine automatically, instead of returning the fds to the caller and forcing it to achieve the equivalent functionality but manually. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-03-26 12:50:41 +01:00
Pavel Mores	7f23734172	runtime-rs: reduce generate_netdev_fds() dependencies generate_netdev_fds() takes NetworkConfig from which it however only needs a host-side network device name. This commit makes it take the device name directly, making the function useful to callers who don't have the whole NetworkConfig but do have the requisite device name. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-03-26 12:50:41 +01:00
Pavel Mores	d4ac45d840	runtime-rs: refactor clear_fd_flags() The idea of this function is to make sure O_CLOEXEC is not set on file descriptors that should be inherited by a child (=hypervisor) process. The approach so far is however rather heavy-handed - clearing all flags is unjustifiably aggresive for a low-level function with no knowledge of context whatsoever. This commit refactors the function so that it only does what's expected and renames it accordingly. It also clarifies some of its call sites. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-03-26 12:50:14 +01:00
Fabiano Fidêncio	cfe75f9422	k8s: confidential: Update cpuid to its latest release Since v2.2.6 it can detect TDX guests on Azure, so let's bump it even if Azure peer-pods are not currently used as part of our CI. Fixes: #9348 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-26 10:21:12 +01:00
Chengyu Zhu	d16971e37e	Merge pull request #9325 from ChengyuZhu6/image_service agent:image: Refactor code to improve memory efficiency of image service	2024-03-26 10:38:37 +08:00
Dan Mihai	6c72c29535	genpolicy: reduce policy debug prints Kata CI has full debug output enabled for the cbl-mariner k8s tests, and the test AKS node is relatively slow. So debug prints from policy are expensive during CI. Fixes: #9296 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-03-26 02:21:26 +00:00
Alex Lyn	cec943fc26	Merge pull request #9244 from Apokleos/dgb-gpu runtime-rs/dragonball: add support building kernel with upcall and GPU hotplug	2024-03-26 08:53:54 +08:00
Chelsea Mafrica	4e3deb5a3b	tools: Fix path for installing yq in packaging script The lib.sh script uses the right directory but the wrong path for the script that installs yq; fix it. Fixes #9165 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2024-03-25 15:09:52 -07:00
Chelsea Mafrica	cfb977625e	docs: Remove links to tests repo Remove links to tests repo and update with corresponding location in the current repo. Fixes #9165 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2024-03-25 15:09:52 -07:00
Chelsea Mafrica	d69514766e	src: Remove references to files in tests repo Change scripts and source that uses files in the tests repo to use the corresponding file in the current repo. Fixes #9165 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2024-03-25 15:09:52 -07:00
Gabriela Cervantes	ddef2be4f1	docs: Remove stale kernel information This PR removes stale kernel information from the README document. Fixes #9343 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-25 15:57:00 +00:00
Greg Kurz	e9e94d2dbd	release: Give a pretty name to all steps For a prettier rendering in the web UI. Fixes #9064 - part VI Signed-off-by: Greg Kurz <groug@kaod.org>	2024-03-25 15:50:35 +01:00
Greg Kurz	dce6ea57b2	release: Simplify the `create-new-release` action of `release.sh` Now that the version is an invariant for the entire workflow, it isn't required to obtain it with an environment variable. Just rely on the content of the `VERSION` file like other actions. Fixes #9064 - part VI Signed-off-by: Greg Kurz <groug@kaod.org>	2024-03-25 15:50:35 +01:00
Alex Lyn	5c54315a87	dragonball: fix CI failure due to poor UT adaptation. Fixes: #9144 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-25 20:25:27 +08:00
Alex Lyn	079d894496	kernel: bump version in kata config version Fixes: #9140 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-25 20:25:27 +08:00
Alex Lyn	070c3fa657	docs: add doc about building kernel with upcall and GPU hotplug We need some docs about how to build a guest kernel to support both Upcall and Nvidia GPU Passthrough(hotplug) at the same time. This patch is to do such thing to help users to build a guest kernel with support both Upcall and Nvidia GPU hotplug/unlplug. Fixes: #9140 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-25 20:25:17 +08:00
ChengyuZhu6	06b9935402	docs: Add a document for kata guest image management design Add a document for kata guest image management design. Related feature: #8484 Fixes: #9225 -- part I Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-25 18:17:23 +08:00
Chengyu Zhu	4029d154ba	Merge pull request #9313 from ChengyuZhu6/rtest agent: Refactor unit tests to leverage rstest for parameterization	2024-03-25 10:31:45 +08:00
Alex Lyn	bc309b9865	kernel: add CONFIG_CRYPTO_ECDSA into whitelist CONFIG_CRYPTO_ECDSA is not supported in older kernels such as 5.10.x which may cause building broken problem if we build such kernel with NVIDIA GPU in version 5.10.x So this patch is to add CONFIG_CRYPTO_ECDSA into whitelist.conf to avoid break building guest kernel with NVIDIA GPU. Fixes: #9140 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-25 08:05:31 +08:00
ChengyuZhu6	f47408fdf4	agent:image: Refactor code to improve memory efficiency of image service Currently, `.lock().await.clone()` results in `Option<ImageService>` being duplicated in memory with each call to `singleton()`. Consequently, if kata-agent receives numerous image pulling requests simultaneously, it will lead to the allocation of multiple `Option<ImageService>` instances in memory, thereby consuming additional memory resources. In image.rs, we introduce two public functions: `merge_bundle_oci()` and `init_image_service()`. These functions will encapsulate the operations on `IMAGE_SERVICE`, ensuring that its internal details remain hidden from external modules such as `rpc.rs`. Fixes: #9225 -- part II Signed-off-by: Xynnn007 <xynnn@linux.alibaba.com> Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-25 07:46:50 +08:00
ChengyuZhu6	7a49ec1c80	agent:util: Refactor the unit tests to leverage rstest Refactor the unit tests in util.rs to leverage rstest for parameterization. Fixes: #9314 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-23 10:49:53 +08:00
ChengyuZhu6	2df2b4d30d	agent:namespace: Refactor unit tests to leverage rstest Refactor the unit tests in `namespace.rs` to leverage rstest for parameterization. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-23 10:49:48 +08:00
Hyounggyu Choi	d915a79e2d	Merge pull request #9280 from BbolroC/enable-qemu-on-s390x runtime-rs: Enable qemu on s390x	2024-03-22 23:58:42 +01:00
Fabiano Fidêncio	25cd28a32b	Merge pull request #9337 from fidencio/topic/bump-nydus-snapshotter versions: Update nydus-snapshotter to v0.13.11	2024-03-22 22:18:18 +01:00
Hyounggyu Choi	81aaa34bd6	runtime-rs: Add DeviceVirtioSerial and DeviceVirtconsole It is observed that virtiofsd exits immediately on s390x if there is no attached console devices. This commit resolves the issue by migrating `appendConsole()` from runtime and being triggered in `start_vm()`. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-03-22 19:27:13 +01:00
Hyounggyu Choi	2cfe745efb	runtime-rs: Enable memory backend option for Machine for s390x For s390x, it requires an additional option `memory-backend` for `-machine`. Otherwise, virtiofsd exits with HandleRequest(InvalidParam). This commit is to add a field `memory_backend` to `struct Machine` and turn it on for s390x. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-03-22 19:27:13 +01:00
Hyounggyu Choi	9bcfaad625	runtime-rs: Add ccw block device for rootfs Like nvdimm for x86_64, a block device for s390x should be treated differently with `virtio-blk-ccw`. This is to generate a QEMU command line parameter for a block device by using `-blockdev` and `-device` if the `vm_rootfs_driver` is set to `virtio-blk-ccw`. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-03-22 19:27:13 +01:00
David Esparza	3e40051634	Merge pull request #9255 from dborquez/thread_pid_function runtime-rs: ch: Implement full thread/tid/pid handling	2024-03-22 10:05:02 -06:00
Fabiano Fidêncio	d0949759ec	versions: Update nydus-snapshotter to v0.13.11 This version brings in a fix for cleaning up k3s/rke2 environments, which directly impacts the TDX machine that's part of our CI. Fixes: #9318 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-22 14:56:18 +01:00
Greg Kurz	e4f6a778a8	Merge pull request #9321 from fidencio/topic/releases-follow-up-VI Revert "release: Skip --generate-notes for this release"	2024-03-22 10:44:40 +01:00
GabyCT	a67382fd00	Merge pull request #9324 from GabyCT/topic/udevguide docs: Update libseccomp instructions in Developers Guide	2024-03-21 14:25:41 -06:00
Gabriela Cervantes	d54cdd3f0c	scripts: Fix unbound variables in k8s setup script This PR fixes the unbound variables error when trying to run the setup script locally in order to avoid errors. Fixes #9328 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-21 19:10:16 +00:00
Chengyu Zhu	9a4cb96262	Merge pull request #9312 from ChengyuZhu6/show-feature agent: Add guest-pull to the list of agent features in announce()	2024-03-21 23:35:29 +08:00
David Esparza	b498e140a1	runtime-rs: ch: Implement full thread/tid/pid handling Add in the full details once cloud-hypervisor/cloud-hypervisor#6103 has been implemented, and the feature is available in a Cloud Hypervisor release. Fixes: #8799 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2024-03-21 08:24:53 -06:00
James O. D. Hunt	1e684f5848	Merge pull request #9259 from jodh-intel/tests-add-static-checks-announce tests: static checker: Add announce message	2024-03-21 13:59:36 +00:00
ChengyuZhu6	754399d909	agent: Add guest-pull to the list of agent features in announce() Add guest-pull to the list of agent features in announce(). Fixes: #9225 -- part IV Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-21 20:01:52 +08:00
Xuewei Niu	9c4f9dcb35	Merge pull request #9311 from studychao/chao/fix_mtrr Dragonballl: introduce MTRR regs support	2024-03-21 17:24:27 +08:00
Hyounggyu Choi	9b2c08935b	runtime-rs: Pass different device argument based on bus type Currently, `*-pci` is used as an argument for the device config. It is not true for a case where a different type of bus is used. s390x uses `ccw`. This commit is to make it flexible to generate the device argument based on the bus type. A structure `DeviceVhostUserFsPci` and `VhostVsockPci` is renamed to `DeviceVhostUserFs` and `VhostVsock` because the structure name is not bound to a certain bus type any more. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-03-21 09:25:37 +01:00
GabyCT	03f3d3491d	Merge pull request #9265 from GabyCT/topic/fixnydusclean gha: Fix nydus namespace clean up	2024-03-20 16:17:38 -06:00
GabyCT	702a8a440f	Merge pull request #9309 from GabyCT/topic/fixlograndom gha: Update journal log names for kubernetes artifacts	2024-03-20 16:17:17 -06:00
Gabriela Cervantes	05f4dc1902	docs: Update libseccomp instructions in Developers Guide This PR updates the libseccomp instructions in the Developers Guide. Fixes #9323 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-20 20:44:24 +00:00
GabyCT	163103d59e	Merge pull request #9307 from GabyCT/topic/fixdocreq docs: Update links in the Documentation Requirements document	2024-03-20 14:29:04 -06:00
Gabriela Cervantes	af18221ab7	docs: Update links in the Documentation Requirements document This PR updates the url links in the Documentation Requirements document. Fixes #9306 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-20 15:45:49 +00:00
Gabriela Cervantes	a855ecf21b	gha: Update journal log names for kubernetes artifacts This PR updates the journal log names for kubernetes artifacts in order to make sure that we have different names when we are running parallel GHA jobs. Fixes #9308 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-20 15:44:20 +00:00
Gabriela Cervantes	4fb8f8705f	gha: Fix nydus namespace clean up This PR terminates the nydus namespace to avoid the error of that the flag needs an argument. Fixes #9264 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-20 15:41:39 +00:00
Fabiano Fidêncio	0278fc8a91	Revert "release: Skip --generate-notes for this release" This reverts commit `0fa59ff94b`, as now we'll be able to use the `--generate-notes`, hopefully, without blowing the allowed limit. Fixes: #9064 - part VI Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-20 15:48:22 +01:00
James O. D. Hunt	577abd014b	tests: static checker: Add announce message Added an announcement message to the `static-checks.sh` script. It runs platform / architecture specific code so it would be useful to display details of the platform the checker is running on to help with debugging. Fixes: #9258. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-20 13:41:26 +00:00
James O. D. Hunt	4af4a8ad2b	tests: static checker: Create setup function Move some of the common code into a setup function. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-20 11:58:28 +00:00
Fabiano Fidêncio	1aec4f737a	Merge pull request #9316 from fidencio/topic/releases-follow-up-V release: Skip --generate-notes for this release	2024-03-20 10:50:14 +01:00
Fabiano Fidêncio	0fa59ff94b	release: Skip --generate-notes for this release This release is a special case, as we've slacked for 6 months and the release content is way too long ... long enough to exceed the allowed limit for the release notes. With this in mind we'll just remove the `--generate-notes` for now, and then revert this commit as soon as the release is out, as releases should be happening every month and, ideally, we won't reach this situation never ever again. Fixes: #9064 - part V Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-20 10:32:11 +01:00
Hyounggyu Choi	7b3d1adb8c	libs: Bump sysinfo to v0.30.5 It has been observed that the runtime stops running around `sysinfo::total_memory()` while adjusting a config on s390x. This is to update the crate to the latest version which happened to resolve the issue. (No explicit release note for this) Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-03-20 09:27:13 +01:00
Chao Wu	5a4b858ece	Dragonballl: introduce MTRR regs support MTRR, or Memory-Type Range Registers are a group of x86 MSRs providing a way to control access and cache ability of physical memory regions. During our test in runtime-rs + Dragonball, we found out that this register support is a must for passthrough GPU running CUDA application, GPU needs that information to properly use GPU memory. fixes: #9310 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2024-03-20 14:18:16 +08:00
Fabiano Fidêncio	19eb45a27d	Merge pull request #8484 from ChengyuZhu6/guest-pull Merge basic guest pull image code to main	2024-03-19 23:15:39 +01:00
Hyounggyu Choi	6e782826c7	Merge pull request #9305 from BbolroC/handle-comment-for-skipped-tests CI\|k8s: Handle skipped tests with a comment for filter_out_per_arch	2024-03-19 22:54:03 +01:00
Fabiano Fidêncio	8911d3565f	gha: tests: Filter out confidential tests for aarch64 / ppc64le Those two architectures are not TEE capable, thus we can just skip running those tests there. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-19 18:06:01 +01:00
Fabiano Fidêncio	d14e9802b6	gha: k8s: Set {https,no}_proxy correctly for TDX This is needed as the TDX machine is hosted inside Intel and relies on proxies in order to connect to the external world. Not having those set causes issues when pulling the image inside the guest. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-19 18:06:00 +01:00
Fabiano Fidêncio	291b14bfb5	kata-deploy: Add the ability to set {https,no}_proxy if needed Let's make sure those two proxy settings are respected, as those will be widely used when pulling the image inside the guest on the Confidential Containers case. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-19 18:06:00 +01:00
ChengyuZhu6	5bad18f9c9	agent: set https_proxy/no_proxy before initializing agent policy When the https_proxy/no_proxy settings are configured alongside agent-policy enabled, the process of pulling image in the guest will hang. This issue could stem from the instantiation of `reqwest`’s HTTP client at the time of agent-policy initialization, potentially impacting the effectiveness of the proxy settings during image guest pulling. Given that both functionalities use `reqwest`, it is advisable to set https_proxy/no_proxy prior to the initialization of agent-policy. Fixes: #9212 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-19 18:06:00 +01:00
ChengyuZhu6	db9f18029c	README: Add https_proxy and no_proxy to agent README Add agent.https_proxy and agent.no_proxy to the table in the agent README. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-19 18:06:00 +01:00
ChengyuZhu6	e23737a103	gha: refactor code with yq for better clarity refactor code with yq for better clarity: Before: ```bash yq write -i "${tools_dir}/packaging/kata-deploy/kata-deploy/base/kata-deploy.yaml" 'spec.template.spec.containers[0].env[7].value' "${KATA_HYPERVISOR}:${SNAPSHOTTER}" ``` After: ```bash yq write -i \ "${tools_dir}/packaging/kata-deploy/kata-deploy/base/kata-deploy.yaml" \ 'spec.template.spec.containers[0].env[7].value' \ "${KATA_HYPERVISOR}:${SNAPSHOTTER}" ``` Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-19 18:06:00 +01:00
ChengyuZhu6	2c0bc8855b	tests: Make sure to install yq before using it Make sure to install yq before using it to modify YAML files. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-19 18:06:00 +01:00
ChengyuZhu6	c52b356482	tests: add guest pull image test Add a test case of pulling image inside the guest for confidential containers. Signed-off-by: Da Li Liu <liudali@cn.ibm.com> Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Georgina Kinge <georgina.kinge@ibm.com> Co-authored-by: Megan Wright <Megan.Wright@ibm.com>	2024-03-19 18:06:00 +01:00
ChengyuZhu6	e8c4effc07	tests: refactor the check for hypervisor to a function Extract two reusable functions for confidential tests in confidential_common.sh - check_hypervisor_for_confidential_tests: verifies if the input hypervisor supports confidential tests. - confidential_setup: performs the common setup for confidential tests. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Co-authored-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-19 18:06:00 +01:00
ChengyuZhu6	6e5e4e55d0	rootfs: add ca file to guest rootfs To access the URL, the component to pull image in the guest needs to send a request to the remote. Therefore, we need to add CA to the rootfs. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-19 18:06:00 +01:00
ChengyuZhu6	8724d7deeb	packaging: Enable to build agent with PULL_TYPE feature Enable to build kata-agent with PULL_TYPE feature. We build kata-agent with guest-pull feature by default, with PULL_TYPE set to default. This doesn't affect how kata shares images by virtio-fs. The snapshotter controls the image pulling in the guest. Only the nydus snapshotter with proxy mode can activate this feature. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-19 18:06:00 +01:00
ChengyuZhu6	cd6a84cfc5	kata-deploy: Setting up snapshotters per runtime handler Setting up snapshotters per runtime handler as the commit (`6cc6ca5a7f`) described. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-19 18:05:59 +01:00
ChengyuZhu6	ba242b0198	runtime: support different cri container type check To support handle image-guest-pull block volume from different CRIs, including cri-o and containerd. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-19 18:05:59 +01:00
ChengyuZhu6	874d83b510	agent/image: Use guest provided pause image By default the pause image and runtime config will provided by host side, this may have potential security risks when the host config a malicious pause image, then we will use the pause image packaged in the rootfs. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Arron Wang <arron.wang@intel.com> Co-authored-by: Julien Ropé <jrope@redhat.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com>	2024-03-19 18:05:59 +01:00
ChengyuZhu6	c269b9e8c6	agent: Add guest-pull feature for kata-agent Add "guest-pull" feature option to determine that the related dependencies would be compiled if the feature is enabled. By default, agent would be built with default-pull feature, which would support all pull types, including sharing images by virtio-fs and pulling images in the guest. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-19 18:05:59 +01:00
Aurélien	192250c52e	Merge pull request #9299 from sprt/sprt/mariner-normal-tests ci: aks: also run tests in normal instance for Mariner	2024-03-19 11:34:20 -05:00
ChengyuZhu6	965da9bc9b	runtime: support to pass image information to guest by KataVirtualVolume support to pass image information to guest by KataVirtualVolumeImageGuestPullType in KataVirtualVolume, which will be used to pull image on the guest. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-19 17:22:36 +01:00
ChengyuZhu6	cfd14784a0	agent: Introduce ImagePullHandler to support IMAGE_GUEST_PULL volume As we do not employ a forked containerd in confidential-containers, we utilize the KataVirtualVolume which storing the image information as an integral part of `CreateContainer`. Within this process, we store the image information in rootfs.storage and pass this image url through `CreateContainerRequest`. This approach distinguishes itself from the use of `PullImageRequest`, as rootfs.storage is already set and initialized at this stage. To maintain clarity and avoid any need for modification to the `OverlayfsHandler`,we introduce the `ImagePullHandler`. This dedicated handler is responsible for orchestrating the image-pulling logic within the guest environment. This logic encompasses tasks such as calling the image-rs to download and unpack the image into `/run/kata-containers/{container_id}/images`, followed by a bind mount to `/run/kata-containers/{container_id}`. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-03-19 17:22:36 +01:00
ChengyuZhu6	462051b067	agent/image: merge container spec for images pulled inside guest When being passed an image name through a container annotation, merge its corresponding bundle OCI specification and process into the passed container creation one. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Arron Wang <arron.wang@intel.com> Co-authored-by: Jiang Liu <gerry@linux.alibaba.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: wllenyj <wllenyj@linux.alibaba.com> Co-authored-by: jordan9500 <jordan.jackson@ibm.com>	2024-03-19 17:22:36 +01:00
ChengyuZhu6	cec1916196	agent: Support https_proxy/no_proxy config for image download in guest Containerd can support set a proxy when downloading images with a environment variable. For CC stack, image download is offload to the kata agent, we need support similar feature. Current we add https_proxy and no_proxy, http_proxy is not added since it is insecure. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Arron Wang <arron.wang@intel.com>	2024-03-19 17:22:36 +01:00
ChengyuZhu6	9cddd5813c	agent/image: Enable image-rs crate to pull image inside guest With image-rs pull_image API, the downloaded container image layers will store at IMAGE_RS_WORK_DIR, and generated bundle dir with rootfs and config.json will be saved under CONTAINER_BASE/cid directory. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Arron Wang <arron.wang@intel.com> Co-authored-by: Jiang Liu <gerry@linux.alibaba.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: wllenyj <wllenyj@linux.alibaba.com>	2024-03-19 17:22:36 +01:00
ChengyuZhu6	2b3a00f848	agent: export the image service singleton instance Export the image service singleton instance. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Jiang Liu <gerry@linux.alibaba.com> Co-authored-by: Xynnn007 <xynnn@linux.alibaba.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: wllenyj <wllenyj@linux.alibaba.com>	2024-03-19 17:22:36 +01:00
ChengyuZhu6	1f1ca6187d	agent: Introduce ImageService Introduce structure ImageService, which will be used to pull images inside the guest. Fixes: #8103 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> co-authored-by: wllenyj <wllenyj@linux.alibaba.com> co-authored-by: stevenhorsman <steven@uk.ibm.com>	2024-03-19 17:22:33 +01:00
Hyounggyu Choi	b381743dd5	CI\|k8s: Handle skipped tests with a comment for filter_out_per_arch This commit updates `filter_k8s_test.sh` to handle skipped tests that include comments. In addition to the existing parameter expansion, the following expansions have been added: - Removal of a comment - Stripping of trailing spaces Fixes: #9304 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-03-19 17:21:25 +01:00
Chelsea Mafrica	42dfe0e8d1	Merge pull request #9286 from jodh-intel/agent-show-enabled-features agent: Show features enabled at build time	2024-03-19 08:54:49 -07:00
Wainer Moschetta	e6501aa4ad	Merge pull request #9229 from ldoktor/ocp-ci ocp.ci: Various fixes and improvements to the OCP pipeline	2024-03-19 11:13:01 -03:00
James O. D. Hunt	46aec0f15a	Merge pull request #9293 from jodh-intel/kata-manager-fix-containerd-for-docker kata-manager: Fix Docker install	2024-03-19 10:06:44 +00:00
Fabiano Fidêncio	e0a6b6449f	Merge pull request #9302 from BbolroC/fix-permission-issue-on-s390x-runners gha: Place pre-action on s390x runner for kata-deploy during release	2024-03-19 10:42:23 +01:00
Hyounggyu Choi	f2bc819644	gha: Place pre-action on s390x runner for kata-deploy during release This is to place a pre-action step for the kata-deploy job in order to clean up the github workspace directory before checking out the repo. Fixes: #9301 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-03-19 10:18:38 +01:00
Alex Lyn	7af2df408e	Merge pull request #9295 from likebreath/0318/fix_clh_default_netconfig runtime-rs: ch: Provide valid default value for NetConfig	2024-03-19 15:17:18 +08:00
Xuewei Niu	99d0e5fff8	Merge pull request #9270 from zvonkok/kata-agent-bind-mount kata-agent: optional bind flag	2024-03-19 10:39:23 +08:00
Aurélien Bombo	71a1be9c57	ci: aks: also run tests in normal instance for Mariner Currently we're only running the small instance tests. This adds the normal instance tests as well. Fixes: #9298 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2024-03-18 23:33:17 +00:00
Bo Chen	ad4262e86b	runtime-rs: ch: Provide valid default value for NetConfig The current default value of IP `0.0.0.0` with mask `0.0.0.0` will cause ioctl error when being used to create and configure TAP device, with newer version of Cloud Hypervisor [1]. This patch replaces them with valid value that are the same as the Go-lang runtime [2]. [1] https://github.com/cloud-hypervisor/cloud-hypervisor/pull/5924 [2] `e3f7852738/src/runtime/virtcontainers/pkg/cloud-hypervisor/client/model_net_config.go (L40-L57)` Fixes: #9254 Signed-off-by: Bo Chen <chen.bo@intel.com>	2024-03-18 15:47:58 -07:00
Fabiano Fidêncio	e3f7852738	Merge pull request #9289 from fidencio/topic/releases-follow-up-IV releases: Simply the release in order to avoid pushing a commit updating the VERSION file	2024-03-18 17:38:58 +01:00
James O. D. Hunt	a6c3f75872	kata-manager: Fix Docker install Fix the Docker install by removing the second (erroneous) call to `containerd_installed()` in `handle_docker()`. Without this fix, installing using Docker (`-D`) will work iff you already have containerd installed. However, if you do not have containerd installed, the `containerd_installed()` function returns 1, which exits the script as we're running with `set -e`, leaving a broken Docker installation. > Note: containerd is installed via Docker's `get-docker.sh` script. Fixes: #9292. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-18 14:08:35 +00:00
stevenhorsman	0ab8e61a64	release: Remove release type from arch release Now we don't have minor and major releases and we are now generating a new version in the release workflow, we can tidy up the arch specific releases workflows to remove the extra required inputs Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-03-18 12:27:57 +00:00
Greg Kurz	3cfc1b6ba7	releases: Adjust documentation to the new workflow This drops the documentation of the legacy release scripts and adds a quick description of the scripts of the new workflow. It also highlights the bump of the `VERSION` file. Signed-off-by: Greg Kurz <groug@kaod.org>	2024-03-18 12:57:02 +01:00
Greg Kurz	76c640767e	releases: Drop Makefile It isn't used anymore. Signed-off-by: Greg Kurz <groug@kaod.org>	2024-03-18 12:54:00 +01:00
Greg Kurz	bfe19e68e8	kata-deploy: Adapt `test-kata.sh` to the new release workflow All releases are now created in the `main` branch following the very same workflow. No need to special case pre-releases. Signed-off-by: Greg Kurz <groug@kaod.org>	2024-03-18 12:54:00 +01:00
Fabiano Fidêncio	12578f11bc	releases: Assume VERSION has the correct version to be released This is done in order to avoid having to push a commit to the main branch, which is against the defined rules on GitHub. By doing this, we need to educate ourselves to always bump the VERSION file as soon as a release is cut out. As a side effect of this change, we can drop the release-major and release-minor workflows, as those are not needed anymore. Fixes: #9064 - part IV Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-16 13:30:58 +01:00
Fabiano Fidêncio	8ce50269fe	release: Bump the VERSION file to the next release number 3.3.0 it will be. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-16 13:21:27 +01:00
Xuewei Niu	9f512c016e	Merge pull request #9282 from gkurz/runtime-rs-fds-for-qemu runtime-rs: Consolidate the handling of fds passed to QEMU	2024-03-16 10:26:11 +08:00
Greg Kurz	1e526a4769	runtime-rs: Consolidate the handling of fds passed to QEMU File descriptors that are passed to QEMU need some special care. We want them to be closed when the QEMU process is started. But at the same time, it is required that the associated rust File structures, either coming from the` std::fs` or the `tokio::fs` crates, are still in scope when the QEMU process is forked. This is currently achieved by keeping File structures in variables at the outer scope of `start_vm()`. This scheme is currently duplicated, with similar justifications in the corresponding comments. Consolidate all this handling in one place with a more generic explanation. Fixes #9281 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-03-15 16:14:59 +01:00
James O. D. Hunt	9ef59488d9	agent: Show features enabled at build time The agent now has a number of optional build-time features that can be enabled. Add details of these features to the following areas: - Version output (`kata-agent --version`) - Announce message (so that the details are always added to the journal at agent startup). - The response message returned by the ttRPC `GetGuestDetails()` API. Fixes: #9285. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-15 13:29:21 +00:00
Chelsea Mafrica	2c50d3c393	Merge pull request #9278 from wainersm/github_env_fix tests: fix nounset error with $GITHUB_ENV	2024-03-14 16:39:13 -07:00
Greg Kurz	6a112cc7a5	runtime-rs: Fix missing dependency Some previous contribution missed to run cargo clippy. Fix the dependency now so that it doesn't cause noise in future contributions. Signed-off-by: Greg Kurz <groug@kaod.org>	2024-03-14 23:19:38 +01:00
Dan Mihai	b3b00e00a6	Merge pull request #9246 from microsoft/danmihai/default-env genpolicy: default env if image doesn't have env	2024-03-14 11:01:43 -07:00
Dan Mihai	6094f1e31d	Merge pull request #9250 from microsoft/danmihai1/k8s-pid-ns2 tests: k8s: k8s-pid-ns.bats auto-generated policy	2024-03-14 10:10:24 -07:00
Zvonko Kaiser	c15e19c806	kata-agent: optional bind flag Fixes: #9269 From https://github.com/opencontainers/runtime-spec/blob/main/config.md#mounts type (string, OPTIONAL) The type of the filesystem to be mounted. bind may be only specified in the oci spec options -> flags update r#type The agent will ignore bind mounts if they are only specified in the OCI spec options and not in the flags. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-03-14 14:42:01 +00:00
Hyounggyu Choi	1dac6b1357	runtime-rs: Configure s390x specific flags for Makefile s390x supports a different machine type `s390-ccw-virtio` and it is not required to configure cpu features by default for the platform. A hypervisor `dragonball` is not supported on s390x so that `DBCMD` is not necessary. `vm-rootfs_driver` should be set to `virtio-blk-ccw`. This commit is to set the architecture-specific flags for Makefile. Fixes: #9158 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-03-14 13:05:35 +01:00
Wainer dos Santos Moschetta	981f95df55	tests: fix nounset error with $GITHUB_ENV Initialize $GITHUB_ENV to avoid nounset error when running the scripts locally out of Github Actions. Fixed commit `9ba5e3d2a8` Fixes #9217 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-13 14:57:38 -03:00
Dan Mihai	ac27caf1b4	Merge pull request #9248 from microsoft/danmihai1/k8s-exec.bats2 tests: k8s: k8s-exec.bats auto-generated policy	2024-03-13 09:21:12 -07:00
Alex Lyn	2aa3519520	kata-agent: Change order of guest hook and bind mount processing The guest_hook_path item in configuration.toml allows OCI hook scripts to be executed within Kata's guest environment. Traditionally, these guest hook programs are pre-built and included in Kata's guest rootfs image at a fixed location. While setting guest_hook_path = "/usr/share/oci/hooks" in configuration.toml works, it lacks flexibility. Not all guest hooks reside in the path /usr/share/oci/hooks, and users might have custom locations. To address this, a more flexible and configurable approach is to be proposed that allows users to specify their desired path. This could include using a sandbox bind mount path for hooks specific to that particular container. However, The current implementation of guest hooks and bind mounts in kata-agent has a reversed order of execution compared to the desired behavior. To achieve the intended functionality, we simply need to swap the order of their implementation. Fixes: #9274 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-13 20:30:32 +08:00
Steve Horsman	8f4cbd49d7	Merge pull request #9263 from Amulyam24/gha-fixes gha: ensure that the self hosted runner is in desired state before running the workflow	2024-03-13 10:49:29 +00:00
Amulyam24	3f4b24be8b	gha: ensure that self hosted runner is prepared before running the workflow This PR ensures that the self hosted runner is prepared by taking necesary actions before running the workflow. The script prepare_runner.sh checks the following: 1. Ensure that containerd/docker is up and running 2. Make sure that the repository workspace is cleaned up and has no conflicts 3. Remove/cleanup any leftover files from the previous runs Fixes: #9262 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2024-03-13 14:20:10 +05:30
Alex Lyn	410afcc913	Merge pull request #8866 from Apokleos/netdev-qemu-rs runtime-rs: add netdev params to cmdline for qemu-rs.	2024-03-13 13:07:43 +08:00
Dan Mihai	e8c2a45ce0	tests: k8s: k8s-pid-ns.bats auto-generated policy Auto-generate policy for k8s-pid-ns.bats. Fixes: #9249 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-03-12 22:34:46 +00:00
Lukáš Doktor	46e62eecb1	ci.ocp: Log the full grepped line rather than the expected msg we are grepping for an expected message but it might contain extra bits of information fruitful for later debugging. Let's include it in the output and the full log in case of an error. Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-12 17:03:46 +01:00
Lukáš Doktor	7ff2eb508e	ci.ocp: Increase the mcp update timeout we're hitting this timeout quite often, looks like newer OCP takes longer to reconfigure. Increase the timeout to 1200. Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-12 16:38:04 +01:00
Lukáš Doktor	cc02329fd1	ci.ocp: Add a cleanup script This script doesn't serve as a complete cleanup, but it can be used as a best-effort cleaner between deploying different versions of kata-containers on the same OCP cluster. Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-12 16:38:04 +01:00
Lukáš Doktor	b811ee0650	ci.ocp: Allow to override the kata-deploy image sometimes we want to test a different than the latest image (eg. when verifying a PR via ghcr images or when bisecting a failure over older builds). Let's add a KATA_DEPLOY_IMAGE variable for that while keeping the latest image by default. Fixes: #9228 Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-12 16:38:04 +01:00
Lukáš Doktor	2936503b24	ci.ocp: Always replace the kata-deploy image in OCP pipeline previously we only replaced the image when the previously defined one matched the "old_img". This is good to avoid modifying developers custom changes, but it might lead to hard-to-debug issues when the image stays different. Let's ensure we always replace the image with the one we asked for. Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-12 16:38:04 +01:00
Lukáš Doktor	6525c94065	ci.ocp: Add a workaround to optionally enable skip_mount_home the latest upstream kata-containers requires the skip_mount_home to be enabled, which is default on OCP 4.14+ but disabled on OCP 4.13-. Let's use a "WORKAROUND_9206_CRIO" (called by kata-containers GH issue) variable to allow users to enable this treatement when needed. Related to: #9206 Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-12 16:38:04 +01:00
Lukáš Doktor	739d627b4e	ci.ocp: Turn selinux relabel failures into warnings Instead of failing the pipeline let's proceed with an error message that selinux setup failed so, in case of a later failure, we know what might have caused it while keeping the coverage in case of a false setup issue. Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-12 16:38:04 +01:00
Lukáš Doktor	76c452d4e0	ci.ocp: Wait for all pods to finish the work previously we only waited for a random pod to finish the selinux relabel, which could be error-prone. Let's wait for all of the podst to contain the expected message. Increase the timeout to 120s as some pods might take a little bit longer to finish. Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-12 16:34:56 +01:00
Lukáš Doktor	f7febd07a0	ci.ocp: Allow to re-apply the selinux workaround in case we re-apply the selinux workaround or if user had already existing similar rule the relabel_selinux was failing. Let's allow it to modify the existing rules as well to avoid such issues. Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-12 16:02:21 +01:00
Lukáš Doktor	fbbea68f1f	ci.ocp: Ignore selinux setup on non-selinux cluster improve our selinux workaround to work well on non-selinux clusters. Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-12 16:02:20 +01:00
Alex Lyn	e2ae8ba79b	runtime-rs: add network device into Qemu's cmdline It will open tuntap device and vhost-net device and store device files. Fixes: #8865 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-12 22:28:54 +08:00
Alex Lyn	d3bca4597e	runtime-rs: add open_named_tuntap to open a named tuntap device. The open_named_tuntap function is designed as a public function to open a tuntap device with the specified name. However, in order to reference existing methods in dbs_utils, we still need to keep the reference "path = "../../../dragonball/src/dbs_utils" in dependencies and cannot hide it. Fixes: #8865 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-12 22:26:32 +08:00
Alex Lyn	005b333976	runtime-rs: add network helpers and impl ToQemuParams Add network helpers and impl ToQemuParams trait to build netdev params which are put into cmdline for Qemu VM running. Fixes: #8865 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-12 22:25:39 +08:00
Alex Lyn	63786934f4	runtime-rs: set network namespace for qemu process and netdev. We need ensure the add_network_device happens in netns and move qemu process into netns which keeps the qemu process running in this net namespace. Fixes: #8865 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-12 22:21:43 +08:00
Alex Lyn	69a5e5b955	runtime-rs: add network device handler in start_vm. Add network device handler in start_vm, which is sepcially for Qemu VM running with added net params to command line. Fixes: #8865 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-12 22:18:01 +08:00
Alex Lyn	a116b252c8	Merge pull request #9236 from jodh-intel/docs-improve-install-details docs: install: Simplify instructions	2024-03-12 14:29:38 +08:00
Alex Lyn	a31fb35e5d	Merge pull request #9231 from UiPath/fix/clh-pid-init clh: initialize clh pid before using it	2024-03-12 13:43:24 +08:00
Alex Lyn	9f6003adde	runtime-rs: add a new netns field in struct QemuInner. We need add a new netns field in struct QemuInner, and initialize it with argument passed down in prepare_vm(). Fixes: #8865 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-11 16:02:39 +08:00
Alex Lyn	f571ec84d2	runtime-rs: add a public method to support process entering netns. The enter_netns function is designed as a public method to help VMMs running as a independent process enter a network namespace, reducing duplicate code. Fixes: #8865 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-11 15:55:52 +08:00
Alex Lyn	4176fcc3c6	runtime-rs: make the code for cleanup fd flags as public method. It just move the related code to a public file(utils.rs) and make it a common method for both vsock and network, or some others. Fixes: #8865 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com> Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-03-11 15:52:20 +08:00
Alex Lyn	b1038704e0	runtime-rs: make NetnsGuard common for hypervisor and resource. In order to better support non-builtin vmm usage of NetnsGuard and reduce code duplication, we need to move it to a common path that can be referenced by both hypervisor and resource manager. In this patch, it just do moving code from network/utils/netns.rs to kata-sys-utils/src/netns.rs Fixes: #8865 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-11 15:38:42 +08:00
Alexandru Matei	617b0114b3	clh: initialize clh pid before using it The PID needs to be initialized before calling isClhRunning. waitVMM() uses isClhRunning and is called by launchClh() just before returning from function. Fixes: #9230 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2024-03-09 13:53:51 +02:00
Dan Mihai	88b7a44271	tests: k8s: k8s-exec.bats auto-generated policy Auto-generate policy for k8s-exec.bats. Fixes: #9247 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-03-08 17:48:20 +00:00
Steve Horsman	54e5ce2464	Merge pull request #9154 from chungeun-choi/change-deprecated-package fixed - Change the deprecated module from 'io/util' to util. 'io/util…	2024-03-08 15:05:43 +00:00
Steve Horsman	e9bbf2f67b	Merge pull request #9203 from fidencio/topic/releases-follow-up-III release: Ensure the release-type is passed to workflows	2024-03-08 14:09:36 +00:00
Alex Lyn	c73597c39d	Merge pull request #9208 from studychao/chao/fix_virt_ci Dragonball: fix unit test problems when switching to new virt github machine	2024-03-08 09:41:05 +08:00
Chengyu Zhu	d49391a555	Merge pull request #8798 from LindaYu17/setpolicy add setpolicy function to kata-runtime tool	2024-03-08 06:31:57 +08:00
Dan Mihai	5398b6466c	Merge pull request #9224 from 3u13r/sidecar-container genpolicy: add restartPolicy to container struct	2024-03-07 12:59:55 -08:00
GabyCT	35d8f82232	Merge pull request #9242 from GabyCT/topic/enabldebugnerd gha: Add collect artifacts step to nerdctl workflow	2024-03-07 13:34:40 -06:00
Wainer Moschetta	91998af173	Merge pull request #9114 from wainersm/ci_kbs_cli CI: add KBS utilities for attestation tests	2024-03-07 16:34:03 -03:00
Dan Mihai	4c3d6fadc8	genpolicy: default env if image doesn't have env Use containerd's default environment for container images that don't specify the Env field. Also, re-enable policy env variable verification, now that these uncommon images are supported too. Fixes: #9239 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-03-07 16:56:06 +00:00
Dan Mihai	b3a02d5e06	Merge pull request #9128 from microsoft/danmihai1/test-genpolicy tests: k8s: auto-generated policy	2024-03-07 08:50:47 -08:00
Fabiano Fidêncio	8faab965a7	gh: Fix payload-after-push tags We now expect the arch specific images to be tagged as kata-containers-latest-${arch}. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-07 12:02:51 +00:00
Fabiano Fidêncio	eab78cf1ba	release: Reword the extra notes added as part of the release We're trying to keep just the bare minimum info, as we really would like to not have the list of commits, and mainly the list of new contributors, trucated from the release notes. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-07 12:02:51 +00:00
Fabiano Fidêncio	658fb6972b	release: Ensure the release-type is passed to workflows We need to ensure the release type is passed down to workflows, otherwise we'll fail to get the correct release version for tagging the daemonset images. Fixes: #9064 - part III Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-03-07 12:02:51 +00:00
Alex Lyn	a0a50f5e52	Merge pull request #9191 from Apokleos/fix-kata-ctl-exec0 kata-ctl: Support using container short ID to enter guest.	2024-03-07 19:26:40 +08:00
Wainer dos Santos Moschetta	8ea9ac515e	tests/k8s: update kbs repository Recently confidential-containers/kbs repository was renamed to confidential-containers/trustee. Github will automatically resolve the old URL but we better adjust it in code. The trustee repository will be cloned to $COCO_TRUSTEE_DIR. Adjusted file paths and pushd/popd's to use $COCO_KBS_DIR ($COCO_TRUSTEE_DIR/kbs). On versions.yaml changed from `coco-kbs` to `coco-trustee` as in the future we might need other trustee components, so keeping it generic. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-07 11:20:36 +00:00
Wainer dos Santos Moschetta	c669567cd3	tests/k8s: add utils to set KBS policies Added the kbs_set_resources_policy() function to set the KBS policy. Also the kbs_set_allow_all_resources() and kbs_set_deny_all_resources to set the "allow all" and "deny all" policy, respectively. Fixes #9056 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-07 11:20:36 +00:00
Wainer dos Santos Moschetta	6f0d38094d	tests/k8s: add utils to set KBS resources Added utility functions to manage resources in KBS: - kbs_set_resource(), where the resource data is passed via argument - kbs_set_resource_from_file(), where the resource data is found in a file Fixes #9056 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-07 11:20:36 +00:00
Wainer dos Santos Moschetta	2a374422c5	tests/k8s: add function to install kbs-client Added kbs_install_cli function to build and install the kbs-client executable if not present into the system. Removed the stub from gha-run.sh; now the install kbs-client in the .github/workflows/run-kata-deploy-tests-on-aks.yaml will effectively install the executable. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-07 11:20:36 +00:00
Wainer dos Santos Moschetta	4141875ffd	ci/lib.sh: set GOPATH default value Scripts sourcing ci/lib.sh need to set $GOPATH otherwise it will fail. This ensure that GOPATH is set to ${HOME}/go unless it is already exported. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-07 11:20:36 +00:00
Wainer dos Santos Moschetta	e410aef4fa	tests/k8s: add utils to get kbs service address Added functions to return the service host, port or full-qualified HTTP address, respectively, kbs_k8s_svc_host(), kbs_k8s_svc_port(), and kbs_k8s_svc_http_addr(). Fixes #9056 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-07 11:20:36 +00:00
Leonard Cohnen	e30e8ab7dc	genpolicy: add restartPolicy to container struct This adds support for sidecar container introduced in Kubernetes 1.28 Fixes: #9220 Signed-off-by: Leonard Cohnen <lc@edgeless.systems>	2024-03-07 12:00:14 +01:00
Chungeun Choi	bad263f399	runtime: Replace deprecated module io/ioutil" to "io" This change updates the module import to use 'util' instead of the deprecated 'io/util' Fixes: #9166 Signed-off-by: Chungeun Choi <ce.choi@okestro.com>	2024-03-07 10:56:06 +00:00
Alex Lyn	ef9a38e551	shim-interface: add Copyright of AntGroup in file shim-interface.rs Fixes: #9189 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-07 15:46:32 +08:00
Alex Lyn	2972a3a675	shim-interface: add UT for get_uds_with_sid Fixes: #9189 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-07 15:45:44 +08:00
Alex Lyn	7145243bd3	kata-ctl: Support using container short ID to enter guest. Fixes: #9189 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-07 15:44:47 +08:00
Linda Yu	bb77d2d7e6	docs: add docs on how to set policy by kata-runtime Fixes: #8797 Signed-off-by: Linda Yu <linda.yu@intel.com>	2024-03-07 15:00:23 +08:00
Linda Yu	1c5693be86	stream: repeat copybuffer if it is blocked by policy copyBuffer returns and the streams will be closed when error occurs. If the error contains "blocked by policy" it means the log output is disabled by policy with "ReadStreamRequest" and "WriteStreamRequest" set to false. But at this moment, we want the real stream still working (not be seen) because we might want to enable logging for debugging purpose, so we repeat copybuffer in this case to avoid streams being closed. Fixes: #8797 Signed-off-by: Linda Yu <linda.yu@intel.com>	2024-03-07 15:00:23 +08:00
Linda Yu	eda419cb03	kata-runtime: add set policy function to kata-runtime logging/debugging information might probably be disabled in production due to security consideration, but we'd better provide an approach for customer to get logging information during runtime, this PR implement setpolicy function in kata-runtime tools, although it can set whole policy other than logging. setpolicy would evokes remote attestation, which means before setting policy during runtime, user has to reconfigure new policy hash in KBS/AS. usage: kata-runtime policy set policy.rego --sandbox-id XXXXXXXX Fixes: #8797 Signed-off-by: Linda Yu <linda.yu@intel.com>	2024-03-07 15:00:23 +08:00
Dan Mihai	c08b696d9e	tests: k8s: k8s-shared-volume generated policy Auto-generate policy for k8s-shared-volume.bats. Fixes: #9096 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-03-07 05:57:30 +00:00
Dan Mihai	b24758fad8	tests: k8s: k8s-scale-nginx auto-generated policy Auto-generate policy for k8s-scale-nginx.bats. Fixes: #9096 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-03-07 05:57:30 +00:00
Dan Mihai	af9ac8d194	tests: k8s: k8s-replication auto-generated policy Auto-generate policy for k8s-replication.bats. Fixes: #9096 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-03-07 05:57:30 +00:00
Dan Mihai	56689c6800	tests: k8s: k8s-qos-pods auto-generated policy Auto-generate policy for k8s-qos-pods.bats. Fixes: #9096 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-03-07 05:57:30 +00:00
Dan Mihai	0179f53469	tests: k8s: k8s-parallel auto-generated policy Auto-generate policy for k8s-parallel.bats. Fixes: #9096 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-03-07 05:57:30 +00:00
Dan Mihai	73a8b61c2e	Merge pull request #9243 from microsoft/danmihai1/genpolicy-unblock-ci genpolicy: disable env variable verification	2024-03-06 21:44:18 -08:00
Dan Mihai	e61ef30a76	genpolicy: disable env variable verification Disable env variable verification to unblock CI, until container images that don't specify the Env variables will be handled correctly (see #9239). Also, mark the image config Env field as optional, thus allowing policy generation for these container images. Fixes: #9240 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-03-07 01:59:18 +00:00
Gabriela Cervantes	94fdcda7f7	scripts: Add collect artifacts function in nerdctl gha run script This PR adds the collect artifacts function in nerdctl gha run script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-06 19:48:12 +00:00
Gabriela Cervantes	f902ee78d0	gha: Add collect artifacts step to nerdctl workflow This PR adds the collect artifacts step to nerdctl workflow. Fixes #9241 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-06 19:41:16 +00:00
GabyCT	640ed591bd	Merge pull request #9219 from GabyCT/topic/fixkerneldoc docs: Remove stale kernel information at README documentation	2024-03-06 10:24:31 -06:00
James O. D. Hunt	b1d4cbd9d1	utils: spell-checker: Fix grep warning Fix the `grep(1)` warning caused by the unnecessary escaping of the hash/sharp symbol. Fixes: #9235. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-06 13:21:15 +00:00
James O. D. Hunt	5257bfa9a9	docs: install: Simplify instructions Move the "build from source" and "manual installation" details to the developer guide. This makes the installation landing page clearer for users. Fixes: #9234. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-06 13:14:03 +00:00
Ryan Savino	fdfc825bc4	Merge pull request #9174 from ryansavino/snp-qemu-stable-coco-tag versions: SNP qemu updated to stable coco tagged version	2024-03-06 01:03:10 -06:00
GabyCT	83e39a206c	Merge pull request #9223 from jodh-intel/tests-add-k3s-artifacts tests: Add k3s artifacts	2024-03-05 13:37:21 -06:00
James O. D. Hunt	a67ed2f1c2	tests: Add k3s artifacts The k3s distribution of k8s uses an embedded version of containerd and configures it to log to a file, not the journal. Hence, although we collect the journal as a test artifact, we also need to collect the actual log files for containerd. Also collect the k3s containerd config files to help with debugging. Fixes: #9104. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-05 17:54:20 +00:00
GabyCT	9fab57acc8	Merge pull request #9217 from wainersm/revert_collect_artifacts gha: export start_time to collect artifacts properly	2024-03-05 11:11:49 -06:00
Gabriela Cervantes	12be4cf828	docs: Remove stale kernel information at README documentation This PR removes stale kernel information at README documentation. Fixes #9218 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-05 16:46:45 +00:00
Wainer dos Santos Moschetta	9ba5e3d2a8	gha: export start_time to collect artifacts properly The jobs running on garm will collect journal information. The data gathered is based on the time the tests started running. The $start_time is exported on run_tests() and used in collect_artifacts(). It happens that run_tests() and collect_artifacts() are called on different steps of the workflow and the environment variables aren't preserved between them, i.e, $start_time exported on the first step is not available on the subsequents. To solve that issue, let's save $start_time in the file pointed out by $GITHUB_ENV that Github actions uses to export variables. In case $GITHUB_ENV is empty then probably it is running locally outside of Github, so it won't save the start time value. Fixes #9217 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-05 12:15:20 -03:00
James O. D. Hunt	b761a80bd1	Merge pull request #9059 from jodh-intel/kata-manager-add-hypervisor-option kata-manager: Allow hypervisor to be changed	2024-03-05 09:30:04 +00:00
Alex Lyn	bf5edc8e73	Merge pull request #9155 from Jimmy-Xu/fix-build-gpu-kernel gpu: fix build guest kernel with gpu	2024-03-05 16:53:44 +08:00
Greg Kurz	0320198889	Merge pull request #9206 from lifupan/main CI: fix the issue of ci failure on crio	2024-03-05 09:52:13 +01:00
Fupan Li	628f57aca0	Merge pull request #9193 from UiPath/fix/clh-dax clh: Enable DAX for rootfs	2024-03-05 09:39:22 +08:00
Wainer Moschetta	38088a934b	Merge pull request #9184 from wainersm/fix_kata_deploy_bats tests/kata-deploy: fix checker for kata-deploy running	2024-03-04 20:50:37 -03:00
GabyCT	77d048da4d	Merge pull request #9065 from wainersm/ci_install_kbs CI: Install KBS on k8s for attestation tests	2024-03-04 16:59:01 -06:00
GabyCT	a4153f3b71	Merge pull request #9210 from GabyCT/topic/addtestreadme docs: Add general README for tests section	2024-03-04 16:54:28 -06:00
Gabriela Cervantes	5d50262422	docs: Add general tests documentation in main README This PR adds the general tests documentation in main README of the kata containers repository. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-04 21:53:01 +00:00
Gabriela Cervantes	d5fa2bebd5	docs: Add general README for tests section This PR adds general README documentation for the tests section in the kata containers repository. Fixes #9209 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-04 21:50:37 +00:00
GabyCT	4dea9019ab	Merge pull request #9126 from GabyCT/topic/addartifactsk gha: Storing artifacts for logs of k8s tests garm	2024-03-04 15:41:54 -06:00
Gabriela Cervantes	fc5e040d96	scripts: Apply general fixes to variables in gha-run script This PR applies general fixes to variables in gha-run script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-04 18:54:15 +00:00
James O. D. Hunt	7af892f8d8	docs: Update kata-manager docs for switching hypervisor Add details to the README for `kata-manager` showing how to list available hypervisor configs (packaged and local), and switch between the configurations. Also, update the hypervisors page to show a lot more detail about the hypervisor configurations, including the "short name" used by `kata-manager` for switching hypervisor config. > Note: > > These changes only apply to the current default golang runtime. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-04 12:24:31 +00:00
James O. D. Hunt	4f6fef1f61	docs: Whitespace fix Remove extraneous whitespace from hypervisors doc. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-04 12:18:05 +00:00
James O. D. Hunt	1ac3caf656	kata-manager: Allow hypervisor to be changed Add new options to allow the configured hypervisor to be changed: - `-L`: List available _packaged_ hypervisor config short names. - `-e`: List available _local_ hypervisor config names. - `-H <hypervisor>`: Install Kata then switch to the specified hypervisor. - `-S <hypervisor>`: Switch to the specified hypervisor (by config short name [Errors if Kata not installed]). For example, to install Kata and configure it to use Cloud Hypervisor with the golang Kata runtime: ```bash $ kata-manager.sh -H clh ``` To switch back to the default hypervisor: ```bash $ kata-manager.sh -S default ``` To show details of the available packaged configs: ```bash $ kata-manager.sh -L ``` To show details of the local configs: ```bash $ kata-manager.sh -e ``` > Notes: > > - This change only applies to the current default (golang) Kata runtime. > > - Although this is mainly for users wishing to switch hypervisor (by > changing the Kata config file to another of the packaged config files > provided for specific hypervisors), strictly it allows users to change > to _any_ config file. For example, if the user has a config file called > `/etc/kata-containers/configuration-my-custom-config.toml`, they could > switch to this by running: > > ```bash > $ kata-manager.sh -S my-custom-config > ``` > > - The "config short names" are the hypervisor specific part of the configuration file name. > For example, the config short name for file `configuration-qemu.toml` is > `qemu` and the config short name for `configuration-clh.toml` is `clh`. Fixes: #8305. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-04 12:18:00 +00:00
James O. D. Hunt	0bb558c0b9	kata-manager: Fix symlink handling The `configure_kata()` function modifies the configuration file to enable debug. But it was doing this by calling `sed -i` which, by default, creates a new _file_ from the `configuration.toml` symbolic link. This defeated the point of the symbolic link which is supposed to resolve to the local copy of the pristine config file, so we now use the GNU sed(1) specific `---follow-symlinks` option to retain the sym-link. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-04 11:15:39 +00:00
James O. D. Hunt	455637b30a	kata-manager: Show message when checking file Add an info message just before the archive file is checked. This keeps the user informed about what is happening as it can take a few seconds to perform the checks on slower systems. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-04 11:15:39 +00:00
James O. D. Hunt	ce350450e8	kata-manager: Sort options in usage Ensure the usage statement lists all options in alphabetical order. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-04 11:15:39 +00:00
James O. D. Hunt	159d29665a	kata-manager: Whitespace fixes Remove extraneous whitespace. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-03-04 11:15:39 +00:00
Chao Wu	9f0eab904b	Dragonball: fix test_signal_handler a) There is some unknown syscalls triggered in new github virt machine that would break the make test process with SIGSYS after applying SeccompFilter. In order to fix this, we change the allowlist in this unit test for seccompfileter into a blocklist to avoid meeting the unknown syscalls. b) lazy static METRICS is not fully initialize in the unit test and may lead to unstable result for this UT. fixes: #9207 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2024-03-04 16:27:27 +08:00
Chao Wu	253fe72435	Dragonball: fix test_handler_insert_region the mmap region start guest addr hard-code a value and later there would be check whether the mentioned addr is larger than or equal to mem_end (default to host_phy_mem >> 1) in order to satisfy the requirement for DaxMemory. Since github virt machine phy_mem is larger than previous CI machine we use, the hard-code value could no longer be worked. To fix this, we change the address to mem_end in unit test to avoid the influence of host machine change. fixes: #9207 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2024-03-04 16:27:19 +08:00
Jimmy-Xu	5ada7329b8	gpu: fix build guest kernel with nvidia gpu - enable CONFIG_MTRR,CONFIG_X86_PAT on x86_64 for nvidia gpu - optimize -f of build-kernel.sh, clean old kernel path and config before setup - add kernel 5.16.x Fixes: #9143 Signed-off-by: Jimmy-Xu <xjimmyshcn@gmail.com>	2024-03-04 09:40:42 +08:00
Fupan Li	07e0cf1855	CI: fix the issue of ci failure on crio PR #8760 tentatively tried to have the shim to run in its own mount namespace for the sake of improving isolation between the sandbox and the host. Thus crio storage drivers shouldn't create a PRIVATE bind mount on their home directory. Otherwise, the container's rootfs mount wouldn't be propagated to kata runtime's mount namespace, and kata runtime couldn't access the container's rootfs files. So, when kata cooperated with crio, crio should set skip_mount_home=true for its storage overlay. Fixes: #9028 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2024-03-03 20:53:36 +08:00
Wainer dos Santos Moschetta	2c24977cb1	tests/k8s: allow to overwrite the cluster name _print_cluster_name() create a string based information like the pull request number and commit SHA. However, when you are developing the scripts you might want to use an arbitrary name, so it was introduced the $AKS_NAME variable that once exported it will overwrite the generated name. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-02 12:42:35 -03:00
Wainer dos Santos Moschetta	5e4b7bbd04	tests/k8s: expose KBS service externally Until this point the deployed KBS service is only reachable from within the cluster. This introduces a generic mechanism to apply an Ingress configuration to expose the service externally. The first implemened ingress is for AKS. In case the HTTP application routing isn't enabled in the cluster (this is required for ingress), an add-on is applied. It was added the get_cluster_specific_dns_zone() and enable_cluster_http_application_routing() helper functions to gha-run-k8s-common.sh. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-02 12:42:35 -03:00
Wainer dos Santos Moschetta	e1e0b94975	tests/k8s: introduce the CoCo kbs library Introduce the tests/integration/kubernetes/confidential_kbs.sh library that contains functions to manage the KBS on CI. Initially implemented the kbs_k8s_deploy() and kbs_k8s_delete() functions to, respectively, deploy and delete KBS on Kubernetes. Also hooked those functions in the tests/integration/kubernetes/gha-run.sh script to follow the convention of running commands from Github Workflows: $ .tests/integration/kubernetes/gha-run.sh deploy-coco-kbs $ .tests/integration/kubernetes/gha-run.sh delete-coco-kbs Fixes #9058 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-02 12:39:26 -03:00
Wainer dos Santos Moschetta	6a28c94d99	tests/k8s: add a kustomize installer Kustomize has been used on some of our internal components (e.g. kata-deploy) to manage k8s deployments. On CI it has been used the `sed` tool to edit kustomization.yaml files, but `kustomize` is more suitable for that purpose. So in order to use that tool on CI scripts in the future, this commit introduces the `install_kustomize()` function that is going to download and install the binary in /usr/local/bin in case it's found on $PATH. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-03-02 12:39:26 -03:00
Xuewei Niu	daab76de36	Merge pull request #9201 from liubogithub/liubo/dev/panic_fix_3 katautils: fix panic on tracing.	2024-03-02 10:27:02 +08:00
GabyCT	4a0cfc4e3f	Merge pull request #9199 from GabyCT/topic/enablecri gha: Enable cri-containerd tests for cloud hypervisor runtime-rs	2024-03-01 12:23:16 -06:00
Steve Horsman	1ec33b8879	Merge pull request #9200 from wainersm/ci_install_kbs-timeout gha: increase timeout of KBS steps	2024-03-01 16:00:21 +00:00
Gabriela Cervantes	7299dbdb43	gha: Store journalctl logs This PR stores the journalctl logs. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-01 15:17:20 +00:00
Gabriela Cervantes	342d3a320d	gha: Add collect artifacts function in gha-run script This PR adds the collect artifacts function in gha-run script for the kubernetes tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-01 15:17:20 +00:00
Gabriela Cervantes	2070e3481e	gha: Storing artifacts for logs of k8s tests garm This PR helps to store the artifacts for different logs for k8s tests on garm. Fixes #9103 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-03-01 15:17:20 +00:00
Greg Kurz	df17bf95d5	Merge pull request #9169 from ldoktor/backport-ocp ci.ocp: Backport service-up detection fixes	2024-03-01 16:09:55 +01:00
Greg Kurz	dc6bda19bf	Merge pull request #9179 from gkurz/fix-k8s-sandbox-vcpus-allocation-check tests: k8s: Adapt k8s-sandbox-vcpus-allocation.bats to kubernetes v1.29	2024-03-01 15:55:07 +01:00
Lukáš Doktor	6fffbaa190	ci.ocp: Backport service-up detection fixes This backports the: 9060e930caf2d20f413df07778d3ab497493161c ci.ocp: Add debug output on HTTP service failure these logs are vital to analyze a setup failure. a10a1e2c9cbc21afc1e80f22b0fb8634d27cbd8d ci.ocp: Improve the service-up detection waiting for the first response is not sufficient as OCP returns html page without error even when the route is not yet established describing the issue (why it doesn't reply with 500?). Waiting for the correct output should do better. commits from the kata-containers/tests repo. Fixes: #8653 Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-03-01 12:04:20 +01:00
Alex Lyn	13a20957cb	Merge pull request #9164 from Apokleos/directvol-csi-dockerfile csi-kata-directvolume: add Dockerfile for building csi image	2024-03-01 18:12:19 +08:00
Alex Lyn	f69428a1e7	csi-kata-directvolume: add Dockerfile for building csi image Fixes: #9163 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-03-01 10:41:51 +08:00
Liu Bo	b6f8355ea3	katautils: fix panic on tracing. This fixes a panic on tracing on container exit. The root cause is that global var needs to be set by "=" instead of ":=". Fixes: #9102 Signed-off-by: Liu Bo <liub.liubo@gmail.com>	2024-02-29 18:40:23 -08:00
Wainer dos Santos Moschetta	24c163e6e1	tests/kata-deploy: fix checker for kata-deploy running Currently, the checking for kata-deploy is running assume that the daemonset scheduled at least one pod, however it might not had and the kubectl wait command fails due to "error: no matching resources found". On CI I've observed that fail intermittently. I suspect the service account kata-deploy-sa take a while to show up then no kata-deploy is scheduled in meanwhile. Changed the checker logic to use waitForProcess() to keep testing if it is already running, or hit the timeout (still 10m). Fixes #9183 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-29 22:26:27 -03:00
Wainer dos Santos Moschetta	4410df7233	gha: increase timeout of KBS steps The step to deploy KBS on run-k8s-tests-on-aks workflow should be increased so that there is enough time for checking the service is healthy and exposed. Likewise the step that builds the kbs-client which requires enough time to build the executable. Fixes #9058 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-29 22:05:58 -03:00
Dan Mihai	11b603e5f1	Merge pull request #9139 from microsoft/saulparedes/genolicy_panic_subpath genpolicy: panic when we see a volume mount subpath	2024-02-29 12:18:56 -08:00
Gabriela Cervantes	beb592b309	gha: Enable cri-containerd tests for cloud hypervisor runtime-rs This PR enables the cri-containerd tests for cloud hypervisor runtime-rs. Fixes #9198 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-29 20:18:16 +00:00
GabyCT	a4f5815f6b	Merge pull request #9182 from GabyCT/topic/addclhcri gha: Add cloud-hypervisor (runtime-rs) support to cri-containerd tests	2024-02-29 14:12:01 -06:00
Gabriela Cervantes	0f595cf15b	gha: General variable fixes to gha-run script This PR adds general variable fixes to gha-run script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-29 18:15:27 +00:00
Alexandru Matei	6856e8f678	clh: Enable DAX for rootfs Fixes: #9192 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2024-02-29 18:01:47 +02:00
Greg Kurz	f3442cdef9	tests: k8s: Adapt k8s-sandbox-vcpus-allocation.bats to kubernetes v1.29 Kubernetes v1.29 introduced a new `PodReadyToStartContainers` condition that gets inserted at index 0 in the conditions array. This means that the expected `PodCompleted` reason can now be either at index 0 with kubernetes v1.28 and older or at index 1 starting with kubernetes v1.29. This is fragile at best since the `kubectl wait` doesn't allow to combine multiple checks. Also, checking the reason is dubious as it doesn't really tell if the pods have actually completed or not. Check the pod phase to be `Succeeded` instead, this guarantees that : > All containers in the Pod have terminated in success, and will not > be restarted. Fixes #9178 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-02-29 17:00:31 +01:00
Greg Kurz	f89120662d	tests: k8s: Wait for all pods concurrently A single invocation of `kubectl wait` can handle all pods. Signed-off-by: Greg Kurz <groug@kaod.org>	2024-02-29 17:00:31 +01:00
Greg Kurz	58bc026656	Merge pull request #9180 from fidencio/topic/actually-add-the-pause-image-into-the-rootfs rootfs: Fix PAUSE_IMAGE_TARBALL addition to the rootfs	2024-02-29 13:56:32 +01:00
Chengyu Zhu	c01ba58b3d	Merge pull request #9176 from ChengyuZhu6/stale_doc docs: renew stale link	2024-02-29 18:35:26 +08:00
Fabiano Fidêncio	1d2f7afd1f	Merge pull request #9188 from fidencio/topic/releases-follow-up-II releases: Second round of follow-up fixes	2024-02-29 10:59:36 +01:00
Fabiano Fidêncio	c9dfe49152	gha: payload: Fix env var declarations This was introduced by `a45988766c`, but didn't follow the correct format for the env declaration. Fixes: #9064 - part II Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-29 10:52:49 +01:00
Fabiano Fidêncio	1c3a769822	gha: payload: Don't use concurrency for this job We want all payloads to be built and published, regardless if there's a new PR merged. This will help people to easily trace / debug issues. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-29 10:52:45 +01:00
Fabiano Fidêncio	02af62b66c	gha: payload: Stop generating payloads for the stable branches We've decided to not maintain stable branches anymore, thus we can only trigger this workflow for the `main` branch. For more details, please, see: https://github.com/kata-containers/kata-containers/issues/9064 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-29 10:42:25 +01:00
Fabiano Fidêncio	b4061a1c23	Merge pull request #9170 from fidencio/topic/releases-follow-up-I release: Add the needed fixes for the release process	2024-02-29 10:36:20 +01:00
ChengyuZhu6	e5d3627794	docs: renew stale link Renew the stale link "https://github.com/containerd/containerd/tree/main/runtime/v2" to the latest "https://github.com/containerd/containerd/tree/main/core/runtime/v2". Fixes: #9177 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-29 15:03:02 +08:00
Fabiano Fidêncio	0022474164	rootfs: Fix PAUSE_IMAGE_TARBALL addition to the rootfs We were never passing the arguments to add the PAUSE_IMAGE to the rootfs, leading to it never being present in the confidential image / initrd. Fixes: #9032 -- part II Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 22:42:27 +01:00
GabyCT	aacbbde35d	Merge pull request #9172 from GabyCT/topic/docpradvice docs: Update Code PR advice document	2024-02-28 13:37:28 -06:00
Gabriela Cervantes	3cd319fcc2	scripts: General fixes to the gha-run script This PR implements general fixes to the gha-run script for the cri-containerd tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-28 19:32:51 +00:00
Gabriela Cervantes	5a498948c8	scripts: Skip cri-containerd in gha-run script This PR skips the cri-containerd in gha-run script for cloud hypervisor runtime-rs. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-28 19:30:38 +00:00
Gabriela Cervantes	4bfb9c30e7	gha: Add cloud-hypervisor (runtime-rs) support to cri-containerd tests This PR adds the Cloud Hypervisor driver, integrated with the runtime-rs, as part of the cri-containerd tests. Fixes #9181 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-28 19:24:18 +00:00
Wainer Moschetta	c4b8270073	Merge pull request #9009 from wainersm/runk_bats tests/runk: fix the "run ps command" flaky test	2024-02-28 15:58:36 -03:00
Wainer Moschetta	129ce84705	Merge pull request #9116 from wainersm/ci_install_kbs-workflow gha: k8s: prepare AKS workflow to install the CoCo KBS	2024-02-28 14:43:41 -03:00
Gabriela Cervantes	ec1dde1d01	docs: Update Code PR advice document This PR updates the code pr advice document to make the proper references now that we have move the test repository to the kata containers repository. Fixes #9171 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-28 16:14:22 +00:00
Ryan Savino	9e9dae8efb	versions: SNP qemu updated to stable coco tagged version New qemu fork of AMDESE created in confidential-containers project. SNP qemu version now pointed to stable tag at: https://github.com/confidential-containers/qemu/tree/amd-snp-202402240000 Fixes: #9173 Signed-Off-By: Ryan Savino <ryan.savino@amd.com>	2024-02-28 09:28:14 -06:00
Fabiano Fidêncio	068d80a9cb	docs: releases: Update link for the release actions This allows users to go directly to the action page whenever a release needs to be cut. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:34:56 +01:00
Fabiano Fidêncio	520cd90c43	release: Remove the "test-" from the release version This is not needed anymore as we can run the tests from any branch, and we can patch this locally before doing a test. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:34:56 +01:00
Fabiano Fidêncio	22b19d0637	release: Add a step to get the release tags GitHub actions is fun and always willing to play tricks with us. This nice little kid decided that `echo "FOO=\"bar zaz\"" >> $GITHUB_ENV` is not valid, and it simply breaks things in a way that is a pain to debug. But hey, we take this path, and after doing so I realised that the correct way to export that is `echo "FOO=bar zaz" >> $GITHUB_ENV`. I know, this looks incorrect, but this fellow never stops surprising us. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:34:56 +01:00
Fabiano Fidêncio	cdf1e4afde	release: Fix typo in the arm arch For some reason I'd changed arm64 to arm4 in a previous (already merged) commit. :-/ Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:34:56 +01:00
Fabiano Fidêncio	3db0630bc1	release: Add our own bits to the release notes I'm getting here the most relevant parts of what we had as part of the release-notes.sh script. As the script will not be used anymore, it's been removed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:34:56 +01:00
Fabiano Fidêncio	aaf38aca98	release: Fix typo in the _upload_libseccomp_tarball() RELEASE_VERSIOB -> RELEASE_VERSION Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:34:56 +01:00
Fabiano Fidêncio	397167836b	release: Fix yq installation For some reason we need to force its installation in the GOPATH, otherwise yq is not found. Ideally we should switch to a packaged version of yq, but that's a topic for another series. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:34:55 +01:00
Fabiano Fidêncio	6915131adc	release: Fix KATA_DEPLOY_{IMAGE_TAGS,REGISTRIES} declaration Otherwise we may end up with an unbound variable. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:34:55 +01:00
Fabiano Fidêncio	757f958943	release: Adjust tags used to publish our deamonset We need to adjust the tags as when this workflow ends up being called from the release side, we'll receive "refs/tags/main" as the GITHUB_REF, and in that case we must use the release version. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:34:51 +01:00
Fabiano Fidêncio	d339366a16	release: Get the release version from our internal function This is utterly counter intuitive, but if we change a file during the GitHub Action, the checkout done for the next workflow won't have that file updated, but rather the branch on its original state when the workflow was created. This makes us safe to always "calculate" the next release version from the VERSION file at the time the workflow was triggered. This requires us to have the release type exported for the whole workflow. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:30:06 +01:00
Fabiano Fidêncio	8023d64b1a	release: Adjust "needs" in the release workflow Without those we'll end up running steps in parallel that should actually wait for a previous step to be completed. While here, let's also correct some of the "needs" that were waiting fro the wrong workflow to be finished. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:30:06 +01:00
Fabiano Fidêncio	d10b818de5	release: Add missing return to _check_required_env_var() Otherwise none of the calls to this function will actually continue after it's called. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:30:06 +01:00
Fabiano Fidêncio	0aa82e7050	release: Add missing env vars to _check_required_env_var() We missed doing this as part of `50011e89a0`, but we also need to check for: * RELEASE_VERSION * GH_TOKEN * ARCHITECTURE * KATA_STATIC_TARBALL While here, let's fix a ARCHITECURE -> ARCHITECTURE typo. Fixes: #9064 -- part II Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-28 12:30:05 +01:00
Chengyu Zhu	bb4c608b32	Merge pull request #9110 from ChengyuZhu6/agent_option agent: Add all agent configuration options to README	2024-02-28 18:50:44 +08:00
Dan Mihai	352e2af5f0	Merge pull request #9153 from microsoft/danmihai1/clh-bootVM-timeout runtime: clh: minimum 10s timeout for CreateVM + BootVM	2024-02-27 09:58:01 -08:00
Wainer dos Santos Moschetta	b44e0c4e7c	gha: k8s: prepare AKS workflow to install the CoCo KBS Changed the "run k8s tests on AKS" workflows to get the CoCo KBS installed so that we can run attestation tests. The plan is to run attestation tests only on a subset of non-TEE jobs initially, so this commit restricts to install KBS only on kata-qemu configuration. Actually at this point it is added only stubs commands to tests/integration/kubernetes/gha-run.sh that should be implemented in a future commit. Fixes #9058 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-27 13:51:15 -03:00
Wainer Moschetta	6186410e35	Merge pull request #8949 from wainersm/tests_nydus tests/nydus: refactor the teardown()	2024-02-27 09:52:44 -03:00
ChengyuZhu6	731c490ded	agent: Add all agent configuration options to README Add all agent configuration options to README so that users can more easily understand what these options do and how to configure them at runtime. Fixes: #9109 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-27 17:35:19 +08:00
Fabiano Fidêncio	4aa40f1bbb	Merge pull request #9146 from fidencio/topic/releases release: Update everything in this repo related to the release and its process	2024-02-27 10:30:49 +01:00
Fabiano Fidêncio	111bb3ec66	release: Add "test-" into the release name This commit should be merged as it's now, then we trigger a test release, fix whatever has to be fixed, and drop it as soon as we know our workflows are working as expected. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:03 +01:00
Fabiano Fidêncio	d69766c0b2	docs: Update the release process Now that we've simplified it by quite a lot, let's update the documentation accordingly. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:03 +01:00
Fabiano Fidêncio	a85481110a	releases: Remove scripts that won't be used anymore Those are not needed anymore as we're automating our release process around GitHub actions. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:03 +01:00
Fabiano Fidêncio	e714c37521	gha: Remove workflows related to backporting stuff We're not doing backports anymore, as we're getting rid of the stable branches in favour of having a better release cadence from the main branch. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:02 +01:00
Fabiano Fidêncio	3229c777e7	kata-deploy: Remove "stable" yamls As we're not maintaining a stable branch anymore, let's get rid of the kata-deploy stable pieces. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:02 +01:00
Fabiano Fidêncio	008293f015	gha: Add release-{major,minor} workflows Those will allow us to cut a release just by a single click, instead of the current process we have. Fixes: #9064 -- part I Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:02 +01:00
Fabiano Fidêncio	f9f04dca2b	gha: release: Update the workflow The release workflow is now updated to be a `workflow_call`, and it includes the steps that had to be manually done in the past, such as updating the needed files and creating the release itself. While on this, the kata-deploy multiarch manifest tags have been updated to match the new release scheme. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:02 +01:00
Fabiano Fidêncio	f0675a163a	release: Add _next_release_version() This function returns the version of the next release (the one about to be cut), and it'll be used as part of our new workflow that will take care of the release. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:02 +01:00
Fabiano Fidêncio	4675364d8d	release: Add _update_version_file() function Let's add a function that will be responsible for bumping the project's version in the VERSION file, and push it to the branch as part of the release process that will be introduced. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:02 +01:00
Fabiano Fidêncio	a99f9026e1	release: Add _create_new_release() This is a helper function that will be used to create a new release as part of our release process workflow (which will still be modified). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:02 +01:00
Fabiano Fidêncio	fd699625fe	release: Add _upload_libseccomp_tarball() As the name of the function says, it's responsible for uploading the libseccomp source tarballs as par of our release process. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:02 +01:00
Fabiano Fidêncio	d517fa54ac	release: Add _upload_vendored_code_tarball() As hinted by the name of the function, this is used to generate and upload the vendored code we have as its own tarball. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:02 +01:00
Fabiano Fidêncio	94b30fcb14	release: Add _upload_versions_yaml_file() As the name says, this function will be used to upload the versions.yaml file during a given release process of the project. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:02 +01:00
Fabiano Fidêncio	50011e89a0	release: Add _upload_kata_static_tarball This function, as it names says, will be used to upload the kata-static.tar.xz tarballs generated during the release process. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:02 +01:00
Fabiano Fidêncio	a45988766c	release: Add _publish_multiarch_manifest() This function, as it names says, will be used to publish multiarch manifests for the Kata Containers CI and Kata Containers releases. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:01 +01:00
Fabiano Fidêncio	fb2ef32c04	release: Introduce the release.sh helper For now this script does nothing, but we're introducing it in order to redduce the diffs for the next commits in this series. My intention is to have as much as possible related to the release as part of this helper script, and it'll be populated function by function while replacing content that's "hard coded" (and duplicated) on different GitHub actions. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-27 08:34:01 +01:00
GabyCT	1a6c378d26	Merge pull request #9161 from GabyCT/topic/testsreadme docs: Update link for tests in README	2024-02-26 14:50:46 -06:00
Gabriela Cervantes	94615a4fd4	docs: Update link for tests in README This PR updates the link for the tests in README for Kata Containers. Fixes #9160 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-26 15:43:33 +00:00
Wainer dos Santos Moschetta	0f8c36d990	tests/nydus: refactor the teardown() This refactor the teardown() of tests/integration/nydus/nydus_tests.sh: * Moved boilerplate code that kill process to a loop; * Doesn't leave teardown() if a process failed to get killed, so that other clean up routines are ran; * Check if the pid exist then attempt to kill the process, so avoid this misleading message: ``` Usage: kill [options] <pid> [...] Options: <pid> [...] send signal to every <pid> listed -<signal>, -s, --signal <signal> specify the <signal> to be sent -q, --queue <value> integer value to be sent with the signal -l, --list=[<signal>] list all signal names, or convert one to a name -L, --table list all signal names in a nice table -h, --help display this help and exit -V, --version output version information and exit For more details see kill(1). ``` Fixes #8948 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-26 11:21:43 -03:00
Wainer dos Santos Moschetta	0f0ce9a81b	tests/runk: replace the busybox image It's recommended to avoid images from docker.io to avoid errors related with hitting the pull limits that happens mostly on bare-metal machines. So this replaced the docker.io's busybox with quay.io/prometheus/busybox. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-26 11:11:05 -03:00
Wainer dos Santos Moschetta	bba8b5b2b4	tests/runk: fix flaky test The "run ps command" test has failed once in a while because it doesn't wait the sh command to start within the container, consequently `ps` won't report the amount of lines expected. Fixes #8975 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com> Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-26 11:09:29 -03:00
Wainer dos Santos Moschetta	28a63070f7	gha: fix step name in run-runk-tests Likely copied from the tracing workflow by mistake. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-26 11:09:29 -03:00
Wainer dos Santos Moschetta	8a606eb94d	tests/runk: convert to bats Migrated runk tests from pure shell script to bats to be consistent with other test suites. The install_dependencies() will install the bats tool locally. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-26 11:09:23 -03:00
Xuewei Niu	bb5e33b33a	Merge pull request #9100 from littlejawa/fix_5738_metrics_memory runtime: remove kata_shim_netdev metric	2024-02-26 19:01:21 +08:00
James O. D. Hunt	0ea30f44cf	Merge pull request #9076 from jodh-intel/add-survey-link-to-release-notes packaging: release notes: Don't show shortlist by default, and add survey link	2024-02-26 10:25:19 +00:00
Steve Horsman	483ecbadf0	Merge pull request #9142 from ChengyuZhu6/protoc build-checks: Install protoc in the ci environments	2024-02-26 09:52:31 +00:00
Dan Mihai	f4509b806b	runtime: clh: minimum 10s timeout for CreateVM + BootVM Relax the timeout for calling CLH's CreateVM + BootVM APIs. When hitting the older 1s timeout, killing a half-booted Guest and retrying the same boot sequence could have been wasteful and resulting in unstable CI testing on slower Hosts. Fixes: #9152 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-24 19:15:57 +00:00
GabyCT	4f3c83cd12	Merge pull request #9115 from GabyCT/topic/adddief scripts: Add an enhanced die function	2024-02-23 12:03:02 -06:00
Saul Paredes	9b7bd376eb	genpolicy: panic when we see a volume mount subpath Based on https://github.com/kata-containers/runtime/issues/2812 Fixes: #9145 Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-02-23 09:56:51 -08:00
James O. D. Hunt	8c72abe38d	packaging: Add link to survey in release notes Add a link in the release notes to the Kata Container survey, to advertise it, and hopefully encourage users to take the survey. Fixes: #9074. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-02-23 09:57:52 +00:00
James O. D. Hunt	0391c0de82	packaging: Add twistie to release notes shortlog Add a "twistie" / arrow (`▶`) that the user can click on to see the full list of commits _if they want to_. This way, the release notes become easier to read and we can display information below the shortlog which would (probably) normally not be seen due to the huge long list of commits. Fixes: #9075. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-02-23 09:57:52 +00:00
ChengyuZhu6	3cc55ff8af	build-checks: Install protoc in the ci environments To test PR #8484 for pulling image in the guest with image-rs, the compilation process for the kata-agent relies on protoc: https://github.com/kata-containers/kata-containers/actions/runs/8016317290/job/21898040849?pr=8484 https://github.com/kata-containers/kata-containers/actions/runs/8016534530/job/21898654435?pr=8484 Fixes: #9141 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-23 17:38:13 +08:00
Xuewei Niu	89c76d7d8d	Merge pull request #9125 from gkurz/fix-agent-cgroup-ns agent: Run container workload in its own cgroup namespace (cgroup v2 guest only)	2024-02-23 10:40:17 +08:00
Steve Horsman	e342a9adc4	Merge pull request #9119 from ChengyuZhu6/pause-confidential kata-deploy: Add pause image to confidential rootfs	2024-02-22 17:10:55 +00:00
Steve Horsman	531dcd2f25	Merge pull request #9132 from ChengyuZhu6/nydus-snapshotter-version gha: bump nydus snapshotter version to v0.13.8	2024-02-22 17:10:42 +00:00
Steve Horsman	dfa6e932bb	Merge pull request #9122 from ChengyuZhu6/snapshotter-clean gha: try to cleanup nydus snapshotter before deploying it	2024-02-22 13:30:04 +00:00
Julien Ropé	1c306fe4a6	runtime-rs: stop reporting net dev metrics for the shim For consistency with the go runtime. As the shim itself is not using the network (all its communication with other processes is done with local unix sockets), there is no reason to keep gathering and reporting shim-specific network metrics. Actual network usage of the kata containers can be found from the existing agent network metrics (kata_guest_netdev_stat). Signed-off-by: Julien Ropé <jrope@redhat.com>	2024-02-22 14:00:00 +01:00
Julien Ropé	9de65707ca	runtime: stop reporting net dev metrics for the shim As part of the shim network metrics, the shim is reporting network interfaces from the host with no namespace isolation - this gives insight in interfaces not tied to the kata containers, and causes an increase in resource usage for kata metrics. As the shim itself is not using the network (all its communication with other processes is done with local unix sockets), there is no reason to keep gathering and reporting shim-specific network metrics. Actual network usage of the kata containers can be found from the existing hypervisor network metrics (kata_hypervisor_netdev) and from the agent network metrics (kata_guest_netdev_stat). Fixes: #5738 Signed-off-by: Julien Ropé <jrope@redhat.com>	2024-02-22 14:00:00 +01:00
ChengyuZhu6	8ab3894dc5	gha: try to cleanup nydus snapshotter before deploying it CI failed to deploy nydus snapshotter because it was not cleaned up last time. So we can try to cleanup nydus snapshotter before deploying it. Fixes: #9121 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-22 18:51:14 +08:00
Alex Lyn	5d3ae360ed	Merge pull request #9130 from Apokleos/bugfix-dragonball-invalidOperation runtime-rs: bugfix for GPU passthrough failed with InvalidOperation.	2024-02-22 17:47:09 +08:00
ChengyuZhu6	f16f709a5e	kata-deploy: Add pause image to confidential rootfs For confidential containers, the pause image needs to be installed in the rootfs. Fixes: #9118 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-22 15:41:16 +08:00
ChengyuZhu6	d8db3fb17f	gha: bump nydus snapshotter version to v0.13.8 Bump nydus snapshotter version to v0.13.8 to fix the bug in v0.13.7 : https://github.com/containerd/nydus-snapshotter/pull/582 Fixes: #9131 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-22 15:35:08 +08:00
Alex Lyn	014e0f4e46	runtime-rs: bugfix for GPU passthrough failed with InvalidOperation. We need initailize the pci_hotplug_enabled with true before we do GPU passthrough with runtime-rs/dragonball. Otherwise it fails with error `InvalidOperation`. Fixes: #9129 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-02-22 10:22:32 +08:00
Dan Mihai	58fbb9f6ec	Merge pull request #9073 from microsoft/danmihai1/test-genpolicy3 tests: k8s: generated policy for additional tests	2024-02-21 14:11:51 -08:00
Dan Mihai	b3c3f992ab	tests: k8s: common clean-up on teardown teardown() gets executed after each test case, so there is no need to clean-up before teardown. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:08 +00:00
Dan Mihai	9c164698d3	tests: k8s: k8s-optional-empty-configmap policy Auto-generate policy for k8s-optional-empty-configmap.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:08 +00:00
Dan Mihai	74a52c6d25	tests: k8s: k8s-oom.bats auto-generated policy Auto-generate policy for k8s-oom.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:08 +00:00
Dan Mihai	26a77d67f4	tests: k8s: k8s-number-cpus auto-generated policy Auto-generate policy for k8s-number-cpus. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:08 +00:00
Dan Mihai	9cbdce15fd	tests: k8s: k8s-memory.bats auto-generated policy Auto-generate policy for k8s-memory.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:08 +00:00
Dan Mihai	40209cc0b7	tests: k8s: k8s-limit-range auto-generated policy Auto-generate policy for k8s-limit-range.bats. Also, fix teardown() namespace. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:08 +00:00
Dan Mihai	df3c0318c6	tests: k8s: add set_namespace_to_policy_settings Add set_namespace_to_policy_settings() for changing the pod namespace in genpolicy settings. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:08 +00:00
Dan Mihai	6e14ce93c9	tests: k8s-kill-all-process-in-container policy Auto-generate policy for k8s-kill-all-process-in-container.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:07 +00:00
Dan Mihai	fad7ba0aea	tests: k8s: k8s-job.bats auto-generated policy Auto-generate policy for 8s-job.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:07 +00:00
Dan Mihai	41c2bcbdc5	tests: k8s: k8s-file-volume auto-generated policy Auto-generate policy for k8s-file-volume.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:07 +00:00
Dan Mihai	d84f50db5b	genpolicy: fix typo in policy logging Improve logging, for easier debugging. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:07 +00:00
Dan Mihai	81e641814f	tests: k8s: k8s-cpu-ns auto-generated policy Auto-generate policy for k8s-cpu-ns.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:07 +00:00
Dan Mihai	bc6d3fc238	tests: k8s: k8s-env.bats auto-generated policy Auto-generate policy for k8s-env.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:07 +00:00
Dan Mihai	0a4fc071ac	tests: k8s: k8s-custom-dns auto-generated policy Auto-generate policy for k8s-custom-dns.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:07 +00:00
Dan Mihai	f693f49e92	tests: k8s: k8s-credentials-secrets policy Auto-generate policy for k8s-credentials-secrets.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:07 +00:00
Dan Mihai	d3d27bbb5b	tests: k8s: k8s-configmap auto-generated policy Auto-generate policy for k8s-configmap.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:07 +00:00
Dan Mihai	b318535536	tests: k8s: auto-generate k8s-caps.bats policy Auto-generated policy for k8s-caps.bats. Fixes: #9072 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-21 18:08:07 +00:00
Greg Kurz	600b951afd	agent: Run container workload in its own cgroup namespace When cgroup v2 is in use, a container should only see its part of the unified hierarchy in `/sys/fs/cgroup`, not the full hierarchy created at the OS level. Similarly, `/proc/self/cgroup` inside the container should display `0::/`, rather than a full path such as : 0::/kubepods.slice/kubepods-besteffort.slice/kubepods-besteffort-podde291f58_8f20_4d44_aa89_c9e538613d85.slice/crio-9e1823d09627f3c2d42f30d76f0d2933abdbc033a630aab732339c90334fbc5f.scope What is needed here is isolation from the OS. Do that by running the container in its own cgroup namespace. This matches what runc and other non VM based runtimes do. Fixes #9124 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-02-21 13:14:13 +01:00
Greg Kurz	14886c7b32	agent: lint code Run cargo-clippy to reduce noise in actual functional changes. Signed-off-by: Greg Kurz <groug@kaod.org>	2024-02-21 13:14:13 +01:00
ChengyuZhu6	cddaf2ce97	kata-deploy: Remove specific kernel/initrd/image leftovers in Makefile Remove specific kernel/initrd/image leftovers in Makefile of local-build, which is the part of #9026. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-21 18:24:10 +08:00
Chelsea Mafrica	241a56989a	Merge pull request #9090 from GabyCT/topic/pulldockerimage gha: docker: Pull docker image as part of the dependencies	2024-02-20 14:28:53 -08:00
GabyCT	ea78013c7e	Merge pull request #9079 from GabyCT/topic/removecilink docs: Update CI link into the README	2024-02-20 14:11:13 -06:00
GabyCT	64c09fe6c5	Merge pull request #9088 from GabyCT/topic/fixnydus gha: nydus: Fix indentation in gha run script	2024-02-20 14:09:54 -06:00
Gabriela Cervantes	ff8a6fa9ef	scripts: Add error script This PR adds the error script to display the error message with much more information to help debugging. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-20 18:30:03 +00:00
Gabriela Cervantes	43a46d5a6b	scripts: Add an enhanced die function This PR adds an enhanced die function in order to dump more information in a yaml format that will help with the debugging. Fixes #9105 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-20 18:27:44 +00:00
Archana Shinde	6d84fe3a37	Merge pull request #8647 from amshinde/cleanup-network Cleanup network to make sure physical interfaces are restores back to original host driver.	2024-02-20 08:59:53 -08:00
Archana Shinde	6d38fa1530	network: Try removing as many changes as possible during network cleanup In case an error is encountered while removing a network endpoint during network cleanup, we cuurently return immediately with the error. With this change, in case of error we simply log the error and proceed towards removing the next endpoint. With this, we can cleanup the network changes made by the shim as much as possible. This is especially important when multiple interfaces are passed to the network namespace using a network plugin like multus. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2024-02-20 06:08:05 -08:00
Archana Shinde	b005cda689	network: Move up defer block tp cleanup network Move the defer for cleaning up network before the call to add network. This way if any change made by add network is reverted by in case of failure. This is particulary important for physical network interfaces as with this step we make sure that driver for the physical interface is reverted back to the original host driver. Without this the physical network iterface will remain bound to vfio. Fixes: #8646 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2024-02-20 06:06:42 -08:00
Ryan Savino	61ce7455c5	Merge pull request #9086 from niteeshkd/nd_snp_upm packaging: qemu-snp-experimental: support host kernel with gmem	2024-02-19 10:50:13 -06:00
Fabiano Fidêncio	79dc6e95d1	Merge pull request #9108 from fidencio/topic/ci-k8s-fix-wrong-logic-on-confidential-tests ci: k8s: Fix checks used to skip confidential tests	2024-02-19 12:49:57 +01:00
Xuewei Niu	f9307f6852	Merge pull request #9112 from ChengyuZhu6/vendor runtime: fix checksum mismatch error in `make vendor`	2024-02-19 10:54:38 +08:00
ChengyuZhu6	96c297cb37	runtime: fix checksum mismatch error in `make vendor` Fix checksum mismatch error in `make vendor`. Fixes: #9111 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-18 22:22:38 +08:00
Fabiano Fidêncio	3468ac3b6e	ci: k8s: Fix checks used to skip confidential tests This has been introduced by `53bc4a432b`, where the condition was changed. The correct condition is: * If the list of supported tees does not contain the kata hypervisor and the list of supported non tees does not contain the kata hypervisor. The error is that we were checking whether kata-hypervisor would contain the list of supported tees, and that would almost always be false (unless in the case where the list had an one and only one element). Fixes: #9055 -- part II Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-18 10:10:45 +01:00
Niteesh Dubey	0538bbfc49	packaging: qemu-snp-experimental: support host kernel with gmem This is required to allow creation of SNP coco on host kernel (e.g. https://github.com/AMDESE/linux ,branch:snp-host-latest) supporting guest private memory for SNP using gmem. Note: This qemu does not work if the host kernel does not support gmem/UPM. Fixes: #9092 Signed-off-by: Niteesh Dubey <niteesh@us.ibm.com>	2024-02-15 16:33:46 +00:00
Wainer Moschetta	db744aa8d2	Merge pull request #9023 from ldoktor/webhook-path tools.kata-webhook: Fix lib path	2024-02-15 12:34:01 -03:00
Fabiano Fidêncio	28b4e5ce51	Merge pull request #9099 from BbolroC/skip-k8s-sandbox-vcpus-allocation-s390x CI\|k8s: Skip vcpu allocation test for s390x	2024-02-15 16:05:18 +01:00
James O. D. Hunt	d1513b2030	Merge pull request #9091 from jodh-intel/packaging-add-kata-manager-script packaging: Add the kata manager script	2024-02-15 13:08:36 +00:00
Hyounggyu Choi	8b3f7f353d	CI\|k8s: Skip vcpu allocation test for s390x A test `vcpu allocation k8s test` exhibits different behavior on s390x For more details, please refer to issue #9093. This commit is to make the test skipped until the issue is resolved on the platform. Fixes: #9093 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-02-15 12:26:35 +01:00
Fabiano Fidêncio	9178541dfb	Merge pull request #9098 from fidencio/topic/runtime-update-runc-to-v1.1.12 runtime: Update runc to v1.1.12	2024-02-15 09:29:10 +01:00
Fabiano Fidêncio	eea4277fbf	runtime: Update runc to v1.1.12 Although we don't seem to be affected by https://nvd.nist.gov/vuln/detail/CVE-2024-21626, we vendor and use the runc package in a few different places of our code, and we better update the package to its latest release. Fixes: #9097 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-14 23:13:39 +01:00
James O. D. Hunt	8c51e02f55	packaging: Add the kata manager script Add `kata-manager.sh` to the release packages. Fixes: #9066. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-02-14 17:44:42 +00:00
James O. D. Hunt	e49aeec97f	packaging: Use variable for default binary permissions Create a variable for the default binary permissions. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-02-14 17:44:35 +00:00
James O. D. Hunt	cc2d96671f	packaging: Remove extraneous whitespace Remove some unnecessary whitespace from a couple of `kata-deploy` files. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com> whitespace Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-02-14 17:44:08 +00:00
Fabiano Fidêncio	c95c37d2ab	Merge pull request #9026 from fidencio/topic/packaging-remove-tee-specific-leftovers packaging: Remove leftovers from the transition from TEE specific kernel / initrd / image to the "confidential" ones	2024-02-13 22:14:26 +01:00
GabyCT	9cf343779f	Merge pull request #9062 from GabyCT/topic/nonteet tests: Add ability to run non-TEE environments	2024-02-13 14:28:07 -06:00
Fabiano Fidêncio	74c8d243ea	versions: Remove TEE specific kernels We've switched to using the confidential one, instead. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-13 19:07:33 +01:00
Fabiano Fidêncio	adbe24c283	versions: Remove non-used tdx / sev image and initrd entries We've switched to using the confidential ones, instead. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-13 19:07:33 +01:00
Fabiano Fidêncio	6c3338271b	packaging: kernel: Remove sev/snp/tdx specific stuff Now we're using a "confidential" image that has support for all of those. Fixes: #9010 -- part II #8982 -- part II #8978 -- part II Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-13 19:07:33 +01:00
Gabriela Cervantes	598c77409a	gha: docker: Pull docker image as part of the dependencies This PR pulls the docker image needed for the test as part of the dependencies in order to avoid failures of timeouts mainly because the image was not properly download it and it is unable to find it. Fixes #9089 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-13 17:48:31 +00:00
Gabriela Cervantes	53bc4a432b	tests: Add ability to run non-TEE environments This PR adds the ability to run k8s confidential tests in a non-TEE environment. Fixes #9055 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-13 17:27:55 +00:00
Fabiano Fidêncio	14f4480f12	packaging: Remove specific TEEs image / initrd leftovers Let's remove the targets as those are not built anymore as part of our CI. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-13 18:03:12 +01:00
Fabiano Fidêncio	0c761f14b3	packaging: Remove specific TEEs kernel leftovers Let's remove the targets as those are not built anymore as part of our CI. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-13 18:03:11 +01:00
Fabiano Fidêncio	28488f0790	Merge pull request #9082 from fidencio/topic/cleanup-kata-deploy-leftovers-before-start-a-test tests: Remove kata-deploy-tdx test and ensure kata-deploy is always cleaned up before starting the tests	2024-02-13 18:01:16 +01:00
Gabriela Cervantes	54d1f34650	gha: nydus: Fix indentation in gha run script This PR fixes the indentation in gha run script for nydus. Fixes #9087 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-13 16:53:28 +00:00
Fabiano Fidêncio	a867e19da1	gha: tdx: Stop running kata-deploy tests on TDX We only have one TDX machine, let's not make it busier than needed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-13 14:14:57 +01:00
Fabiano Fidêncio	3877a9f49a	ci: Clean up kata-deploy ds before starting the tests This will ensure no leftovers are in the node, which has been cause the TDX CI to fail every now and then. Fixes: #9081 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-13 14:10:44 +01:00
Fabiano Fidêncio	8fe7349d3e	Merge pull request #9080 from fidencio/topic/dont-add-the-pause-image-to-the-released-tarball release: Don't ship the pause-image / coco-guest-components as part of the release artefacts	2024-02-13 12:34:29 +01:00
Fabiano Fidêncio	443a5b8327	release: Don't ship the coco-guest-components In the same way that doesn't make sense to ship the pause-image, it also doesn't make sense to ship the coco-guest-components itself as part an release artefact. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-13 09:47:26 +01:00
Fabiano Fidêncio	0462b33a5b	release: Don't ship the pause-image It doesn't make sense to ship the pause-image itself as an release artefact. The reason we build it and cache it is in order to use it inside the rootfs, and that's it, there's not need to ship it as part of the release, at all. Fixes: #9032 -- part II Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-13 09:45:50 +01:00
GabyCT	00be9ae872	Merge pull request #9070 from microsoft/danmihai1/debug-containers tests: k8s: avoid deleting unrelated pods	2024-02-12 15:24:15 -06:00
Gabriela Cervantes	69b325a31c	docs: Update CI link into the README This PR updates the CI link into the README as currently we are using GHA workflows and they are now part of the kata containers repository. Fixes #9078 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-12 20:53:25 +00:00
Greg Kurz	532567bfe9	Merge pull request #8936 from fidencio/topic/fix-cri-o-ci tests: cri-o: Use packages from pkgs.k8s.io	2024-02-12 10:04:53 +01:00
Dan Mihai	42d13a0f33	Merge pull request #9068 from microsoft/danmihai1/dockerfile-linux-musl-gcc tools: avoid rootfs-image build "ln -s" error	2024-02-11 18:02:53 -08:00
Greg Kurz	d7afd31fd4	Merge pull request #8455 from BbolroC/runtime-rs-qemu-config runtime-rs: Add a new config option for QEMU	2024-02-10 08:48:23 +01:00
Dan Mihai	a21ca9b7c9	tests: k8s: avoid deleting unrelated pods Delete the debugger pod created during the test, rather than already existing debugger pods. Also, send the output of "kubectl delete" to stderr, just in case it's useful for debugging. Fixes: #9069 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-09 22:48:41 +00:00
Dan Mihai	a054462eb7	Merge pull request #9051 from microsoft/danmihai1/k8s-copy-file tests: k8s: k8s-copy-file auto-generated policy	2024-02-09 12:30:49 -08:00
Hyounggyu Choi	05c4c8055c	runtime-rs: Configure argument replacement for QEMU in Makefile Last but not least, all placeholders for argument replacement should be configured to generate a configuration file when `QEMUCMD` is defined. This enriches those variables. Additionally, this involves creating a symbolic link to `configuration-qemu.toml` if QEMU is defined as the default hypervisor. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-02-09 19:31:20 +01:00
Dan Mihai	fcd005774d	tools: avoid rootfs-image build "ln -s" error Avoid error when building for amd64 using: USE_CACHE=no AGENT_POLICY=yes DEBUG=1 \ tools/packaging/kata-deploy/local-build/kata-deploy-binaries.sh \ --build=rootfs-image Fixes: #9067 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-09 17:10:35 +00:00
GabyCT	b8f277676f	Merge pull request #9047 from GabyCT/topic/ukd docs: Remove jenkins reference in kernel documentation	2024-02-09 10:58:06 -06:00
Fabiano Fidêncio	e78a951e03	Merge pull request #8585 from ChengyuZhu6/dependencies-for-guest-pull gha: Setup nydus snapshotter for CoCo tests	2024-02-09 16:45:42 +01:00
Hyounggyu Choi	27cb30d8ce	runtime-rs: Adjust configuration template for runtime-rs There are some variables newly introduced to runtime-rs, such as: - runtime.name - runtime.hypervisor_name - runtime.agent_name - vm_rootfs_driver Additionally some of the placeholders for argument replacement are made hypervisor-specific based on the changes made for dragonball. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-02-09 16:26:59 +01:00
ChengyuZhu6	97fbf360cc	gha: Cleanup nydus snapshotter by the daemonset Cleanup nydus snapshotter by the daemonset. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-09 14:47:13 +01:00
ChengyuZhu6	43b04fd0c0	gha: Deploy nydus snapshotter by the daemonset We can use daemonset to deploy nydus snapshotter, which will decrease one manual step both for Kata Containers and Confidential Containers CI. Fixes: #8584 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-09 14:47:09 +01:00
Julien Ropé	236c2c7650	tests: cri-o: Update critools version to 1.29 This will also update the version of crio used in kata-monitor tests. Signed-off-by: Julien Ropé <jrope@redhat.com>	2024-02-09 12:15:55 +01:00
Fabiano Fidêncio	344e0580ca	tests: cri-o: Use packages from pkgs.k8s.io CRI-O has moved, for a long time, towards pkgs.k8s.io, see: https://kubernetes.io/blog/2023/10/10/cri-o-community-package-infrastructure/ With this the OBS repo won't be used anymore. Fixes: #8935 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-09 12:15:55 +01:00
Fabiano Fidêncio	03f7cfd429	Merge pull request #9061 from GabyCT/topic/csk tests:k8s: make add_kernel_initrd_anotations function generic	2024-02-09 10:05:58 +01:00
Fabiano Fidêncio	555784268d	Merge pull request #9031 from ChengyuZhu6/guest-pull-rootfs packaging/osbuilder: allow to pull and unpack pause image	2024-02-08 22:21:44 +01:00
Gabriela Cervantes	0b508f301b	tests:k8s: make add_kernel_initrd_anotations function generic This PR replaces the add_kernel_initrd_annotations_to_yaml function more generic so later can be used for other components. Fixes #9054 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-08 19:30:43 +00:00
Dan Mihai	f139c7dc60	tests: k8s: k8s-copy-file auto-generated policy Auto-generate policy for k8s-copy-file.bats. Fixes: #9050 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-08 13:26:05 +00:00
Dan Mihai	1179306afa	tests: k8s: additional policy testing utilities 1. add_requests_to_policy_settings allows one or more ttrpc requests from the Host to the Guest. Example: add_requests_to_policy_settings "${policy_settings_dir}" \ "ReadStreamRequest" "WriteStreamRequest" 2. add_copy_from_host_to_policy_settings allows executing on the Guest the commands initiated behind the scenes by "kubectl cp" from the Host to the Guest. Example: add_copy_from_host_to_policy_settings "${policy_settings_dir}" 3. add_copy_from_guest_to_policy_settings allows executing on the Guest the commands initiated behind the scenes by "kubectl cp" from the Guest to the Host. Example: add_copy_from_guest_to_policy_settings "${policy_settings_dir}" \ "/tmp/file.txt" Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-08 13:25:41 +00:00
Steve Horsman	b99f574522	Merge pull request #9037 from niteeshkd/nd_SevSnpGuest runtime: fix creation of SEV confidential container on SNP enabled host.	2024-02-08 09:29:20 +00:00
ChengyuZhu6	a43edd0c30	rootfs: Install pause image into rootfs Install the pause image into the confidential rootfs image and initrd. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-08 16:49:56 +08:00
Greg Kurz	6ead48ec06	Merge pull request #8986 from pmores/drop-shim-v2-address-value-validation runtime-rs: fix interoperability issues between runtime-rs and cri-o	2024-02-08 09:44:12 +01:00
ChengyuZhu6	42ef6bdcae	osbuilder:rootfs: support to unpack pause image to rootfs This env ver will serve us to pass the pause image tarball to the rootfs builder, which will then just unpack the content into the rootfs. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Co-authored-by: Wang, Arron <arron.wang@intel.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Jakob Naucke <jakob.naucke@ibm.com>	2024-02-08 16:29:36 +08:00
ChengyuZhu6	53183cba31	workflow: Enable to build pause image in ci Enable to build pause image static tarball for confidential containers casesi in ci environment. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-08 11:23:23 +08:00
ChengyuZhu6	70a84eca9e	packaging: allow to pull and unpack pause image For Confidential containers stack, the pause image is managed by host side, then it may configure a malicious pause image, we need package a pause image inside the rootfs and don't the pause image from host. But the installation of skopeo is not included in 20.04 release, so we can not directly install skopeo in rootfs and pull pause image. So I plan to let the task as a static build stuff, which would not be influenced by the system version in rootfs. And the pause image will be part of the Kata Containers rootfs that's used by the Confidential Containers usecase. This commit enables the component to be built both locally and in our CI environment with the command: make pause-image-tarball. Fixes: #9032 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Co-authored-by: Wang, Arron <arron.wang@intel.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Jakob Naucke <jakob.naucke@ibm.com>	2024-02-08 11:23:23 +08:00
Dan Mihai	9a780aa98f	genpolicy: improve logging from ExecProcessRequest Additional logging from the ExecProcessRequest rules, for easier debugging. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-08 02:21:58 +00:00
Dan Mihai	dab567bdfa	genpolicy: add easy way to allow CloseStdinRequest For example, Kata CI's k8s-copy-file.bats transfers files between the Host and the Guest using "kubectl exec", and that results in CloseStdinRequest being called from the Host. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-08 02:21:58 +00:00
Dan Mihai	8401adb113	genpolicy: update default values 1. Remove PullImageRequest because that is not used in the main branch. It was used in the CCv0 branch. 2. Add default false values for the remaining Kata Agent ttrpc requests. These changes don't change the functionality of the auto generated Policy, but they help with easier understanding the Policy text and the logging from the Rego rules. Fixes: #9049 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-08 02:21:58 +00:00
Dan Mihai	535db6b29c	Merge pull request #9043 from ChengyuZhu6/assert runtime-rs: fix assert error in `make check`	2024-02-07 18:19:18 -08:00
Dan Mihai	2bb91c9d8f	Merge pull request #8922 from microsoft/danmihai1/k8s-attach-handlers tests: k8s-attach-handlers auto-generated policy	2024-02-07 13:29:50 -08:00
Dan Mihai	01745689e1	Merge pull request #9029 from microsoft/danmihai1/k8s-empty-dirs genpolicy: mount source for non-confidential guest	2024-02-07 11:26:16 -08:00
Dan Mihai	6b5e57f7c7	tests: k8s: address PR review feedback 1. Rename install_kata_common to install_kata_core. 2. Add TODO for better way to install the Kata tools. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-07 18:51:56 +00:00
Steve Horsman	934d8dca0f	Merge pull request #9045 from ChengyuZhu6/nydus-version nydus: Bump nydus snapshotter version to v0.13.7	2024-02-07 17:20:21 +00:00
Pavel Mores	6346e04cf7	runtime-rs: fix handling of TTRCP_ADDRESS Since cri-o doesn't seem to use address for event publishing as mentioned in the previous commit it will not send it. However, the exact way of not sending it is unfortunately different from what is assumed by runtime-rs. Due to an implementation detail of cri-o which uses containerd libraries for some low-level tasks, TTRPC_ADDRESS will not be missing from environment as assumed, instead it will be present with an empty value. This commit contains a small adjustment to account for that and use LogForwarder even if TTRPC_ADDRESS is present, but with an empty value. Fixes #8985 Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-02-07 17:01:04 +01:00
Gabriela Cervantes	ff1ace1c74	docs: Remove jenkins reference in kernel documentation This PR removes the jenkins reference which is not longer being used in the kernel documentation. Fixes #9046 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-07 15:44:07 +00:00
ChengyuZhu6	d0b8e6d8f3	nydus: Bump nydus snapshotter version to v0.13.7 Bump nydus snapshotter version to v0.13.7. The new release name of nydus snapshotter is `nydus-snapshotter-v0.13.7-linux-amd64.tar.gz`, which differs from the version used by kata (`nydus-snapshotter-v0.12.0-x86_64.tgz`). Therefore, we need to update the script to obtain the correct nydus snapshotter name. Fixes: #9044 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-07 22:17:05 +08:00
ChengyuZhu6	34c47e08b2	runtime-rs: fix assert error in test in `make check` Fix assert error: error: used `assert_eq!` with a literal bool --> crates/hypervisor/src/ch/inner.rs:218:9 \| 218 \| assert_eq!(state.jailed, false); \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#bool_assert_comparison = note: `-D clippy::bool-assert-comparison` implied by `-D warnings` Fixes: #9042 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-07 19:31:10 +08:00
Archana Shinde	d9ce88ada3	Merge pull request #8704 from amshinde/runtime-rs-clh-implement-persist runtime-rs: implement persist api for cloud-hypervisor	2024-02-07 02:29:33 -08:00
Dan Mihai	dd16bc393f	tests: k8s: k8s-attach-handlers generated policy Automatically generate the test policy for k8s-attach-handlers.bats, if AUTO_GENERATE_POLICY is enabled. Steps: - Create a temporary directory for the current test and copy the common genpolicy settings into this new directory. - Change genpolicy settings in the temp directory to allow the "kubectl exec" command that this test needs. (For CoCo, exec is blocked by the default policy settings) - Auto-generate the policy for the test YAML file. - Test as usual, using the YAML file. - Clean-up the temporary settings described above. Fixes: #8921 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-07 02:26:03 +00:00
Dan Mihai	0de407f8b7	tests: k8s: enable AUTO_GENERATE_POLICY Enable AUTO_GENERATE_POLICY for one of the Kata CI K8s test platforms. Additional platforms will be enabled after testing them. When AUTO_GENERATE_POLICY is enabled, create genpolicy settings that are common for all tests. Some of the tests will make temporary copies of these common settings and customize them as needed. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-07 02:25:54 +00:00
Dan Mihai	05b2e4f606	tests: k8s: install genpolicy Install the genpolicy app before starting test execution. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-07 02:25:42 +00:00
Dan Mihai	8aa8b70573	tests: k8s: add policy test utilities Add script functions useful for auto-generating and testing policy. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-07 02:24:06 +00:00
Dan Mihai	24a17a2e1b	tests: k8s: output the names of test files Output the names of test files, for easier search through logs. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-07 02:23:54 +00:00
Dan Mihai	bf533de31a	tests: k8s: add DEBUG support for test scripts Make these scripts easier to debug. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-07 02:23:46 +00:00
Dan Mihai	1b4ef672ef	tests: k8s: reduce namespace name duplication 1. Avoid repeating "kata-containers-k8s-tests". 2. Allow users to specify a different test namespace. 3. Introduce the TEST_CLUSTER_NAMESPACE variable, that will also be useful when auto-generating the Agent Policy for these tests. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-07 02:23:38 +00:00
Dan Mihai	8a5ba5fb34	tests: k8s: allow run_kubernetes_tests.sh exec Allow everyone to directly execute run_kubernetes_tests.sh, for easier local testing. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-07 02:23:30 +00:00
Fabiano Fidêncio	11ba90ebf2	Merge pull request #8958 from fidencio/topic/kata-manager-nerdctl-support kata-manager: Add support for nerdctl installation	2024-02-06 21:33:48 +01:00
GabyCT	d74b6e143f	Merge pull request #8951 from GabyCT/topic/udf metrics: Update packages for TensorFlow ResNet Int8 Dockerfile	2024-02-06 14:29:41 -06:00
GabyCT	6337f300a8	Merge pull request #8628 from GabyCT/topic/enablek8stclh tests: k8s: Enable tests for cloud hypervisor runtime-rs without devicemapper	2024-02-06 14:28:35 -06:00
Niteesh Dubey	3e383674f8	runtime: fix creation of SEV confidential container on SNP enabled host. This is needed to fix the bug which is not allowing to create SEV container on SNP enabled host anymore. This is a regression that was introduced as part of the following commit: `de39fb7d38` Fixes: #9036 Signed-off-by: Niteesh Dubey <niteesh@us.ibm.com>	2024-02-06 19:01:30 +00:00
Hyounggyu Choi	462afcf829	runtime-rs: Copy configuration for QEMU from runtime It makes sense to reuse a configuration template for runtime-golang as a base. This is simply to copy it into the config directory. Fixes: #8441 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-02-06 19:35:44 +01:00
Fabiano Fidêncio	058f068d67	Merge pull request #9020 from BbolroC/ok-to-test-static-checks-but-x86 gha: Run static-checks on self-hosted runners conditionally	2024-02-06 19:30:21 +01:00
Gabriela Cervantes	cf049fc718	k8s: Skip k8s tests that are not working This PR skips the k8s tests that are not working with cloud hypervisor runtime-rs with its proper issue. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-06 16:52:02 +00:00
Pavel Mores	f0256fded5	runtime-rs: remove validation of shim v2 -address value It appears that under the shim v2 protocol, a shim has no use of its own for the -address value, it just passes it back to container runtime's (mostly containerd or cri-o) event-publishing binary. Since the -address value only flows through the shim, being passed to the shim by a container runtime and then essentially passed back by shim to the container runtime, it seems inappropriate for a shim to validate the value that is fully owned and only used by the container runtime. This commit removes such validation from runtime-rs. Doing so, it solves (part of) an interoperability problem between runtime-rs and cri-o. cri-o seems to intentionally choose not to implement the event-publishing part of the shim v2 protocol and thus it has no value it could pass to runtime-rs for -address. As a result, it sends an empty string which has been failing the excessive validation performed by runtime-rs so far. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-02-06 13:43:09 +01:00
Wainer Moschetta	f1ca5d1563	Merge pull request #8953 from ChengyuZhu6/ci-guest-pull gha: Enable nydus snapshotter in CoCo ci tests	2024-02-06 09:36:59 -03:00
Fabiano Fidêncio	1ccb850ee7	Merge pull request #9027 from fidencio/topic/add-libattest-tdx-into-the-confidential-rootfs rootfs: Add libattest-tdx into the confidential rootfs	2024-02-06 12:52:13 +01:00
Fabiano Fidêncio	ce82b5e3f5	rootfs: Add libtdx-attest into the confidential rootfs This is required as the tdx-attest-rs crate, which is used as part of the guest components, has a runtime dependency on libattest-tdx. Fixes: #9021 -- part II Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-06 09:13:49 +01:00
Xuewei Niu	67d9847fac	Merge pull request #9025 from wainersm/cri-containerd_fix_loop cri-containerd: fix loop in TestContainerMemoryUpdate()	2024-02-06 14:49:57 +08:00
Amulya Meka	354a3093fa	Merge pull request #9019 from Amulyam24/k8s-fix gha: add GOPATH env var to the ppc64le k8s workflow	2024-02-06 11:01:49 +05:30
Alex Lyn	1ab9a21492	Merge pull request #8552 from deagon/fix/missing-port-type runtime: missing port type in the DeviceInfo	2024-02-06 10:56:46 +08:00
Dan Mihai	473efc2149	genpolicy: mount source for non-confidential guest The emergent Kata CI tests for Policy use confidential_guest = false in genpolicy-settings.json. That value is inconsistent with the following mount settings: "emptyDir": { "mount_type": "local", "mount_source": "^$(cpath)/$(sandbox-id)/local/", "mount_point": "^$(cpath)/$(sandbox-id)/local/", "driver": "local", "source": "local", "fstype": "local", "options": [ "mode=0777" ] }, We need to keep those settings for confidential_guest = true, and change confidential_guest = false to use: "emptyDir": { "mount_type": "local", "mount_source": "^$(cpath)/$(sandbox-id)/rootfs/local/", "mount_point": "^$(cpath)/$(sandbox-id)/local/", "driver": "local", "source": "local", "fstype": "local", "options": [ "mode=0777" ] }, The value of the mount_source field is different. This change unblocks testing using Kata CI's pod-empty-dir.yaml: genpolicy -u -y pod-empty-dir.yaml kubectl apply -f pod-empty-dir.yaml k get pod sharevol-kata NAME READY STATUS RESTARTS AGE sharevol-kata 1/1 Running 0 53s Fixes: #8887 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-06 01:19:48 +00:00
Fabiano Fidêncio	ffa190831d	Merge pull request #9022 from fidencio/topic/add-guest-components-to-the-confidential-image-and-initrd rootfs: confidential: Install coco-guest-components	2024-02-05 18:56:48 +01:00
Hyounggyu Choi	40b2b2a43a	gha: Run static-checks on self-hosted runners conditionally Due to the restrictions on instance provisioning for self-hosted runners, performing static checks (36 jobs at the time of writing) on them each time a PR is updated could significantly burden them, consequently slowing down the entire CI system. To address this, the decision is to trigger these checks only when an 'ok-to-test' label is added. Meanwhile, the checks for x86_64, which are supported by GitHub-hosted runners, will remain unchanged. Fixes: #8998 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-02-05 15:24:21 +01:00
Wainer dos Santos Moschetta	106e1af497	cri-containerd: fix loop in TestContainerMemoryUpdate() The loop that generate test cases for virtio-mem enabled/disabled doesn't return the integers '1' and '0' as expected. Instead it returns the strings '{1,' and '0}'. Fixes #9024 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-05 10:59:39 -03:00
Fabiano Fidêncio	27e7974048	rootfs: confidential: Install coco-guest-components Let's install the coco-guest-components into the confidential rootfs image and initrd. Fixes: #9021 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-05 14:41:29 +01:00
Fabiano Fidêncio	f80dbcee0e	rootfs: Add logging about the coco guest components This will make our lives easier to figure out whether the components are being installed or not. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-05 14:41:29 +01:00
Fabiano Fidêncio	68b8186ec4	osbuilder: Expose COCOGUEST_COMPONENTS_TARBALL We need to pass this to the container where the rootfs is built, so it can actually be unpacked inside the rootfs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-05 14:41:28 +01:00
Lukáš Doktor	3b0049b2a4	tools.kata-webhook: Fix lib path When moving the webhook we skipped the common.bash as (close-enough) version is already in `/tests` but we forgot to update the source path, fixing it here. Fixes: #8653 Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-02-05 14:17:24 +01:00
Fabiano Fidêncio	64d09874c3	packaging: coco-guest-components: Pass DESTDIR to the build script As DESTDIR was not being passed, we've been installing the final binaries in a container path that was not exposed to the host, leading to creating an empty tarball with the guest components. Now, theoretically, guest-components should respect a PREFIX passed, but that's not the case and we're manually adding "/usr/local/bin" to the passed DESTDIR. Here's the result of the tarball: ```bash ⋊> kata-containers ≡ tar tf build/kata-static-coco-guest-components.tar.xz ./ ./usr/ ./usr/local/ ./usr/local/bin/ ./usr/local/bin/confidential-data-hub ./usr/local/bin/attestation-agent ./usr/local/bin/api-server-rest ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-05 14:07:10 +01:00
ChengyuZhu6	a214bd8d13	gha: Enable nydus snapshotter in CoCo ci tests This PR is a split of #8585. make the changes on the Github workflows, and the skeleton to deploy_snapshotter() and cleanup_snapshotter() in tests/integration/kubernetes/gha-run.sh in this commit. After initially merging this patch to trigger CI jobs for CoCo, which will begin executing the dummy functions deploy_snapshotter() and cleanup_snapshotter(), the implementation details for these functions remain in #8585. Our subsequent step involves transferring this logic to the PR #8484, enabling the PR to undergo CI testing prior to its merge. Fixes: #8997 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-05 18:51:59 +08:00
Fabiano Fidêncio	1362918ff0	Merge pull request #9011 from fidencio/topic/switch-to-using-the-confidential-rootfs runtime: Replace TEE specific initrd / image for the confidential one	2024-02-05 10:43:12 +01:00
Guoqiang Ding	6068faf40b	runtime: failed to run in the case of ColdPlugVFIO Add the missing port type in the DeviceInfo. Fixes: #9014 Signed-off-by: Guoqiang Ding <dgq8211@gmail.com>	2024-02-05 17:30:11 +08:00
Fabiano Fidêncio	65013205ed	Merge pull request #9005 from ChengyuZhu6/clang static-checks: Install clang in the ci environments	2024-02-05 09:24:51 +01:00
Archana Shinde	b3c74411f6	runtime-rs: Add tests for persist api for clh Add tests to check clh struct is saved/restored correctly. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2024-02-04 22:03:57 -08:00
Archana Shinde	0b78296dca	runtime-rs: Store additional field for hypervisor state Implementing Persist API for cloud-hypervisor was done partially with initial support for cloud-hypervisor. Store and retrieve additional fields to/from the hypervisor state. Fixes: #6202 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2024-02-04 22:03:57 -08:00
Archana Shinde	a5f0b92bca	runtime-rs: Add guest protection to hypervisor state Store guest-protection used while storing the state of the hypervisor. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2024-02-04 22:03:54 -08:00
Alex Lyn	cf74166d75	Merge pull request #9015 from Apokleos/bugfix-exec-uds runtime: display accurate error msg to avoid misleading users.	2024-02-05 13:50:43 +08:00
Amulyam24	e59d005568	gha: add GOPATH env var to the ppc64le k8s workflow The filtering of testing cases installs/uses yq and expects GOPATH to be present. Hence, add it to the workflow. Fixes: #9018 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2024-02-05 10:30:10 +05:30
Alex Lyn	51a82bec3c	Merge pull request #9012 from deagon/fix/monitor-agent-url kata-monitor: fix agentUrl from containerd shim	2024-02-05 10:41:56 +08:00
ChengyuZhu6	f354beb253	static-checks: Install clang in the ci environments To test PR #8484, the compilation process for the kata-agent relies on clang. There have been encountered failures on ARM, s390x, and ppc64le architectures: ppc64le: https://github.com/kata-containers/kata-containers/actions/runs/7754082828/job/21146689026?pr=8484 s390x: https://github.com/kata-containers/kata-containers/actions/runs/7754082828/job/21146689401?pr=8484 arm: https://github.com/kata-containers/kata-containers/actions/runs/7754082828/job/21146689026?pr=8484 Fixes: #9004 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-02-04 17:00:19 +08:00
Alex Lyn	c6830ceb89	runtime: display accurate error msg to avoid misleading users. The original handling method does not reach user expectations. When the ClientSocketAddress method stats the corresponding path of runtime-rs and has not found it yet, we should return an error message here that includes the reason for the failure (which should be an error display indicating that both runtime-go and runtime-rs were not found). Instead of simply displaying the corresponding path of runtime-rs as the final error message to users. It is also necessary to return the error promptly to the caller for further error handling. Fixes: #8999 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-02-04 16:45:59 +08:00
Xuewei Niu	fa01a86334	Merge pull request #9007 from wainersm/aks_delete_rg gha: delete azure RG only if it exists	2024-02-04 16:34:17 +08:00
Guoqiang Ding	7bf1ebe16d	kata-monitor: fix agentUrl from containerd shim Fix the missing leading slash. Fixes: #9013 Signed-off-by: Guoqiang Ding <dgq8211@gmail.com>	2024-02-04 16:24:13 +08:00
Fabiano Fidêncio	d4a9856a84	gha: Remove SEV / SNP / TDX images / initrds We can remove this now that we're relying on the confidential one. Fixes: #9010 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-03 13:22:07 +01:00
Fabiano Fidêncio	e4258d8694	runtime: Use confidential image / initrd instead of TEE specific ones Now that we have a confidential image / initrd being built, instead of a specific one for each TEE, let's use it everywhere possible. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-03 13:20:14 +01:00
Fabiano Fidêncio	e0bb632053	Merge pull request #8983 from fidencio/topic/add-confidential-image packaging: Add confidential image / initrd	2024-02-03 12:30:16 +01:00
Fabiano Fidêncio	a9f8888c15	packaging: Add confidential image / initrd Let's use a single rootfs image / initrd for confidential workloads, instead of having those split for different TEEs. We can easily do this now as the soon-to-be-added guest-components can be built in a generic way. Fixes: #8982 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-03 00:58:52 +01:00
Fabiano Fidêncio	7ddb2e5999	Merge pull request #8978 from fidencio/topic/use-the-kernel-confidential-when-possible runtime: packaging: Use confidential kernel instead of the TDX one	2024-02-03 00:29:43 +01:00
Fabiano Fidêncio	e9de0ef6b3	packaging: rootfs: Depend on kernel-confidential tarball Now that we're using the kernel-confidential, let the rootfs depending on it, instead of depending on the TEE specific ones. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 21:13:41 +01:00
Fabiano Fidêncio	b58cfc765c	packaging: Ensure rootfs is rebuilt in case kernel changes We need to do this in order to ensure that the measure boot will be taking the latest kernel bits, as needed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 21:13:06 +01:00
Fabiano Fidêncio	4394dacb88	packaging: Build the confidential kernel with MEASURED_ROOTFS support This is already done for the TDX kernel, and should have been done also for the confidential one. This action requires us to bump the kernel version as the resulting kernel will be different from the cached one. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 21:13:06 +01:00
Fabiano Fidêncio	c7680839f9	packaging: Fix modules tarball for nvidia-gpu-confidential The modules dir has an extra "-nvidia-gpu-confidential" string in its name. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 21:13:06 +01:00
Fabiano Fidêncio	dc027e39d6	gha: Remove TEE specific kernel build targets We're using the confidential kernel instead from now on. Fixes: #8981 -- part I Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 21:12:41 +01:00
Fabiano Fidêncio	3755c69165	runtime: makefile: remove SNP specific kernel references As this is not used anymore, we can go ahead and just remove it Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 21:12:21 +01:00
Fabiano Fidêncio	57b132f94c	runtime: makefile: remove SEV specific kernel references As this is not used anymore, we can go ahead and just remove it Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 21:12:21 +01:00
Fabiano Fidêncio	2562d23242	runtime: makefile: remove TDX specific kernel references As this is not used anymore, we can go ahead and just remove it. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 21:11:43 +01:00
Fabiano Fidêncio	f4e3c936d8	runtime: snp: config: Use the confidential kernel As we're building a single confidential kernel, we should rely on it rather than keep using the specific ones for TDX / SEV / SNP. However, for debugability-sake, let's do this change TEE by TEE. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 21:11:36 +01:00
Fabiano Fidêncio	8731366d7b	runtime: sev: config: Use the confidential kernel As we're building a single confidential kernel, we should rely on it rather than keep using the specific ones for TDX / SEV / SNP. However, for debugability-sake, let's do this change TEE by TEE. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 21:11:36 +01:00
Wainer dos Santos Moschetta	a04b215bcc	gha: delete azure RG only if it exists delete_cluster() has tried to delete the az resources group regardless if it exists. In some cases the result of that operation is ignored, i.e., fail to resource group not found, but the log messages get a little dirty. Let's delete the RG only if it exists then. Fixes #8989 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-02 16:57:20 -03:00
Gabriela Cervantes	eb5b7d3bf8	tests: k8s: Enable tests for cloud hypervisor runtime-rs This PR enable the k8s tests for cloud hypervisor runtime-rs. Fixes #8627 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-02 17:58:58 +00:00
Fabiano Fidêncio	6cbdba7268	runtime: tdx: config: Use the confidential kernel As we're building a single confidential kernel, we should rely on it rather than keep using the specific ones for TDX / SEV / SNP. However, for debugability-sake, let's do this change TEE by TEE. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 17:13:06 +01:00
Fabiano Fidêncio	a618461d3a	runtime: Add confidential kernel to the makefile With this we can properly generate and the the `-confidential` kernel, which supports SEV / SNP / TDX as part of our configuration files. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 17:13:05 +01:00
GabyCT	40d9a65601	Merge pull request #8996 from GabyCT/topic/addclhr gha: k8s: Add cloud-hypervisor (runtime-rs) support	2024-02-02 09:48:35 -06:00
Fabiano Fidêncio	741ed1c8bd	Merge pull request #9001 from fidencio/topic/fix-cache-for-confidential-kernel-part-III packaging: Don't build the confidential / sev kernel twice -- part III	2024-02-02 15:19:41 +01:00
Wainer Moschetta	424fbfe58f	Merge pull request #8654 from ldoktor/openshift-tests ci/openshift-ci: Move openshift-ci from the tests repo here	2024-02-02 10:40:30 -03:00
Fabiano Fidêncio	2ff3f0afc6	packaging: Remove trailing whitespace from extra_tarballs arg This was overlooked during the reviews. Fixes: #6415 -- part III Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 12:42:02 +01:00
Fabiano Fidêncio	228bc48c73	packaging: Fix kernel confidential name It should be "kernel-confidential" instead of "kernel". Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 12:42:02 +01:00
Fabiano Fidêncio	31b21093b0	packaging: Pass the kernel flavour to get_kernel_modules_dir I made this a required argument during the series and ended up forgetting to add that while calling the function. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 12:42:02 +01:00
Fabiano Fidêncio	51b1df2333	packaging: Fix typo to get the extra_tarballs path It should've been "${m#*:}" instead of "${m#&:}". Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 12:41:54 +01:00
Fabiano Fidêncio	53e8461db2	Merge pull request #9000 from fidencio/topic/fix-pushing-artefacts-to-registry packaging: Fix pushing artefacts to the registry	2024-02-02 10:21:40 +01:00
Fabiano Fidêncio	0b221b5618	packaging: Fix pushing artefacts to the registry This issues was introduced due to a typo not caught during reviews on `e5bca90274`. Fixes: #6415 -- part II Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-02 10:13:11 +01:00
Wenyuan Liu	cb888516c1	Merge pull request #8760 from fadecoder/reduce_go_runtime_mounts runtime: Reduce the mount points with namespace isolation	2024-02-02 16:54:44 +08:00
Greg Kurz	d1a26ead94	Merge pull request #8454 from BbolroC/compile-with-qemu-s390x runtime-rs: make compilation for QEMU on s390x	2024-02-02 09:29:32 +01:00
Fabiano Fidêncio	0520b272a3	Merge pull request #8987 from fidencio/topic/fix-cache-for-confidential-kernel packaging: cache: Fix caching kernels which rely on extra modules	2024-02-02 09:10:52 +01:00
Amulya Meka	e4252a3fe2	Merge pull request #8957 from Amulyam24/add-k8s-test-ppc64le gha: add kubernetes tests workflow for ppc64le	2024-02-02 10:22:00 +05:30
Fabiano Fidêncio	b2f1235e3c	Merge pull request #8994 from sprt/sprt/switch-aks-eastus ci: aks: switch from eastus2 to eastus region	2024-02-02 00:09:40 +01:00
Hyounggyu Choi	bb6f5073aa	runtime-rs: Allow compilation for s390x Until now, runtime-rs couldn't be compiled on s390x. We need to lift those restrictions in Makefile first. Fixes: #8446 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-02-01 23:48:15 +01:00
Dan Mihai	6f1062b5d6	Merge pull request #8966 from microsoft/danmihai1/k8s-sandbox-vcpus-allocation genpolicy: ignore empty YAML as input	2024-02-01 13:51:02 -08:00
Dan Mihai	8f9c92c0ee	Merge pull request #8977 from microsoft/danmihai1/default-namespace genpolicy: support non-default namespace name	2024-02-01 13:50:33 -08:00
Gabriela Cervantes	6771ca463b	gha: k8s: Add cloud-hypervisor (runtime-rs) support This PR adds the Cloud Hypervisor driver, integrated with the runtime-rs, as part of the kubernetes tests different with devmapper. Fixes #8995 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-01 21:22:56 +00:00
Aurélien Bombo	0ace31f041	ci: aks: switch from eastus2 to eastus region This addresses an internal AKS issue that intermittently prevents clusters from getting created. The fix has been rolled out to eastus but not yet eastus2, so we unblock the CI by switching. No downsides in general. This supersedes #8990. Fixes: #8989 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2024-02-01 19:22:42 +00:00
Hyounggyu Choi	8fcee6e6ec	runtime-rs: Use Persist::restore() of QEMU for VirtSandbox It fails to compile virt_container because Dragonball is only used in the implementation of the trait method Persist::restore(). As the hypervisor is not compiled on s390x and QEMU implements the trait method, this commit is to let the method use QEMUi's. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-02-01 18:02:10 +01:00
Hyounggyu Choi	56aef3741d	runtime-rs: Exclude hypervisors plugins except QEMU for s390x Dragonball and cloud-hypervisor are not supported on s390x. We need to exclude the plugins for these hypervisors from compilation. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-02-01 18:02:10 +01:00
Fabiano Fidêncio	5d2906c36a	packaging: Bump the kata config kernel version Just to make sure we won't use cached components. Fixes: #6415 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-01 16:57:15 +01:00
Fabiano Fidêncio	d2ea11dbff	packaging: Use the cached kernel modules Till now we didn't have a logic to consume the kernel modules cached tarball. Let's make sure those are consumed as it'll save us a reasonable amount of build time. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-01 16:57:15 +01:00
Fabiano Fidêncio	e5bca90274	packaging: Cache the kernel modules This will save us a lot of time, as right now the CI is rebuilding the kernel for absolutely no reason. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-01 16:55:21 +01:00
Fabiano Fidêncio	f481f58659	packaging: Create the tarball for the kernel modules Let's start doing this for the confidential kernels (and also for SEV, till it gets removed). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-01 16:55:20 +01:00
Fabiano Fidêncio	a58caca723	packaging: Take extra tarballs in install_cached_tarball_component() This allows us to add a map, in the format of: `"tarball1_name:tarball1_path tarball2_name:tarball2_path ..."` With this we have a base to start doing a better job when caching extra artefacts, like kernel modules. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-01 16:55:20 +01:00
Fabiano Fidêncio	33ac5468fe	packaging: Add function to get the kernel modules directory Right now this is just being added but not used yet. The idea is to use this to both cache and later on untar the kernel modules needed for some of the kernel targets we have (specifically looking at the confidential one). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-01 16:55:20 +01:00
Zhigang Wang	9317e23df1	mount: Reduce the mount points with namespace isolation This patch can reduce load on systemd process, and increase the k8s deployment density when using go runtime. Fixes: #8758 Signed-off-by: Zhigang Wang <wangzhigang17@huawei.com> Signed-off-by: Liu Wenyuan <liuwenyuan9@huawei.com>	2024-02-01 18:34:24 +08:00
Fabiano Fidêncio	ed6816e29f	kata-manager: Add support for nerdctl installation As already done for docker, let's also add support for installing nerdctl + kata containers. For now, at least for now, we are explicitly not allowing the combination of installing both docker and nerdctl in the same installation in order to reduce the script complexity. Also, nerdctl installation, for now, is limited to x86_64 and aarch64 as those are the only architectures that nerdctl releases a "full" package for. Fixes: #8358 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-02-01 09:19:35 +01:00
Xuewei Niu	2332552c8f	Merge pull request #7483 from frezcirno/passfd_io_feature runtime-rs: improving io performance using dragonball's vsock fd passthrough	2024-02-01 14:53:53 +08:00
Amulyam24	f8585db8d9	gha: add kubernetes tests workflow for ppc64le This PR adds workflow for running kubernetes test suite on ppc64le. It uses scripts to create and delete the cluster using kubeadm as none of the current cluster creation tools are supported on Power. Fixes: #7950 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2024-02-01 12:23:11 +05:30
Alex Lyn	cf26c16017	Merge pull request #8931 from yaoyinnan/8930/feat/merge-ValidCgroupPath runtime: merged ValidCgroupPath method	2024-02-01 12:53:55 +08:00
Alex Lyn	a157fc3b74	Merge pull request #8974 from yaoyinnan/5240/fix/cgroup-parallel runtime: add SingleContainer when obtaining OCI Spec	2024-02-01 11:43:02 +08:00
Alex Lyn	1b8f3ce28a	Merge pull request #8929 from yaoyinnan/8838/fix/error-message runtime-rs: report error on missing or empty fields in configuration	2024-02-01 11:02:30 +08:00
Dan Mihai	09ea0eed9d	genpolicy: ignore empty YAML as input Kata CI's pod-sandbox-vcpus-allocation.yaml ends with "---", so the empty YAML document following that line should be ignored. To test this fix: genpolicy -u -y pod-sandbox-vcpus-allocation.yaml Fixes: #8895 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-02-01 02:22:21 +00:00
Dan Mihai	befef119ff	Merge pull request #8941 from malt3/genpolicy-flags genpolicy: allow separate paths for rules and settings files	2024-01-31 18:14:12 -08:00
GabyCT	6db1cd5f65	Merge pull request #8964 from GabyCT/topic/fixnerdcltt tests: Re-arranged nerdctl tests	2024-01-31 15:02:54 -06:00
Dan Mihai	21125baec3	Merge pull request #8962 from microsoft/danmihai1/config-map-optional2 genpolicy: ignore volume configMap optional field	2024-01-31 12:29:30 -08:00
Fabiano Fidêncio	39a64d1447	Merge pull request #8269 from wainersm/kata-deploy_deprecated kata-deploy: fix deprecations on kustomization files	2024-01-31 20:02:01 +01:00
Hyounggyu Choi	9c0312d466	Merge pull request #8956 from BbolroC/agent-build-fix-s390x-ppc64le packaging: Use Ubuntu 20.04 for building an agent	2024-01-31 18:23:16 +01:00
Greg Kurz	8b1dc06971	Merge pull request #8938 from pmores/log-qemus-stderr-in-shim-log runtime-rs: Log qemu's stderr in shim log	2024-01-31 18:04:28 +01:00
Dan Mihai	f0339a79a6	genpolicy: support non-default namespace name Allow users to specify in genpolicy-settings.json a default cluster namespace other than "default". For example, Kata CI uses as default namespace: "kata-containers-k8s-tests". Fixes: #8976 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-31 15:47:01 +00:00
Zixuan Tan	222de4f684	agent: Fix a race condition in passfd_io.rs There is a race condition in agent HVSOCK_STREAMS hashmap, where a stream may be taken before it is inserted into the hashmap. This patch add simple retry logic to the stream consumer to alleviate this issue. Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	6e4d4c329a	agent,runtime-rs: Add license header to passfd_io.rs Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	1206de2c23	agent: Use pipes as stdout/stderr of container process Linux forbids opening an existing socket through /proc/<pid>/fd/<fd>, making some images relying on the special file /dev/stdout(stderr), /proc/self/fd/1(2) fail to boot in passfd io mode, where the stdout/stderr of a container process is a vsock socket. For back compatibility, a pipe is introduced between the process and the socket, and its read end is set as stdout/stderr of the container process instead of the socket. The agent will do the forwarding between the pipe and the socket. Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	f6710610d1	agent,runtime-rs,runk: fix fmt and clippy warnings Fix rustfmt and clippy warnings detected by CI. Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	89be42a177	runtime-rs: open stdout and stderr fifos NONBLOCK This patch adds O_NONBLOCK flag when open stdout and stderr FIFOs to avoid blocking. Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	3eb4bed957	agent: use biased select to avoid data loss This patch uses a biased select to avoid stdin data loss in case of CloseStdinRequest. Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	7874ef5fd2	agent: set stdout/err vsock stream as blocking before passing to child In passfd io mode, when not using a terminal, the stdout/stderr vsock streams are directly used as the stdout/stderr of the child process. These streams are non-blocking by default. The stdout/stderr of the process should be blocking, otherwise the process may encounter EAGAIN error when writing to stdout/stderr. Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Fupan Li	cfb262d02f	container: keep the io connection when pass fd to hybrid vsock We want the io connection keep connected when the containerd closed the io pipe, thus it can be attached on the io stream. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2024-01-31 21:07:48 +08:00
Fupan Li	4a762fcfdd	dbs: hybrid stream support keep the connection when local closed Support the hybrid fd passthrough mode with passing pipe fd, which can specify this connection kept even when the pipe peer closed, and this connection can be reget wich re-opening the pipe. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	5536743361	agent,runtime-rs: fix container io detach and attach Partially fix some issues related to container io detach and attach. Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	657b17a86f	runtime-rs: open stdin fifo with RDWR\|NONBLOCK when pass vsock streams In linux, when a FIFO is opened and there are no writers, the reader will continuously receive the HUP event. This can be problematic when creating containers in detached mode, as the stdin FIFO writer is closed after the container is created, resulting in this situation. In passfd io mode, open stdin fifo with O_RDWR\|O_NONBLOCK to avoid the HUP event. Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	f1b33fd2e0	agent: clean up term master fd when container exits When container exits, the agent should clean up the term master fd, otherwise the fd will be leaked. Fixes: kata-containers#6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	b8632b4034	dragonball: vsock: properly handle EPOLLHUP/EPOLLERR events When one end of the connection close, the epoll event will be triggered forever. We should close the connection and kill the connection. Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	442df71fe5	agent,runtime-rs: refactor process io using vsock fd passthrough feature Currently in the kata container, every io read/write operation requires an RPC request from the runtime to the agent. This process involves data copying into/from an RPC request/response, which are high overhead. To solve this issue, this commit utilize the vsock fd passthrough, a newly introduced feature in the Dragonball hypervisor. This feature allows other host programs to pass a file descriptor to the Dragonball process, directly as the backend of an ordinary hybrid vsock connection. The runtime-rs now utilizes this feature for container process io. It open the stdin/stdout/stderr fifo from containerd, and pass them to Dragonball, then don't bother with process io any more, eliminating the need for an RPC for each io read/write operation. In passfd io mode, the agent uses the vsock connections as the child process's stdin/stdout/stderr, eliminating the need for a pipe to bump data (in non-tty mode). Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	eb6bb6fe0d	config: add two options to control vsock passthrough io feature Two toml options, `use_passfd_io` and `passfd_listener_port` are introduced to enable and configure dragonball's vsock fd passthrough io feature. This commit is a preparation for vsock fd passthrough io feature. Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Zixuan Tan	973b5ad1f4	runtime-rs: make Container::new async Fixes: #6714 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2024-01-31 21:07:48 +08:00
Xuewei Niu	5449173102	Merge pull request #8932 from kalil-pelissier/feature/issue-8586/fix-noop-method-call-warning dragonball: fix noop-method-call warning	2024-01-31 19:24:27 +08:00
Malte Poll	531a11159f	genpolicy: allow separate paths for rules and settings files Using custom input paths with -i is counter-intuitive. Simplify path handling with explicit flags for rules.rego and genpolicy-settings.json. Fixes: #8568 Signed-Off-By: Malte Poll <1780588+malt3@users.noreply.github.com>	2024-01-31 11:00:19 +01:00
Hyounggyu Choi	2e1d770fcf	packaging: Track files correctly when naming builder image for agent The necessary files for the agent builder image can be found in `tools/packaging/static-build/agent`, `ci/install_libseccomp.sh` and `tools/packaging/kata-deploy/local-build/kata-deploy-copy-libseccomp-installer.sh`. Identifying the correct files addresses the previously misreferenced path used to name the builder image. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-01-31 10:49:20 +01:00
yaoyinnan	9aa1ed805a	runtime: add SingleContainer when obtaining OCI Spec When creating a cgroup, add a SingleContainer when obtaining the OCI Spec to apply to ctr, podman, etc. Fixes: #5240 Signed-off-by: yaoyinnan <35447132+yaoyinnan@users.noreply.github.com>	2024-01-31 15:24:07 +08:00
yaoyinnan	b0b8523cea	runtime: modify ValidCgroupPath unit test Modify ValidCgroupPath unit test. Fixes: #8930 Signed-off-by: yaoyinnan <35447132+yaoyinnan@users.noreply.github.com>	2024-01-31 14:37:17 +08:00
yaoyinnan	feed5c8ff9	runtime: merged ValidCgroupPath method Merged ValidCgroupPath method to handle cgroupv1 and cgroupv2. Fixes: #8930 Signed-off-by: yaoyinnan <35447132+yaoyinnan@users.noreply.github.com>	2024-01-31 14:37:13 +08:00
yaoyinnan	864389c524	runtime-rs: report error on missing or empty fields in configuration Removed the setting of default values for runtime fields. Added explicit checks for missing or empty fields, reporting errors with clear messages. Fixes: #8838 Signed-off-by: yaoyinnan <35447132+yaoyinnan@users.noreply.github.com>	2024-01-31 12:46:17 +08:00
Wainer dos Santos Moschetta	abc2fcd88f	kata-deploy: fix deprecations on kustomization files By running `kustomize edit fix` on those files they have changed deprecated instructions ('bases' and 'patchesStrategicMerge') as well as 'apiVersion' and 'kind' were added. Fixes #8268 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-01-30 18:41:03 -03:00
Lukáš Doktor	4876eadd2f	tools: Add reference to the kata webhook's README The newly added webhook is a new component and oughst to be linked from the main README file. Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-01-30 19:05:56 +01:00
Lukáš Doktor	b0b7748f30	ci/openshift-ci: Correct the lib location correct the lib file locations after the move from tests->kata-containers repo and add a minimized version of the ".ci/lib.sh" library into the "ci/openshift-ci" as we don't really utilize all of the features. Fixes: #8653 Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-01-30 19:05:56 +01:00
Lukáš Doktor	4c58478536	ci/openshift-ci: Move openshift-ci from the tests repo Move the f15be37d9bef58a0128bcba006f8abb3ea13e8da version of scripts required for openshift-ci from "kata-containers/tests/.ci/openshift-ci" into "kata-containers/kata-containers/ci/openshift-ci" and required webhook+libs into "kata-containers/kata-containers/tools/testing" as is to simplify verification, the different location handling will be added in following commit. Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2024-01-30 19:05:55 +01:00
Kvlil	3fd5628771	dragonball: fix noop-method-call warning The `noop-method-call` is a rustc lint that has existed since v1.52.0. This lint has been moved to the warn by default lint level since v1.73.0. Therefore build is failing with this version and above. This commit removes the unnecessary call to `<&T as Deref>::deref` on `T: !Deref`. Fixes: #8586 Signed-off-by: Kvlil <kalil.pelissier@gmail.com>	2024-01-30 17:16:49 +00:00
Wainer Moschetta	bf54a02e16	Merge pull request #8924 from microsoft/danmihai1/pod-nested-configmap-secret genpolicy: fix ConfigMap volume mount paths	2024-01-30 14:09:41 -03:00
Gabriela Cervantes	78b517ccc8	tests: Re-arranged nerdctl tests This PR re-arranged the nerdctl tests to avoid random failures. In this PR first will run the tests with RunC and then with the kata hypervisor. This PR tries to avoid the random failures that is happening with cloud-hypervisor and clh. Fixes #8963 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-30 16:07:12 +00:00
Dan Mihai	d12875ee66	genpolicy: ignore volume configMap optional field The auto-generated Policy already allows these volumes to be mounted, regardless if they are: - Present, or - Missing and optional Fixes: #8893 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-30 15:32:37 +00:00
Fabiano Fidêncio	7a83e6dc14	Merge pull request #8959 from fidencio/topic/crio-bump-runners-to-2204 gha: cri-o: Bump runners to 22.04	2024-01-30 14:27:40 +01:00
Fabiano Fidêncio	34d51b05f8	gha: cri-o: Bump runners to 22.04 This will not solve the CRI-O CI breakage but will give us an environment where we could get it to run locally. Fixes: #8935 -- part I Thanks to Julien Ropé for trying to reproduce the issues I faced on https://github.com/kata-containers/kata-containers/issues/8935 in an Ubuntu 22.04 system. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-30 14:17:06 +01:00
Xuewei Niu	7e10000b6f	Merge pull request #8928 from yaoyinnan/8927/fix/unused-DriverInfo runtime-rs: fix unused driverInfo error	2024-01-30 20:39:10 +08:00
Hyounggyu Choi	f3bc6e4155	packaging: Use Ubuntu 20.04 for building an agent This involves using Ubuntu 20.04 as a build environment for an agent to match with a runtime environment. Fixes: #8955 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-01-30 10:22:14 +01:00
Pavel Mores	d53edbd0a5	runtime-rs: collect qemu stderr and log it in shim log Qemu stderr monitoring runs in its own asynchronous green thread. For that, `stderr` is taken out of the Child representing the qemu child process to avoid partial move and make it possible for the main thread still to call functions on QemuInner::qemu_process (e.g. kill(), id()). Fixes #8937 Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-01-30 09:09:05 +01:00
Pavel Mores	684d740122	runtime-rs: switch qemu child process management from std to tokio We'll want to capture qemu's stderr in parallel with normal runtime-rs execution. Tokio's primitives make this much easier than std's. This also makes child process management more consistent across runtime-rs (i.e. virtiofsd child process is already launched and managed using tokio). Some changes were necessary due to tokio functions being slightly different from their std counterparts. Child::kill() is now async and Child::id() now returns an Option. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-01-30 09:07:14 +01:00
Dan Mihai	6a8f46f3b8	Merge pull request #8918 from microsoft/danmihai1/metadata genpolicy: optional PodTemplateSpec metadata field	2024-01-29 12:36:30 -08:00
Dan Mihai	60ac3048e9	genpolicy: fix ConfigMap volume mount paths Allow Kata CI's pod-nested-configmap-secret.yaml to work with genpolicy and current cbl-mariner images: 1. Ignore the optional type field of Secret input YAML files. It's possible that CoCo will need a more sophisticated Policy for Secrets, but this change at least unblocks CI testing for already-existing genpolicy features. 2. Adapt the value of the settings field below to fit current CI images for testing on cbl-mariner Hosts: "kata_config": { "confidential_guest": false }, Switching this value from true to false instructs genpolicy to expect ConfigMap volume mounts similar to: "configMap": { "mount_type": "bind", "mount_source": "$(sfprefix)", "mount_point": "^$(cpath)/watchable/$(bundle-id)-[a-z0-9]{16}-", "driver": "watchable-bind", "fstype": "bind", "options": [ "rbind", "rprivate", "ro" ] }, instead of: "confidential_configMap": { "mount_type": "bind", "mount_source": "$(sfprefix)", "mount_point": "$(sfprefix)", "driver": "local", "fstype": "bind", "options": [ "rbind", "rprivate", "ro" ] } }, This settings change unblocks CI testing for ConfigMaps. Simple sanity testing for these changes: genpolicy -u -y pod-nested-configmap-secret.yaml kubectl apply -f pod-nested-configmap-secret.yaml kubectl get pods \| grep config nested-configmap-secret-pod 1/1 Running 0 26s Fixes: #8892 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-29 16:13:47 +00:00
Gabriela Cervantes	31813cf8d8	metrics: Update packages for TensorFlow ResNet Int8 Dockerfile This PR updates the required packages for the TensorFlow ResNet50 Int8 Dockerfile. Fixes #8950 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-29 16:11:09 +00:00
Fabiano Fidêncio	087856f26c	Merge pull request #8934 from microsoft/danmihai1/nodeName genpolicy: ignore the nodeName field	2024-01-29 16:57:59 +01:00
Greg Kurz	d687b601f1	Merge pull request #8933 from fidencio/topic/package-coco-guest-components packaging: Build coco-guest-components	2024-01-29 16:34:06 +01:00
Zvonko Kaiser	a9348fa35b	Merge pull request #8375 from zvonkok/opa-binary-fix arm64: agent_policy build always pulls amd64 opa binary	2024-01-29 15:10:10 +01:00
Fabiano Fidêncio	5ea6a29c37	Merge pull request #8947 from fidencio/topic/gha-pass-down-AZ_SUBSCRIPTION_ID gha: azure: Set the correct subscription to the account	2024-01-29 15:07:06 +01:00
Fabiano Fidêncio	448c0aaecb	gha: azure: Set the correct subscription to the account Due to the changes done in the CI, we need to set the correct subscription to be used with the account from now on, otherwise we'd end up using CoCo subscription. Fixes: #8946 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-29 15:00:38 +01:00
Pavel Mores	b52a398469	runtime-rs: move creation of VM path from start_vm() to prepare_vm() This fixes a flaw pointed out in review of PR #8185. Creation of the directory semantically fits better into VM preparation than VM launch. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-01-27 13:46:35 +01:00
Fabiano Fidêncio	98dc2d4c52	rootfs: agent: Initialise AGENT_SOURCE_BIN & AGENT_TARBALL Otherwise those would be unbound if not passed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-26 19:58:41 +01:00
Fabiano Fidêncio	5e57e0235e	rootfs: agent: Fix build with AGENT_SOURCE_BIN We need to actually check that the env var is not empty. :-) This was introduced by `8307718842`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-26 19:58:20 +01:00
Fabiano Fidêncio	fbfc880eb6	rootfs: Add COCO_GUEST_COMPONENTS_TARBALL env var This env ver will serve us to pass the Confidential Containers guest-components tarball to the rootfs builder, which will then just unpack the content into the rootfs. Fixes: #8848 -- part I Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Linda Yu <linda.yu@intel.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Jakob Naucke <jakob.naucke@ibm.com> Co-authored-by: Wang, Arron <arron.wang@intel.com> Co-authored-by: zhouliang121 <liang.a.zhou@linux.alibaba.com> Co-authored-by: Alex Carter <alex.carter@ibm.com> Co-authored-by: Suraj Deshmukh <suraj.deshmukh@microsoft.com> Co-authored-by: Xynnn007 <xynnn@linux.alibaba.com>	2024-01-26 19:58:19 +01:00
Fabiano Fidêncio	644abde35c	packaging: coco-guest-components: Allow building the project The Confidential Containers guest-components will, in the very short future, be part of the Kata Containers rootfs that's used by the Confidential Containers usecase. This commit introduces the ability to, standalone, build the component locally and as part of our CI, and this can be done by calling: `make coco-guest-components-tarball` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Linda Yu <linda.yu@intel.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Jakob Naucke <jakob.naucke@ibm.com> Co-authored-by: Wang, Arron <arron.wang@intel.com> Co-authored-by: zhouliang121 <liang.a.zhou@linux.alibaba.com> Co-authored-by: Alex Carter <alex.carter@ibm.com> Co-authored-by: Suraj Deshmukh <suraj.deshmukh@microsoft.com> Co-authored-by: Xynnn007 <xynnn@linux.alibaba.com>	2024-01-26 19:36:01 +01:00
Hyounggyu Choi	ee072e8a06	Merge pull request #8926 from fidencio/topic/cache-the-agent-for-non-x86_64 gha: Cache the agent for non-x86_64 arches	2024-01-26 18:04:33 +01:00
Dan Mihai	076869aa39	genpolicy: ignore the nodeName field Validating the node name is currently outside the scope of the CoCo policy. This change unblocks testing using Kata CI's test-pod-file-volume.yaml and pv-pod.yaml. Fixes: #8888 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-26 16:30:55 +00:00
Dan Mihai	ef1ee81f81	Merge pull request #8909 from microsoft/danmihai1/main-shareProcessNamespace genpolicy: add shareProcessNamespace support	2024-01-26 05:49:19 -08:00
yaoyinnan	9b7c5c69cf	runtime-rs: fix unused driverInfo error Remove the unused DriverInfo declaration or integrate it into the codebase where applicable. Fixes: #8927 Signed-off-by: yaoyinnan <35447132+yaoyinnan@users.noreply.github.com>	2024-01-26 19:59:52 +08:00
Greg Kurz	f41fa7557a	Merge pull request #8914 from BbolroC/basic-e2e-ibm-se tests: Add IBM SE to the basic confidential test	2024-01-26 12:32:32 +01:00
Fabiano Fidêncio	08a082ca47	gha: Cache the agent for non-x86_64 arches Those are not yet being cached for no reason, and they better be as it'll allow us to save a considerable amount of time building the rootfs. Fixes: #8917 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-26 12:02:26 +01:00
Fabiano Fidêncio	a7c68225aa	Merge pull request #8916 from fidencio/topic/packaging-reuse-already-built-agent packaging: Don't always build the kata-agent	2024-01-26 12:00:55 +01:00
Fabiano Fidêncio	95c569b0a6	packaging: Add safe.directory to the git config Otherwise building as root will not work, as demonstrated by the arm64 CI. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-26 09:44:43 +01:00
Hyounggyu Choi	ab462a4b89	tests: Add IBM SE to the basic confidential test The existing confidential basic test titled `Test unencrypted confidential container launch success and verify that we are running in a secure enclave` has been updated to incorporate IBM Secure Execution (`qemu-se`). Previously, a secure image was absent from kata-deploy, hindering the inclusion of IBM SE in the test. Thanks to the #6755 update, it is now possible to test the TEE. This modification extends the existing test by introducing `qemu-se`. The specific changes are outlined below: - Add an additional test `cc-se-e2e-tests` to s390x nightly - Expansion of `REMOTE_COMMAND_PER_HYPERVISOR` for `qemu-se` - Temporary exclusion of two test cases currently incompatible with IBM SE (`cpu-ns` is a common issue across all TEEs, while `inotify` will be addressed in a subsequent pull request). Fixes: #8913 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-01-26 06:04:39 +01:00
GabyCT	c13a63c8ba	Merge pull request #8905 from zvonkok/enable-tpm qemu: enable TPM	2024-01-25 14:52:00 -06:00
GabyCT	aa958adf90	Merge pull request #8904 from GabyCT/topic/buildbq tools: Use defined variable in build base qemu script	2024-01-25 13:51:44 -06:00
GabyCT	36fc2fd83f	Merge pull request #8876 from GabyCT/topic/dockerrestfp metrics: Update packages needed for ResNet50 FP32 Dockerfile	2024-01-25 13:51:16 -06:00
Dan Mihai	8ad5459beb	genpolicy: optional PodTemplateSpec metadata field Add metadata containing the Policy annotation if the user didn't provide any metadata in the input yaml file. For a simple sanity test using a Kata CI YAML file: genpolicy -u -y job.yaml kubectl apply -f job.yaml kubectl get pods \| grep job job-pi-test-64dxs 0/1 Completed 0 14s Fixes: #8891 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-25 19:06:59 +00:00
Fabiano Fidêncio	dd49479829	packaging: Don't build the agent if not needed Let's start relying on the already cached agent to be deployed inside the rootfs. By doing this we save a lot of time in our CI, and we have a better way, for developers, to play with changes in the agent. Fixes: #8915 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-25 19:41:33 +01:00
Fabiano Fidêncio	21fd7e6dfd	packaging: Fail in case oras can't find an artefact It just means the component is not cached, and that it must be built in the usual way. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-25 19:41:32 +01:00
Fabiano Fidêncio	eb7a33ee71	rootfs: Always strip the agent binary Let's always do this, regardless of where the agent is coming from. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-25 19:41:32 +01:00
Fabiano Fidêncio	f23451de01	rootfs: Add xz as a dep As we'll be untarring the agent tarball (and any other component that may be part of the rootfs) into the rootfs, we have to have xz installed. For debian and ubuntu the package is called xz-utils; for centos, alpine and cbl-mariner the package is called xz. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-25 19:41:32 +01:00
Fabiano Fidêncio	8307718842	rootfs: Add AGENT_TARBALL env var This env var will serve us to pass the agent tarball to the rootfs builder, which will then just unpack the content into the rootfs instead of building the agent again. AGENT_TARBALL and AGENT_SOURCE_BIN should never be used together. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-25 19:41:32 +01:00
Fabiano Fidêncio	5b0d0687e5	packaging: agent: Allow building in all arches We're moving away from alpine and using ubuntu in order to be able to build the agent for all the architectures we need. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-25 19:41:32 +01:00
Dan Mihai	535cf04edb	genpolicy: add shareProcessNamespace support Validate the sandbox_pidns field value for CreateSandbox and CreateContainer. Fixes: #8868 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-25 16:48:57 +00:00
Dan Mihai	1e24581c07	Merge pull request #8908 from microsoft/danmihai1/genpolicy-permissions tools: allow all users to execute genpolicy	2024-01-25 08:42:24 -08:00
Dan Mihai	295494c7dc	Merge pull request #8898 from microsoft/danmihai1/show-output-of-passing-tests tests: k8s: bats --show-output-of-passing-tests	2024-01-25 06:22:50 -08:00
Fabiano Fidêncio	1039641ab8	packaging: agent: Add the arch to the builder container This has been missed during reviews and is already a problem as we're trying to build the agent outside of the rootfs for other architectures than x86_64. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-25 14:11:14 +01:00
Fabiano Fidêncio	58874f9c3e	packaging: tools: Add the arch to the builder container This has been missed during reviews and will become a problem when the tools start to be built in different architectures. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-25 14:10:22 +01:00
Zvonko Kaiser	76efe25aed	Merge pull request #8901 from zvonkok/remove-gha-action gpu: remove GHA target first then remove the obsoleted Makefile targets	2024-01-25 13:40:03 +01:00
Chelsea Mafrica	24b33ae35b	Merge pull request #8884 from GabyCT/topic/ulib versions: Update libseccomp to version v2.5.5	2024-01-24 23:55:32 -08:00
Dan Mihai	723c76d945	tools: allow all users to execute genpolicy This tool can be useful for any users. Fixes: #8907 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-25 00:40:53 +00:00
Zvonko Kaiser	19ecdbca3b	qemu: enable TPM Several use-cases need a vTPM lets enable it for QEMU, a follow up patch will introduce the runtime config. Fixes: #8902 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-01-24 17:49:08 +00:00
Gabriela Cervantes	98b5a19b3a	tools: Use defined variable in build base qemu script This PR uses a variable that is already defined in the build base qemu script to have uniformity across the script as this variable is already used in the script. Fixes #8903 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-24 17:05:17 +00:00
Zvonko Kaiser	4b8d79c1f6	gpu: remove GHA target first then remove the obsoleted Makefile targets Lets remove the GHA target actions first so the the follow-up PR #8874 tests are succeeding. Fixes: #8900 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-01-24 11:43:39 +00:00
Dan Mihai	66c012d052	tests: k8s: bats --show-output-of-passing-tests Add --show-output-of-passing-tests to the k8s integration tests. The output of a passing test can be helpful when investigating a failure of the same test. Fixes: #8885 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-24 03:04:28 +00:00
Hyounggyu Choi	f4290688bb	Merge pull request #7146 from BbolroC/ibm-se-howto-doc docs: provide a guide for how to use IBM Secure Execution	2024-01-23 22:48:05 +01:00
Hyounggyu Choi	25ecca91c6	docs: provide a guide for how to use IBM Secure Execution This PR is to add a document for how to run kata containers under IBM Secure Execution environment. Fixes: #7025 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-01-23 18:58:27 +01:00
Greg Kurz	0f67a26751	Merge pull request #8812 from kalil-pelissier/feature/issue-7720/drop-dead-code runtime: remove SharedVersions field dead code	2024-01-23 17:46:41 +01:00
Gabriela Cervantes	1b0d12ab78	versions: Update libseccomp to version v2.5.5 This PR updates the libseccompt version to v2.5.5 which includes the following changes: - Update the syscall table for Linux - Fix minor issues with binary tree testing and with empty binary trees Fixes #8883 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-23 16:31:25 +00:00
Zvonko Kaiser	ab597a4d5b	opa: Improve the download logic The versions.yaml has a default for the amd64 binary, but there is no code to actually build the arm64 binary, which seems an overlook. Let's simplify the OPA logic by removing the direct link to the binary, and construct that link as part of the checks we do to decide whether we need to build OPA or not. Fixes: #8373 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-23 09:16:16 +00:00
Greg Kurz	4516f38165	Merge pull request #8872 from zvonkok/nvidia-gpu-confidential gpu: Add NVIDIA GPU Confidential kernel target	2024-01-23 09:22:27 +01:00
Dan Mihai	3d2ec5c919	Merge pull request #8857 from microsoft/danmihai1/k8s-gha gha: get ready to install genpolicy	2024-01-22 08:29:24 -08:00
Gabriela Cervantes	eb7e123de8	metrics: Update packages needed for ResNet50 FP32 Dockerfile This PR updates the packages necessary to build the ResNet50 fp32 Dockerfile to run properly the benchmark. Fixes #8875 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-22 16:15:36 +00:00
Zvonko Kaiser	4fc34323ae	gpu: Add NVIDIA GPU Confidential kernel target This is a follow up to the work of minimizing targets, unifying TDX,SNP builds for NVIDIA GPUs Fixes: #8828 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-01-22 14:58:57 +00:00
Kvlil	a4b208a712	runtime: remove SharedVersions field dead code SharedVersion fiel add a versiontable property that isn't supported by upstream QEMU. This is dead code since virtcontainers isn't setting SharedVersions to true. Fixes: #7720 Signed-off-by: Kvlil <kalil.pelissier@gmail.com>	2024-01-22 12:18:42 +00:00
Dan Mihai	ea9c659d36	gha: get ready to install genpolicy The changes to install and test genpolicy must come later, after CI picks up these gha changes. Fixes: #8856 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-19 23:37:49 +00:00
GabyCT	bb1ada1a8b	Merge pull request #8855 from GabyCT/topic/updatefc versions: Update firecracker version	2024-01-19 16:25:50 -06:00
Fabiano Fidêncio	1e30fde8fa	Merge pull request #8862 from microsoft/danmihai1/genpolicy-dns genpolicy: ignore pod DNS settings	2024-01-19 23:08:26 +01:00
Dan Mihai	ca03d47634	genpolicy: ignore pod DNS settings Ignore pod DNS settings because policing the network traffic is currently outside the scope of the Agent Policy. Example from Kata CI: pod-custom-dns.yaml Fixes: #8832 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-19 16:42:35 +00:00
Alex.Lyn	826c751bf3	Merge pull request #8185 from pmores/add-qemu-cmdline-generation-framework Add qemu cmdline generation framework	2024-01-19 21:42:49 +08:00
Greg Kurz	b7d6b18768	Merge pull request #8485 from BbolroC/add-unit-test-s390x GHA: Enable static check for s390x, aarch64 and ppc64le	2024-01-19 11:49:16 +01:00
Pavel Mores	25c8d5db5d	runtime-rs: use qemu cmdline generation framework to launch VM Deploy the framework added by the previous commit to generate qemu command line and launch the VM. We now properly store the child process object which allows us to implement remaining Hypervisor functions necessary for a simple but successful VM lifecycle, get_vmm_master_tid() and stop_vm(). Fixes #8184 Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-01-19 11:42:23 +01:00
Gabriela Cervantes	0696807384	versions: Update firecracker version This PR updates the firecracker version to v1.6.0 which includes the following features - Added support for per net device metrics. In addition to aggregate metrics net, each individual net device will emit metrics under the label "net_{iface_id}". E.g. the associated metrics for the endpoint "/network-interfaces/eth0" will be available under "net_eth0" in the metrics json object. - Added support for per block device metrics. In addition to aggregate metrics block, each individual block device will emit metrics under the label "block_{drive_id}". E.g. the associated metrics for the endpoint "/drives/{drive_id}" will be available under "block_drive_id" in the metrics json object. - Added a new vm-state subcommand to info-vmstate command in the snapshot-editor tool to print MicrovmState of vmstate snapshot file in a readable format. Also made the vcpu-states subcommand available on x86_64. - Added source-level instrumentation based tracing. See tracing for more details. - Added developer preview only (NOT for production use) support for vhost-user block devices. Firecracker implements a vhost-user frontend. Users are free to choose from existing open source backend solutions or their own implementation. Known limitation: snapshotting is not currently supported for microVMs containing vhost-user block devices. See the related doc page for details. The device emits metrics under the label "vhost_user_{device}_{drive_id}". Fixes #8854 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-18 15:50:30 +00:00
Amulyam24	f6fea5f2ca	agent: fix failing unit tests on ppc64le - test_volume_capacity_stats: verify the file block size against the fetched size via statfs() - test_reseed_rng: Correct the request codes for RNDADDTOENTCNT and RNDRESEEDCRNG when platform is ppc64le - test list_routes: Add the route only if destination is not empty - test_new_fs_manager: skip the test if cgroups v2 is used by default - skip test cases rpc::tests::test_do_write_stream, sandbox::tests::test_find_process, sandbox::t ests::test_find_container_process and sandbox::tests::add_and_get_container on ppc64le as they are fl aky Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2024-01-18 16:32:16 +01:00
Hyounggyu Choi	610f878894	dragonball: Fix compile error for aarch64 This is to fix a compile error raised for aarch64. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-01-18 16:32:15 +01:00
Amulyam24	376941cf69	kata-ctl: skip building kata-ctl on ppc64le kata-ctl currently fails to build on ppc64le. Skip it for running static checks and the issues will be fixed and tracked in a seperate issue. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2024-01-18 16:31:13 +01:00
Amulyam24	4ecd82a5df	runk: skip the test_init_container_create_launcher if not root on ppc64le This is to skip the test_init_container_create_launcher if not root on ppc64le. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2024-01-18 16:31:13 +01:00
Amulyam24	a4b5447924	tools: fix makefile spacing This minor PR removes the extra space in the makefiles. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2024-01-18 16:31:13 +01:00
Amulyam24	394777291d	runtime: fix failing unit tests on ppc64le A few CPU related test cases were failing as the version was being verified against Power8 while the CI machine is Power9. Fixes: #5531 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2024-01-18 16:31:13 +01:00
Amulyam24	486b8a0538	dragonball: skip running static-checks for ppc64le Since dragonball is not currently supported on ppc64le, skip running the targets for static-checks. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2024-01-18 16:31:13 +01:00
Amulyam24	14934c7b0d	github: run static checks on ppc64le This PR adds ppc64le runner to the static-checks workflow. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2024-01-18 16:31:13 +01:00
Hyounggyu Choi	8061a49ca5	kata-ctl: Clean up a test leftover file explicitely It was observed that a tmporary file `/tmp/kata_hybrid_vsock02.hvsock` for test_setup_hvsock_failed() is not removed from time to time. This leads to a test failure for the same test next time due to the file permission on a self-hosted runner. This commit is to explicitely delete the file before the check starts. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-01-18 16:31:13 +01:00
Hyounggyu Choi	290ecf4c46	Static-check: Exclude s390x from dragonball and runtime-rs At the moment, a project `dragonball` and `runtime-rs` does not support for s390x. During the enablement, some errors due to the misconfiguration of Makefile for `make check` and `make vendor` were identified. This is to skip the build for the affected target of the projects. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-01-18 16:31:13 +01:00
Hyounggyu Choi	c0f57c9e0a	Lint: Fix `cargo clippy` errors for s390x Some linting errors were identified during the enablement of `make check`. These have not been found by the Jenkins CI job because `make test` was only triggered. The errors for the `agent` occurs under the s390x specific tests while the other ones for the `kata-ctl` are the architecture-specific code. This commit is to fix those errors. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-01-18 16:31:13 +01:00
Hyounggyu Choi	a1f288e5d3	CI: Use sudo if yq_path is not writable by USER If `yq_path` is set to `/usr/local/bin/yq`, there could be a situation where the `yq` cannot be installed without `sudo`. This commit handles the situation by putting `sudo` in front of `curl` and `chmod`, respectively. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-01-18 16:31:13 +01:00
Hyounggyu Choi	354cbede9c	GHA: Enable static check for s390x As part of the CI migration from Jenkins to GitHub Action, a CI job named `kata-containers-2.0-ubuntu-s390x-unit-PR` is covered by the static check. This commit is to enable the check for s390x by incorporating a runner `s390x` with the corresponding workflow. Fixes: #8482 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-01-18 16:31:13 +01:00
Jianyong Wu	ba74a624a8	runtime-rs: use pathBuf only for x86 PathBuf here is only used for x86. Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2024-01-18 16:31:13 +01:00
Jianyong Wu	a10779bf0b	GHA: enable static check on arm64 This is to add a runner for arm64 to the workflow. Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2024-01-18 16:31:11 +01:00
Dan Mihai	eeba459a6b	Merge pull request #8845 from microsoft/danmihai1/genpolicy-defaults tools: install genpolicy settings files	2024-01-17 15:08:49 -08:00
Chelsea Mafrica	32ad465663	Merge pull request #8710 from jodh-intel/runtime-rs-ch-get-thread-ids runtime-rs: ch: Implement minimal implementation for missing thread/pid APIs	2024-01-17 14:51:44 -08:00
Fabiano Fidêncio	147d5fd752	Merge pull request #8836 from microsoft/danmihai1/test-with-cbl-mariner genpolicy: use root path from cbl-mariner Guest VM	2024-01-17 17:51:44 +01:00
Pavel Mores	f550d9a325	runtime-rs: add basic implementation of qemu command line generation This current framework is enough to launch a VM with a simple container in it (e.g. busybox). Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-01-17 12:55:00 +01:00
Pavel Mores	e8e13044da	runtime-rs: add simple impls to some of Qemu's Hypervisor functions The idea of most of these is just to prevent running into todo!()s where we can at the moment, while implementing the fundamental functionality of VM launch. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-01-17 12:55:00 +01:00
Dan Mihai	febabef08c	tools: install genpolicy settings files Install the default genpolicy OPA rules and settings JSON files, in addition to the genpolicy binary. Fixes: #8844 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-16 23:59:59 +00:00
David Esparza	e11c520ffa	Merge pull request #8808 from kata-containers/memory_usage_test_skip_virtiofs_when_req tests: Ignore virtiofs contribution to memory usage when it is disabled.	2024-01-16 16:50:06 -06:00
Dan Mihai	69557e5ad6	Merge pull request #8814 from microsoft/danmihai1/genpolicy-kata-deploy tools: genpolicy static checks	2024-01-16 07:33:42 -08:00
Dan Mihai	13f2398fe8	Merge pull request #8837 from microsoft/danmihai1/allow_storages genpolicy: temporarily disable allow_storages()	2024-01-16 07:10:49 -08:00
Alex.Lyn	17719f1ac5	Merge pull request #8708 from Apokleos/directvol-bugfix-blk-pci runtime-rs: bugfix for DirectVolume/rawblock when driver is blk	2024-01-16 14:25:16 +08:00
alex.lyn	99717371c1	runtime-rs: bugfix for DirectVolume/rawblock when driver is blk DirectVolume/Rawblock doesn't work well when device's block driver is virtio-blk-pci and the storage handler is DRIVER_BLK_PCI_TYPE. Fixes: #8707 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2024-01-16 10:35:08 +08:00
Dan Mihai	205dafd323	genpolicy: temporarily disable allow_storages() Temporarily disable the allow_storages() rules, because they are based on the tarfs snapshotter + container image integrity information that are not available yet in the main branch - see #8833. Fixes: #8834 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-15 23:55:27 +00:00
Dan Mihai	f4106a6107	genpolicy: use root path from cbl-mariner Guest VM Adjust genpolicy-settings.json to match the container root path from the main branch + cbl-mariner Guest VMs. This configuration might have to be adjusted again when other types of Guest VMs will be tested during CI using genpolicy, in the future. Also, improve logging from allow_root_path(), to easier debug these issues in the future. Fixes: #8835 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-15 23:33:28 +00:00
GabyCT	37a4049d0f	Merge pull request #8830 from GabyCT/topic/removeprotocol metrics: Remove iperf3 server protocol	2024-01-15 14:44:39 -06:00
Dan Mihai	201eec628a	tools: genpolicy static checks Package genpolicy and enable static checks for it. Fixes: #8813 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-15 16:49:58 +00:00
David Esparza	4b772d2480	tests: Ignore virtiofs contribution to memory usage when it is disabled. This PR removes the references to virtiofs from memory average calculation when the container uses a shared file system other than virtiofs. Fixes: #8807 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2024-01-15 08:07:06 -08:00
Gabriela Cervantes	dff800a8ff	metrics: Remove iperf3 server protocol This PR removes the iperf3 server protocol as this server definition is also used for the UDP iperf3 benchmarks to avoid duplication of the same yaml files. Fixes #8829 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-15 15:44:24 +00:00
Fabiano Fidêncio	0dc00ae373	Merge pull request #8822 from microsoft/danmihai1/cargo-clippy genpolicy: cargo clippy fixes	2024-01-15 14:59:04 +01:00
Fabiano Fidêncio	73cf31bd9e	Merge pull request #8827 from microsoft/danmihai1/disable-k8s-oom tests: cbl-mariner: disable k8s-oom.bats	2024-01-15 14:40:16 +01:00
Xuewei Niu	923bd65dff	Merge pull request #8819 from justxuewei/rm-protocol-backend dragonball: Remove unused definition	2024-01-15 10:09:46 +08:00
Dan Mihai	b7c31e3b98	tests: cbl-mariner: disable k8s-oom.bats Disable k8s-oom.bats on cbl-mariner until it passes more often. Fixes: #8824 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-14 17:39:25 +00:00
Dan Mihai	681cb1626a	genpolicy: cargo clippy fixes Clean up cargo clippy errors. Fixes: #8818 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-14 01:23:46 +00:00
Dan Mihai	3af713acd4	Merge pull request #8817 from microsoft/danmihai1/cargo-fmt genpolicy: "cargo fmt -- --check" clean-up	2024-01-13 16:22:27 -08:00
Xuewei Niu	f1fda3d6b0	dragonball: Remove unused definition `EndpointProtocolFlags::ProtocolBackend` is removed due to no reference. Fixes: #8745 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2024-01-13 13:25:11 +08:00
Dan Mihai	dcaae54cf6	genpolicy: "cargo fmt -- --check" clean-up Also, update Cargo.lock Fixes: #8816 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-13 01:57:00 +00:00
GabyCT	a7114a35a8	Merge pull request #8792 from GabyCT/topic/updatenhwc metrics: Use a specific python version to run tensorflow benchmark	2024-01-12 11:24:54 -06:00
Alex.Lyn	ffcd95b6b4	Merge pull request #8737 from Apokleos/test-ci-dgb-cri-containerd ci: enable test dragonball stability and cri-containerd	2024-01-12 11:56:22 +08:00
Fabiano Fidêncio	a606401722	Merge pull request #8803 from jodh-intel/issues-8784-runtime-rs-ch-rm-todo-to-unbreak runtime-rs: ch: Unbreak CH driver	2024-01-11 19:37:13 -03:00
Gabriela Cervantes	12a41f89b1	metrics: Use a specific python version to run tensorflow benchmark This PR uses a specific python version to run tensorflow benchmark as it needs python 3.8 to run correctly and avoid failures. Fixes #8791 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-11 22:15:31 +00:00
GabyCT	2ffb161958	Merge pull request #8763 from stevenhorsman/fix-backport-check-hub Fix backport check hub	2024-01-11 15:15:12 -06:00
Fabiano Fidêncio	86a6d133e4	Merge pull request #8248 from microsoft/danmihai1/genpolicy-main tools: add policy generation tool	2024-01-11 17:02:54 -03:00
GabyCT	69be050ff9	Merge pull request #8657 from WenyuanLau/8656/Fix_StratoVirt_on_gha_metrics gha: Fix the failure of gha metrics for StratoVirt	2024-01-11 11:41:25 -06:00
James O. D. Hunt	29e0de4e4a	runtime-rs: ch: Implement minimal memory hotplug APIs Replace the `todo!()` calls with a minimal NOP implementation to return the CH driver to working order since the `todo!()`'s forcibly crash the driver at runtime. Full implementations for these APIs will be added on issues #8800, #8801, and #8802. Fixes: #8784. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-01-11 14:11:31 +00:00
James O. D. Hunt	1c0df670af	runtime-rs: ch: Add minimal implementation of hypervisor metrics method Remove the `todo!()` macro which would cause a runtime crash and replace with a implementation that returns an error as a stop-gap until #8800 is implemented. Fixes: #8785. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2024-01-11 14:11:01 +00:00
alex.lyn	b97efc3139	CI: enable test container memory update for dragonball Fixes: #8746 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2024-01-11 19:07:33 +08:00
alex.lyn	6c85e95c34	CI: bugfix for dragonball when CI running with cri-containerd Containerd runtime options with wrong setting cause it failed. Correct it as below: ... [plugins.cri.containerd.runtimes.${runtime}.options] ConfigPath= "${KATA_CONFIG_PATH}" ... Fixes: #8746 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2024-01-11 17:35:33 +08:00
alex.lyn	cd59d31a15	CI: make CI work for dragonball to test stability and cri-containerd It needs to remove the skip setting, and make it work for dragonball. Fixes: #8746 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2024-01-11 17:35:13 +08:00
Hyounggyu Choi	f62ec0a7f5	Merge pull request #8693 from BbolroC/ibm-se-config-validation-fix runtime: Allow no initrd path for IBM Z Secure Execution	2024-01-11 09:53:51 +01:00
Xuewei Niu	70305fefc5	Merge pull request #8780 from justxuewei/containerd-events runtime-rs: Forward events to containerd via ttrpc	2024-01-11 14:58:14 +08:00
Xuewei Niu	6fd49f7604	runtime-rs: Forward events to containerd via ttrpc It is a little bit heavy for the runtime-rs to forwards events via containerd CLI, contrast to the ttrpc way. Plus, for runtimes that haven't this mechanism, e.g. CRI-O, we can't get those events anywhere. This patch introduces two types of forwarders: - `ContainerdForwarder`: Acquire ttrpc address from environment variables and forward events via ttrpc connection. - `LogForwarder`: Write event info into logs. Fixes: #7881 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2024-01-11 10:32:50 +08:00
GabyCT	a8be3d0450	Merge pull request #8796 from GabyCT/topic/uruncv versions: Update runc version	2024-01-10 14:16:20 -06:00
Gabriela Cervantes	e69f7c07a7	versions: Update runc version This PR updates the runc version to 1.1.11 which includes the following improvements - Fix several issues with userns path handling. - Support memory.peak and memory.swap.peak in cgroups v2. Add swapOnlyUsage in MemoryStats. This field reports swap-only usage. For cgroupv1, Usage and Failcnt are set by subtracting memory usage from memory+swap usage. For cgroupv2, Usage, Limit, and MaxUsage are set. - build(deps): bump github.com/cyphar/filepath-securejoin. Fixes #8795 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-10 16:46:08 +00:00
Greg Kurz	0c37aec7dc	Merge pull request #8753 from fidencio/topic/add-confidential-artefacts TEEs: Introduce kernel-confidential	2024-01-10 16:59:57 +01:00
Alex.Lyn	695440a431	Merge pull request #8749 from Apokleos/fixup-dragonball-vfio runtime-rs: fixup vfio device in runtime-rs/dragonball	2024-01-10 15:20:34 +08:00
Dan Mihai	de61b4d4e2	Merge pull request #8772 from microsoft/danmihai1/wait-for-delete tests: list the current k8s pods	2024-01-09 13:45:55 -08:00
Fabiano Fidêncio	c3f6eaa267	build-kernel: Fix typo 'terball' -> 'tarball' SSIA. :-) Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-09 14:35:45 -03:00
Fabiano Fidêncio	8b2f43a2c2	build: Add "confidential" kernel We're using a Kernel based on v6.7, which should include all te patches needed for SEV / SNP / TDX. By doing this, later on, we'll be able to stop building the specific kernel for each one of the targets we have for the TEEs. Let's note that we've introduced the "confidential" target for the kernel builder script, while the TEE specific builds are being kept as they're -- at least for now. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-09 14:35:45 -03:00
Jianyong Wu	379e2f3da2	kernel: update some configs based on kernel 6.5 and 6.6 There are lots of configs removed from latest kernel. Update them here for convenience of next kernel upgrade. Remove CONFIG_SECURITY_SELINUX_CHECKREQPROT_VALUE [1] Remove CONFIG_IP_NF_TARGET_CLUSTERIP [2] Remove CONFIG_NET_SCH_CBQ [3] Remove CONFIG_AUTOFS4_FS [4] Remove CONFIG_EMBEDDED [5] [1] https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?h=v6.6&id=a7e4676e8e2cb158a4d24123de778087955e1b36 [2] https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?h=v6.6&id=9db5d918e2c07fa09fab18bc7addf3408da0c76f [3] https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?h=v6.6&id=051d442098421c28c7951625652f61b1e15c4bd5 [4] https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?h=v6.6&id=1f2190d6b7112d22d3f8dfeca16a2f6a2f51444e [5] https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?h=v6.6&id=ef815d2cba782e96b9aad9483523d474ed41c62a Fixes: #8408 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2024-01-09 14:35:45 -03:00
Fabiano Fidêncio	cf4835e3ae	packaging: qemu: Simplify "--disable-virtiofsd" logic As all the supported architectures are disabling the virtiofsd build, there's no need to keep the switch statement there. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-09 14:35:45 -03:00
Fabiano Fidêncio	bfc6fc7a85	build: Get rid of QEMU experimental We've not been building QEMU experimental for a very long time, and the entry there has only been serving the purpose to clutter the versions.yaml (in the best case scenario) or even confuse new contributors to the project. Mind that the machinery to build the QEMU experimental is not touched, and that's used to build the TEEs capabale artefacts. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-09 14:35:45 -03:00
GabyCT	4ac5f13722	Merge pull request #8789 from GabyCT/topic/installimagestress tests: Add check images as part of install dependencies	2024-01-09 09:28:13 -06:00
GabyCT	393edf380a	Merge pull request #8778 from GabyCT/topic/fixin packaging: Fix indentation of build static stratovirt	2024-01-09 09:27:52 -06:00
Greg Kurz	e3611cf27d	Merge pull request #8326 from cheriL/8325/fix_method_param agent: use method params instead of const params in functions	2024-01-09 07:35:19 +01:00
Gabriela Cervantes	24fab19f6f	tests: Remove check images function from stressng test This PR removes the check images function from stressng test as now it will part of the install dependencies function from gha-run script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-08 17:40:39 +00:00
Gabriela Cervantes	aceba94d95	tests: Add check images as part of install dependencies To avoid random failures while trying to build and install the stressng image, this PR moves that step as part of the install dependencies in order to move the stability tests and avoid timeouts. Fixes #8787 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-08 17:38:14 +00:00
Pavel Mores	0cfb2d2570	runtime-rs: add simple Persist implementation for Qemu This is not necessarily meant to work, just to stub out unimplemented functionality while focusing on more fundamental things. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-01-08 13:12:39 +01:00
Pavel Mores	45862aeec0	runtime-rs: add default rootfs type for qemu Make sure that rootfs type is known early on even if it's not set in configuration.toml. Signed-off-by: Pavel Mores <pmores@redhat.com>	2024-01-08 13:12:39 +01:00
Gabriela Cervantes	7d41c97f60	packaging: Fix indentation of build static stratovirt This PR fixes the indentation of the build static stratovirt script for kata containers. Fixes #8777 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-05 18:06:08 +00:00
Dan Mihai	90c782f928	tests: list the current k8s pods Log the list of the current pods between tests because these pods might be related to cluster nodes occasionally running out of memory. Fixes: #8769 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-05 16:41:43 +00:00
Xuewei Niu	192c6ee9c3	Merge pull request #8773 from justxuewei/dbs-k8s-fragile	2024-01-05 12:54:32 +08:00
Xuewei Niu	0e9d73fe30	agent: Fix an issue reporting OOM events by mistake The agent registers an event fd in `memory.oom_control`. An OOM event is forwarded to containerd when the event is emitted, regardless of the content in that file. I observed content indicating that events should not be forwarded, as shown below. When `oom_kill` is set to 0, it means no OOM has occurred. Therefore, it is important to check the content to avoid mistakenly forwarding OOM events. ``` oom_kill_disable 0 under_oom 0 oom_kill 0 ``` Fixes: #8715 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2024-01-05 11:06:37 +08:00
Dan Mihai	b18f269ccf	Merge pull request #8735 from microsoft/danmihai1/set-policy agent: hold lock while setting new policy	2024-01-04 13:28:21 -08:00
GabyCT	5ea07c2b3e	Merge pull request #8776 from GabyCT/topic/addextraqemu tests: Add hypervisor component to kill kata components function	2024-01-04 14:29:52 -06:00
Gabriela Cervantes	4ad1971a0a	tests: Add hypervisor component to kill kata components function This PR adds the qemu-experimental hypervisor in the function to kill kata components. Fixes #8775 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-04 17:07:12 +00:00
stevenhorsman	6bac3323be	workflows: Update backport-label to use gh-utils.sh - hub is deprecated, so use the new gh-utils.sh script that wraps the github cli instead Fixes: #8125 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-01-04 16:48:34 +00:00
stevenhorsman	0d5d1c8c36	ci: Add gh-util.sh script - The hub tool is now deprecated, so introduce a new alternative to `hub-util.sh` https://github.com/kata-containers/.github/blob/main/scripts/hub-util.sh that works with it. Initially I've only started with the couple of commands that we use regularly, but we can extend it in future. - Expects jq to be installed and `gh` to be installed an setup (see [1]) - Now we don't have lots of repos, I've moved it into `kata-containers` rather than `.github`, so it is more visible. Fixes: #8125 [1] https://docs.github.com/en/github-cli/github-cli/quickstart#prerequisites Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-01-04 16:48:34 +00:00
Dan Mihai	7d5336aca3	agent: hold lock while setting new policy Don't release the lock between is_allowed and set_policy calls, because the policy might change in between these calls. Also, move more policy code into policy.rs. Fixes: #8734 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-04 16:45:30 +00:00
GabyCT	f056ffe5ef	Merge pull request #8759 from fadecoder/update_docs_for_stratoVirt_VMM docs: Update docs for new StratoVirt VMM introduction	2024-01-04 10:39:37 -06:00
GabyCT	4f9ee7b31c	Merge pull request #8766 from GabyCT/topic/improvedeleteion metrics: Improve iperf3 cleanup	2024-01-04 10:38:33 -06:00
Xuewei Niu	b5a6e74cdf	Merge pull request #8744 from justxuewei/vhu-net-compile dragonball: Fix compilation issue without all net features	2024-01-04 19:02:55 +08:00
Xuewei Niu	db948f685d	Merge pull request #8757 from justxuewei/upgrade-containerd-shim-protos runtime-rs\|agent\|protocols\|agent-ctl: Bump ttrpc and containerd-shim-protos versions	2024-01-04 19:02:42 +08:00
soup	7c176a62fe	agent: use method params instead of const params in functions Fixes: #8325 Signed-off-by: soup <lqh348659137@outlook.com>	2024-01-04 09:29:29 +01:00
Xuewei Niu	f97f16a44a	agent-ctl: Bump ttrpc version - `ttrpc` from `0.7.1` to `0.8`. Fixes: #8757 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2024-01-04 15:58:34 +08:00
Xuewei Niu	bf59c7b3d4	runtime-rs: Bump ttrpc and containerd-shim-protos versions - `ttrpc` from `0.7.1` to `0.8`. - `containerd-shim-protos` from `0.3.0` to `0.6.0`. Fixes: #8756 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2024-01-04 15:58:34 +08:00
Xuewei Niu	cf9a0e21a1	protocols: Bump ttrpc version - `ttrpc` from `0.7.1` to `0.8`. Fixes: #8756 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2024-01-04 15:58:34 +08:00
Xuewei Niu	91360e7ddb	agent: Bump ttrpc version - `ttrpc` from `0.7.1` to `0.8`. Fixes: #8756 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2024-01-04 15:58:34 +08:00
Chao Wu	0f532175fe	Merge pull request #8771 from openanolis/chao/fix_ut dbs-pci: introduce Cargo.lock to prevent the influence from upstream	2024-01-04 15:14:22 +08:00
Zhigang Wang	44b5b88f4c	docs: Update docs for new StratoVirt VMM introduction As the StratoVirt VMM has been added, we can update the docs and make some intoduction to StratoVirt, thus users can know more about the hypervisor choices. Fixes: #8645 Signed-off-by: Zhigang Wang <wangzhigang17@huawei.com> Signed-off-by: Liu Wenyuan <liuwenyuan9@huawei.com>	2024-01-04 14:26:48 +08:00
Chao Wu	f1235ddba3	dbs_virtio_devices: add Cargo.lock In order to avoid rust-vmm upstream change breaks Dragonball compilation, we introduce Cargo.lock to dbs crates. fixes: #8770 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2024-01-04 11:23:30 +08:00
Chao Wu	02cd726bfc	dbs-utils: add Cargo.lock In order to avoid rust-vmm upstream change breaks Dragonball compilation, we introduce Cargo.lock to dbs crates. fixes: #8770 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2024-01-04 11:17:45 +08:00
Chao Wu	97bdc1529b	dbs-pci: introduce Cargo.lock As reported in #8767, we have found that the root cause is that rust-vmm's vmm-sys-utils introduce a new release 0.12.1 and dbs-pci rely on rust-vmm's vfio-ioctls which uses >= to declare vmm-sys-utils so it automatically upgrade vmm-sys-utils to 0.12.1. That's how two different versions of vmm-sys-utils is introduced and this breaks the compilation. In order to fix this and also avoid future problems, we introduce Cargo.lock file to dbs crates. fixes: #8770 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2024-01-04 11:11:56 +08:00
Gabriela Cervantes	4bc67dba08	metrics: Improve iperf3 cleanup This PR improves the iperf3 cleanup to ensure all the components are being deleted properly to avoid the random failures of leaving the iperf3 clients on the kata metrics CI. Fixes #8765 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-01-03 17:14:38 +00:00
alex.lyn	d2080fd221	runtime-rs: refactor getting the vfio device guest pci path Fixes: #8748 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2024-01-02 14:28:34 +08:00
alex.lyn	d795fcfc2f	runtime-rs: bridge the vfio device between runtime-rs and dragonball Previously, Dragonball did not support PCI device hot-plugging or VFIO device passthrough. Therefore, the runtime-rs support for Dragonball was incomplete. it is time to complete it so that users can use Dragonball's PCI hot-plugging and VFIO passthrough capabilities. Fixes: #8748 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2024-01-02 14:28:10 +08:00
Chao Wu	67b91c1eb3	Merge pull request #8740 from openanolis/upstream/pci-6-final Dragonball: add pci vfio passthrough, hot(un)plug support	2023-12-29 01:58:32 +08:00
Chao Wu	71c322c293	runtime-rs: fix ci complains vfio commits introduce quite a lot change in runtime-rs, this commit is for all the changes related to ci, including compilation errors and so on. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-12-28 23:34:41 +08:00
Chao Wu	f9e0a4bd7e	upcall: introduce pci device add & del kernel patch add pci add and del guest kernel patch as the extension in the upcall device manager server side. also, dump config version to 120 since we need to add config for dragonball pci in upcall fixes: #8741 Signed-off-by: Gerry Liu <gerry@linux.alibaba.com> Signed-off-by: Helin Guo <helinguo@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-12-28 16:21:30 +08:00
Chao Wu	a3f7601f5a	dragonball: add pci hotplug / hot-unplug support Introduce two new vmm action to implement pci hotplug and pci hot-unplug: PrepareRemoveHostDevice and RemoveHostDevice. PrepareRemoveHostDevice is to call upcall to unregister the pci device in the guest kernel. RemoveHostDevice should be called after PrepareRemoveHostDevice, it is used to clean the PCI resource in the Dragonball side. fixes: #8741 Signed-off-by: Gerry Liu <gerry@linux.alibaba.com> Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com> Signed-off-by: Zha Bin <zhabin@linux.alibaba.com> Signed-off-by: Helin Guo <helinguo@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-12-28 16:08:31 +08:00
Chao Wu	0f402a14f9	dragonball: add InsertHostDevice vmm action Introduce a new vmm action InsertHostDevice to passthrough host pci devices like NIC or GPU devices into guest so that users could have high performance usage of those devices. fixes: #8741 Signed-off-by: Gerry Liu <gerry@linux.alibaba.com> Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com> Signed-off-by: Zha Bin <zhabin@linux.alibaba.com> Signed-off-by: Helin Guo <helinguo@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-12-28 16:04:22 +08:00
Xuewei Niu	4c023e341c	dragonball: Fix compilation issue without all net features Combinations of network features were tested: - None - virtio-net - vhost-net - vhost-user-net - virtio-net,vhost-net - vhost-net,vhost-user-net - virtio-net,vhost-user-net - virtio-net,vhost-net,vhost-user-net Fixes: #8742 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-28 11:37:26 +08:00
Alex.Lyn	990a3adf39	Merge pull request #8618 from Apokleos/csi-for-directvol runtime-rs: Add dedicated CSI driver for DirectVolume support in Kata	2023-12-27 21:27:29 +08:00
Chao Wu	cbd4481bc1	Merge pull request #7489 from Apokleos/pci_path runtime-rs: add pci topology for pci devices	2023-12-27 18:52:06 +08:00
alex.lyn	ea69c17008	runtime-rs: initialize pcie topology in Device Manager Add a pcie_topology field to DeviceManager and initialize pcie_topology when ResourceManager calls DeviceManager's new() with TopologyConfigInfo. Fixes: #7218 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-27 15:57:23 +08:00
alex.lyn	b42548b8e1	runtime-rs: do unregister device in Trait Device/detach Fixes: #7218 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-27 15:53:18 +08:00
alex.lyn	0f0b6d13c9	runtime-rs: do register/update device in Trait Device/attach Before calling the device driver to attach a device, register the device to PCIe topology and allocate a PciPath for it. However, for some hypervisor such as CLH, the allocation is invalid when plugging devices to VM, they have the ability to return DeviceInfo containing PciPath. It'll update the PciPath with the returned pci path in the PCIe topology for them to prevent the inferred pcipath from being different from the actual value returned. But the update will not be executed if the pcipath value doesn't change. Fixes: #7218 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-27 15:49:18 +08:00
alex.lyn	ce7d363695	runtime-rs: Introduce helper macros to simplify PCIe device ops Introduce helper macros to simplify PCIe device register/unregister and update, which provides a convenient way to handle devices in topology. Fixes: #7218 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-27 15:43:58 +08:00
alex.lyn	0d4992b24d	runtime-rs: add one more argument in Device attach/detach Add one more argument with type &mut Option<&mut PCIeTopology> in attach and detach to inroduce methods within PCIe Topology. Fixes: #7218 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-27 15:40:01 +08:00
alex.lyn	b425de6105	runtime-rs: implement Trait PCIeDevice for pcie/pci device Implement Trait PCIeDevice register/unregister for pcie/pci device, such as vfio device which needs set/get device's pci path for kata agent's device handler. Fixes: #7218 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-27 15:33:08 +08:00
alex.lyn	87e39cd1f6	runtime-rs: introduce Trait PCIeDevice to do [un]register device Introduce Trait PCIeDevice with register/unregister, which are used to register or unregister pcie device within the PCIe topology. Fixes: #7218 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-27 15:29:35 +08:00
alex.lyn	6ebc4884fa	runtime-rs: introduce PCIe Topology framework for pcie/pci devices Due to different ways that different VMMs handle PCI devices, we expect to provide a general PCIe topology processing framework that is as compatible as possible with VMMs such as dragonball, qemu, clh(Though it has its own management method, no conflict). Currently,it's mainly developed for kinds of PCIe/PCI devices in dragonball/clh which are attached on the pci/pcie root bus directly. More will be added when Qemu is ready in runtime-rs. Fixes: #7218 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-27 15:29:25 +08:00
alex.lyn	88839026b9	runtime-rs: introduce TopologyConfigInfo to initialize pcie topology A TopologyConfigInfo added to store device config info for PCIe/PCI devices in the VM from Hypervisor DeviceInfo. And TopologyConfigInfo::new will be the entry to initialize PCIe Topology for each VM. Fixes: #7218 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-27 15:21:53 +08:00
Fabiano Fidêncio	35f88dfc93	Merge pull request #8733 from fidencio/topic/fix-shim-check-for-snapshotter-configration kata-deploy: Fix shim check for snapshotter configuration	2023-12-27 03:30:53 -03:00
Chao Wu	8895cb82df	Merge pull request #8724 from openanolis/chao/add_vfio dragonball: introduce vfio support	2023-12-27 11:40:53 +08:00
Xuewei Niu	43a627c96f	Merge pull request #8632 from adamqqqplay/support-vhost-user-blk dragonball: introduce vhost-user-blk device	2023-12-27 09:54:21 +08:00
Chao Wu	2f797a6eb7	pci: rename 2 parameters to follow rust naming convention PciCapabilityID -> PciCapabilityId PciBarRegionType::IORegion -> PciBarRegionType::IoRegion Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-12-26 23:28:47 +08:00
Chao Wu	9c13b2c990	dragonball: introduce vfio support vfio mod collects lots of information related to the vfio operations, including VfioMsi and VfioMsix capability & state, vfio interrupt info, pci region infor and vfio pci device info & state. fixes: #8722 Signed-off-by: Gerry Liu <gerry@linux.alibaba.com> Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com> Signed-off-by: Shifang Feng <fengshifang@linux.alibaba.com> Signed-off-by: Yang Su <yang.su@linux.alibaba.com> Signed-off-by: Zha Bin <zhabin@linux.alibaba.com> Signed-off-by: Xin Lin <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-12-26 23:28:43 +08:00
alex.lyn	8779fe7dd5	runtime-rs: create a reference that directs users to kata csi doc Fixes: #8602 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-26 20:36:34 +08:00
alex.lyn	ba5437382a	runtime-rs: add examples about Kata pod with directvol by CSI. Fixes: #8602 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-26 20:36:34 +08:00
alex.lyn	c6d2a32146	runtime-rs: add support for directvol csi deploy scripts. Fixes: #8602 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-26 20:36:34 +08:00
alex.lyn	25d8e83e43	runtime-rs: Add dedicated CSI driver for DirectVolume support in Kata Bridge the gap between user requirements for direct block device access and the DirectVolume capabilities provided by Kata runtimes (kata-runtime/runtime-rs), and facilitate seamless integration with CSI to improve user experience. It aims to integrate DirectVolume CSI support into Kata, enabling users to benefit from its performance and flexibility advantages. Fixes: #8602 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-26 20:36:22 +08:00
Fabiano Fidêncio	6ee7fb5402	kata-deploy: Double quote the snapshotter name Otherwise `jq` will complain about: ```sh jq: error: nydus/0 is not defined at <top-level>, line 1: .plugins."io.containerd.grpc.v1.cri".containerd.runtimes."kata-clh".snapshotter=nydus jq: 1 compile error ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-26 09:14:36 -03:00
Qinqi Qu	81ab174c16	dragonball: support vhost-user-blk in device manager This patch introduces a feature of supporting vhost-user-blk device. Fixes: #8631 Signed-off-by: Qinqi Qu <quqinqi@linux.alibaba.com>	2023-12-26 20:02:38 +08:00
Qinqi Qu	ef8dc3b0ce	dragonball: support vhost-user-blk This patch introduces a feature of supporting vhost-user-blk device. This device needs to be defined before the VM instance is started, which can be done through the dbs-cli tool with --virblks option: --virblks '{ "drive_id": "8623", "device_type": "Spdk", "path_on_host": "spdk:///var/tmp/vhost.sock", "is_root_device": false, "is_read_only": false, "is_direct": false, "no_drop": false, "num_queues": 1, "queue_size": 256 }' Fixes: #8631 Signed-off-by: Eric Ren <renzhen@linux.alibaba.com> Signed-off-by: fupan <fupan.lfp@antgroup.com> Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Qinqi Qu <quqinqi@linux.alibaba.com>	2023-12-26 20:02:32 +08:00
Fabiano Fidêncio	8332f3c684	kata-deploy: Fix the snapshotter config placement In the way the script is without this patch, we're trying to set ```toml [`$shim`] snapshotter = $snapshotter ``` However, what we actually want to set is the full runtime table instead of shim. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-26 08:26:38 -03:00
Fabiano Fidêncio	907f1ddb9e	kata-deploy: Fix shim check for snapshotter configuration We want to check whether the shim is part of the "plain text" shims passed to the daemonset (meaning, checking against `$SHIMS`). Before this fix we were checking against `$shims`, which is an array of shims instead of a string, resulting on a broken check. Fixes: #8732 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-26 07:42:36 -03:00
Tim Zhang	a4ad12a3d1	Merge pull request #8729 from liubin/fix/package-kata-monitor kata-monitor: fix Dockerfile to build image	2023-12-26 18:30:15 +08:00
alex.lyn	3b317e69e2	runtime-rs: add README and user guide to deploy directvol CSI Driver Fixes: #8602 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-26 18:00:35 +08:00
Bin Liu	23eb3042c7	kata-monitor: fix Dockerfile to build image move `SKIP_GO_VERSION_CHECK` after `make` command to skip checking golang version. And also upgrade golang to 1.19. Fixes: #8728 Signed-off-by: Bin Liu <bin@hyper.sh>	2023-12-26 15:11:13 +08:00
Xuewei Niu	1065ca6fa7	Merge pull request #8626 from justxuewei/vhost-user-endpoint	2023-12-26 12:52:21 +08:00
Xuewei Niu	36a4cbccf6	runtime-rs: Expand all DeviceType in match arms The compiler will give a warning if a developer forget to add an arm for a new variants defined. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-26 10:18:59 +08:00
Xuewei Niu	f2d08bc00f	runtime-rs: Remove unused index from Endpoints The affected `Endpoint`s are `VhostUserEndpoint` and `TapEndpoint`. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-26 10:18:59 +08:00
Xuewei Niu	60a42351e2	runtime-rs: DAN supports vhost-user-net device DAN reads vhost-user-net device from JSON config. It only supports VMM running as server right now. Fixes: #8625 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-26 10:18:59 +08:00
Xuewei Niu	693a0cfbfd	dragonball: Make vhost-user-net ready for VhostUserEndpoint The changes involve: - Expose VhostUserConfig struct to runtime-rs. - Set a default value while num_queues or queue_size are 0. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-26 10:18:59 +08:00
Xuewei Niu	54df832407	runtime-rs: Support VhostUserEndpoint This commit introduces VhostUserEndpoint and supports relative to vhost-user-net devices for device manager. For now, Dragonball is able to attach vhost-user-net devices. Fixes: #8625 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-26 10:18:50 +08:00
Xuewei Niu	374c2f01aa	runtime-rs: Simplify VhostUserType enum Remove unused string parameter from each item. Fixes: #8625 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-25 16:15:57 +08:00
Xuewei Niu	38eb4077a6	Merge pull request #8503 from justxuewei/vhost-user-net dragonball: Support vhost-user-net device	2023-12-25 13:47:51 +08:00
Xuewei Niu	4c5de72863	dragonball: Wrap config space into `set_config_space` Config space of network device is shared and accord with virtio 1.1 spec. It is a good way to abstract the common part into one function. `set_config_space()` implements this. Plus, this patch removes `vq_pairs` from vhost-net devices, since there is a possibility of data inconsistency. For example, some places read that from `self.vq_pairs`, others read from `queue_sizes.len() / 2`. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-25 10:47:34 +08:00
Alex.Lyn	3a3f39aa2d	Merge pull request #8668 from Apokleos/pci-path-refactor runtime-rs: Refactor the code related to PCI paths and VFIO device driver initialize in DM.	2023-12-23 21:44:07 +08:00
Steve Horsman	1afce09858	Merge pull request #8721 from stevenhorsman/kata-deploy-typos kata-deploy: snapshotter typo fixes	2023-12-22 21:26:03 +00:00
stevenhorsman	4a95c0d07f	kata-deploy: snapshotter typo fixes - Add spaces so that the if statements are valid Fixes: #8720 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-12-22 16:32:02 +00:00
Dan Mihai	080541a0f2	genpolicy: add SPDX license header Add SPDX license header to rules.rego. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-22 15:35:05 +00:00
Saul Paredes	7f126be67e	genpolicy: Update oci_distribution to 0.10.0 Also support alternative media type and update samples Signed-off-by: Saul Paredes <saulparedes@microsoft.com> Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-22 15:35:05 +00:00
Dan Mihai	9eb6fd4c24	docs: add agent policy and genpolicy docs Add docs for the Agent Policy and for the genpolicy tool. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-22 15:35:05 +00:00
Dan Mihai	57f93195ef	genpolicy: add support for StatefulSet YAML input Generate policy for K8s StatefulSet YAML. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-22 15:35:05 +00:00
Dan Mihai	35958ec9cc	genpolicy: add support for ReplicationController Generate policy for K8s ReplicationController YAML. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-22 15:35:05 +00:00
Dan Mihai	7da17099f2	genpolicy: add support for ReplicaSet YAML input Generate policy for K8s ReplicaSet YAML. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-22 15:35:05 +00:00
Dan Mihai	d84300f1ee	genpolicy: add support for List YAML input Generate policy for K8s List YAML. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-22 15:35:05 +00:00
Dan Mihai	a03452637b	genpolicy: add support for Job YAML input Generate policy for K8s Job YAML. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-22 15:35:05 +00:00
Dan Mihai	2dbd01c80b	genpolicy: add support for Deployment YAML input Generate policy for K8s Deployment YAML. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-22 15:35:05 +00:00
Dan Mihai	a40a6003d0	genpolicy: add support for DaemonSet YAML input Generate policy for K8s DaemonSet YAML. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-22 15:35:05 +00:00
Dan Mihai	48829120b6	policy: initial genpolicy commit Add application that infers K8s user's intentions based on user's K8s YAML file, and generates a Rego/OPA based policy for that YAML. Just Pod YAML files are supported as input using this initial source code. Support for other types of YAML files will come with upcoming commits. Fixes: #7673 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-22 15:35:05 +00:00
Chao Wu	555136c1a5	Merge pull request #8662 from openanolis/pci/4-upstream dragonball: introduce pci msi/msix interrupt	2023-12-22 18:08:31 +08:00
Steve Horsman	c5f939cdc1	Merge pull request #8655 from fidencio/topic/kata-deploy-add-snapshotter-support kata-deploy: Allow setting up snapshotters per runtime handler	2023-12-22 09:16:07 +00:00
Chao Wu	8cf3bcefd8	dragonball: introduce pci msi/msix interrupt introduce msi/msix mod to maintain information for PCI Message Signalled Interrupt Extended Capability. It will be initialized when parsing pci configuration space and used when getting interrupt capabilities. fixes: #8661 Signed-off-by: Gerry Liu <gerry@linux.alibaba.com> Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com> Signed-off-by: Shifang Feng <fengshifang@linux.alibaba.com> Signed-off-by: Yang Su <yang.su@linux.alibaba.com> Signed-off-by: Zha Bin <zhabin@linux.alibaba.com> Signed-off-by: Xin Lin <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-12-22 16:28:22 +08:00
Xuewei Niu	beadce54c5	dragonball: Support vhost-user-net devices This PR introduces vhost-user-net devices to Dragonball. The devices are allowed to run as server on the VMM side. Fixes: #8502 Signed-off-by: Eric Ren <renzhen@linux.alibaba.com> Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Zha Bin <zhabin@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com> Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-22 14:53:18 +08:00
Xuewei Niu	1f21d3cb2c	dragonball: Introduce address space for MmioV2DeviceState Vhost-user-net has a dependency on address space from `MmioV2DeviceState`. The addition of the address space is introduced in this patch. Plus, it makes sure all unit tests have the according parameter as well. Fixes: #8502 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-22 14:53:18 +08:00
Fupan Li	dc9a0ac8ce	Merge pull request #8718 from justxuewei/enable-vhost tests: Load vhost modules explicitly while Kata installing	2023-12-22 14:52:49 +08:00
Xuewei Niu	206ed6d77d	tests: Load vhost modules explicitly while Kata installing The default network backend of runtime-rs with Dragonball is vhost-net after #8609 merged. The tests might be failed if vhost modules are not loaded. Fixes: #8717 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-22 11:07:37 +08:00
alex.lyn	94c83cea84	runtime-rs: Refactor vfio driver implementation It's important to ensure that these tasks which setup vfio devices are completed before add_device. So Moving vfio device setup code to a dedicated method at device building time which does not affect the behavior of other code. And this change makes it easier to understand the difference between create and attach, and also makes the boundaries clearer. Fixes: #8665 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-22 10:37:40 +08:00
alex.lyn	82d3cfdeda	runtime-rs: Make VhostUserConfig's field pci_path type more specific Make VhostUserConfig pci_path's type more specific, change it from Option<String> to Option<PciPath>. Fixes: #8665 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-22 10:35:38 +08:00
alex.lyn	5cc2890a10	runtime-rs: refactor and re-implement pci path. Do refactor and re-implement to make the pci path more "rusty". Fixes: #8665 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-22 10:34:41 +08:00
Fabiano Fidêncio	32e1ba2525	Merge pull request #8714 from cmaf/libsh-update-loc tests: Use function from Kata repo	2023-12-21 12:30:31 -03:00
Fabiano Fidêncio	6cc6ca5a7f	kata-deploy: Allow setting up snapshotters per runtime handler Since containerd 1.7.0 we can easily set a specific snapshotter to be used with a runtime handler, and we should take advantage of this, mostly as it'll help setting up any runtime using devmapper or nydus snapshotters. This implementation here has a few caveats: * The format expected for the SNAPSHOTTER_HANDLER_MAPPING is: `shim:snapshotter,shim:snapshotter,...` * It only works with containerd 1.7 or newer * We never change the default containerd snapshotter * We don't do any check on our side to verify whether the snapshotter required is properly deployed * Users will have to add an annotation to their pods, in order to use the snapshotter set up per runtime handler * Example: ``` metadata: ... annotations: io.containerd.cri.runtime-handler: kata-fc ``` Fixes: #8615 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-21 07:20:10 -03:00
alex.lyn	1b5758c1f2	runtime-rs: Move the PciPath-related code to a dedicated file Move the pciPath code to a new file pci_path.rs and update the references. Fixes: #8665 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-21 11:35:18 +08:00
alex.lyn	275de453d5	runtime-rs: remove useless get_host_guest_map and its test case Fixes: #8665 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-21 11:07:56 +08:00
Chelsea Mafrica	9f394f6e18	tests: Use function from Kata repo Switch to use function from Kata repo in common.bash to reduce dependency on the tests repo. Fixes #8713 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-12-20 16:45:06 -08:00
Dan Mihai	d916da15dd	Merge pull request #8688 from microsoft/danmihai1/k8s-confidential tests: retry connection to pod SSH server	2023-12-20 15:01:26 -08:00
Fabiano Fidêncio	3482256340	Merge pull request #8709 from fidencio/topic/update-jq-for-kata-deploy kata-deploy: Update `jq` as part of the kata-deploy daemonset	2023-12-20 16:48:07 -03:00
James O. D. Hunt	7da6d0a845	runtime-rs: ch: Implement missing thread/pid APIs Add implementations for the following `Hypervisor` trait methods which simply return the same details as the `get_vmm_master_tid()` method: - `get_thread_ids()` - `get_pids()` Fixes: #6438. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-20 17:58:40 +00:00
Fabiano Fidêncio	c9e631dc0c	kata-deploy: Reapply "kata-deploy: Use tomlq to configure containerd" This reverts commit `ee5fa08a27`. This is perfectly fine to do as we narrwoed down the issue to be on the version of `jq` provided by alpine, and we've already updated it in the previous commit (in this very same series). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-20 12:52:41 -03:00
Fabiano Fidêncio	41320c586e	kata-deploy: Install jq from GitHub `jq` coming from alpine is in its 1.6 version, and that has a bug that hits us quite hard, as it changes a float to an int whenever the number is in the `x.0` format. One example is: ```bash / # jq --version jq-1.6 / # echo '{"foo": 1.0}' \| jq .foo 1 ``` With this in mind, let's switch, at least for now, to using the `jq` released directly on github, as it does address the issue we've been hitting. ```bash ⋊> Downloads ./jq-linux-amd64 --version jq-1.7 ⋊> Downloads echo '{"foo": 1.0}' \| jq .foo 1.0 ``` Fixes: #8678 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-20 12:52:41 -03:00
Greg Kurz	ce094ecdc2	Merge pull request #8679 from stevenhorsman/kata-deploy-containerd-config-fix gha: kata-deploy: Revert containerd config break	2023-12-20 12:58:56 +01:00
stevenhorsman	ee5fa08a27	Revert "kata-deploy: Use tomlq to configure containerd" This reverts commit `dd9f5b07b9`. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-12-20 09:10:43 +00:00
stevenhorsman	9e718b4e23	gha: kata-deploy: Add containerd status check After kata-deploy has installed, check that the worker nodes are still in Ready state and don't have a containerd://Unknown container runtime versions, identicating that container isn't working to ensure that we didn't corrupt the containerd config during kata-deploy's edits Fixes: #8678 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-12-20 09:10:43 +00:00
Archana Shinde	7e5868a55f	Merge pull request #8588 from amshinde/runtime-rs-update-readme runtime-rs: Update readme to indicate cloud-hypervisor support	2023-12-19 22:09:14 -08:00
Dan Mihai	8aa390279e	tests: retry connection to pod SSH server To become more resilient against these kinds of errors: deployment.apps/confidential-unencrypted created pod/confidential-unencrypted-c5fdd6964-rrb6q condition met ssh: connect to host 10.42.0.109 port 22: Connection refused Fixes: #8687 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-20 02:48:05 +00:00
GabyCT	5504176e9a	Merge pull request #8699 from GabyCT/topic/fixconfidentialscript tests: k8s: Fix indentation in confidential common script	2023-12-19 16:01:28 -06:00
Dan Mihai	6cea8a5f2a	Merge pull request #8697 from microsoft/danmihai1/runk tests: additional run-runk logging	2023-12-19 11:27:29 -08:00
Dan Mihai	551a50cd72	tests: additional run-runk logging Add logging to run-runk, for debugging possible failures. Fixes: #8696 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-12-19 14:08:01 +00:00
Hyounggyu Choi	540a2a7fb1	runtime: Allow no initrd path for IBM Z Secure Execution This is to reintroduce a configuration rule for IBM Z Secure Execution, where no initrd path should be configured. For the TEE of interest, only a kernel image should be specified with `confidential_guest=true`. Fixes: #8692 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-19 11:21:16 +01:00
Xuewei Niu	ec30d5a9a8	Merge pull request #8700 from justxuewei/dbs-ut dragonball: Trigger unit tests of dbs_* subcrates by `make test`	2023-12-19 17:51:20 +08:00
Xuewei Niu	039fe7f391	dragonball: Trigger unit tests of dbs_* subcrates by `make test` `make SUPPORT_VIRTUALIZATION=1 test` iterates through all subcrates and does test. Plus, this patch fixes some issues about unit tests: - Feed too much parameters to `I8042Device::new()`. - Virtqueue checks have been introduced since `virtio-queue v0.7.0`. - GHA might have no access to `/var/tmp` dir on runner. Fixes: #8690 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-19 16:22:37 +08:00
Hyounggyu Choi	ceea8882db	Merge pull request #8672 from BbolroC/introduce-vsock-device-init runtime-rs: Separate init_config() from new() for struct VsockDevice	2023-12-18 22:04:37 +01:00
Gabriela Cervantes	1469a5efca	tests: k8s: Fix indentation in confidential common script This PR fixes the indentation of the confidential common script for kubernetes tests. Fixes #8698 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-18 20:25:06 +00:00
Chelsea Mafrica	312475508a	Merge pull request #8682 from cmaf/static-checks-update-loc ci: Use static checks from kata repo for lib functions	2023-12-18 09:53:01 -08:00
Hyounggyu Choi	3cd0cc1388	runtime-rs: Separate init_config() from new() for struct VsockDevice As a follow-up for #8516, guest_cid and vhost_fd are not necessarily initialised via new(). Instead, the fields should be initialised later when they are really used to construct hypervisor's parameters. This commit is to separate init_config() from new() to initialise guest_cid and vhost_fd and leave only the assignment of id for the existing function. Fixes: #8671 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-18 16:36:09 +01:00
Greg Kurz	2987d3eeb5	Merge pull request #8341 from jongwu/fix_cpushares agent: correct CPUShares and CPUWeight value	2023-12-18 15:40:04 +01:00
James O. D. Hunt	3c49120d2f	Merge pull request #8641 from jodh-intel/kata-ctl-add-cfg-file-cli-option kata-ctl: Add option to dump config files	2023-12-18 11:54:19 +00:00
Greg Kurz	1cfcc80018	Merge pull request #8664 from amshinde/remove-ignore-paths-ga github-actions: Remove ignore paths for required CI checks	2023-12-18 12:49:21 +01:00
Chelsea Mafrica	b785ef96ec	docs: Change location of static checks script We now use the static checks script from the main kata containers repo and not the tests repo; update documentation to reflect this. Fixes #8681 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-12-15 17:13:02 -08:00
Chelsea Mafrica	bfb756199f	ci: Use static checks from kata repo for lib functions Change the two functions in lib.sh to use the static checks script from the kata containers repo instead of tests. Remove cloning the repo from these functions since we don't need it anymore. Leave these two functions because the document checking one may be used locally and the static checks one is called from the virtcontainers Makefile. Fixes #8681 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-12-15 17:08:33 -08:00
Archana Shinde	510bc36a77	github-actions: Remove ignore paths for required CI checks If a PR contains files from the ignore-paths, these actions do not run as intended. However, the actions are make as required. And there does not seem to be a way to mark these as non-required in that case. As a result a PR containing the files from the ignore-paths remains stalled. Hence remove the ignore-paths until github provides a way to mark actions that are skipped due to ignore-paths as non-required/passed. Fixes: #8663 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-12-15 15:12:20 -08:00
Liu Wenyuan	61fe20cf9a	gha: Fix some of gha metrics failure for StratoVirt Update the Speed & Density metric tests baseline for StratoVirt and re-enable them, and skip other metric tests temporarily. Fixes: #8656 Signed-off-by: Liu Wenyuan <liuwenyuan9@huawei.com>	2023-12-15 17:45:01 +08:00
Zhongtao Hu	0f80dc636c	Merge pull request #6876 from openanolis/memory_hotlug runtime-rs: support Memory hotplug	2023-12-15 14:28:35 +08:00
Zhongtao Hu	9a37e77f2a	runtime-rs: check the update memory size check the update memory size greater than default max memory size Fixes:#6875 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-12-15 11:25:34 +08:00
Zhongtao Hu	6039417104	runtime-rs: add default_maxmemory in config file add default_maxmemory in config file Fixes:#6875 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-12-15 10:25:20 +08:00
Zhongtao Hu	8d9fd9c067	runtime-rs: support memory resize Fixes:#6875 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-12-15 10:25:13 +08:00
Zhongtao Hu	81e55c424a	runtime-rs: add resize_memory trait for hypervisor Fixes: #6875 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-12-15 10:25:03 +08:00
Zhongtao Hu	d428a3f9b9	runtim-rs: get guest memory details get memory block size and guest mem hotplug probe Fixes:#6356 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-12-15 10:22:37 +08:00
GabyCT	4a49dd73db	Merge pull request #8676 from GabyCT/topic/fixins tests: k8s: Fix indentation in setup script	2023-12-14 13:57:47 -06:00
GabyCT	7a606a19c4	Merge pull request #8659 from GabyCT/topic/improvecleanuplatency metrics: Improve latency network cleanup	2023-12-14 13:57:28 -06:00
GabyCT	0831529279	Merge pull request #8644 from GabyCT/topic/updadockerresint metrics: Update TensorFlow ResNet50 Int8 Dockerfile	2023-12-14 13:56:41 -06:00
Jianyong Wu	58e88d9469	agent: correct CPUShares and CPUWeight value If cgroup driver is systemd, CPUShares, for cgroup v1, should be at least 2 [1] and CPUWeight for cgroup v2, should be at least 1 [2]. Fixes: #8340 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com> [1] `d19434fbf8/src/basic/cgroup-util.h (L122)` [2] `d19434fbf8/src/basic/cgroup-util.h (L91)`	2023-12-15 02:04:31 +08:00
Steve Horsman	04de6eb4fd	Merge pull request #8674 from ChengyuZhu6/fix_statis_check static-checks: Add some dependencies to static checks for CoCo features	2023-12-14 16:47:01 +00:00
Greg Kurz	1bd9c1b4de	Merge pull request #8589 from wvell/patch-1 Remove warning for cgroupsv2 only operating systems	2023-12-14 17:37:59 +01:00
Gabriela Cervantes	c92b14da97	tests: k8s: Fix indentation in setup script This PR fixes the indentation of the kubernetes setup script. Fixes #8675 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-14 16:26:22 +00:00
Amulya Meka	ac7b3d4735	Merge pull request #8667 from Amulyam24/workflow gha: add a post cleanup script for cri-containerd ppc64le workflow	2023-12-14 21:52:54 +05:30
Alex.Lyn	c7c7632203	Merge pull request #8620 from Apokleos/enhance-directv-using-csi runtime-rs: Enhancement of DirectVolume when using a dedicated CSI	2023-12-14 22:59:09 +08:00
ChengyuZhu6	dfad0e6622	.github: fix the failure without devicemapper for host sharing fix error when running checks and tests: error: failed to run custom build command for `devicemapper-sys v0.1.5` fatal error: 'libdevmapper.h' file not found thread 'main' panicked at 'Could not generate dm.h bindings: ClangDiagnostic("dm.h:2:10: fatal error: 'libdevmapper.h' file not found\n")', /home/runner/.cargo/registry/src/index.crates.io-6f17d22bba15001f/devicemapper-sys-0.1.5/build.rs:24:10 stack backtrace: 0: rust_begin_unwind at /rustc/5680fa18feaa87f3ff04063800aec256c3d4b4be/library/std/src/panicking.rs:593:5 1: core::panicking::panic_fmt at /rustc/5680fa18feaa87f3ff04063800aec256c3d4b4be/library/core/src/panicking.rs:67:14 2: core::result::unwrap_failed at /rustc/5680fa18feaa87f3ff04063800aec256c3d4b4be/library/core/src/result.rs:1651:5 3: core::result::Result<T,E>::expect 4: build_script_build::main 5: core::ops::function::FnOnce::call_once note: Some details are omitted, run with `RUST_BACKTRACE=full` for a verbose backtrace. warning: build failed, waiting for other jobs to finish... make: *** [../../utils.mk:177: standard_rust_check] Error 101 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2023-12-14 20:47:47 +08:00
ChengyuZhu6	983479748f	.github: fix error when making checks for CoCo guest pull Fix error when making checks: ``` error: failed to run custom build command for `image-rs v0.1.0 (https://github.com/confidential-containers/guest-components?tag=v0.8.0#e849dc89)` Caused by: process didn't exit successfully: `/home/runner/work/kata-containers/kata-containers/src/ agent/target/release/build/image-rs-fd932206d09362b7/build-script-build` (exit status: 101) --- stdout cargo:rerun-if-changed=./protos/getresource.proto cargo:rerun-if-changed=./protos --- stderr thread 'main' panicked at 'Could not find `protoc` installation and this build crate cannot proceed without this knowledge. If `protoc` is installed and this crate had trouble finding it, you can set the `PROTOC` environment variable with the specific path to your installed `protoc` binary.If you're on debian, try `apt-get install protobuf-compiler` or download it from https://github.com/protocolbuffers/protobuf/releases ``` Fixes #8673 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2023-12-14 20:47:42 +08:00
alex.lyn	aa42f0a03f	runtime-rs: Enhancement of DirectVolume when using CSI. We use a matching direct-volume path to determine whether an OCI mount is a DirectVolume. However, we should handle the case where no match is found appropriately. This error will be defined as a non-DirectVolume type when judging the OCI mount but not failed. Fixes: #8619 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-14 18:19:03 +08:00
alex.lyn	80d631ee84	runtime-rs: Add attribute serde rename to each field of DirectVolume. DirectVolume structure in runtime-rs is different from it in kata-runtime, which causes they has no unified handling method for DirectVolumeMountInfo and MountInfo. We should align the two by simply adding the attribute #[serde(rename="x") to each field in DirectVolumeMountInfo Fixes: #8619 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-14 18:18:40 +08:00
Xuewei Niu	7f611dfe84	Merge pull request #8609 from justxuewei/runtime-rs-vhost-net dragonball: Use vhost-net device by default	2023-12-14 16:33:29 +08:00
Amulyam24	0db820fa01	gha: add a post cleanup script for cri-containerd ppc64le workflow This PR identifies and adds an action to cleanup the ppc64le self hosted runner. Fixes: #8666 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2023-12-14 13:46:47 +05:30
Hyounggyu Choi	fbc04460f6	Merge pull request #8649 from BbolroC/put-pre-action-gha-s390x GHA: Put all the preliminary steps into pre-action for s390x	2023-12-14 07:16:17 +01:00
Xuewei Niu	82fde4431e	dragonball: Set default queue config for vhost-net device Dragonball sets a default queue config in the case of `None`. The queue_size and num_queues of vhost-net are set to `Some(0)` by default. Therefore, we might get an invalid queue config. This patch fixes this issue. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-14 11:18:33 +08:00
Xuewei Niu	c11b066728	runtime-rs: Use vhost-net device by default This patch set vhost-net as default backend of networking. It allows users to set `disable_vhost_net` to `true` to reenable virtio-net backend. Plus, which backend to use is a matter of hypervisor, runtime-rs will no longer need to know that. Fixes: #8608 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-14 11:18:26 +08:00
Chelsea Mafrica	6c2e2a9120	Merge pull request #8635 from cmaf/migrate-static-checks-gha static-checks: Direct Makefile to use new static checks	2023-12-13 16:00:16 -08:00
Gabriela Cervantes	8151117f73	metrics: Improve latency network cleanup This PR improves the latency network cleanup by removing the pods even if the test fails. Fixes #8658 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-13 17:56:01 +00:00
Fabiano Fidêncio	a998e89bcf	Merge pull request #8639 from fidencio/topic/kata-deploy-use-tomlq-to-configure-containerd kata-deploy: Use `tomlq` to configure containerd	2023-12-13 14:11:45 +01:00
Hyounggyu Choi	05e278de5b	GHA: Put all the preliminary steps into pre-action for s390x This is to introduce a pre-action to all the workflows for building artifacts. The action could take care of tasks such as cleaning up files and reinstalling packages, which prevents a workflow from getting affected by the environment. This also includes the removal of the step `Adjust a permission for repo`, because it could be incorporated into the action. Fixes: #8648 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-13 13:24:40 +01:00
Chao Wu	dfaf006fcc	Merge pull request #8564 from openanolis/chao/add_pci_root_bus_device dragonball: add pci root bus and root device	2023-12-13 17:57:16 +08:00
Fabiano Fidêncio	7ad873cf29	kata-deploy: Simplify shim configuration We never have to add a configuration for the "default" case, as we're already creating the runtime class pointing to what should be the "default" handler. This helps to simplify the logic by quite a lot. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-13 10:52:54 +01:00
Fabiano Fidêncio	e618949937	kata-deploy: Remove useless comment from CRI-O drop-in The comment adds absolutely nothing to the runtime handler added, and it'd make our life slightly harder to properly say which VMM is being used when setting the default `kata` handler. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-13 10:49:52 +01:00
Fabiano Fidêncio	dd9f5b07b9	kata-deploy: Use tomlq to configure containerd This save us a lot of trouble on properly sed'ing content that may or may not be in the containerd configuration file. Fixes: #8638 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-13 10:49:49 +01:00
Fabiano Fidêncio	4f01f294bb	kata-deploy: Install `tomlq` to the base image This will help us to have an easier time playing with the containerd configuration, instead of having to sed the **** out of it, which is super error prone. `tomlq` is a tool that comes from https://github.com/kislyuk/yq, and that depends on `jq` to do the toml parsing / editing. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-13 10:49:07 +01:00
James O. D. Hunt	d7c6219dfe	Merge pull request #8630 from jodh-intel/runtime-rs-ch-set-state-on-vm-stop runtime-rs: ch: Change state when VM stopped	2023-12-13 09:26:30 +00:00
Xuewei Niu	855adbc63b	Merge pull request #8634 from justxuewei/disable-packed-vq dragonball: Disable packed virtqueue for vhost-user devices	2023-12-13 17:03:05 +08:00
wvell	af4622fcc1	docs: Remove warning for cgroupsv2 only operating systems Removes warning for cgroupsv2 as it is not needed anymore according to #6259. Fixes #8650 Signed-off-by: wvell <w.vellema@slash2.nl>	2023-12-13 09:18:39 +01:00
Chelsea Mafrica	b46cb22270	static-checks: Direct Makefile to use new static checks Direct the Makefile to use the static checks script in the tests directory of the main Kata Containers repo so it is run in GHA. Fixes #8595 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-12-12 16:43:35 -08:00
Chelsea Mafrica	63636b869c	static-checks: Update copyright dates Some copyright dates were not updated with the most recent changes to code; update them. Fixes #8595 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-12-12 16:34:06 -08:00
Chelsea Mafrica	b11c772865	static-checks: Change dir for building tools Change directory for running make due to local errors when building with make -C. Fixes #8595 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-12-12 16:34:06 -08:00
James O. D. Hunt	2a518f0898	runtime-rs: ch: Change state when VM stopped Make the CH (Cloud Hypervisor) `stop_vm()` method check the VM state before attempting to stop the VM, and update the state once the VM has stopped. This avoids the method failing if called multiple times which will happen if the workload exits before the container manager requests that the container stop. This change ensures the CH driver finishes cleanly. Fixes: #8629. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-12 18:25:20 +00:00
Fabiano Fidêncio	39f5cea3b1	kata-deploy: Fix k0s cri notation comment We can safely assume we're using the newer notation, not the older one. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-12 18:20:18 +01:00
Gabriela Cervantes	23f76653e5	metrics: Update command to run the tensorflow int8 benchmark This PR updates the command to run the tensorflow resnet50 int8 benchmark. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-12 16:24:09 +00:00
Gabriela Cervantes	8fd5ef7fb7	metrics: Update TensorFlow ResNet50 Int8 Dockerfile This PR updates the TensorFlow ResNet50 Int8 Dockerfile to use the proper python version for kata metrics. Fixes #8643 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-12 16:20:56 +00:00
James O. D. Hunt	1195692d3c	runtime-rs: ch: Move state handling to top-level APIs Move the state setting to the `Hypervisor` trait calls. This makes the code clearer. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-12 15:25:27 +00:00
James O. D. Hunt	5637f11a8c	kata-ctl: Add option to dump config files Add a `--show-default-config-paths` command line option for parity with `kata-runtime`. Note that this requires the `KataCtlCli.command` to be optional so that the user can run simply: ```bash $ kata-ctl --show-default-config-paths ``` ... without also specifying a (sub-)command. Fixes: #8640. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-12 14:20:04 +00:00
Chelsea Mafrica	a9d360728e	static-checks: Fix directory for github labels Fix paths for yqdir (where the install_yq.sh script currently is) so that static checks can run without error. Fixes #8595 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-12-12 02:16:35 -08:00
Xuewei Niu	86918e91b3	dragonball: Disable packed virtqueue for vhost-user devices The layout of packed virtqueue isn't supported by `Endpoint::negotiate()`. Communication between device and driver will be failed due to the failure of parsing virtqueue if we don't disable the packed feature. This patch fixes this issue. Fixes: #8633 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-12-12 17:24:20 +08:00
Chao Wu	b079e1aabc	dragonball: add pci root bus and root device In order to follow up the PCI implementation in Dragonball, we need to add PCI root device and root bus support. root device is a pseudo PCI root device to manage accessing to PCI configuration space. root bus is mainly for emulating PCI root bridge and also create the PCI root bus with the given bus ID with the PCI root bridge. fixes: #8563 Signed-off-by: Gerry Liu <gerry@linux.alibaba.com> Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com> Signed-off-by: Shifang Feng <fengshifang@linux.alibaba.com> Signed-off-by: Yang Su <yang.su@linux.alibaba.com> Signed-off-by: Zha Bin <zhabin@linux.alibaba.com> Signed-off-by: Xin Lin <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-12-12 11:43:14 +08:00
GabyCT	ee74fca92c	Merge pull request #8617 from GabyCT/topic/enabletestnerdctl tests: nerdctl: Enable nerdctl tests for cloud hypervisor runtime-rs	2023-12-11 14:09:58 -06:00
David Esparza	584a26dab0	Merge pull request #8542 from dborquez/metrics_fix_deployment_cleaning metrics: cleans k8s iperf deployment when the test finishes.	2023-12-11 13:14:39 -06:00
Chao Wu	198e4adcb1	Merge pull request #8599 from openanolis/chao/fix_cargo_fmt dragonball: add --all for fmt ci	2023-12-12 00:20:21 +08:00
GabyCT	43410e1918	Merge pull request #8560 from GabyCT/topic/enablek8srs gha: k8s: Add cloud-hypervisor (runtime-rs) support	2023-12-11 09:42:49 -06:00
Hyounggyu Choi	ea2a0dc69d	Merge pull request #7769 from BbolroC/opa-multiarch rootfs: build OPA binary from source for ppc64le and s390x	2023-12-11 15:25:33 +01:00
Chao Wu	52f7a40e4e	dragonball: add --all for fmt ci Right now, cargo fmt check in Dragonball only test with the default features but not all features. This will cause some code being untested by the fmt tool. This PR adds --all option for the Dragonball CI and also fix some code that forgets to do cargo fmt --all. fixes: #8598 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-12-11 20:54:25 +08:00
Hyounggyu Choi	375c787e09	rootfs: build OPA binary from source for ppc64le and s390x This PR is to build a binary for OPA from source code for ppc64le and s390x. Fixes: #7616 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-11 12:59:48 +01:00
Hyounggyu Choi	16e2a50d17	Merge pull request #8624 from BbolroC/fix-runtime-class-check-qemu-se GHA: Fix kata-deploy-runtime-classes-check for kata-qemu-se	2023-12-11 12:58:00 +01:00
James O. D. Hunt	2a35541af7	Merge pull request #8592 from jodh-intel/static-checks-try-multiple-user-agents CI: static-checks: Try multiple user agents	2023-12-11 11:52:29 +00:00
Hyounggyu Choi	28c3e0e5f0	GHA: Fix kata-deploy-runtime-classes-check for kata-qemu-se This is to fix an error on kata-deploy-runtime-classes-check for kata-qemu-se. Fixes: #8623 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-11 10:30:00 +01:00
Hyounggyu Choi	b469dbf92f	Merge pull request #8622 from BbolroC/hotfix-k3s-kubectl-version GHA: Use --client=true for k3s kubectl version	2023-12-11 10:00:16 +01:00
Hyounggyu Choi	40f0c8fbb7	GHA: Use --client=true for k3s kubectl version This is to fix a broken usage for `k3s kubectl version` by switching an option `--short` to `--client=true`. Fixes: #8621 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-11 08:26:39 +01:00
Chao Wu	df7f416cb8	Merge pull request #8566 from liubogithub/liubo/dev/panic_fix runtime-rs: fix panic when hypervisor mismatches with configuration	2023-12-10 21:33:59 +08:00
Gabriela Cervantes	1662a3e859	common: Add cloud hypervisor in enabling hypervisor function This PR adds the cloud hypervisor in the enabling hypervisor function. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-08 21:32:00 +00:00
Chelsea Mafrica	1c42d94550	Merge pull request #6826 from gabevenberg/log-parser-rs kata-ctl: Moved log-parser-rs into kata-ctl	2023-12-08 11:33:09 -08:00
James O. D. Hunt	5d085a3042	CI: static-checks: Try multiple user agents Make the URL checker cycle through a list of user agent values until we hit one the remote server is happy with. This is required since, unfortunately, we really, really want to check these URLs, but some sites block clients based on their `User-Agent` (UA) request header value. And of course, each site is different and can change its behaviour at any time. Our strategy therefore is to try various UA's until we find one the server accepts: - No explicit UA (use `curl`'s default) - Explicitly no UA. - A blank UA. - Partial UA values for various CLI tools. - Partial UA values for various console web browsers. - Partial UA for Emacs's built-in browser. - The existing UA which is used as a "last ditch" attempt where the UA implies multiple platforms and browser. > Notes: > > - The "partial UA" values specify specify the UA "product" but not the > UA "product version": we specify `foo` and not `foo/1.2.3`). We do > this since most sites tested appear to not care about the version. > This is as expected given that the version is strictly optional (see `[]`). > > - We now log all errors and display an error summary if none of the UAs > worked, in addition to the simple list of the URLs we believe to be > invalid. This should make future debugging simpler. `[]` - https://www.rfc-editor.org/rfc/rfc9110#section-10.1.5 Fixes: #8553. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-08 18:02:41 +00:00
James O. D. Hunt	3174c18772	docs: Remove problematic URL Removed the Azure Portal URL (https://portal.azure.com) since this causes problems with our static checks script: that URL returns HTTP 403 ("Forbidden") when queried using command-line tools like `curl(1)`, which is used by the static check script. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-08 17:11:20 +00:00
James O. D. Hunt	3779261a99	docs: Fix whitespace Remove some extraneous whitespace. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-08 17:11:20 +00:00
James O. D. Hunt	613def0328	CI: static-checks: Move curl to a separate function Split the call to `curl` in the URL checker out into a new `run_url_check_cmd()` function to make `check_url()` slightly clearer. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-08 17:11:20 +00:00
James O. D. Hunt	6d859f97ee	CI: static-checks: Lint fixes Declare and then define a couple of variables separately. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-08 17:11:20 +00:00
James O. D. Hunt	efa8e6547c	CI: static-checks: Check params have a value Check that the `check_url()` parameters have a value. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-08 17:11:20 +00:00
James O. D. Hunt	563ea020b0	CI: static-checks: Fold long line Break up a long line as little to make it easier to read. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-08 17:11:20 +00:00
James O. D. Hunt	3ad43df946	CI: static-checks: Improve markdown checker test Only attempt to build the markdown checker if it doesn't already exist. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-08 17:11:20 +00:00
Liu Bo	bf97051f11	runtime-rs: fix panic when hypervisor mismatches with configuration If a wrong configuration.toml file is used by accidentally, runtime-rs binary could run into panic because of unwrap(). This fixes the panic by returning errors instead of unwrap(). fixes: #8565 Signed-off-by: Liu Bo <liub.liubo@gmail.com>	2023-12-08 08:56:23 -08:00
Zvonko Kaiser	9d38f01c2f	Merge pull request #8612 from BbolroC/introduce-secret-inheritance-s390x GHA: make secrets inherited for build-kata-static-tarball-s390x	2023-12-08 17:32:47 +01:00
Gabriela Cervantes	f3eeab10ab	tests: nerdctl: Enable nerdctl tests for cloud hypervisor runtime-rs This PR enables the nerdctl tests for cloud hypervisor runtime-rs. Fixes #8616 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-08 16:12:36 +00:00
Hyounggyu Choi	636eef8907	GHA: make secrets inherited for build-kata-static-tarball-s390x This is to make GHA secrets inherited for the workflow titled `build-kata-static-tarball-s390x` to configure an environment variable `CI_HKD_PATH` for a `build-asset-boot-image-se` step. Fixes: #8611 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-08 13:55:45 +01:00
Chao Wu	5054e59ccb	Merge pull request #8429 from adamqqqplay/support-vhost-user-fs dragonball: introduce vhost-user-fs device	2023-12-08 17:20:52 +08:00
Hyounggyu Choi	588f639a69	Merge pull request #6755 from BbolroC/add-se-artifacts-to-main packaging: Add IBM Z SE artifacts to main	2023-12-08 05:17:38 +01:00
Gabe Venberg	69fdd05ce5	kata-ctl: Moved log-parser-rs into kata-ctl Log-parser-rs was always intended to become a sub-functionality of kata-ctl, but it was useful to develop it and initaly merge it as a standalone program, and migrate it to a subcommand later. Fixes #6797 Signed-off-by: Gabe Venberg <gabevenberg@gmail.com>	2023-12-07 21:35:28 -06:00
David Esparza	b2577000e7	metrics: Expose iperf3 pods over a k8s networks. A prerequisite for measuring kata network bandwidth is run Iperf3 tool at a the transport layer provided by a k8s service for exposing a network where the clients inside the cluster can use to contact Pods in the service. Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-12-07 18:07:05 -06:00
David Esparza	a062ba166b	metrics: cleans k8s iperf deployment when the test finishes. This PR fixes small issues like: 1. Cleaning up the k8s environment by removing the iperf test implementation even when the test fails. 2. Checks if the workload returned a result before generating an empty results json file as it was bein done. 3. Removes the redundancy of calls to functions that process subtests and should compose the results json file only when all results are ready and not before. 4. The tcp service manifest was added to the server deployment which targets TCP port 5201. Fixes: #8534 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-12-07 18:02:39 -06:00
Archana Shinde	a5105b4227	Merge pull request #8582 from amshinde/runtime-rs-tryfrom-blkconfig Implement and use try_from for DiskConfig	2023-12-07 15:02:00 -08:00
Archana Shinde	458e91b289	runtime-rs: Update readme to indicate cloud-hypervisor support Since cloud-hypervisor is no longer built as an optional feature, lets mention cloud-hypervisor in the list of hypervisors supported by runtime-rs. Fixes: #8587 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-12-07 14:59:43 -08:00
GabyCT	0e0a7d9410	Merge pull request #8604 from GabyCT/topic/enablenerdctlrs gha: nerdctl: Enable cloud hypervisor runtime-rs for nerdctl CI	2023-12-07 14:35:26 -06:00
Hyounggyu Choi	3fab1690a4	local-build: make strip support for cross-compilation This is to adjust a name of the binary `strip` to a target architecture for cross-compilation. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-07 20:05:40 +01:00
Hyounggyu Choi	f38c7f14c5	gha: remove build redundancy of kernel and rootfs-initrd It is to remove the build redundancy of `kernel` and `rootfs-initrd` by making `boot-image-se` built based on them at the second build stage. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-07 20:05:40 +01:00
Hyounggyu Choi	31db56207b	local-build: add support for key verification for IBM Secure Execution This is to make `build_se_image.sh` incorporate the key verification originally supported by `genprotimg`. It can be achieved by specifying two environment variables called `SIGNING_KEY_CERT_PATH` and `INTERMEDIATE_CA_CERT_PATH`. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-07 20:05:40 +01:00
Hyounggyu Choi	52bdc87fe9	local-build: make kernel parameters configurable This is to make kernel parameters configurable during the secure image build by adding an environment variable SE_KERNEL_PARAMS. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-07 20:05:40 +01:00
Hyounggyu Choi	9ceb2c27e0	local-build: consider cross-compilation env This is to make a base builder image build genprotimg without a package manager under the cross-compilation environment. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-07 20:05:40 +01:00
David Esparza	298be4aa1c	Merge pull request #8594 from GabyCT/topic/updatedockerfilet metrics: Update TensorFlow ResNet FP32 dockerfile	2023-12-07 11:14:48 -06:00
Gabriela Cervantes	ce694b905b	tests: Fix indentation of gha-run script This PR fixes the indentation of gha run script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-07 16:56:19 +00:00
Gabriela Cervantes	33b300431e	tests: Enable but do not run k8s tests for cloud hypervisor This PR enables but do not run k8s tests for cloud hypervisor for runtime-rs. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-07 16:39:15 +00:00
Gabriela Cervantes	acee3d8438	gha: k8s: Add cloud-hypervisor (runtime-rs) support This PR adds the Cloud Hypervisor driver, integrated with the runtime-rs, as part of the kubernetes tests. Fixes #8559 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-07 16:33:59 +00:00
Gabriela Cervantes	50a5fa9a65	tests: Enable but do not run the nerdctl tests for cloud hypervisor This PR enables but do not run the nerdctl tests for cloud hypervisor runtime-rs until we find out how stable they are. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-07 16:29:51 +00:00
Gabriela Cervantes	e70b2ea95d	gha: nerdctl: Enable cloud hypervisor runtime-rs for nerdctl CI This PR enables the cloud hypervisor runtime-rs for the nerdctl gha CI. Fixes #8603 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-07 16:24:36 +00:00
Hyounggyu Choi	ad6aab9918	Merge pull request #8601 from BbolroC/conflict-handling-for-self-hosted-runners GHA: remove GITHUB_WORKSPACE when workflow fails due to merge conflict	2023-12-07 12:17:31 +01:00
Hyounggyu Choi	0d5a970e54	GHA: remove GITHUB_WORKSPACE when workflow fails due to merge conflict It is to remove a GITHUB_WORKSPACE directory for self-hosted runners when a workflow fails due to the merge conflict. This will prevent the subsequent workflows from getting stuck in the same situation. Fixes: #8600 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-07 10:25:57 +01:00
Greg Kurz	501910d743	Merge pull request #8509 from zvonkok/stable-overlay deployment: Add stable overlay for kata-deploy.yaml	2023-12-07 09:43:41 +01:00
Huang Jianan	5629b7454f	dragonball: support vhost-user-fs in device manager This patch implements the virtio-fs device used for filesystem sharing and heavily based on the vhost-user protocol. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com> Signed-off-by: Eryu Guan <eguan@linux.alibaba.com> Signed-off-by: Huang Jianan <jnhuang@linux.alibaba.com> Signed-off-by: Qinqi Qu <quqinqi@linux.alibaba.com>	2023-12-07 11:59:07 +08:00
Archana Shinde	a661ac3a0e	runtime-rs: Implement and use try_from for DiskConfig Implement try_from trait function to convert runtime-rs BlockConfig to cloud-hypervisor DiskConfig. This can allow for code reuse in the future. Fixes: #8581 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-12-06 12:10:34 -08:00
Fabiano Fidêncio	c14e3096c8	Merge pull request #8580 from amshinde/runtime-rs-clh-network-hotplug runtime-rs: add network hotplug for clh	2023-12-06 20:50:04 +01:00
Gabriela Cervantes	56dddab04f	metrics: Update command to run tensorflow resnet fp32 benchmark This PR updates the command needed to run the tensorflow benchmark. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-06 17:02:10 +00:00
Gabriela Cervantes	62fdebeeb5	metrics: Update TensorFlow ResNet FP32 dockerfile This PR updates the python version for the TensorFlow ResNet FP32 dockerfile so the benchmark can run without issues. Fixes #8593 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-06 16:53:21 +00:00
GabyCT	3d149d3455	Merge pull request #8578 from GabyCT/topic/fixlinkconfig docs: Update config containerd url link	2023-12-06 10:40:29 -06:00
Zvonko Kaiser	16380558e0	deployment: Create a stable overaly for kata-deploy Fixes: #8508 Create a stable overlay for kata-deploy.yaml so we do not have to maintain two files, only one. Single source for both. This is also preparation for the helm-overlay Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-12-06 14:23:22 +00:00
Huang Jianan	2a1fc29e84	dragonball: add unit test for vhost-user-fs Add some test cases for vhost-user-fs function. Signed-off-by: Beiyue <beiyue@linux.alibaba.com> Signed-off-by: Huang Jianan <jnhuang@linux.alibaba.com>	2023-12-06 10:43:24 +08:00
Huang Jianan	d6cfbe9436	dragonball: support vhost-user-fs This patch implements the virtio-fs device used for filesystem sharing and heavily based on the vhost-user protocol. This vhost-user-fs device defines 5 parameters: - path: vhost-user socket path - tag: mount tag used from the guest to mount the filesystem - req_num_queues: number of request virtqueues - queue_size: depth of each virtqueue - cache_size: cache window size for dax This device needs to be defined before the VM instance is started, which can be done through the dbs-cli tool with --fs option: --fs '{ "sock_path":"/path/to/virtiofs.socket", "tag":"myfs", "num_queues":1, "queue_size":1024, "cache_size":0, "thread_pool_size":1, "cache_policy":"auto", "writeback_cache":true, "no_open":true, "xattr":true, "drop_sys_resource":false, "mode":"vhostuser", "fuse_killpriv_v2":true, "no_readdir":false, }' Fixes: #8428 Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com> Signed-off-by: Eryu Guan <eguan@linux.alibaba.com> Signed-off-by: Huang Jianan <jnhuang@linux.alibaba.com>	2023-12-06 10:43:17 +08:00
Archana Shinde	955dec06da	runtime-rs: add network hotplug for clh This is required for clh to work with nerdtcl and docker. This fixes the issues seen with nerdctl while starting a container. Hoewever, container exit with docker is still broken due to an unrelated issue. Fixes: #8579 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-12-05 15:29:53 -08:00
Fabiano Fidêncio	b056683b7a	Merge pull request #8436 from Lu-Biao/main image-builder: bugfix incorrect partition location	2023-12-06 00:10:06 +01:00
Fabiano Fidêncio	2cd003156e	Merge pull request #8573 from fidencio/topic/gha-add-a-timeout-for-tests gha: basic-ci: Add a timeout for the tests	2023-12-05 22:20:49 +01:00
Fabiano Fidêncio	d149b9f9ca	Merge pull request #7231 from wainersm/measured_rootfs-improvements Build for measured rootfs improvements	2023-12-05 22:20:33 +01:00
Fabiano Fidêncio	f75f17c4ff	Merge pull request #8570 from fidencio/topic/gha-dragonball-enable-some-tests-but-do-not-run-them-yet gha: dragonball: Enable, but do not run, cri-containerd, stability, and devmapper tests	2023-12-05 20:00:24 +01:00
Jeremi Piotrowski	e2c6b8ae6e	Merge pull request #4743 from yuchen0cc/main mount: support checking multiple kinds of block device driver	2023-12-05 18:04:51 +01:00
Gabriela Cervantes	61b868692b	docs: Update config containerd url link This PR updates the config containerd url link in the containerd kata documentation. Fixes #8577 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-05 16:35:21 +00:00
Fabiano Fidêncio	05ce52d746	devmapper: dragonball: Enable, but do not run, the tests This will make the life easier for dragonball developers to properly enable the tests once the tests are ready. Fixes: #8569 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-05 15:29:23 +01:00
Fabiano Fidêncio	a8a156b1af	stability: dragonball: Enable, but do not run, the tests This will make the life easier for dragonball developers to properly enable the tests once the tests are ready. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-05 15:29:23 +01:00
Fabiano Fidêncio	16ad721eda	cri-containerd: dragonball: Enable, but do not run, the tests This will make the life easier for dragonball developers to properly enable the tests once the tests are ready. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-05 15:29:23 +01:00
James O. D. Hunt	d9daadf15c	Merge pull request #8558 from jodh-intel/load-config-improvement runtime-rs: Show config files attempted on config load failure	2023-12-05 11:48:42 +00:00
Greg Kurz	1650d02b91	Merge pull request #8516 from Apokleos/vsock-dev move vsock device into device manager	2023-12-05 11:28:37 +01:00
James O. D. Hunt	93c0fc2ad3	Merge pull request #8551 from amshinde/runtime-rs-setns-clh runtime-rs: Launch cloud-hypervisor in given netns	2023-12-05 10:18:34 +00:00
James O. D. Hunt	d627893975	runtime-rs: Show config files attempted on config load failure PR #8483 changed the location of the rust runtime config files to `/etc/kata-containers/runtime-rs/`. However, if you haven't updated your system to create that directory, attempting to create a container using the rust runtime was giving the following cryptic message (formatted for easier reading): ``` failed to handler message try init runtime instance Caused by: 0: load config 1: load toml config 2: entity not found ``` Now, the message is as follows (again, reformatted for easier reading): ``` failed to handle message try init runtime instance Caused by: 0: load config 1: load TOML config failed (tried [ \"/etc/kata-containers/runtime-rs/configuration.toml\", \"/usr/share/defaults/kata-containers/runtime-rs/configuration.toml\", \"/opt/kata/share/defaults/kata-containers/runtime-rs/configuration.toml\" ]) ``` Fixes: #8557. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-05 09:10:18 +00:00
James O. D. Hunt	45c0364d4c	runtime-rs: Fix typo in task service "failed to handler message" -> "failed to handle message". Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-05 09:10:18 +00:00
Fabiano Fidêncio	a14f2fc180	gha: runk: Fix typo in the test name tracing -> runk Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-05 09:44:42 +01:00
Fabiano Fidêncio	1a74142a16	gha: basic-ci: Add a timeout for the tests This will ensure no job will be stuck forever, as we've noticed with a few jobs already. Fixes: #8572 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-05 09:42:46 +01:00
GabyCT	e8b28fed2a	Merge pull request #8540 from GabyCT/topic/fixctrdoc docs: Update cri installation url link	2023-12-04 17:36:33 -06:00
Archana Shinde	2df8144cfe	runtime-rs: Launch cloud-hypervisor in given netns Launch cloud-hypervisor binary in the netns provided at the prepare_vm stage. Fixes: #6441 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-12-04 13:02:43 -08:00
Hyounggyu Choi	511dd5feac	local-build: add support to build IBM Z SE image This is to add an artifact for IBM Z SE(TEE) to main. Fixes: #6754 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-04 21:08:51 +01:00
Hyounggyu Choi	4de8ef3d18	local-build: add build target boot-image-se This is to add a build target boot-image-se for s390x. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-04 21:08:51 +01:00
Hyounggyu Choi	a63a6959d1	local-build: install s390-tools in Dockerfile This is to install s390-tools including genprotimg during the docker build. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-04 21:08:51 +01:00
Hyounggyu Choi	6d0dabd81e	gha: build secure image for s390x release This is add a build target boot-image-se with a host-key-document config for s390x. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-04 21:08:51 +01:00
Hyounggyu Choi	bb1d4adaa9	config: add SE configuration This is to add SE configuration which is used by kata runtime. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-04 21:08:49 +01:00
Gabriela Cervantes	2b05029347	docs: Update cri installation url link This PR updates the cri installation url link for the containerd documentation. Fixes #8539 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-04 20:07:49 +00:00
Hyounggyu Choi	8de4241d3b	kata-deploy: add kata-qemu-se runtimeclass This is to increase resources for relaxing the limitation of hotplug for SE. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-04 21:06:53 +01:00
Hyounggyu Choi	9ede2bcd95	local-build: differentiate build targets based on architecture This is to rule out unnecessary build targets for s390x. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-12-04 21:06:53 +01:00
GabyCT	1c00a9a6a9	Merge pull request #8524 from GabyCT/topic/addiperfinfo docs: Update iperf3 network documentation	2023-12-04 14:03:30 -06:00
GabyCT	1b204cc3cb	Merge pull request #8550 from GabyCT/topic/enableclhstability gha: Add cloud runtime rs as part of the stability tests	2023-12-04 11:37:58 -06:00
Gabriela Cervantes	dfc07d1c72	gha: stability: Add cloud-hypervisor (runtime-rs) support This PR adds the Cloud Hypervisor driver, integraedwith the runtime-rs, as part of the stability tests. Fixes #8462 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-12-04 15:32:29 +00:00
Fabiano Fidêncio	8d7e0f7721	Merge pull request #8556 from fidencio/topic/kernel-add-tdx-guest-driver kernel: Add CONFIG_TDX_GUEST_DRIVER to the tdx.conf	2023-12-04 15:13:57 +01:00
James O. D. Hunt	e4aebb4560	Merge pull request #8549 from jodh-intel/tdx-no-root libs: protection: x86_64: drop root requirement for querying	2023-12-04 13:03:10 +00:00
Chao Wu	1550ee6767	Merge pull request #8480 from openanolis/chao/add_dbs_pci dragonball: init dbs-pci lib with pci bus & pci conf	2023-12-04 18:08:40 +08:00
Fabiano Fidêncio	03c3f4275e	kernel: Add CONFIG_TDX_GUEST_DRIVER to the tdx.conf The driver enables the userspace interface to communicate with the TDX module to request the TDX guest details, like the attestation report. Fixes: #8555 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-12-04 10:25:59 +01:00
Biao Lu	b816dca3ed	image-builder: fix incorrect part start position The 'part_start' of image and dax_image should exactly specify the same location, according to the parted documentation, to exactly specify the location, the units of start and end should use MiB. https://www.gnu.org/software/parted/manual/parted.html#IEC-binary-units Fixes: #8435 Signed-off-by: Biao Lu <biao.lu@intel.com>	2023-12-04 17:20:26 +08:00
Chao Wu	52fd57e49a	Merge pull request #8301 from Apokleos/do-direct-volume runtime-rs: Enhancing DirectVolMount Handling with Patching Support	2023-12-04 16:49:46 +08:00
James O. D. Hunt	7beab11d9e	Merge pull request #8547 from jodh-intel/unbreak-logger libs:logging: Fix logger	2023-12-04 08:38:03 +00:00
alex.lyn	0fabfa336d	runtime-rs: bring support for legacy vsock device. Bring support for legacy vsock and add Vsock to the ResourceConfig enum type, and add the processing flow of the Vsock device to the prepare_before_start_vm function. Fixes: #8474 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-04 15:54:51 +08:00
alex.lyn	6c08cf35d5	runtime-rs: Introduce prepare_vm_socket_config to VirtSandbox. Instroduce prepare_vm_socket_config to VirtSandbox for vm socket config, including Vsock and Hybrid Vsock. Use the capabilities() trait of the hypervisor to get the vm socket supported in VMM. Fixes: #8474 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-04 15:54:50 +08:00
alex.lyn	60f88da5e1	runtime-rs: add Capability of HybridVsockSupport for Hypervisor. Add Cap of HybridVsockSupport for hypervisors CLH and Dragonball which use hybrid-vsock, default for Qemu, which uses legacy vsock. Fixes: #8474 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-04 15:54:50 +08:00
alex.lyn	c5178dd258	runtime-rs: Introduce Capability of HybridVsockSupport. Introduce HybridVsock Cap to judge which kind of vm socket will be supported by the Hypervisor. Use `is_hybrid_vsock_supported` to tell if an hypervisor supports hybrid-vsock, if not, it supports legacy vsock. Fixes: #8474 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-12-04 15:54:29 +08:00
James O. D. Hunt	e1caca3e41	kata-ctl: Remove root requirement for "env" Remove the redundant `kata-ctl` `root` check when running the `env` command. This check duplicated the `GuestProtection` check, and that check is now no longer necessary anyway. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-01 15:55:45 +00:00
James O. D. Hunt	f05ada592f	libs: protection: x86_64: drop root requirement for querying It is no longer necessary to be `root` to query the guest protection (TDX) on `x86_64` systems, so drop the requirement. > Note: > > This change drops the `nix` `Uid` import required for the `root` check. > But at the same time it adds it for PPC64le since that implementation of > `available_guest_protection()` needs it and it was previously missing. Fixes: #8548. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-01 15:55:21 +00:00
Fabiano Fidêncio	852021e416	Merge pull request #8483 from fidencio/topic/move-rust-config-files-to-subdir-based-on-jodh-approach build/kata-deploy: Move rust runtime config files to runtime-rs directory -- based on #8445	2023-12-01 16:22:51 +01:00
James O. D. Hunt	f9f1d3a071	libs:logging: Fix logger PR #8311 inadvertently broke the logging since no log messages below the `Info` level are logged now, regardless of the requested log level. Resolve the issue by storing the requested log level in the `RuntimeComponentLevelFilter` and using that level in the `log()` function, rather than hard-coding `Info` as the default where no entry is found in the `FILTER_RULE` hashmap. Fixes: #8546. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-12-01 12:21:20 +00:00
yuchen.cc	1cd1558a92	mount: support checking multiple kinds of block device driver Device mapper is the only supported block device driver so far, which seems limiting. Kata Containers can work well with other block devices. It is necessary to enhance supporting of multiple kinds of host block device. Fixes #4714 Signed-off-by: yuchen.cc <yuchen.cc@alibaba-inc.com>	2023-12-01 11:59:30 +08:00
Chelsea Mafrica	818b8f93b1	Merge pull request #8288 from cmaf/migrate-static-checks Migrate static checks	2023-11-30 17:44:16 -08:00
Chelsea Mafrica	207a7fef90	Merge pull request #7815 from cmaf/runtime-rs-ch-vsock runtime-rs: Add Hybrid VSOCK device handling for CH	2023-11-30 12:22:36 -08:00
GabyCT	2bd21f7831	Merge pull request #8531 from GabyCT/topic/fixiperfli metrics: Fix iperf parallel bandwidth limit	2023-11-30 13:47:00 -06:00
Chao Wu	b3da71f21e	dragonball: init dbs-pci lib with pci bus & pci conf This commit inits dbs-pci lib for Dragonball to use. It contains several implementation now: 1. PCI configuration space 2. PCI bus More info of the design & behavior of those two features could be found in the README of dbs-pci. fixes: #8479 Signed-off-by: Gerry Liu <gerry@linux.alibaba.com> Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com> Signed-off-by: Shifang Feng <fengshifang@linux.alibaba.com> Signed-off-by: Yang Su <yang.su@linux.alibaba.com> Signed-off-by: Zha Bin <zhabin@linux.alibaba.com> Signed-off-by: Xin Lin <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-11-30 23:40:26 +08:00
Dan Mihai	38f24c41c0	Merge pull request #8271 from microsoft/danmihai1/exec-test-failure tests: more k8s-exec-rejected debug output	2023-11-30 07:11:01 -08:00
Greg Kurz	48e5596186	Merge pull request #8456 from cheriL/8447/alpine_bash osbuilder: add pkg bash for alpine	2023-11-30 13:43:48 +01:00
Steve Horsman	c6110284d5	Merge pull request #8520 from stevenhorsman/hypervisor-ttrpc runtime: Update hypervisor generated code	2023-11-30 10:01:56 +00:00
Amulya Meka	3d5db65b2e	Merge pull request #8526 from Amulyam24/workflow-ppc gha: fix artefacts build on ppc64le	2023-11-30 15:00:06 +05:30
Fabiano Fidêncio	80fcc56cef	Merge pull request #8528 from fidencio/topic/stop-building-and-shipping-log-parser-rs tools: Stop building / shipping log-parser-rs	2023-11-30 09:14:10 +01:00
Fabiano Fidêncio	9b30d97885	Merge pull request #8533 from fidencio/topic/fix-invalid-cpu-topology-for-tdx Revert "runtime: confidential: Do not set the max_vcpu to cpu"	2023-11-30 09:06:45 +01:00
Amulyam24	6a922f0e37	gha: fix artefacts build on ppc64le Add step in the right place to prepare the runner for the builds/tests. Fixes: #8525 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2023-11-30 09:50:47 +05:30
soup	811ec07359	osbuilder: add pkg bash for alpine The bash component is required in the guest for debug console to work properly. Fixes: #8447 Signed-off-by: soup <lqh348659137@outlook.com>	2023-11-30 09:42:39 +08:00
Fabiano Fidêncio	f15e16b692	Revert "runtime: confidential: Do not set the max_vcpu to cpu" This reverts commit `b0157ad73a`. ``` commit `b0157ad73a` Refs: 3.3.0-alpha0-124-gb0157ad73 Author: Fabiano Fidêncio <fabiano.fidencio@intel.com> AuthorDate: Fri Aug 11 14:55:11 2023 +0200 Commit: Fabiano Fidêncio <fabiano.fidencio@intel.com> CommitDate: Fri Nov 10 12:58:20 2023 +0100 runtime: confidential: Do not set the max_vcpu to cpu We don't have to do this since we're relying on the `static_sandbox_resource_mgmt` feature, which gives us the correct amount of memory and CPUs to be allocated. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> ``` This commit was removing a requirement that was made previously, but due to the SMP issue we're facing with the QEMU used for TDX (see commit d1b54ede290e95762099fff4e0bcdad10f816126), QEMU will fail to start due to: ``` Invalid CPU topology: product of the hierarchy must match maxcpus: sockets (1) dies (1) * cores (1) * threads (1) != maxcpus (240)" ``` This has no affect on the SEV / SNP workflow and hopefully we'll be able to re-revet this soon enough, when this gets solved on te QEMU side. Last but not least, this is not a "clean" revert as we're using conf.NumVCPUs() instead of conf.NumVCPUs, to ensure we're dealing with uint32. Fixes: #8532 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-30 00:41:27 +01:00
Fabiano Fidêncio	1284b4e80d	tools: Stop building / shipping log-parser-rs This is a commit that's a pre-req for #6826, as that PR will merge log-parser-rs into kata-ctl, but that will result in a CI breakage. So, let's deal with the CI changes here, thanks to GHA and our favourite `pull_request_target` event, unblocking that PR to be merged. Fixes: #6797 (not really, but related). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-30 00:32:10 +01:00
Gabriela Cervantes	37633d3cc2	metrics: Fix iperf parallel bandwidth limit This PR fixes the iperf parallel bandwidth limit for the kata metrics CI. Fixes #8530 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-11-29 19:59:45 +00:00
Dan Mihai	96deea52f2	tests: more k8s-exec-rejected debug output Print more information useful for debugging. Also, use a separate YAML file for this test, instead of reusing someone else's file. Fixes: #8270 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-11-29 18:05:15 +00:00
stevenhorsman	47b8c3181f	runtime: remote hypervisor updates to ttrpc - Update the remote hypervisor code to match the re-genned code for the ttrpc Hypervisor Service Fixes: #8519 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-11-29 18:04:40 +00:00
stevenhorsman	613c75ba8c	runtime: Update hypervisor generated code Update to use ttrpc_out instead of grpc_out Fixes: #8519 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-11-29 18:04:40 +00:00
GabyCT	1f1e5377e5	Merge pull request #8497 from GabyCT/topic/removemetricsstratovirt gha: Disable stratovirt for gha metrics	2023-11-29 11:16:53 -06:00
Fabiano Fidêncio	8fd39d11c4	tests: Adapt `enable_hypervisor`to the runtime-rs config location change As the configuration for the runtime-rs based drivers are now placed in a different location than the golang ones, we should adapt this script accordingly. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-29 14:51:35 +01:00
Fabiano Fidêncio	38183acbcb	tests: Use `kata-ctl` instead of `kata-runtime` for runtime-rs `kata-ctl` is the tool for runtime-rs, and it should be used instead of `kata-runtime`. `kata-ctl` requires sudo, and that's the reason it's also been added as part of the calls. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-29 14:51:35 +01:00
Fabiano Fidêncio	a5a73a11cb	tests: Replace `kata-runtime kata-env` by `kata-runtime env` `kata-runtime env` is an alias for `kata-runtime kata-env, and calling it with the `env` paramenter allows us to easily extend the scripts to use `kata-ctl` instead of `kata-runtime` when dealing with runtime-rs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-29 14:51:31 +01:00
Chelsea Mafrica	05efb23261	tests: update go.mod and go.sum Generate a go.sum file for tests. Fixes #8187 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-11-28 17:40:41 -08:00
Fabiano Fidêncio	30acb5a0c0	tests: nydus: Adapt the default config file for runtime-rs based drivers As we've done some changes in the runtime-rs based drivers to install their configuration into a different location, this should also be reflected as part of this test. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-28 20:37:59 +01:00
Chelsea Mafrica	6d9cb9325d	tests: update scripts for static checks migration Updates to scripts for static-checks.sh functionality, including common functions location, the move of several common functions to the existing common.bash, adding hadolint and xurls to the versions file, and changes to static checks for running in the main kata containers repo. The changes to the vendor check include searching for existing go.mod files but no other changes to expand the test. Fixes #8187 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-11-28 11:13:55 -08:00
Chelsea Mafrica	66f3944b52	tests: move github-labels to main repo Move tool as part of static checks migration. Fixes #8187 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com> Signed-off-by: Derek Lee <derlee@redhat.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> Signed-off-by: Graham Whaley <graham.whaley@intel.com> Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com> Signed-off-by: Marco Vedovati <mvedovati@suse.com> Signed-off-by: Peng Tao <bergwolf@hyper.sh> Signed-off-by: Shiming Zhang <wzshiming@foxmail.com> Signed-off-by: Snir Sheriber <ssheribe@redhat.com> Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:13:55 -08:00
Chelsea Mafrica	7f3c12f1dd	tests: move spell check tool to main repo Move tool as part of static checks migration. Fixes #8187 Signed-off-by: Bo Chen <chen.bo@intel.com> Signed-off-by: Carlos Venegas <jos.c.venegas.munoz@intel.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com> Signed-off-by: Dan Middleton <dan.middleton@intel.com> Signed-off-by: Derek Lee <derlee@redhat.com> Signed-off-by: Eric Ernst <eric.ernst@intel.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> Signed-off-by: Graham Whaley <graham.whaley@intel.com> Signed-off-by: Hui Zhu <teawater@antfin.com> Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com> Signed-off-by: Jimmy Xu <xjmmyshcn@gmail.com> Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com> Signed-off-by: Shiming Zhang <wzshiming@foxmail.com> Signed-off-by: Snir Sheriber <ssheribe@redhat.com> Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:13:55 -08:00
Chelsea Mafrica	8ad433d4ad	tests: move markdown check tool to main repo Move the tool as a dependency for static checks migration. Fixes #8187 Signed-off-by: Bin Liu <bin@hyper.sh> Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> Signed-off-by: Ganesh Maharaj Mahalingam <ganesh.mahalingam@intel.com> Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com> Signed-off-by: Julio Montes <julio.montes@intel.com>	2023-11-28 11:13:55 -08:00
Chelsea Mafrica	eaa6b1b274	tests: move static checks and dependencies from tests Move static checks scripts and dependencies from tests to kata-containers repo. Fixes #8187 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com> Signed-off-by: Archana Shinde <archana.m.shinde@intel.com> Signed-off-by: Bin Liu <bin@hyper.sh> Signed-off-by: Carlos Venegas <jos.c.venegas.munoz@intel.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com> Signed-off-by: Dan Middleton <dan.middleton@intel.com> Signed-off-by: David Gibson <david@gibson.dropbear.id.au> Signed-off-by: Derek Lee <derlee@redhat.com> Signed-off-by: Dov Murik <dovmurik@linux.ibm.com> Signed-off-by: Eric Ernst <eric_ernst@apple.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Fupan Li <fupan.lfp@antgroup.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> Signed-off-by: Ganesh Maharaj Mahalingam <ganesh.mahalingam@intel.com> Signed-off-by: Graham Whaley <graham.whaley@intel.com> Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com> Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com> Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com> Signed-off-by: Jon Olson <jonolson@google.com> Signed-off-by: Jose Carlos Venegas Munoz <jose.carlos.venegas.munoz@intel.com> Signed-off-by: Julio Montes <julio.montes@intel.com> Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Manohar Castelino <manohar.r.castelino@intel.com> Signed-off-by: Marco Vedovati <mvedovati@suse.com> Signed-off-by: Nitesh Konkar <niteshkonkar@in.ibm.com> Signed-off-by: Peng Tao <bergwolf@gmail.com> Signed-off-by: Salvador Fuentes <salvador.fuentes@intel.com> Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com> Signed-off-by: Shiming Zhang <wzshiming@foxmail.com> Signed-off-by: Snir Sheriber <ssheribe@redhat.com> Signed-off-by: stevenhorsman <steven@uk.ibm.com> Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com> Signed-off-by: Xu Wang <xu@hyper.sh> Signed-off-by: Yang Bo <bo@hyper.sh> Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-11-28 11:13:55 -08:00
Fabiano Fidêncio	61aa84b158	Revert "tests: k8s: Allow passing rust-runtime env var to kata-deploy" This reverts commit `44899d4cdf`, as we've decided to keep both golang and rust runtime installable and usable at the same time. The decision of having both runtimes installable and usable will help users to test and easily catch any possible differences between those runtimes, helping us to get on par with both implementations. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-28 18:02:07 +01:00
James O. D. Hunt	158ca17ae7	kata-deploy: Add cloud-hypervisor Now that we have a separate Cloud Hypervisor configuration file for the rust runtime, add it to the kata-deploy. See: https://github.com/kata-containers/kata-containers/pull/8250 Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-28 18:02:06 +01:00
Fabiano Fidêncio	d4e00238ab	kata-deploy: Improve the logic for linking to the rust runtime This change for now doesn't do much, apart from making it easier to expand which runtimes should be linked to the runtime-rs containerd shim binary. Also, this matches the logic used for the config files. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-28 18:01:27 +01:00
James O. D. Hunt	fc28deee0e	kata-deploy: Use rust runtime config files in runtime-rs directory Update `kata-deploy` to modify the rust runtime configuration files in their new `runtime-rs/` directory. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-28 18:01:25 +01:00
Gabriela Cervantes	9166d0aabb	docs: Update iperf3 network documentation This PR updates the iperf3 network documentation to include the parallel bandwidth. Fixes #8523 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-11-28 15:59:38 +00:00
Wainer dos Santos Moschetta	48bdca4c49	tests/k8s: add k8s-measured-rootfs.bats Implements the following test case: Scenario: Check incorrect hash fails Given I have a version of kata installed that has a kernel with the initramfs built and config with rootfs_verity.scheme=dm-verity rootfs_verity.hash=<incorrect hash of rootfs> set in the kernel_params When I try and create a container a basic pod Then The pod is doesn't run And Ideally we'd get a helpful message to indicate why Currently on CI only qemu-tdx is built with measured rootfs support in the kernel, so the test is restriced to that runtimeclass. Fixes #7415 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:54 -03:00
Wainer dos Santos Moschetta	1eae657b91	tests/k8s: add set_node() to lib.sh Use this new function to set the node where the pod should be scheduled to. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	c6075c8627	tests/k8s: add setup common Bring the setup_common() from CCv0 branch test's integration/kubernetes/confidential/tests_common.sh. It should be used to reduce boilerplates on the setup() of the tests. Unlike the original code, this won't export the `test_start_time` variable as it wouldn't be accurate to grab logs from the worker nodes due date/time mismatch between the running tests machine and the worker node. The function export the `node` variable which holds the name of a random node which has kata installed. Apart from that, it exports the `node_start_time` which capture the date/time when the test started, relative to the `node`. Tests that should inspect the logs can schedule pods/resources to the `node` and use `node_start_time` as the value reference to grep the logs. Fixes #7590 Co-authored-by: stevenhorsman <steven@uk.ibm.com> Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	220a2d9a15	tests/k8s: add assert_logs_contain() to lib.sh Bring the assert_logs_contain() from CCv0 branch tests' integration/kubernetes/confidential/lib.sh. Introduced the print_node_journal() which uses `kubectl debug` to print the systemd's journal of a k8s's node. Fixes #7590 Co-authored-by: stevenhorsman <steven@uk.ibm.com> Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	9a9c7a5c6f	tests/k8s: add set_metadata_annotation() to lib.sh This new function allow to the annotations to metadata section in a yaml configuration file. Co-authored-by: Ryan Savino <ryan.savino@amd.com> Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	a13eecf7f3	runtime(-rs): add clean-generated-files target The new clean-generated-files make target allows for removing the generated files (including the configuration.toml files). The tools/packaging/static-build/shim-v2/build.sh script now uses that target to always force the re-generation of those files. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	36ea1b8ee7	tests/k8s: add new_pod_config() to lib.sh Copied the new_pod_config() and pod-config.yaml.in from CCv0 branch tests' integration/kubernetes/confidential/tests_common.sh and fixtures. Unlike the original version, new_pod_config() now gets the runtimeclass by parameter as the RUNTIMECLASS environment variable seems not broadly used on main branch's CI. The pod-config.yaml.in was changed as the diff shows below. In particular the imagePullSecrets was removed to avoid it throwing a warning on the pod's log. ``` --- a/tests/integration/kubernetes/runtimeclass_workloads/pod-config.yaml.in +++ b/tests/integration/kubernetes/runtimeclass_workloads/pod-config.yaml.in @@ -5,12 +5,10 @@ apiVersion: v1 kind: Pod metadata: - name: busybox-cc + name: test-e2e spec: runtimeClassName: $RUNTIMECLASS containers: - - name: nginx + - name: test_container image: $IMAGE - imagePullPolicy: Always - imagePullSecrets: - - name: cococred \ No newline at end of file + imagePullPolicy: Always \ No newline at end of file ``` Co-authored-by: Georgina Kinge <georgina.kinge@ibm.com> Co-authored-by: Megan Wright <Megan.Wright@ibm.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	428daf9ebc	tests/k8s: add utilities functions for the tests The following functions were copied from CCv0's branch test's integration/kubernetes/confidential/lib.sh. I did just smalls refactorings (shortened their names and delinted shellcheck warnings): - k8s_delete_all_pods_if_any_exists() - k8s_wait_pod_be_ready() - k8s_create_pod() - assert_pod_fail() Co-authored-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Co-authored-by: Georgina Kinge <georgina.kinge@ibm.com> Co-authored-by: Jordan Jackson <jordan.jackson@ibm.com> Co-authored-by: Megan Wright <Megan.Wright@ibm.com> Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com> Co-authored-by: Wang, Arron <arron.wang@intel.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	ba4f806c30	initramfs: re-wrote devices checking on init.sh Re-wrote the logic of init.sh to follow the rules: * the root device MUST exist always because it will be either mounted or verified (then mounted) * if rootfs verifier is enabled then the hash device MUST exist. Avoid the case where dm-verity is set but the hash device does not exist and so the verification is silently skipped Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	72ef82368c	shim-v2: ensure root hash exist when measured rootfs When measured toofs is enabled then the shim-v2 build should find the guest rootfs hash file, otherwise might (silently) generate configuration files with empty hash. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	1465e58854	kernel: ensure initramfs exist when measured rootfs The KATA_BUILD_CC variable plus the existence (or not) of the initramfs were used to determine whether to build the kernel for measured rootfs or not. Currently the variable MEASURED_ROOTFS has been used to trigger the feature build and when it is activated it should expect the initramfs exist. In other words, this changed the kernel build so that if `MEASURED_ROOTFS=yes` then the initramf file must exist and be found. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	4dbba5215f	shim-v2: moved measured rootfs logic to its builder Moved the measure rootfs logic from kata-deploy-binaries.sh to the shim-v2's builder script so that the former get less bloated with components's specific code. Fixes #6674 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	34be78df19	kernel: moved measured rootfs logic to its builder Moved the measure rootfs logic from kata-deploy-binaries.sh to the kernel's builder script so that the former get less bloated with components's specific code. Fixes #6674 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:53 -03:00
Wainer dos Santos Moschetta	3f16d29593	kernel: measured rootfs as argument to build-kernel.sh By convention the caller of tools/packaging/kernel/build-kernel.sh changes the script behavior by passing arguments, whereas, for measured rootfs it has used an environment variable (MEASURED_ROOTFS). This refactor the script so that the caller now must pass the "-m" argument to enable the build of the kernel with measured rootfs support. Fixes #6674 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-11-28 11:21:51 -03:00
Fabiano Fidêncio	80860478bf	runtime-rs: Remove the golang config paths As the configuration files are different, we can safely remove those as any new installation of the binary should also bring in the new configurations. This makes things less error-prone in the future, as we're ensuring that the rust runtime will only be reading the rust configuration files. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-28 15:16:53 +01:00
James O. D. Hunt	b86ab5aa21	runtime-rs: Update list of config paths to check Update the `DEFAULT_RUNTIME_CONFIGURATIONS` list to include a number of rust runtime specific paths to try to load before checking the "traditional" (golang) runtime configuration paths. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-28 15:16:53 +01:00
James O. D. Hunt	89ef464b7c	build: Install rust config files to runtime-rs directory Install the rust runtime configuration files to a `runtime-rs/` directory to distinguish them from the golang config files (which may have a different syntax). The default values mean that the rust config files are now installed to `/opt/kata/share/defaults/kata-containers/runtime-rs/` rather than `/opt/kata/share/defaults/kata-containers/`. See: https://github.com/kata-containers/kata-containers/issues/6020 Fixes: #8444. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-28 15:16:53 +01:00
alex.lyn	fe68f25bea	runtime-rs: enhancement of vfio volume. Reimplement vfio volume into direct_volume and do alignment of rawblock/spdk volume. Fixes: #8300 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-28 10:08:05 +08:00
alex.lyn	e3fd403126	runtime-rs: enhancement of spdk volume. (1) Add enum DirectVolumeType for direct volumes. (2) Reimplement spdk volume into direct_volume and do alignment of rawblock volume. Fixes: #8300 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-28 10:08:05 +08:00
alex.lyn	f973729029	runtime-rs: Enhancing DirectVolMount Handling for current Infra. The current infra(K8S, CSI, CRI, Containerd) for Kata containers is unable to properly handle direct volumes, resulting in the need for workarounds like searching/comparision and then patch up volume type. In this commit, reimplement of handling method is added to support raw block volume which backends may be rawdisk or other format file. Fixes: #8300 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-28 10:08:05 +08:00
alex.lyn	e3becea566	runtime-rs: add support kata/multi-containers sharing one vfio volume. Fiexes: #8300 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-28 10:07:23 +08:00
Steve Horsman	891f488ee3	Merge pull request #8501 from Amulyam24/containerd-tests gha: add cri-containerd workflow for ppc64le	2023-11-27 17:22:59 +00:00
James O. D. Hunt	45cc417a4e	Merge pull request #8461 from jodh-intel/update-codeowners CODEOWNERS: Expand scope	2023-11-27 15:38:39 +00:00
Fabiano Fidêncio	bb4c51a5e0	Merge pull request #8494 from ChengyuZhu6/kata_virtual_volume runtime: Pass `KataVirtualVolume` to the guest as devices in go runtime	2023-11-27 16:02:28 +01:00
Steve Horsman	bee6fba5c7	Merge pull request #8459 from Amulyam24/workflow-1 github: add workflows for building and publishing kata artefacts on ppc64le	2023-11-27 14:31:20 +00:00
Amulyam24	754aec02c3	gha: add cri-containerd workflow for ppc64le This PR adds workflow to run containerd tests on Power as a part of CI migration. Fixes: #8500 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2023-11-27 17:58:58 +05:30
alex.lyn	6af0592274	runtime-rs: Add vsock device in device manager. (1) Implement Device Trait for vsock device. (2) add vsock device in device manager. Fixes: #8474 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-27 15:23:18 +08:00
alex.lyn	1a6b45d3b7	runtime-rs: Reintroduce Vsock and add it to the DeviceType enum As vsock device will be used in Qemu or other VMMs, the Vsoock is reintroduced to DeviceType enum. Fixes: #8474 Signed-off-by: Pavel Mores <pmores@redhat.com> Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-27 15:12:44 +08:00
alex.lyn	e31dbc94a5	runtime-rs: remove vhost_fd from VsockConfig and make it cloneable. Currently encounters difficulty in utilizing the clone operation on VsockConfig due to the implicit management of the vhost fd within the runtime-rs. This responsibility should be delegated to the VMM(especially QEMU) child process, as it's not runtime-rs core responsibilities. We'll remove the member vhost_fd from VsockConfig and make the VsockConfig/VsockDevice Cloneable. Fixes: #8474 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-27 15:11:21 +08:00
alex.lyn	eb90962b27	runtime-rs: introduce a new function generate_vhost_vsock_cid. Introduce a new function generate_vhost_vsock_cid to generate a guest CID and set guest CID for vsock fd. Also this commit wouldn't introduce functional change and it's just splited from the previous VsockDevice::new(). Fixes: #8474 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-27 15:06:58 +08:00
alex.lyn	b952c5c5ce	runtime-rs: add support kata/multi-containers sharing one spdk volume. Fiexes: #8300 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-25 21:13:03 +08:00
alex.lyn	17d2d465d1	runtime-rs: re-organize the volumes with adding new direct_volumes. Add a new dire direct_volumes containing spdk, rawblock and vfio volume. Fixes: #8300 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-25 21:04:55 +08:00
alex.lyn	6731466b13	runtime-rs: set a standard NotFound when direct volume path not found. Fixes: #8300 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-25 19:51:12 +08:00
alex.lyn	d23867273f	runtime-rs: split the block volume into block and rawblock volume (1) rawblock volume is directvol mount type. (2) block volume is based on the bind mount type. Fixes: #8300 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-24 23:30:30 +08:00
Amulyam24	ae2c0c5696	github: add workflows for building and publishing kata artifacts on ppc64le Adds workflows for building kata static tarball and releasing it. Fixes: #8458 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2023-11-24 15:53:38 +05:30
ChengyuZhu6	5318afe273	runtime: support to create VirtualVolume rootfs storages 1) Creating storage for all `io.katacontainers.volume=` messages in rootFs.Options, and then aggregates all storages into `containerStorages`. 2) Creating storage for other data volumes and push them into `volumeStorages`. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2023-11-23 23:22:55 +08:00
ChengyuZhu6	0b4f7c2ee7	runtime: redefine and add functions to handle VirtualVolume to storage 1) Extract function `handleBlockVolume` to create Storage only. 2) Add functions to handle KataVirtualVolume device and construct corresponding storages. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2023-11-23 23:07:32 +08:00
ChengyuZhu6	bd099fbda9	runtime: extend SharedFile to support mutiple storage devices To enhance the construction and administration of `Katavirtualvolume` storages, this commit expands the 'sharedFile' structure to manage both rootfs storages(`containerStorages`) including `Katavirtualvolume` and other data volumes storages(`volumeStorages`). NOTE: `volumeStorages` is intended for future extensions to support Kubernetes data volumes. Currently, `KataVirtualVolume` is exclusively employed for container rootfs, hence only `containerStorages` is actively utilized. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2023-11-23 23:05:14 +08:00
ChengyuZhu6	e4f33ac141	runtime: add functions to create devices in KataVirtualVolume The snapshotter will place `KataVirtualVolume` information into 'rootfs.options' and commence with the prefix 'io.katacontainers.volume='. The purpose of this commit is to transform the encapsulated KataVirtualVolume data into device information. Fixes: #8495 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Feng Wang <feng.wang@databricks.com> Co-authored-by: Samuel Ortiz <sameo@linux.intel.com> Co-authored-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-11-23 23:05:13 +08:00
Dan Mihai	756022787c	Merge pull request #8239 from Sumynwa/sumsharma/fix_configmap_update_propagation runtime: Fix configmap/secrets updates with FS sharing disabled	2023-11-23 06:50:53 -08:00
Chelsea Mafrica	98aa291c9e	runtime-rs: Add Hybrid VSOCK device handling for CH Update cloud hypervisor implementation to allow hybrid vsock device to be handled. Fixes #6692 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-11-22 14:42:09 -08:00
Gabriela Cervantes	8839ca93ba	gha: Disable stratovirt for gha metrics This PR disables the stratovirt for gha metrics. Fixes #8496 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-11-22 16:17:31 +00:00
briwan01	231b9dfd9d	runtime-rs/clh: Fix unable to boot container In the case of Cloud Hypervisor running on arm64 architecture, only arm AMBA UART (pl011) is supported as the TTY. Consequently, when enabling Hypervisor debug mode, it's essential to configure the console as "ttyAMA0" rather than "ttyS0 Fixes: #8381 Signed-off-by: briwan01 <brian.wang@arm.com>	2023-11-22 17:52:11 +08:00
GabyCT	358f32e8bb	Merge pull request #8467 from GabyCT/topic/fixresult metrics: Fix result finding in tensorflow benchmark	2023-11-21 13:41:46 -06:00
Fabiano Fidêncio	45a41c3431	Merge pull request #8481 from ChengyuZhu6/guest-kernel kernel: backport erofs patch to 6.1.52 guest kernel	2023-11-21 12:22:24 +01:00
Fabiano Fidêncio	8425c78c91	Merge pull request #8476 from fidencio/topic/gha-pass-rust-runtime-to-kata-deploy tests: k8s: Allow passing rust-runtime env var to kata-deploy	2023-11-21 11:09:01 +01:00
Chao Wu	6a6c3c53b5	Merge pull request #8450 from adamqqqplay/vhost-user-general dragonball: add vhost-user connection management logic	2023-11-21 16:05:17 +08:00
ChengyuZhu6	6de01eacfd	kernel: backport erofs patch to 6.1.52 guest kernel Backport the erofs patch from linux kernel to solve the error #8083 Fixes: #8083 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Co-authored-by: Gao Xiang <hsiangkao@linux.alibaba.com>	2023-11-21 15:22:40 +08:00
Amulyam24	d8a8cc4491	tools: install oras from source on ppc64le Since the release is not yet out for ppc64le, build oras from source and use it. Fixes: #8458 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2023-11-21 11:38:20 +05:30
Amulyam24	08f3603123	tools: fix static build of qemu and shimv2 on ppc64le - statically linked qemu requires slof.bin to run, hence remove it from blacklist - By default, initrd is used for Power, modify the configuration.toml accordingly Fixes: #8458 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2023-11-21 11:38:20 +05:30
Alex.Lyn	4fd2914a33	Merge pull request #7932 from Apokleos/wrap-virtiofs-in-dm runtime-rs: bringing virtio-fs device in device-manager	2023-11-21 13:48:15 +08:00
Huang Jianan	a9571398a6	dragonball: add test utils for vhost-user The test utils will be used by the upcoming feature tests: vhost-user-net, vhost-user-blk and vhost-user-fs. Signed-off-by: Beiyue <beiyue@linux.alibaba.com> Signed-off-by: Huang Jianan <jnhuang@linux.alibaba.com>	2023-11-21 09:51:56 +08:00
Qinqi Qu	a6a399d5bc	dragonball: add vhost-user connection management logic The vhost-user connection management logic will be used by the upcoming features: vhost-user-net, vhost-user-blk and vhost-user-fs. Fixes: #8448 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Qinqi Qu <quqinqi@linux.alibaba.com> Signed-off-by: Huang Jianan <jnhuang@linux.alibaba.com>	2023-11-21 09:51:48 +08:00
Fabiano Fidêncio	9445a967b6	Merge pull request #8471 from ChengyuZhu6/kata-virtual-volume runtime: Introduce `KataVirtualVolume` structure into go runtime	2023-11-20 21:58:27 +01:00
Fabiano Fidêncio	8002de895a	Merge pull request #8439 from fidencio/topic/kata-manager-install-a-given-kata-tarball utils: kata-manager: Allow installing kata from a given tarball	2023-11-20 20:02:25 +01:00
Wainer Moschetta	728565d1e4	Merge pull request #7046 from stevenhorsman/remote-hypervisor-cherry-picks CC: Remote hypervisor merge to main	2023-11-20 15:22:37 -03:00
Chao Wu	5ee8829700	Merge pull request #8451 from openanolis/chao/pci	2023-11-21 00:29:22 +08:00
Fabiano Fidêncio	41f3f6f93e	Merge pull request #8465 from justxuewei/rename-virtio dragonball: Uniform the spelling of Virtio	2023-11-20 16:31:33 +01:00
Hyounggyu Choi	506b127df8	Merge pull request #8478 from BbolroC/set-default-allowed_hypervisor_annotations kata-deploy: Set a default value for ALLOWED_HYPERVISOR_ANNOTATIONS	2023-11-20 15:39:56 +01:00
alex.lyn	fe62e656a7	runtime-rs: Name the ShareFs Mount Option type more accurately Fixes: #7915 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-20 20:05:50 +08:00
alex.lyn	856315ff87	runtime-rs: bringing virtio-fs device in device-manager It mainly focus on the two parts: (1) redesign the ShareFsConfig with ShareFsMountConfig The device mount operation must depend on the fact that sharefs device exists, and re-design the structure of SharesFsConfig and move the ShareFsMountConfig into it with Option type, which is to describe the relation between ShareFsConfig and ShareFsMountConfig. (2) move virtiofs into device manager Currently, virtio-fs is still outside of the device manager. To do Enhancement of device manager, it will bring virtio-fs device in device-manager for unified management Fixes: #7915 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-20 20:04:47 +08:00
Chao Wu	b3318e59eb	Merge pull request #8332 from Apokleos/bugfix-directvol-multicontainers runitme-rs/bugfix: kata pod with multi-containers sharing one direct volume	2023-11-20 19:37:58 +08:00
Hyounggyu Choi	c489f1f504	kata-deploy: Set a default value for ALLOWED_HYPERVISOR_ANNOTATIONS As a follow-up PR for #8404, this is to set a default value for an environment variable `ALLOWED_HYPERVISOR_ANNOTATIONS`. This will prevent a pod launching without an explicit configuration for the variable from getting into a `CrashLoop` state. Fixes: #8477 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-11-20 12:33:34 +01:00
Chao Wu	ee55897827	fmt: refactor in pci & balloon 1. merge hashmap get logic according to Xuewei suggestion. 2. do cargo fmt Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-11-20 17:53:51 +08:00
Chao Wu	baf3db9e6e	Dragonball: add PCI bus and PCI interrupt support in mptable Spec In order to support PCI VFIO functionality in Dragonball, we should first add PCI bus and PCI device Interrupt information in Dragonball mptable setup process. This patch add : 1. pci_legacy_irqs transfered to setup_mptable function. 2. pci bus support in mptable mem 3. pci interrupt support in mptable mem fixes: #8449 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-11-20 17:53:51 +08:00
Xuewei Niu	c305634b4e	dragonball: Uniform the spelling of Virtio The changes are: - VirtIoError -> VirtioError - VirtIoResult -> VirtioResult - VirtIoDevice -> VirtioDevice Fixes: #8464 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-20 17:00:58 +08:00
Fabiano Fidêncio	44899d4cdf	tests: k8s: Allow passing rust-runtime env var to kata-deploy This will be used for selecting the correct runtimes and runtimeclasses to be deployed with kata-deploy. Fixes: #8475 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-20 09:13:05 +01:00
ChengyuZhu6	1353b14e6c	runtime: Add KataVirtualVolume struct in runtime Add the corresponding data structure in the runtime part according to kata-containers/kata-containers/pull/7698. Fixes: #8472 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2023-11-19 13:30:32 +08:00
Greg Kurz	110574353d	Merge pull request #8345 from beraldoleal/issues/8343 Fixes make check errors	2023-11-17 17:38:29 +01:00
Gabriela Cervantes	37916e7a58	metrics: Fix result finding This PR fixes the result finding for the general throughput for the tensorflow benchmark. Fixes #8466 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-11-17 15:59:51 +00:00
stevenhorsman	ebf9d2725a	kata-deploy: Add remote shim - Add remote to the list of shims in kata-deploy and kata-cleanup Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-11-17 13:38:49 +00:00
Fabiano Fidêncio	d5cf169adf	kata-deploy: Add missing kata-remote runtimeclass It's CCv0 specific for now, and it's needed as the Operator is now delegating the runtimeclass creation to the kata-deploy daemonset. Fixes: #7550 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `2df6cb7609`)	2023-11-17 13:34:40 +00:00
Pradipta Banerjee	39e8c84269	runtime: Add support for key annotations to remote hyp In order to support different pod VM instance type via remote hypervisor implementation (cloud-api-adaptor), we need to pass machine_type, default_vcpus and default_memory annotations to cloud-api-adaptor. The cloud-api-adaptor then uses these annotations to spin up the appropriate cloud instance. Reference PR for cloud-api-adaptor https://github.com/confidential-containers/cloud-api-adaptor/pull/1088 Fixes: #7140 Signed-off-by: Pradipta Banerjee <pradipta.banerjee@gmail.com> (based on commit `004f07f076`)	2023-11-17 13:33:27 +00:00
Yohei Ueda	2910e333a8	runtime: Use static resource in remote hypervisor This patch updates the template configuration file for the remote hypervisor to set static_sandbox_resource_mgmt to be true. The remote hypervisor uses the peer pod config to determine the sandbox size, so requires this to be set to true by default. Fixes: #6616 Signed-off-by: Yohei Ueda <yohei@jp.ibm.com> (based on commit `938447803b`)	2023-11-17 13:33:27 +00:00
stevenhorsman	26d56678a9	config: Add initial remote hypervisor config - Remote hypervisor template config - Add annotation enablement for machine_type, default_memory and default_vcpus for flexible instance types Fixes: #6349 Signed-off-by: stevenhorsman <steven@uk.ibm.com> (based on commits `7c9a791d67` and `335a456425`)	2023-11-17 13:33:24 +00:00
stevenhorsman	ad63439a3e	runtime: Update the remote hypervisor config Add the SELinux setting to ensure it is passed through to the remote hypervisor Fixes: #5936 Signed-off-by: stevenhorsman <steven@uk.ibm.com> (based on commit `3ef2fd1784`)	2023-11-17 13:32:52 +00:00
Lei Li	50e0d43dad	runtime: Support privileged containers in peer pod VM This patch fixes the issue of running containers with privileged as true. See the discussion at this URL for the details. https://github.com/confidential-containers/cloud-api-adaptor/issues/111 Signed-off-by: Lei Li <cdlleili@cn.ibm.com> (based on commit `c3e6b66051`)	2023-11-17 13:32:52 +00:00
Yohei Ueda	57d4dd8e57	runtime: Support the remote hypervisor type This patch adds the support of the remote hypervisor type. Shim opens a Unix domain socket specified in the config file, and sends TTPRC requests to a external process to control sandbox VMs. Fixes #4482 Co-authored-by: Pradipta Banerjee <pradipta.banerjee@gmail.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Signed-off-by: Yohei Ueda <yohei@jp.ibm.com> (based on commit `f9278f22c3`)	2023-11-17 13:32:49 +00:00
Yohei Ueda	8ac9a22097	runtime: Add hypervisor proto to support peer pod VMs This patch adds a protobuf definiton of the remote hypervisor type. Signed-off-by: Yohei Ueda <yohei@jp.ibm.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> (based on commit `150e8aba6d`)	2023-11-17 13:31:09 +00:00
Fabiano Fidêncio	f8322ffad2	Merge pull request #7796 from WenyuanLau/7794/StratoVirt_VMM_support StratoVirt: add support for a lightweight VMM StratoVirt in Kata	2023-11-17 10:53:17 +01:00
Fabiano Fidêncio	d6d9b45007	Merge pull request #7931 from BbolroC/migrate-to-gha-s390x tests\|gha: add containerd and k8s tests for s390x	2023-11-17 10:24:14 +01:00
Sumedh Alok Sharma	4aaf54bdad	runtime: Fix configmap/secrets update propagation with FS sharing disabled This PR fixes k8's configmap/secrets etc update propagation when filesystem sharing is disabled. The commit introduces below changes with some limitations: - creates new timestamped directory in guest - updates the '..data' symlink - creates user visible symlinks to newly created secrets. - Limitation: The older timestamped directory and stale user visible symlinks exist in guest due to missing DELETE api in agent. Fixes: #7398 Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2023-11-17 13:01:23 +05:30
Hyounggyu Choi	0c7aa1f307	gha: Set nightly test for s390x to 5 UTC This is to push back the time for the s390x nightly test to 5 a.m. UTC. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-11-17 05:47:44 +01:00
Hyounggyu Choi	ffe1ea52cf	tests\|gha: add containerd and k8s tests for s390x As part of the CI migration, this PR is to add workflows for containerd and k8s for s390x. Fixes: #7930 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-11-16 18:14:26 +01:00
GabyCT	8586308dcd	Merge pull request #8453 from GabyCT/topic/udpreadme metrics: Add iperf udp information to README	2023-11-16 10:38:56 -06:00
GabyCT	494174a98e	Merge pull request #8421 from GabyCT/topic/enablestressng tests: Enable stressng scalability test	2023-11-16 10:25:05 -06:00
James O. D. Hunt	4a4fc9c648	CODEOWNERS: Expand scope Improve the `CODEOWNERS` file by specifying more groups. Since GitHub automatically checks the `CODEOWNERS` file when a PR is created and adds all matching groups as reviewers for the PR, this may help reduce the PR backlog since the right people will be alerted and requested to review the PR. That should improve the quality of reviews (and thus the quality of the landed code). It may also have a positive effect on PR velocity. > Note: > > This PR combines the other `CODEOWNERS` files so we have > a single, visible, top-level file. See: https://github.com/kata-containers/community/issues/253 Fixes: #3804. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-16 16:09:20 +00:00
Fabiano Fidêncio	10996f3bbb	Merge pull request #8460 from ldoktor/artifacts gha: Keep kata tarballs for 15 days	2023-11-16 13:56:25 +01:00
Liu Wenyuan	c77e990c3e	tests: Enable tests for StratoVirt hypervisor This commit enables StratoVirt hypervisor to be tested in kata GHA, incluing k8s, metrics, cri-containerd, nydus and so on. Meanwhile, adding some unit tests for StratoVirt to make sure it works. Fixes: #7794 Signed-off-by: Liu Wenyuan <liuwenyuan9@huawei.com>	2023-11-16 20:47:26 +08:00
Liu Wenyuan	14d8790d83	kata-deploy: Add StratoVirt support to deploy process Allow kata-deploy process to pull StratoVirt from release binaries, and add them as a part of kata release. Fixes: #7794 Signed-off-by: Liu Wenyuan <liuwenyuan9@huawei.com>	2023-11-16 20:47:26 +08:00
Liu Wenyuan	9542211e71	configuration: add configuration for StratoVirt hypervisor. Add configuration-stratovirt.toml.in to generate the StratoVirt configuration, and parser to deliver config to StratoVirt. Fixes: #7794 Signed-off-by: Liu Wenyuan <liuwenyuan9@huawei.com>	2023-11-16 20:47:26 +08:00
Liu Wenyuan	561c85be54	build: Makefile for StratoVirt hypervisor Add support for building StratoVirt hypervisor, including x86_64 and arm64. Fixes: #7794 Signed-off-by: Liu Wenyuan <liuwenyuan9@huawei.com>	2023-11-16 20:47:26 +08:00
Liu Wenyuan	26966c8469	virtcontainers: Add StratoVirt as a supported hypervisor Initial support of the MicroVM machine type of StratoVirt hypervisor for the kata go runtime. Fixes: #7794 Signed-off-by: Liu Wenyuan <liuwenyuan9@huawei.com>	2023-11-16 20:47:24 +08:00
Fabiano Fidêncio	edb791315e	Merge pull request #7987 from BbolroC/nightly-ci-s390x tests\|gha: add nightly tests for s390x	2023-11-16 11:45:32 +01:00
Lukáš Doktor	8959e3ca05	gha: Keep kata tarballs for 15 days these tarballs are useful for debugging and re-running jobs, keep them for 15 days. Fixes: #8000 Signed-off-by: Lukáš Doktor <ldoktor@redhat.com>	2023-11-16 10:35:20 +01:00
Gabriela Cervantes	9cc6908b09	stability: Update stressng to run on the gha This PR updates the stressng test to run on the gha for kata CI. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-11-15 19:34:36 +00:00
Gabriela Cervantes	9d8eb298c3	metrics: Add iperf udp information to README This PR adds the iperf udp information to the network README for the kata metrics CI. Fixes #8452 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-11-15 15:22:06 +00:00
Gabriela Cervantes	4b7854b668	stability: Add missing dependencies This PR adds missing dependencies to run stability tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-11-15 14:51:14 +00:00
Gabriela Cervantes	79177bb9cb	tests: Enable stressng scalability test This PR enables the stressng scalability test for kata CI. Fixes #8420 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-11-15 14:51:14 +00:00
Xuewei Niu	f18794d880	Merge pull request #8426 from justxuewei/vhost-rm-virtio-net dragonball: Remove vhost-net dependency on virtio-net	2023-11-15 10:39:27 +08:00
alex.lyn	ba632ba825	runitme-rs: kata with multi-containers sharing one direct volume When multiple containers in a kata pod share one direct volume, it's important to make sure that the corresponding block device is only mounted once in the guest. This means that there should be only one mount entry for the device in the mount information. Fixes: #8328 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-15 10:37:01 +08:00
alex.lyn	d7594d830c	runtime-rs: correct the path from cid to device_id. When a direct volume is used by multiple containers in Kata, Generating many shared paths with cids will cause IO error as the result of one direct volume mounts more than once. To correct it, use the device_id instead of cid which ensures that the guest only mounts the FS once. Fixes: #8328 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-15 10:30:39 +08:00
Fabiano Fidêncio	906f6b7380	Merge pull request #8431 from UiPath/fix-vsock-packets-drop kernel: Fix vsock packets drop when the driver initializes	2023-11-14 18:52:53 +01:00
Fabiano Fidêncio	1699b84f13	utils: kata-manager: Remove $enable_debug from the install_kata call This was added as part of `d4d65bed38`, but install_kata has never actually used the passed enable_debug var. With this in mind, let's just remove it. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-14 17:34:03 +01:00
Fabiano Fidêncio	38d2edd83b	utils: kata-manager: Allow installing kata from a given tarball With this change, we give the users the change to try kata-containers with their own pre-built tarball. This will become very useful in the CI context, as we won't be downloading a specific version of kata-containers, but rather installing whatever was built in previous steps of the CI pipeline. Fixes: #8438 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-14 17:34:01 +01:00
Fabiano Fidêncio	fd9b6d6837	Merge pull request #7623 from fidencio/topic/runtime-improve-vcpu-allocation-on-host-side runtime: Improve vCPU allocation for the VMMs	2023-11-14 14:10:54 +01:00
Alexandru Matei	bfd1ce30e1	kernel: Fix vsock packets drop when the vsock driver starts The virtio vsock driver has a small window during initialization where it can silently drop replies to connection requests. Because no reply is sent, kata waits for 10 seconds and in the end it generates a connection timeout error in HybridVSockDialer. Fixes: #8291 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2023-11-14 11:02:52 +02:00
Xuewei Niu	49c2e6e23c	dragonball: Remove vhost-net dependency on virtio-net This patch is to remove vhost-net dependency on virtio-net for dbs-virtio-devices crate. Then, the feature of vhost-net is able to enable without enabling virtio-net device, error, etc. Fixes: #8423 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-14 15:35:10 +08:00
Fabiano Fidêncio	dffc6f611c	Merge pull request #8432 from justxuewei/rm-ci-docker-and-nerdctl gha: Remove docker and nerdctl tests from ci.yaml	2023-11-14 08:34:18 +01:00
alex.lyn	4d65c2e8a2	runtime-rs: introduce `update_device` in trait Hypervisor Introduce the `update_device` trait in Hypervisor to enable device updates for VMMs.This trait will initially be utilized for virtiofs Mount operations. Fixes: #7915 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-11-14 11:56:36 +08:00
Xuewei Niu	481486c6d5	gha: Remove docker and nerdctl tests from CI Two workflows, run-nerdctl-tests-on-garm.yaml and run-docker-tests-on-garm.yaml, are removed from commit `b481d39`. However, they are referenced by CI workflow. It leads to the CI not working properly. This patch is to remove those files from ci.yaml. Fixes: #8433 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-14 10:44:14 +08:00
Fabiano Fidêncio	c858ea1460	Merge pull request #8174 from fidencio/topic/re-revert-8115 ci: Re-add tracing tests and move docker/nerdctl to the basic-ci-amd64.yaml file	2023-11-13 18:19:40 +01:00
James O. D. Hunt	a781ce33b0	Merge pull request #8383 from jodh-intel/kata-manager-add-list-option utils: kata-manager: Add option to list versions	2023-11-13 16:18:36 +00:00
David Esparza	98ec34b04c	Merge pull request #8338 from dborquez/improve_metrics_init_environment metrics: Fix function that completely stops kata containers before running a test	2023-11-13 09:35:27 -06:00
Fabiano Fidêncio	b481d396fc	gha: Move docker / nerdctl content to the basic-ci-amd64 file There's no need to keep those as separate files, and by having those in the basic-ci-amd64.yaml file actually helps us to avoid the undocummented GHA limitation about the number of files imported. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-13 15:34:00 +01:00
Fabiano Fidêncio	3c735c236d	ci: tracing: Adapt to basic-ci-amd64.yaml Peng Tao made this move as part of `1280f85343`, and here we're simply adjusting to the move. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-13 15:27:39 +01:00
Fabiano Fidêncio	ee17fe9d20	Revert "gha: ci: Revert tracing test PR to unbreak CI" This reverts commit `e9bd852113`.	2023-11-13 15:27:39 +01:00
James O. D. Hunt	4d5b23b73a	Merge pull request #8419 from jodh-intel/2023-11-10-fix-tdx runtime-rs: ch: Fix TDX	2023-11-13 11:58:16 +00:00
James O. D. Hunt	7f666f783d	runtime-rs: ch: Fix TDX PR #8311 inadvertently broke the runtime-rs / Cloud Hypervisor TDX handling. It also introduced unrecoverable failure scenarios. Hence, replace slow, fallible regex matching in logging fast path with single pass non-failing multi-string log level matching. Also, added a unit test for `parse_ch_log_level()`. Fixes: #8418. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-13 08:49:47 +00:00
Xuewei Niu	0a9125e629	Merge pull request #7675 from justxuewei/vhost-net	2023-11-12 20:38:18 +08:00
Xuewei Niu	d1deaf0538	dragonball: Minor changes for a comment from Bian - Add feature control for InsertNetworkDevice. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-12 14:14:10 +08:00
Xuewei Niu	e4f83e27c4	dragonball: vhost-net set_offload with acked features set_offload() for tap devices depends on acked features. Signed-off-by: Helin Guo <helinguo@linux.alibaba.com> Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-12 14:10:39 +08:00
Xuewei Niu	6cd572dbbb	dragonball: Minor changes for Chao's comments - Remove two panic statements from InsertNetworkDevice test. - Rename `NUM_QUEUES` to `DEFAULT_NUM_QUEUES`, `QUEUE_SIZE` to `DEFAULT_QUEUE_SIZE` for vhost-net and virtio-net. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-12 14:10:39 +08:00
Xuewei Niu	dcdf3c6556	runtime-rs: Supply missing fields of NetworkConfig `test_networkconfig_to_netconfig` from clh depends on `NetworkConfig` which has some new fields in this PR. Therefore, this commit gives the test missing fields. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-12 14:10:39 +08:00
Xuewei Niu	58e9709c1f	dragonball: Changes for ZizhengBian's comments - Dragonball's vhost-net feature not depends on virtio-net feature. - Remove `TapError` from dbs-virtio-devices's Error, and add `VirtioNet` and `VhostNet` two fields. - Downgrade visiblity of two fields of `VhostNetDeviceMgr` from `pub(crate)`. - File an issue to record a todo for network rate limiter. - Print internal errors with `{0:?}. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-12 14:10:33 +08:00
Fabiano Fidêncio	849253e55c	tests: Add a simple test to check the VMM vcpu allocation As we've done some changes in the VMM vcpu allocation, let's introduce basic tests to make sure that we're getting the expected behaviour. The test consists in checking 3 scenarios: * default_vcpus = 0 \| no limits set * this should allocate 1 vcpu * default_vcpus = 0.75 \| limits set to 0.25 * this should allocate 1 vcpu * default_vcpus = 0.75 \| limits set to 1.2 * this should allocate 2 vcpus The tests are very basic, but they do ensure we're rounding things up to what the new logic is supposed to do. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-10 18:26:01 +01:00
Fabiano Fidêncio	5e9cf75937	vc: utils: Rename CalculateMilliCPUs() to CalculateCPUsF() With the change done in the last commit, instead of calculating milli cpus, we're actually converting the CPUs to a fraction number, a float. Let's update the function name (and associated vars) to represent that change. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-10 18:26:01 +01:00
Fabiano Fidêncio	e477ed0e86	runtime: Improve vCPU allocation for the VMMs First of all, this is a controversial piece, and I know that. In this commit we're trying to make a less greedy approach regards the amount of vCPUs we allocate for the VMM, which will be advantageous mainly when using the `static_sandbox_resource_mgmt` feature, which is used by the confidential guests. The current approach we have basically does: * Gets the amount of vCPUs set in the config (an integer) * Gets the amount of vCPUs set as limit (an integer) * Sum those up * Starts / Updates the VMM to use that total amount of vCPUs The fact we're dealing with integers is logical, as we cannot request 500m vCPUs to the VMMs. However, it leads us to, in several cases, be wasting one vCPU. Let's take the example that we know the VMM requires 500m vCPUs to be running, and the workload sets 250m vCPUs as a resource limit. In that case, we'd do: * Gets the amount of vCPUs set in the config: 1 * Gets the amount of vCPUs set as limit: ceil(0.25) * 1 + ceil(0.25) = 1 + 1 = 2 vCPUs * Starts / Updates the VMM to use 2 vCPUs With the logic changed here, what we're doing is considering everything as float till just before we start / update the VMM. So, the flow describe above would be: * Gets the amount of vCPUs set in the config: 0.5 * Gets the amount of vCPUs set as limit: 0.25 * ceil(0.5 + 0.25) = 1 vCPUs * Starts / Updates the VMM to use 1 vCPUs In the way I've written this patch we introduce zero regressions, as the default values set are still the same, and those will only be changed for the TEE use cases (although I can see firecracker, or any other user of `static_sandbox_resource_mgmt=true` taking advantage of this). There's, though, an implicit assumption in this patch that we'd need to make explicit, and that's that the default_vcpus / default_memory is the amount of vcpus / memory required by the VMM, and absolutely nothing else. Also, the amount set there should be reflected in the podOverhead for the specific runtime class. One other possible approach, which I am not that much in favour of taking as I think it's less clear, is that we could actually get the podOverhead amount, subtract it from the default_vcpus (treating the result as a float), then sum up what the user set as limit (as a float), and finally ceil the result. It could work, but IMHO this is less clear, and less explicit on what we're actually doing, and how the default_vcpus / default_memory should be used. Fixes: #6909 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2023-11-10 18:25:57 +01:00
Fabiano Fidêncio	8d958b8c47	Merge pull request #8406 from microsoft/danmihai1/policy-doc docs: add agent policy documentation	2023-11-10 17:19:04 +01:00
James O. D. Hunt	f588d31324	Merge pull request #8374 from jodh-intel/kata-manager-check-dl-url-count utils: kata-manager: Ensure only one download URL	2023-11-10 13:19:07 +00:00
Fabiano Fidêncio	b0157ad73a	runtime: confidential: Do not set the max_vcpu to cpu We don't have to do this since we're relying on the `static_sandbox_resource_mgmt` feature, which gives us the correct amount of memory and CPUs to be allocated. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-10 12:58:20 +01:00
Steve Horsman	b23952c852	Merge pull request #8309 from gkurz/update-release-process-doc Update release process documentation	2023-11-10 09:44:18 +00:00
James O. D. Hunt	0ead018d0a	utils: kata-manager: Add Docker details to list output Add Docker version details to the output of the list versions CLI option. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-10 09:19:56 +00:00
James O. D. Hunt	be3044fd01	utils: kata-manager: Add option to list versions Add a command-line option to list the installed and available versions of Kata and containerd. Fixes: #8355. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-10 09:19:56 +00:00
James O. D. Hunt	9969f5a94a	utils: kata-manager: Make test container name more unique Rather than creating a container called `test-kata`, prefix with the script name to make it a bit "more unique" and less likely for users to have an existing container with the test container name. The new test container name is `kata-manager-sh-test-kata`. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-10 09:19:56 +00:00
James O. D. Hunt	436d7d1275	utils: kata-manager: Improve usage message Update the usage to show that the latest Kata version can also be queried using `kata-ctl`. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-10 08:29:14 +00:00
James O. D. Hunt	1625a5ce48	utils: kata-manager: Improve version check Update `github_get_latest_release()` to use `sort -V` rather than sub-sorting on the major, minor and patch level version number elements. The new approach is safer and more accurate. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-10 08:29:14 +00:00
James O. D. Hunt	c72a27e219	utils: kata-manager: Ensure only one download URL Add an extra sanity check to ensure that only a single download URL is found for the specified release version. Fixes: #8364. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-10 08:27:23 +00:00
James O. D. Hunt	839f6c3d44	utils: kata-manager: Improve info messages Improve some of the information messages a little by adding more detail and quoting file names. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-10 08:27:20 +00:00
Archana Shinde	21e45bebc8	Merge pull request #8376 from fidencio/topic/kata-manager-add-support-for-docker-installation kata-manager: Add support for Docker CLI installation	2023-11-09 22:11:50 -08:00
Chao Wu	a62fb83c91	Merge pull request #8169 from openanolis/chao/fix_typo_shm runtime-rs: fix a typo in shm	2023-11-10 14:00:11 +08:00
Chao Wu	820b578aa3	Merge pull request #8370 from gaohuatao-1/bugfix agent: update AGENT_THREADS metrics value	2023-11-10 13:16:29 +08:00
gaohuatao	78df1bb851	agent: update AGENT_THREADS metrics value Fixes: #8369 Signed-off-by: gaohuatao <gaohuatao@bytedance.com>	2023-11-10 10:39:57 +08:00
Chao Wu	afb002c25c	runtime-rs: fix a typo in shm is_shim_volume should be is_shm_volume in shm_volume mod. fixes: #8168 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-11-10 10:36:58 +08:00
Fabiano Fidêncio	2b937400fe	Merge pull request #8404 from fidencio/topic/kata-deploy-allow-users-to-enable-hypervisor-annotations kata-deploy: Allow users to set hypervisor annotations	2023-11-09 17:44:52 +01:00
Dan Mihai	bc49c553ef	docs: add agent policy documentation Add initial agent policy documentation. Fixes: #7671 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-11-09 16:43:00 +00:00
Fabiano Fidêncio	5d10aed9ba	kata-manager: Make containerd_config a global var As "/etc/containerd/config.toml" is used from more than one place, let's just make it a global var. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-09 13:47:52 +01:00
Fabiano Fidêncio	66d1b2c173	kata-manager: Add support for docker installation Add support for also installing the Docker CLI, giving users the chance to try Kata Containers with docker in the same way we provide users the chance to try Kata Containers with `ctr`. Fixes: #8357 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-09 13:47:52 +01:00
Fabiano Fidêncio	1a81989d20	tests: k8s: Use the "ALLOWED_HYPERVISOR_ANNOTATIONS" The current kata-deploy code has been doing a `sed` to add allowed hypervisor annotations, so CBL mariner can be tested with their own kernel and initrd. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-09 13:42:31 +01:00
Fabiano Fidêncio	023c4a17cf	kata-deploy: Allow users to set hypervisor annotations Currently the only way one can specify allowed hypervisor annotations is during build time, which is a big issue for users grabbing kata-deploy as we provide. Fixes: #8403 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-09 13:42:31 +01:00
Fabiano Fidêncio	0352f1e029	kata-manager: Allow passing a specific tool to test_installation Right now we're only testing with `ctr` and there's no change in behaviour with this commit. However, allowing to pass a tool to run the tests with gives us an easier time when expanding kata-manager to support, for instance, docker and nerdctl. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-09 11:24:37 +01:00
Fabiano Fidêncio	50df1129ea	Merge pull request #8411 from fidencio/topic/fix-k3s-deployment gha: Fix regex used to get kubectl version from the k3s version	2023-11-09 10:44:34 +01:00
Fabiano Fidêncio	455b7bf776	gha: k3s: Avoid unnecessary escape There's no reason to escape the first + on the +k3s[0-9]\+ regex, as shown here: ```sh ubuntu@k3s:~$ /usr/local/bin/k3s kubectl version --short 2>/dev/null \| \ grep "Client Version" \| \ sed \ -e 's/Client Version: //' \ -e 's/+k3s[0-9]\+//' v1.27.7 ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-09 08:42:25 +01:00
Fabiano Fidêncio	e7890ee8f6	gha: Fix regex used to get kubectl version from the k3s version It seems that with the new k3s release, they've bumped their kubectl version from x.y.z+k3s1 to x.y.z+k3s2. Let's ensure our regexp is more generic and future proof for such changes. Fixes: #8410 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-09 07:08:02 +01:00
Archana Shinde	1611723465	Merge pull request #8379 from likebreath/1103/clh_v36.0 Upgrade to Cloud Hypervisor v36.0	2023-11-08 21:10:41 -08:00
Archana Shinde	268d4d622f	Merge pull request #8389 from justxuewei/vm-capable-test runtime: Fix TestCheckHostIsVMContainerCapable unstablity issue	2023-11-08 12:14:04 -08:00
Archana Shinde	92a517156c	Merge pull request #8367 from amshinde/add-nerdctl-ipvlan-test network: Fix network hotplug for ipvlan and macvlan endpoints for qemu and add tests	2023-11-08 11:45:13 -08:00
Chelsea Mafrica	83e731328f	Merge pull request #8023 from cmaf/runtime-rs-ch-pause-resume runtime-rs: Update status for pause and resume	2023-11-08 11:34:47 -08:00
Hyounggyu Choi	84b5618733	tests\|gha: add internal nightly tests for s390x This is to add a workflow for internal nightly tests for s390x in Jenkins. Fixes: #7986 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-11-08 16:07:41 +01:00
Xuewei Niu	acd9057c7b	runtime: Fix TestCheckHostIsVMContainerCapable unstablity issue TestCheckHostIsVMContainerCapable removes sysModuleDir to simulate a case that the kernel modules are not loaded. However, checkKernelModules() executes modprobe <module> if a module not found in that directory. Loading those modules is required to be denied temporarily. Fixes: #8390 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-08 22:40:08 +08:00
Fupan Li	100a73d2fd	Merge pull request #7531 from justxuewei/device-cgroup agent: Restrict device access at upper node of container's cgroup	2023-11-08 22:01:48 +08:00
Chao Wu	4435c1efd7	Merge pull request #8386 from jodh-intel/runtime-rs-ch-tidy-up runtime-rs: ch: Simplify VSOCK error handling	2023-11-08 17:31:40 +08:00
Xuewei Niu	023d8dc01e	agent: Changes according to Pan's comments - Disable device cgroup restriction while pod cgroup is not available. - Remove balcklist-related names and change whitelist-related names to allowed_all. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-08 09:39:08 +08:00
Xuewei Niu	136fb76222	tests: Add a integrated test for device cgroup `TestDeviceCgroup` is added to cri-containerd's integration tests. The test launches two containers. Each container has a block device. It checks the validity of device cgroup. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-08 09:39:07 +08:00
Xuewei Niu	b5f3a8cb39	agent: Fix container launching failure with systemd cgroup FSManager of systemd cgroup manager is responsible for setting up cgroup path. The container launching will be failed if the FSManager is in read-only mode. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-08 09:39:07 +08:00
Xuewei Niu	6477825195	agent: Minor changes according to Zhou's comments The changes include: - Change to debug logging level for resources after processed. - Remove a todo for pod cgroup cleanup. - Add an anyhow context to `get_paths_and_mounts()`. - Remove code which denys access to VMROOTFS since it won't take effect. If blackmode is in use, the VMROOTFS will be denyed as default. Otherwise, device cgroups won't be updated in whitelist mode. - Add a unit test for `default_allowed_devices()`. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-08 09:39:07 +08:00
Xuewei Niu	cec8044744	agent: Make devcg_info optional for LinuxContainer::new() The runk is a standard OCI runtime that isnt' aware of concept of sandbox. Therefore, the `devcg_info` argument of `LinuxContainer::new()` is unneccessary to be provided. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-08 09:39:07 +08:00
Xuewei Niu	ef4c3844a3	agent: Restrict device access at upper node of container's cgroup The target is to guarantee that containers couldn't escape to access extra devices, like vm rootfs, etc. Assume that there is a cgroup, such as `/A/B`. The `B` is container cgroup, and the `A` is what we called pod cgroup. No matter what permissions are set for the container (`B`), the `A`'s permission is always `a : rwm`. It leads that containers could acquire permission to access to other devices in VM that not belongs to themselves. In order to set devices cgroup properly, the order of setting cgroups is that the pod cgroup comes first and the container cgroup comes after. The `Sandbox` has a new field, `devcg_info`, to save cgroup states. To avoid setting container cgroup too early, an initialization should be done carefully. `inited`, one of the states, is a boolean to indicate if the pod cgroup is initialized. If no, the pod cgroup should be created firstly, and set default permissions. After that, the pause container cgroup is created and inherits the permissions from the pod cgroup. If whitelist mode which allows containers to access all devices in VM is enabled, then device resources from OCI spec are ignored. This feature not supports systemd cgroup and cgroup v2, since: - Systemd cgroup implemented on Agent hasn't supported devices subsystem so far, see: https://github.com/kata-containers/kata-containers/issues/7506. - Cgroup v2's device controller depends on eBPF programs, which is out of scope of cgroup. Fixes: #7507 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-08 09:39:07 +08:00
Archana Shinde	c075fa6817	tests: Add test with nerdctl to verify macvlan support Add test to verify kata supports macvlan networks. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-11-07 10:13:51 -08:00
Archana Shinde	07db673eb9	tests: Add test with nerdctl to verify ipvlan support Add test to verify kata supports ipvlan networks. This test can be bit tricky as it requires knowledge about host interfaces to be used as a master for the ipvlan network. However, with github actions, we can assume interface called eth0 to be present on the host and functioning. Fixes: #8366 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-11-07 10:13:51 -08:00
Archana Shinde	a6272733e7	network: Fix network hotplug for ipvlan and macvlan endpoints. Since moving from network coldplug to hotplug, the only case verified was veth endpoints. Support for network hotplug for ipvlan and macvlan was broken/not added. Fix it. Fixes: #8391 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-11-07 10:13:51 -08:00
James O. D. Hunt	59d0d4caff	runtime-rs: ch: Simplify VSOCK error handling Remove the redundant `VmConfigError::EmptyVsockSocketPath` error from the Cloud Hypervisor config crate since this scenario is already handled by the `VsockConfigError::NoVsockSocketPath` error. Fixes: #8385. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-07 17:45:38 +00:00
James O. D. Hunt	bdb83f8282	runtime-rs: ch: Remove unused function Remove the redundant `parse_mac()` function: this was never used and we already have an implementation in `crates/resource/src/network/utils/mod.rs`. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-11-07 17:45:38 +00:00
Wainer Moschetta	949ac4d810	Merge pull request #8217 from beraldoleal/issues/8216 tests: fixes permission denied when running test	2023-11-07 12:25:23 -03:00
Wainer Moschetta	7f5d70f48b	Merge pull request #8061 from beraldoleal/gogo-removal-v3 Updating containerd to a GogoProtobuf free version	2023-11-07 12:18:50 -03:00
Xuewei Niu	8ea87405ed	runtime-rs: Remove virtio config from Backend Virtio-net and vhost-net share a common virtio config, and vhost-user-net uses another config, named `VhostUserConfig`. Thus, the virtio config could be added into `NetworkConfig` instead of `Backend`. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-07 19:35:02 +08:00
Xuewei Niu	ad66378bf5	runtime-rs: Move Dragonball stuff out of device drivers Moving Dragonball structs convertions out of device drivers to keep driver neutral. The convertions include `NetworkBackend` to `DragonballNetworkBackend` and `NetworkConfig` to `DragonballNetworkConfig`. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-07 19:35:02 +08:00
Xuewei Niu	3e0614cdf0	dragonball: Minor changes to comments Changes include: - Merge `VhostNetDeviceError` import item. - Replace if with match in `add_vhost_net_device()` Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-07 19:35:02 +08:00
Xuewei Niu	a047331a34	runtime-rs: Network config distinguishes backends Network backends determine the virtio dataplane implementations. Common protocols include virtio-net, vhost-net and vhost-user-net, etc. Network config has a new field named `backend` to specify which protocol to use. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-07 19:35:02 +08:00
Xuewei Niu	9203371833	dragonball: Introduce vhost-net device PLEASE NOTE THAT this pull request just implements vhost-net support for Dragonball, and adaptation for the Runtime-rs. And this pull request DOESN'T provide an item to config which backend to use. To sum up, virtio-net as a default backend is only choice for the user so far. This pull request introduces vhost-net device for the Dragonball. In addition, this pull request includes changes of Runtime-rs to improve network configuration abilities. The Dragonball part implements a vhost-net device and a vhost-net device manager, named `VhostNetDeviceMgr`, to manage vhost-net device. `NetworkInterfaceConfig` is introduced as a high-level abstract for network config. Then, the Dragonball is able to distinguish network backends, e.g. virtio-net, vhost-net, vhost-user-net(WIP), etc. The Runtime-rs part adds support of multiple network backends as well. `NetworkConfig` has a couple of new fields, like `backend`, `use_shared_irq`, etc. And Dragonball's network config structs are implmented `From` trait which allow to be converted from the Runtime-rs's network config conveniently. Fixes: #7674 Signed-off-by: Eric Ren <renzhen@linux.alibaba.com> Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-07 19:35:02 +08:00
Greg Kurz	b27b4ce104	doc: No longer release the test repository Now that most of the test repository got migrated to the main Kata repository, it is no longer needed to tag the test repository when doing a release. Update the documentation accordingly by dropping all references to the test repository and only mention the Kata repository. Fixes #8302 Signed-off-by: Greg Kurz <groug@kaod.org>	2023-11-07 10:28:43 +01:00
Greg Kurz	af2d897fb1	doc: Release now uses the official GitHub CLI The hub tool is deprecated. Releases are now based on the official gh CLI. A notable improvement : when properly setup (see [1]), gh allows to directly use HTTPS with one's GitHub credentials, instead of having to setup proper SSH access for pushes to the repo. Adjust the documentation accordingly. Fixes #8302 [1] https://docs.github.com/en/github-cli/github-cli/quickstart#prerequisites Signed-off-by: Greg Kurz <groug@kaod.org>	2023-11-07 10:22:54 +01:00
Greg Kurz	2af9419fa4	doc: No longer run kata-deploy test when releasing This is already tested by CI for every PR. Drop this step from the release process documentation. Fixes #8302 Signed-off-by: Greg Kurz <groug@kaod.org>	2023-11-07 10:19:32 +01:00
Beraldo Leal	dd530ba8ee	tests: fixes AMD errors TestCheckHostIsVMContainerCapable is failing on AMD machines. kata-check_amd64_test.go:96 has no AMD modules, also getCPUType is missing. Fixes #8384. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-11-06 16:49:59 +00:00
Beraldo Leal	7641c19f74	runtime: bump containerd for gogo deprecation This update includes necessary changes due to the version bump of containerd and its dependencies. It's part of a broader initiative to phase out gogo protobuf, which has been deprecated, and to align with the current supported libraries. Fixes #7420. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-11-06 16:49:59 +00:00
Beraldo Leal	16fa2c39e6	protocols: replace gogo/types.Empty and Any by Google versions. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-11-06 16:49:58 +00:00
Beraldo Leal	c61f4a8592	protocols: remove unused fieldpath option The +fieldpath option, specific to gogoprotobuf, enabled dynamic field access in protobuf messages, allowing nested fields to be accessed via string paths. This change is part of a larger effort to transition to the official Go protobuf library for better maintainability and community support. Upon review, no instances of dynamic field access were found in the codebase, confirming that the feature is not in use. By removing this unused feature, we simplify the build process and make it easier to complete the transition away from gogoprotobuf. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-11-06 16:49:58 +00:00
Beraldo Leal	c87bc60ea0	protocols: removing unused mappings Those mappings are not used by our .proto files and there is no difference between .pb.go files generated. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-11-06 16:49:58 +00:00
Beraldo Leal	c5d845b30a	agent: updating Cargo.lock files Probably previous changes missed updating Cargo.lock. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-11-06 16:49:58 +00:00
Beraldo Leal	5d88c78a6e	protocols: generating agent.pb.go `a3b003c345` modified agent but agent.pb.go was not updated. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-11-06 16:49:58 +00:00
David Esparza	28e7b3467b	metrics: improving stop and remove running containers This PR makes the change to using the SIGKILL signal instead of SIGTERM to force stop each kata component before start running any metric test. Fixes: #8336 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-11-06 09:54:32 -06:00
Archana Shinde	3b2fb6a604	Merge pull request #8284 from amshinde/runtime-rs-update-device-pci-info runtime-rs: update device pci info for vfio and virtio-blk devices	2023-11-06 01:09:20 -08:00
Archana Shinde	036b7787dd	runtime-rs: Use PCI path from hypervisor for vfio devices Remove earlier functionality that tries to assign PCI path to vfio devices from the host assuming pci slots to start from 1. Get this from the hypervisor instead. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-11-05 21:59:44 -08:00
Archana Shinde	c3ce6a1d15	runtime-rs: Provide PCI path to the agent for virtio-block If PCI path for block device is not empty for a block device, use that as identifier for agent instead of virt path which is valid only for mmio devices. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-11-05 21:59:44 -08:00
Archana Shinde	a2bbbad711	runtime-rs: change hypervisor add_device trait to return device copy Block(virtio-blk) and vfio devices are currently not handled correctly by the agent as the agent is not provided with correct PCI paths for these devices. The PCI paths for these devices can be inferred from the PCI information provided by the hypervisor when the device is added. Hence changing the add_device trait function to return a device copy with PCI info potentially provided by the hypervisor. This can then be provided to the agent to correctly detect devices within the VM. This commit includes implementation for PCI info update for cloud-hupervisor for virtio-blk devices with stubs provided for other hypervisors. Removing Vsock from the DeviceType enum as Vsock currently does not implement the Device Trait, it has no attach and detach trait functions among others. Part of the reason is because these functions require Vsock to implement Clone trait as these functions need cloned copies to be passed down the hypervisor. The change introduced for returning a device copy from the add_device hypervisor trait explicitly requires a device to implement Copy trait. Hence removing Vsock from the DeviceType enum for now, as its implementation is incomplete and not currently used. Note, one of the blockers for adding the Clone trait to Vsock is that it currently includes a file handle which cannot be cloned. For Clone and Device Traits to be implemented for Vsock, it requires an implementation change in the future for it to be cloneable. Fixes: #8283 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-11-05 21:59:44 -08:00
Bo Chen	071667f1ca	runtime: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v35.0. Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by openapi-generator. Fixes: #8378 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-11-03 10:47:06 -07:00
Bo Chen	d1163141b9	versions: Upgrade to Cloud Hypervisor v36.0 Details of this release can be found in ourroadmap project as iteration v36.0: https://github.com/orgs/cloud-hypervisor/projects/6. Fixes: #8378 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-11-03 10:46:56 -07:00
Fabiano Fidêncio	0aac3c76ee	Merge pull request #8365 from fidencio/topic/kata-manager-restrict-containerd-versions-to-be-used kata-manager: Accept only "lts" or "active" as containerd versions	2023-11-03 11:54:05 +01:00
Fabiano Fidêncio	8b4fc847d7	kata-manager: Accept only "lts" or "active" as containerd versions kata-manager is a very nice tool, but we shouldn't be trying to take care of "everything" in "all possible scenarios", and we should focus on installing Kata Containers dependencies that are supported. With this in mind, let's limit a little bit the scope of which versions of containerd can be installed, limitting to "active" and "lts", which will then install the latest version of those "flavours". The default value will always be "lts" as that's supposed to be the stable one. NOTE: This is a breaking change, as it changes the behaviour of what the script takes in its `-c` parameter. I'm assuming here we're safe to do so as the majority of the users should / would only be using the full installation by default. Fixes: #8356 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-03 10:30:37 +01:00
Fabiano Fidêncio	d395ae8198	Merge pull request #8368 from fidencio/topic/gha-stale-fixes gha: stale: Fix typo and allow manually triggering it	2023-11-03 10:07:56 +01:00
Fabiano Fidêncio	994615ca28	gha: stale: Allow manually triggering it This will help us to avoid waiting till the next time cron would trigger the action to test Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-03 08:17:48 +01:00
Fabiano Fidêncio	6abcf03611	gha: stale: Fix typo action -> actions This is causing the following error: ``` Unable to resolve action action/stale, repository not found ``` Fixes: #8347 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-03 08:15:18 +01:00
Steve Horsman	a7a14e33d8	Merge pull request #8285 from sazzy4o/patch-1 Docs: Fix Dragonball link	2023-11-02 17:54:47 +00:00
Fabiano Fidêncio	37233622da	kata-manager: Ensure we run apt-get update before apt-get install As that's an operation that can easily fail, and it's quite simple / cheap for us to run it, let's just do it and avoid the failure. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-02 14:14:32 +01:00
Fabiano Fidêncio	d547798284	Merge pull request #7057 from brianwang12/kata-manager-fix kata-manager: Fix deployment of containerd on architectures other than amd64.	2023-11-02 14:14:18 +01:00
Fabiano Fidêncio	8905286767	Merge pull request #8348 from fidencio/topic/gha-add-stale-action-for-PRs gha: Add workflow to close stale PRs	2023-11-02 11:34:35 +01:00
Fabiano Fidêncio	abec287058	gha: Add workflow to close stale PRs Our goal. as discussed in the Architecture Committee meeting held on October 31st, 2023, is to take a more aggressive action on issues and PRs that have been opened for a long time. This commit is the very first step, and it's only targetting PRs. What this action will do is: * Mark all the PRs that have no activity for more than 180 days, starting from May 1st, 2023, as stale. * A message will be added, letting the contributor know that they can simply comment on the PR in order to make it "not stale". * If there's no activity on the PR for 7 days, the PR will be automatically closed. Fixes: #8347 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-02 09:19:44 +01:00
briwan.wang	437db15916	kata-manager: Fix Mulit-Arch deployment for containerd Fix: Kata-Manager fails to retrieve the correct Containerd string name for architectures other than amd64. Update the 'github_get_release_file_url()' function to make it compatible with different architecture expressions. eg. aarch64/arm64, or x86_64/amd64, allowing it to acquire the correct URL addresses Fixes: #7071 Signed-off-by: briwan.wang <briwan.wang@arm.com>	2023-11-02 06:12:04 +00:00
Archana Shinde	004646162e	Merge pull request #8308 from gkurz/fully-drop-hub release: Fully migrate from hub to gh	2023-11-01 22:46:44 -07:00
Peng Tao	b3dbd4f1c7	Merge pull request #8351 from amshinde/update-agent-cargo-lock cargo: Agent cargo.lock updated	2023-11-02 11:31:24 +08:00
Archana Shinde	58b4d1a264	cargo: Agent cargo.lock updated The Cargo.lock for agent needs to be updated to include "safe-path" dependency. Fixes: #8350 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-11-01 11:54:33 -07:00
Fabiano Fidêncio	40cc397218	Merge pull request #8255 from cmaf/migrate-checks-fixes-links docs: Fix broken links	2023-11-01 14:46:30 +01:00
Beraldo Leal	afec54799e	libs: fixes dereferenced reference make check is giving us the following error: error: this expression creates a reference which is immediately dereferenced by the compiler. Fixes #8344 Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-10-31 15:55:32 -04:00
Beraldo Leal	c57df607ad	libs: fixes comparison to empty slice Make check gives us an "error: comparison to empty slice". Fixes #8343 Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-10-31 15:51:03 -04:00
Greg Kurz	d20b7381f0	release: Drop obsolete comment in workflow file This comment belongs to the hub tool that got sunset by `710eb8ab9d`. Just drop it. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-10-31 16:03:12 +01:00
Greg Kurz	6236fa4617	release: Drop build_hub helper Not used anymore. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-10-31 15:28:57 +01:00
Greg Kurz	bc4c66caaf	release: Migrate tag_repos.sh to GitHub CLI The hub tool is deprecated. Convert this script to use the official GitHub CLI gh instead of hub. A typical gh setup is able to access repos using HTTPS along with GitHub credentials. It is only needed to patch the remote url when using SSH. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-10-31 15:11:28 +01:00
Greg Kurz	e331102ba3	release: Migrate update-repository-version.sh to GitHub CLI The hub tool is deprecated. Convert this script to use the official GitHub CLI gh instead of hub. A couple of adjustments had to be made : - the notes.md temporary file is moved to ${tmp_dir} in order to silent gh, otherwise it complains about an untracked file, - title of a PR no longer goes to the notes.md file since gh requires the title to be passed with a dedicated --title option. Fixes #8303 Signed-off-by: Greg Kurz <groug@kaod.org>	2023-10-31 15:10:50 +01:00
Greg Kurz	b83a7149ee	release: Introduce helper to get GitHub CLI If gh isn't installed already, download it from GitHub. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-10-31 15:09:24 +01:00
Fabiano Fidêncio	53cda12a71	Merge pull request #8311 from TimePrinciple/log-system-enhancement runtime-rs: Log system enhancement	2023-10-31 10:14:41 +01:00
Greg Kurz	ceeabe3714	release: Allow to test release scripts with an alternate repo We don't want to mess with the official repo when testing a change in the release scripts. Adapt `update-repository-version.sh` to be able to use an alternate repo just like `tag_repos.sh` already does. This means that the following command : $ OWNER="$SOME_ORG" ./update-repository-version.sh -p "$NEW_VERSION" "$BRANCH" will only create a PR in this repo : http://github.com/$SOME_ORG/kata-containers.git Signed-off-by: Greg Kurz <groug@kaod.org>	2023-10-31 09:49:27 +01:00
Archana Shinde	148c565b2f	Merge pull request #8289 from BbolroC/skip-create-tmpfs-s390x agent: Skip flaky create_tmpfs on s390x	2023-10-30 22:26:28 -07:00
Ruoqing He	4ad2cfe0c2	runtime-rs: Log system enhancement By modifying RuntimeLevelFilter drain to improve logging control, enabling isolation of change effect of the loggers between components, tuning clh logs to be logged according to their log levels given by cloud-hypervisor. Fixes: #8310 Signed-off-by: Ruoqing He <linuxwatcher@outlook.com>	2023-10-31 04:57:46 +00:00
David Esparza	2a17d3889e	Merge pull request #8334 from amshinde/ipvlan-nerdctl-fix network: Fix network attach for ipvlan and macvlan	2023-10-30 16:00:32 -06:00
David Esparza	5573705800	Merge pull request #8202 from dborquez/enable_fio_checkmetrics Enable fio checkmetrics	2023-10-30 15:55:37 -06:00
David Esparza	c232869af9	metrics: removes double-quotes in checkemtrics when parsing results This PR removes double quotes in jq output to return raw strings as input of checkmetrics tool. Fixes: #8331 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-10-30 09:43:03 -06:00
David Esparza	c42a2f2eda	metrics: increase the number of attempts to stop kata This PR increases the number of attempts to stop kata components when it is required usually before starting a metrics test. Fixes: #8307 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-10-30 09:43:03 -06:00
David Esparza	1626253d9e	metrics: FIO ci test enablement This PR enables the new FIO test based on the containerd client which is used to track the I/O metrics in the kata-ci environment. Additionally this PR fixes the parsing of results. Fixes: #8199 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-10-30 09:42:54 -06:00
David Esparza	873386a349	metrics: update iodepth and job size fio parameters to improve workload This PR updates the values of the fio parameters for iodepth requests and for the number of jobs, in order to increase the number of sequential operations. Additionally, it adds the list of packages needed to parse the results. Fixes: #8198 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-10-30 08:43:06 -06:00
James O. D. Hunt	d93275224b	Merge pull request #8323 from jodh-intel/utils-kata-manager-fix-version-checks utils: kata manager: Fix version checks	2023-10-30 12:25:51 +00:00
Chao Wu	7d26604061	Merge pull request #7831 from lisongqian/feat/dragonball_trace dragonball: add tracing feature for dragonball	2023-10-30 17:27:30 +08:00
James O. D. Hunt	d7e410ad2b	Merge pull request #8314 from jodh-intel/kata-ctl-show-confidential-guest kata-runtime/kata-ctl: Add security details to output	2023-10-30 07:41:22 +00:00
Songqian Li	2f533c3003	dragonball: add tracing feature for dragonball This PR adds the tracing capability for dragonball and it depends on the tracing::Subscriber of the upper layer. Fixes: #7249 Signed-off-by: Songqian Li <mail@lisongqian.cn>	2023-10-28 19:52:24 +08:00
Chao Wu	f1f4410537	Merge pull request #7695 from lisongqian/feat/legacy_metrics dragonball: add metrics support for legacy device	2023-10-28 16:48:57 +08:00
Archana Shinde	f53f86884f	network: Fix network attach for ipvlan and macvlan We used the approach of cold-plugging network interface for pre-shimv2 support for docker.Since the hotplug approach was not required, we never really got to implementing hotplug support for certain network endpoints, ipvlan and macvlan being among them. Since moving to shimv2 interface as the default for runtime, we switched to hotplugging the network interface for supporting docker and nerdctl. This was done for veth endpoints only. Implement the hot-attach apis for ipvlan and macvlan as well to support ipvlan and macvlan networks with docker and nerdctl. Fixes: #8333 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-10-27 21:42:37 -07:00
Peng Tao	52a014d9cd	Merge pull request #8033 from h56983577/6715/shared-mount agent: use open_tree()/move_mount() to set up bind mounts between containers directly.	2023-10-28 10:57:34 +08:00
Songqian Li	da77b19449	dragonball: output legacy device metrics to runtime Legacy device manager adds device metrics to METRICS when a device is created and removes metrics when a device is dropped. Fixes: #7248 Signed-off-by: Songqian Li <mail@lisongqian.cn>	2023-10-27 14:09:42 +08:00
Songqian Li	65213e9fbe	dragonball: unify the metric interface of legacy device Fixes: #7248 Signed-off-by: Songqian Li <mail@lisongqian.cn>	2023-10-27 14:09:42 +08:00
Chao Wu	b508091305	Merge pull request #8322 from wainersm/git_helper-fix tests/git-helper: cancel any previous rebase left halfway	2023-10-27 14:07:16 +08:00
Spencer von der Ohe	fee97e219c	docs: Fix Dragonball link Update dragonball link to be the current repo (from archived repo) Fixes #8324 Signed-off-by: Spencer von der Ohe <s.vonderohe40@gmail.com>	2023-10-26 21:12:31 -06:00
Archana Shinde	f5c17f89a3	Merge pull request #8250 from amshinde/runtime-rs-clh-config runtime-rs: Add default configuration file for cloud-hypervisor	2023-10-26 14:54:47 -07:00
Chelsea Mafrica	0608e20a01	docs: Fix broken links Update broken links so that static checks pass. Fixes #8254 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-10-26 10:17:01 -07:00
Chelsea Mafrica	4ede63fa4d	Merge pull request #8317 from cmaf/gha-spellcheck-reqs gha: add dependencies for spell checker	2023-10-26 10:11:26 -07:00
James O. D. Hunt	ae3ea1421d	utils: kata-manager: Fix containerd version check Contained release files include the version number without a "v" prefix. However, the tag for the equivalent release does include it so handle this distinction and also tighten up the Kata check by specifying an explicit version number in the regex. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-10-26 16:34:56 +01:00
James O. D. Hunt	346f195532	utils: kata-manager: Fix whitespace Use tabs consistently. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-10-26 16:06:51 +01:00
Wainer dos Santos Moschetta	0ce0abffa6	tests/git-helper: cancel any previous rebase left halfway In bare-metal machines the git tree might get on unstable state with the previous rebase left halfway. So let's attempt to abort any rebase before. Fixes #8318 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-26 11:50:12 -03:00
James O. D. Hunt	2ac7ac1dd2	utils: kata-manager: Fix "Cannot determine download URL" issue The archive names for x86_64 [Kata releases](https://github.com/kata-containers/kata-containers/releases) used to include the tag `x86_64`, but that has now been changed to `amd64`, which unfortunately broke `kata-manager.sh`: ``` kata-static-3.1.3-x86_64.tar.xz ~~~~~~ expected kata-static-3.2.0-alpha3-x86_64.tar.xz ~~~~~~ expected kata-static-3.2.0-alpha4-amd64.tar.xz ~~~~~ changed ``` Fixes: #8321. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-10-26 15:27:37 +01:00
James O. D. Hunt	59bd534827	utils: kata-manager: Lint fixes Improve the code by fixing some lint issues: - defining variables before using them. - Using `grep -E` rather than `egrep`. - Quoting variables. - Adding a check for invalid CLI arguments. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-10-26 15:24:46 +01:00
HanZiyao	a3b003c345	agent: support bind mounts between containers This feature supports creating bind mounts directly between containers through annotations. Fixes: #6715 Signed-off-by: HanZiyao <h56983577@126.com>	2023-10-26 16:34:50 +08:00
Archana Shinde	1b8ec08278	Merge pull request #8281 from amshinde/add-clh-config-kata-manager kata-manager: Add clh config to containerd config file	2023-10-25 13:44:53 -07:00
Chelsea Mafrica	c20aadd7a8	gha: add dependencies for spell checker In the migration from the tests repo to the kata containers repo we missed two huspell dictionaries for static checks; add them. Fixes #8315 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-10-25 12:49:09 -07:00
James O. D. Hunt	d707fa2c0d	kata-runtime/kata-ctl: Add security details to output Add the hypervisor security details to the output of the `kata-runtime env` and `kata-ctl env` commands so the user can see, amongst other things, the value of `confidential_guest`. Fixes: #8313. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-10-25 16:34:42 +01:00
Chao Wu	29d863350f	Merge pull request #7697 from lisongqian/feat/balloon_metrics dragonball: add metrics support for balloon device	2023-10-25 02:42:14 -05:00
Fabiano Fidêncio	328ba0da99	Merge pull request #7647 from jongwu/use_pcie_virt AArch64: runtime: use pcie root port to do pci/pcie device hotplug	2023-10-25 09:17:13 +02:00
Archana Shinde	f99de4d5a1	runtime-rs: Make default kernel params as empty The default kernel params passed to any hypervisor except dragonball is empty. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-10-24 15:50:12 -07:00
Archana Shinde	a813012785	runtime-rs: Add default configuration file for clouf-hypervisor The config template file for clh is in the new format for runtime-rs. It is a result of merging the new format file and options supportted by cloud-hypervisor. Some config options from the golang runtime are missing as they may not be currently supported by the rust runtime. An example of this is the selinux options, rate limiting options as these are not currently supported or verified with the rust runtime. Fixes: #8249 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-10-24 15:17:24 -07:00
Chao Wu	43675bd485	Merge pull request #8294 from ZizhengBian/jason/for-master runtime-rs: fix a typo in device manager	2023-10-24 04:52:04 -05:00
Songqian Li	dce365d5b4	dragonball: add conditional compilation for BalloonDeviceMetrics Fixes: #7248 Signed-off-by: Songqian Li <mail@lisongqian.cn>	2023-10-24 13:33:39 +08:00
GabyCT	4c3a664358	Merge pull request #8278 from GabyCT/topic/udpparallel metrics: Add parallel udp iperf3 benchmark	2023-10-23 10:30:53 -06:00
Fabiano Fidêncio	a001021721	Merge pull request #8292 from fidencio/topic/release-ensure-gh-is-used-from-a-git-repo release: Always use actions/checkout to ensure we're in a git repo	2023-10-23 15:16:12 +02:00
Songqian Li	3819f0ee6f	dragonball: output balloon device metrics to runtime Balloon device manager adds balloon device metrics to METRICS when a device is created and remove metrics when a device is dropped. Fixes: #7248 Signed-off-by: Songqian Li <mail@lisongqian.cn>	2023-10-23 21:15:22 +08:00
Zizheng Bian	7d7c25c1d6	runtime-rs: fix a typo in device manager Fixes: #8293 Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com>	2023-10-23 20:33:47 +08:00
Fabiano Fidêncio	c5cfad7023	actions: Move all the checkout actions to v4 It's been released for a while now, and we need to keep consistency between what we used. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-23 14:01:53 +02:00
Fabiano Fidêncio	b32c6bf805	release: Always use actions/checkout to ensure we're in a git repo Otherwise we'll face issues like: ``` Run tag=$(echo $GITHUB_REF \| cut -d/ -f3-) tag=$(echo $GITHUB_REF \| cut -d/ -f3-) tarball="kata-static-$tag-amd64.tar.xz" mv kata-static.tar.xz "$GITHUB_WORKSPACE/${tarball}" pushd $GITHUB_WORKSPACE echo "uploading asset '${tarball}' for tag: ${tag}" GITHUB_TOKEN=*** gh release upload "${tag}" "${tarball}" popd shell: /usr/bin/bash -e {0} ~/work/kata-containers/kata-containers ~/work/kata-containers/kata-containers uploading asset 'kata-static-3.3.0-alpha0-amd64.tar.xz' for tag: 3.3.0-alpha0 failed to run git: fatal: not a git repository (or any of the parent directories): .git ``` Fixes: #8286 (or better, just a follow up of that) Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-23 14:00:39 +02:00
Fabiano Fidêncio	8fe88696c0	Merge pull request #8287 from fidencio/topic/release-use-gh-cli-instead-of-hub actions: release: Use GH cli instead of hub	2023-10-23 12:40:22 +02:00
Hyounggyu Choi	a0746c8d7b	agent: Skip flaky create_tmpfs on s390x This is to skip a flaky test `create_tmpfs()` on s390x until a root cause is identified and fixed. Fixes: #4248 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-10-23 11:22:14 +02:00
Fabiano Fidêncio	710eb8ab9d	actions: release: Use GH cli instead of hub hub is now deprecated, which has been causing issues with our release process. Let's move to the GH cli (https://cli.github.com/manual), and unblock this release. NOTE: This commit is purposefully not touching anywhere else hub is used, as that would require more time and investigation to do the switch, and right now we just want to unblock the release. Fixes: #8286 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-23 08:49:55 +02:00
Fabiano Fidêncio	74d4865189	Merge pull request #8275 from fidencio/topic/ci-adapt-kata-deploy-regex-on-repo-version-update release: Adapt the CIs using the kata-deploy image	2023-10-23 00:37:19 +02:00
Archana Shinde	d3250dff34	kata-manager: Add clh config to containerd config file kata-manager currently adds default config which currently is qemu. Add config for clh as well to containerd configuration. This should allow new users to get started with clh using kata-manager. Also add config related to enabling privileged_without_host_devices. Always good to have this config enabled when users try to run privileged containers so that devices from host are not inadverdantly passed to the guest. Fixes: #8280 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-10-20 18:16:16 -07:00
Gabriela Cervantes	2d0518cbe6	metrics: Add parallel udp iperf3 benchmark This PR adds the parallel udp iperf3 benchmark for network metrics. Fixes #8277 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-20 19:54:06 +00:00
Dan Mihai	732fe163f3	Merge pull request #8229 from microsoft/danmihai1/no-config-toml-endpoints agent: no endpoint blocking from agent-config.toml	2023-10-20 11:30:43 -07:00
Fabiano Fidêncio	026f6a1a4c	release: Adapt the CIs using the kata-deploy image This is needed in order to properly run the CIs in branches that are not the main one, as the kata-deploy.yaml file on those branches do not have the `latest` tag, but rather the latest stable release. Fixes: #8274 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-20 18:59:14 +02:00
Fabiano Fidêncio	124f498830	Merge pull request #8266 from fidencio/3.3.0-alpha0-branch-bump # Kata Containers 3.3.0-alpha0	2023-10-20 17:40:44 +02:00
GabyCT	8486283012	Merge pull request #8247 from GabyCT/topic/iperfudp metrics: Add iperf udp benchmark	2023-10-20 09:21:37 -06:00
Fabiano Fidêncio	0fb69ddf6a	release: Kata Containers 3.3.0-alpha0 - kata-deploy-stable: Switch to using the ubuntu based payload - libs: protection: Fix typo in TDX output - ci: k8s: Fix bogus firecracker check in k8s-credentials-secrets.bat - tests: Enable agent stability test - docs: Fix paths to build kernel in SNP VMs documentation - runtime-rs: ch: Add TDX CH features check - runtime: Validate hypervisor section name in config file - tests: query data from the OPA service - release: tag_repos: Stop tagging the `tests` repo - metrics: fixes common.sh function to always return true - Memory footprint test removing trailing commas to make json results file valid - policy: allow access to ReseedRandomDev - runtime/kata-ctl: update dependencies - runtime-rs : fix Nydus support for runtime-rs + Dragonball - metrics: removal of reference in the documentation to the fio dax subtest. - runtime-rs: ch: Detect Intel TDX version - runitme-rs: use the same base64 as kata-runtime/direct-volume does - tests: Enable scability test for stability CI - runtime-rs: Add support for adding vfio device for cloud-hypervisor - tests: Enable soak parallel stability test - dragonball: vcpu metrics change to be recorded per vcpu - ci: k8s: adapt gha-run.sh to run locally - metrics: removes kata components and k8s deployment when test finishes - GHA: fix up referenced yaml exceeding 20 limit problem - gha: ci: Revert tracing test PR to unbreak CI - runtime-rs: ch: Enable feature - gha: ci: Port runk tests over - ci: gha: Port tracing tests over - Enable fio test using containerd client - gha: Add stability tests workflow for gha - gha: arm64: Ensure the builder is arm64-builder - kata-deploy: Build kata-agent as we build all the other components - versions: migrate out of k8s.gcr.io - doc: Update crictl pod-config - gha: Fix k0s deployment - tests: Add stability test for kata CI - docs: Update url in kata vra document - gpu: Adding CDI support for cold and hot-plug of VFIO devices - kata-deploy: build & ship the rust components from src/tools/ - metrics: Add latency value limits for kata CI - runtime: fix reading cgroup stats of sandboxes - Upgrade to Cloud Hypervisor v35.0 - ci: Port kata-monitor tests from Jenkins to GHA - metrics: Fix latency yamls path - metrics: Fix metrics README - metrics: Fix C-Ray documentation - runtime-rs: ch: Enable Intel TDX - ci: k8s: crio: Follow up patches to have CRI-O also working as part of our CI - metrics: Enable latency test in gha run script - local-build: Fix .docker ownership before build-payload - runtime-rs: Add network support for cloud-hypervisor - osbuild: Reduce guest components binary size with strip - gha: Add pandoc as a dependency for static checks - ci: rootfs-image build-asset is failing - feat(runtime-rs): introduce huge page mode to select VM RAM's backend - clh: Direct IO support for block devices - gha: Install hunspell for static checks - ci: Trigger payload-after-push on workflow_dispatch - ci: Actually enable the CRI-O tests - protocol: remove gogoprotobuff tests - ci: k8s: Also run tests with CRI-O - runtime: support kernel params including spaces - ci: kata-deploy: Fix runner name - metrics: Enable parallel bandwidth iperf limit - ci: kata-deploy: Enable all k8s flavours that we support - ci: Create clusters in individual resource groups - versions: Bump virtiofsd to v1.8.0 - clh: arm: Use static_sandbox_resource_mgmt=true - Bump nydus versions and update nydus tests - runtime/qemu: Rework QMP/HMP support - clh:arm64: use arm AMBA UART for hypervisor debug - ci: Use variable size of VMs depending on the tests running - ci: Rework static checks - runtime: incorrect handling of non-empty []Endpoint parameter in Remo… - ci: cache: Check the sha256sum of the components & fix ovmf-sev cache usage - ci: cache: Use the artefacts stored in ghcr.io/kata-containers/cached-artefacts/${component} - ci: Run some of the GARM tests in smaller instances - ci: Reduce the size of the AKS VMs - ci: cache: Allow pushing our artefacts to an OCI registry - metrics: Add iperf value for cpu utilization - ci: cache: Export env vars needed to use ORAS - gha: vfio: Import test script - tests: fix kernel and initrd annotations - metrics: Add iperf bandwidth value for kata metrics - metrics: Add Cassandra Metrics documentation - metrics: Remove warning from metrics documentation - ci: docker: nerdctl: Switch to tcp port 80 ping - runtime: Naming conflict of network devices - Remove gogoproto.nullable extension - metrics: Ensure docker is running in init_env - metrics: this PR skips the FIO test temprarily to fix issues - ci: Add a very basic nerdctl sanity test - runtime-rs: hypervisor: Remove debug kernel options - versions: Bump rust version - ci: Add a very basic docker sanity test - dragonball: fix for non-deterministic builds - runtime-rs: bring hybrid vsock devices in manager. - ci: use github.ref_name instead of $GITHUB_REF_NAME - ci: Add more target-branch related fixes - ci: Fix target-branch usage - agent: optimize the code of systemd cgroup manager - gha: Manually rebase PR atop of the target branch before testing - Update kernel to the latest LTS release (v6.1.52) and bring in erofs patches needed for the CC work - kata-deploy: Fix aarch64 image build - runtime: Fix more virtiofs args - kata-deploy: Switch to an alpine image - metrics: Use TensorFlow optimized image - metrics: fix FIO test initialization - ci: k8s: Add clean-up-garm argument for gha-run.sh - ci: k8s: Second round of fix-ups with the devmapper CI - metrics: re-enable memory-usage initialization step - Dragonball: optimize the placement of dbs-upcall features - ci: k8s: Fix typo in run-k8s-tests-on-garm.yaml - ci: k8s: Add k8s devmapper tests (part 0) - kata-deploy: Create kata-static.tar with correct ownership - runtime: run prestart hooks before starting VM for FC - metrics: Add write 95 percentile FIO value - runtime: Allow virtio_fs_extra_args annotation - packaging: do not install docker-compose-plugin for s390x\|ppc64le - runtime-rs: Fix volumes and rootfs cleanup issues - metrics: Enable iperf benchmark on gha for kata metrics - CI: switch static-checks-dragonball CI machines to Azure - metrics: Add README for kata metrics report - osbuilder: Remove chcon operation for guest SELinux - kata-sys-util: protection: Update TDX checks - Improve the way to clean up storage devices for sandbox - agent: avoid possible leakage of storage device - tests: add policy to existing tests - gha: Rebase PR atop of the target branch before testing - versions: Update alpine to its 3.18 version - runtime: Fix data race in ioCopy - metrics: Add grabdata script for metrics report - Fixes tests on AMD machines - metrics: Enable FIO limits for kata metrics - metrics: Add metrics report script - metrics: Fix memory inside limits for kata metrics - metrics: fix parsing issue on memory-usage test - dragonball: vsock add fifo/pipe stream support for passed fd hybridSt… - tests: Add confidential test - tdx: Update the components needed for using the 6.2 kernel stack - tests: delete k8s deployment at the test's end - tests: use unique test name - runtime-rs: check peer close in log_forwarder - gha: Avoid "fail-fast" in tests that are known to be flaky - Refine storage device management for kata-agent - metrics: Remove unused variable in tensorflow nhwc script - kata-deploy: Don't try to remove /opt/kata - metrics: Add TensorFlow ResNet50 FP32 benchmark - gha: vfio: Run on Ubuntu 23.04 runner - kata-agent: use default filemode for block device when it is set to 0 - kata-types: introduce KataVirtualVolume to support nydus, direct volume and image pull - libs,tests: fix typo disable_guest_seccomp in configuration-anno-1.toml - local-build: Remove GID before creating group - kata-deploy: Avoid failing on content removal - runtime: fix image and initrd assets handling - metrics: Add disk link to README - metrics: Fix FIO path - gha: capture additional kata-deploy output - metrics: Use function from metrics common in pytorch script - metrics: Enable kata runtime in K8s for FIO test. - metrics: Fix README for pytorch - metrics: Remove unused variable in tensorflow mobilenet script - rootfs: agent: Policy support with AGENT_INIT=yes - gha: k8s: kata-deploy: Move kata-deploy specific tests from integration/kubernetes to functional/kata-deploy - metrics: Fix check results for tensorflow benchmark - metrics: Add Tensorflow ResNet50 int8 benchmark - kata-deploy: Properly create default runtime class - agent: simplify error handling - metrics: Fix MobileNet help me description - gha: ci: Start running kata-deploy tests - runk: Modify kill command's error message for containerd tests - runtime-rs: add driver option - gha: cri-containerd: Enable tests - metrics: Rename tensorflow scripts - gha: tests: Add kata-deploy functional tests -- Part 1 - agent: runtime: add Agent Policy feature - runk: Support without pid ns - metrics: Add Cassandra Kubernetes benchmark for kata metrics - metrics: Add common functions to the common script - metrics: fix the loop used to stop kata components - docs: Remove installation step in virtcontainers doc - Propogate secrets, config maps etc into guest if sharedFS not available - kata-deploy: Preliminary k0s support - gha: static-checks: Move to the Azure instances - versions: Update firecracker version to 1.4.0 - agent: Allow clippy::redundant_clone in the unit tests - agent: avoid creating new `Vec` instances when easily avoidable - metrics: compute tensorflow statistics - metrics: Add network nginx benchmark - metrics: install kata once and run multiple checks - ci: unencrypted-image: Fix build context - ci: create-confidential-image: Add dependent actions - Follow up fixes for https://github.com/kata-containers/kata-containers/pull/7596 - tests: Create image that will be used in the unencrypted confidential tests - kata-deploy: Ensure we cover SHIMS / DEFAULT_SHIM as part of our tests - tests: upgrade bats version - Fix mimor bugs and improve coding stype of agent rpc/sandbox/mount - deps: Bump dependent crate versions - fix number of queues handling in dragonball share fs device - runtime-rs: Introduce directly attachable network - metrics: General improvements to mobilenet tensorflow test - gha: Add iperf network metrics - docs: Use control-plane term instead of master - agent: avoid unnecessary calls to `Arc::clone` - metrics: Add network latency test - Image pulling on the host - Use version 0.10.4 of `fuse-backend-rs` - kata-deploy: Use host's systemctl - release: Revert kata-deploy changes after 3.2.0-rc0 release - metrics: stop kata components before start a metric test. - runtime-rs: Add block device handling for cloud hypervisor `a93fdb014` kata-deploy-stable: Adapt to what we're using in the stable branch `36109da93` ci: k8s: Fix bogus firecracker check in k8s-credentials-secrets.bat `d01daf749` tests: Adjust timeout for agent stability test `9b14dda14` libs: protection: Fix typo in TDX output `0e0867f15` runtime-rs: ch: Add TDX CH features check `409eadddb` runtime-rs: ch: Improve readability of guest protection checks `82a0814fc` tests: Enable agent stability test `32be8e3a8` tests: query data from the OPA service `b81c0a669` tests: encode policy file during test `4f9681b41` metrics: fixes common.sh function to always return true `2ef2b2a6d` docs: Fix paths to build kernel in SNP VMs documentation `408b59c02` runtime-rs: fix bugs to support Nydus v5 `157caea9f` Revert "nydus: Temporarily skip tests on dragonball" `678fe3cd3` Dragonball: fix Nydus config serde problem `b6ec62138` policy: allow access to ReseedRandomDev `908519db9` metrics: skips docker restart when it is not installed or is masked. `c2763120a` metrics: removing trailing comma characters from json file. `3e8cf6959` runtime: Validate hypervisor section name in config file `ef6388e81` tests: Remove unused function from scability test `fbc8f8f46` scripts: Use install_yq from the `kata-containers` repo `65b1a2d27` release: tag_repos: Stop tagging / updating the `tests` repo `87b760f56` runtime-rs: ch: Detect Intel TDX version `73e81f5e3` runitme-rs: unify base64 encoding for direct-volume `c6463cb5a` tests: Fix path for versions yaml for soak parallel test `89c9454fc` metrics: removal of reference in the documentation to the dax test. `30ff58904` tests: Enable scability test for stability CI `8d6f7b909` runtime-rs: Add support for handling vfio device for cloud-hypervisor `e786b2b01` gha: Add install dependencies for stability tests `dbfe6512f` dragonball: vcpu metrics change to be recorded per vcpu `fa60fbe02` dragonball: METRICS is refactored to RwLock<DragonballMetrics> `500d1c5ce` kata-ctl: update rustls-webpki/webpki dependency `d7660d82a` runtime: unify gopkg.in/yaml.v3 to v3.0.1 `fc9a107e8` runtime: unify swag and testify dependency `79ebb959c` runtime: update runc dependency to v1.1.9 `7f3e8bd65` runtime: unify golang.org/x/text to v0.7.0 `df325ae37` runtime: update golang.org/x/net to v0.7.0 `bba34910d` metrics: stops kata components and k8s deployment when test finishes `84e3d884e` gha: Add general dependencies to stability tests `dec3951ca` tests: Add soak parallel stability test `0f04d527d` tests: Enable soak parallel test `e669282c2` ci: k8s: set KUBERNETES default value `c30c3ff18` tests: run k8s-volume on a given node `666993da8` tests: run k8s-file-volume on a given node `3a00fc910` tests: exec_host() now gets the node name `61c9c17bf` tests: add get_one_kata_node() to tests_common.sh `68f083c4d` ci: k8s: set KATA_HYPERVISOR default value `6677a61fe` ci: k8s: configurable deploy kata timeout `200e54292` ci: k8s: shellcheck fixes to gha-run.sh `4af78be13` kata-deploy: re-format kata-[deploy\|cleanup].yaml `d54e6d9cd` ci: k8s: run_tests() for kcli `c2ef1f0fb` ci: k8s: add deploy-kata-kcli() to gh-run.sh `d2be8eef1` ci: k8s: add cleanup-kcli() to gha-run.sh `cbb9aa15b` ci: k8s: set default image for deploy_kata() `89bef7d03` ci: k8s: create k8s clusters with kcli `954d40cce` gha: combine coco jobs into a single yaml `b60e0a9b5` gha: combine basic amd64 jobs into a single yaml `e9bd85211` gha: ci: Revert tracing test PR to unbreak CI `b8a46a4b8` runtime-rs: ch: Enable feature `0f2dc8c67` gha: Add containerd stability tests to ci yaml `da91c9df8` ci: Port runk tests to this repo `7f2377276` ci: Add placeholder for runk tests `9205acc3d` ci: Move tracing tests here `85d290a04` gha: Add stability gha run script `54f0c8f88` gha: Add stability tests workflow for gha `3bb2923e5` ci: Add placeholder for tracing tests `2c3bf406d` ci: Create a function to install docker `119f03de2` gha: arm64: Ensure the builder is arm64-builder `8c498ef5e` metrics: Use jq tool to pretty-print json metrics output `a2159a636` metrics: Enables FIO test for kata containers `70e7ec3e2` gha: Fix k0s deployment `560bbffb5` packaging: tools: Remove `set -x` leftover `18fa483d9` packaging: release: Mention newly added images `ca3b88837` packaging: tools: Fix container image env var name `5ca66795c` packaging: Allow passing the TOOLS_CONTAINER_BUILDER `02acef957` gha: Build the kata-agent as part of our workflows `5208386ab` packaging: Build the kata-agent `1727487ee` agent: Allow specifying DESTDIR and AGENT_POLICY via env vars `45c118883` packaging: Add get_agent_image_name() `0db8fb8f9` versions: migrate out of k8s.gcr.io `a1a054367` doc: Fix spelling `6339605a1` tests: Add general stability fixes `59ae24444` doc: Update crictl pod-config `fd19f4082` tests: Add agent stability test `215577032` tests: Add cassandra stress in stability tests `f2d3ea988` tests: Add stressng dockerfile for stability tests `6493aa309` tests: Add stressor CPU test for stability tests `ef68a3a36` metrics: Add stability test for kata CI `7c934dc7d` gpu: Fix cold-plug of VFIO devices `8d66ef518` metrics: Increase qemu jitter value `5600e28b5` metrics: Increase jitter value for clh `a6b1f5e21` ci: Build src/tools components as part of our tests / releases `501a168a8` kata-deploy: Build components from src/tools `6ef42db5e` static-build: Add scripts to build content from src/tools `4d08ec29b` packaging: Add get_tools_image_name() `98097c96d` packaging: Use git abbreviated hash `489caf1ad` ci: kata-monitor: Move tests over `a3fb067f1` ci: Add placeholder for kata-monitor tests `57cb4ce20` ci: Make install_kata aware of container engines `de1eeee33` ci: Create a generic install_crio function `64a200085` ci: Add install_cni_plugins helper `8132fe15c` ci: Modify containerd default config `8cb7df1be` metrics: Add checkmetrics for latency test `e90440ae2` metrics: Add qemu latency value limit `a74a8f8a9` metrics: Add latency value limits for kata CI `d7def8317` metrics: Fix general check static warnings `928553d1b` docs: Update url in kata vra document `b0a3293d5` runtime-rs: ch: Enable Intel TDX `523399c32` runtime-rs: ch: Add more consts `dea806581` runtime-rs: ch: Remove unused function `995f2c015` runtime-rs: ch: Only handle particular pending device types `b1b96a5c4` runtime-rs: ch: Remove erroneous "virtio-blk-mmio" check `9ac29b8d3` metrics: Add init_env function to latency test `dfd0c9fa9` runtime: clh: Re-generate the client code `8f9f087e3` versions: Upgrade to Cloud Hypervisor v35.0 `81c8babca` metrics: Fix latency yamls path `481573682` metrics: Fix C-Ray documentation `ef63d67c4` ci: crio: Trail '\r' from exec_host() output `74c12b292` ci: crio: Enable default capabilities `358dc2f56` kata-deploy: Fix CRI-O detection `ebaa4fa4c` ci: crio: Pass `-y` to apt `97e73b223` metrics: Fix spelling warnings `36c8cd6f1` metrics: Fix metrics README `15425a2b8` local-build: Fix .docker ownership before build-payload `13ca7d9f9` gha: Add pandoc as a dependency for static checks `08bc8e4db` metrics: Add latency benchmark for gha `6776b55d7` metrics: Enable latency test in gha run script `94e2ccc2d` runtime: fix reading cgroup stats of sandboxes `d507d189b` fc: Add support for noflush cache option `2ca781518` clh: Direct IO support for block devices `0c95697cc` ci: Trigger payload-after-push on workflow_dispatch `28cbc3b51` ci: rootfs-image build-asset is failing Fixes: #8027 `87a861648` gha: Install hunspell for static checks `8c3c50ca8` ci: Actually enable the CRI-O tests `3a6510ad6` osbuild: Reduce guest components binary size with strip `07a6e63a6` ci: k8s: rke2: Use sudo to call systemd `03b82e848` ci: k8s: Add a CRI-O test `d7105cf7a` ci: k8s: Add a method to install CRI-O `54c0a471b` ci: k8s: k0s: Allow passing parameters to the k0s installer `730ef5169` deps: updating dependencies `3a2c83d69` ci: kata-deploy: Fix runner name `82ff2db46` runtime: support kernel params including spaces `604a9dd67` protocol: remove gogoprotobuff tests `f7fa7f602` ci: Enable kata-deploy tests for all the supported k8s flavours `2c908b598` ci: kata-deploy: Add the ability to deploy rke2 `eaf616491` ci: kata-deploy: Add the ability to deploy k0s `001525763` ci: kata-deploy: Add deploy-k8s argument to gha-run.sh `bf2cb0228` ci: kata-deploy: Expland tests to run on k0s / rke2 `b12b9e188` ci: kata-deploy: Add placeholder for tests on GARM `9e1fb8a96` ci: kata-deploy: Export KUBERNETES env var `09cc0ed43` ci: Move deploy_k8s() to gha-run-k8s-common.sh `486fe14c9` ci: Properly set K8S_TEST_UNION `d9ef1352a` ci: Add first letter of the K8S_TEST_HOST_TYPE to resource group name `68267a399` ci: Create clusters in individual resource groups `9aa8d1c91` metrics: Add parallel bandwidth limit for qemu `44c7c082d` versions: Bump virtiofsd to v1.8.0 `af59d4bf4` metrics: Enable parallel bandwidth iperf limit `aba36ab18` nydus: Temporarily skip tests on dragonball `b8a8dfcd1` nydus: Use `kata-${KATA_HYPERVISOR}` instead of `kata` `f6df3d6ef` static-build: Fix arch error on nydus build `2f9c9e2e6` tests: nydus: Update nydus tests `c9a4e7e46` versions: Bump nydus and nydus-snapshotter to its latest release `b73bde320` gha: nydus: Populate run() `b3904a1a3` gha: nydus: Populate install_dependencies() `d2b3b67f5` gha: nydus: Actually install kata when `install-kata` is called `0ec00ad42` gha: nydus: Get rid of nydus{,-snapshotter} install from nydus_test.sh `568439c77` tests: nydus: Add timeout to the crictl calls `5ac3b76eb` tests: nydus: Add uid / namespace to the nydus container / sandbox `376574a16` tests: nydus: Decorate some calls with `sudo` `4290fd4b6` tests: nydus: Adapt "source ..." to GHA `a84efa3e8` tests: nydus: Adapt check to "clh" instead "cloud-hypervisor" `56a14b395` tests: common: Add install_nydus_snapshotter() `b6563783e` tests: common: Add install_nydus() `72599f191` clh: arm: Use static_sandbox_resource_mgmt=true `1f16b6627` runtime/qemu: Rework QMP/HMP support `8b1e9b0c7` ci: static-checks: Clean up static-checks job `2c5ca2eaf` ci: static-checks: Run tests depending on KVM `509c309ab` ci: static-checks: Move "sudo make test" to the new test matrix `4e963cedf` ci: static-checks: Move "make test" to the new test matrix `08f2e5ae0` runtime-rs: Ensure static-checks-build is a dep of `make test` `2bc3a616a` kata-ctl: Use `loop` instead of `kvm` module in tests `46daddc50` kata-ctl: Ensure GENERATED_CODE is a dep of `make test` `ec826f328` agent: Ensure GENERATED_CODE is a dep of `make test` `1d32410a8` ci: install_libseccomp: Do not depend on the tests repo `bf888b9a5` ci: static-checks: Move "make check" to the new test matrix `473ec8780` kata-ctl: Add `kata-types` to the Cargo.lock file `ea19549a9` kata-ctl: Ensure GENERATED_CODE is a dep of `make check` `e12577586` tests: install_rust: Also install clippy `e2c61a152` ci: static-checks: Move vendor check to its own job `6794d4c84` tests: Move install_rust.sh from the tests repo `e64508c30` tests: install_go: Remove tests repo dependency `11dff731b` tests: Move functions from kata_arch script here `75c974c80` ci: static-checks: Move kernel config check to its own job `9c233bb9e` test: Add test to verify try_from for clh Netconfig `c69a1e33b` ci: Use variable size of VMs depending on the tests running `9049d311d` runtime-rs: Add network support for cloud-hypervisor `eecd5bf2a` ci: cache: Fix ovmf-sev cache `86c41074b` ci: cache: Check the sha256sum of the component `460988c5f` ci: cache: Remove the script used to cache artefacts on Jenkins `4533a7a41` ci: cache: Also store the ${component} sha256sum `eccc76df6` ci: cache: Use the cached artefacts from ORAS `7f5e77bcb` kernel: enable Arm pl011 support `241c355e0` clh:arm64: use arm AMBA uart for hypervisor debug `094b6b2cf` ci: k8s: Temporarily disable tests that require a bigger VM instance `d0c257b3a` ci: cache: Push cached artefacts to ghcr.io `108f1b60d` kata-deploy: Generate latest_{artefact,image_builder} files `be2eb7b37` ci: cache: Install ORAS in the kata-deploy binaries builder container `fb24fb0dc` ci: k8s: devmapper: Use a smaller / cheaper VM instance `1daf02f5d` ci: nydus: Use a smaller / cheaper VM instance `e60d81f55` ci: nerdctl: Use a smaller / cheaper VM instance `4db416997` ci: docker: Use a smaller / cheaper VM instance `32841827b` ci: cri-containerd: Use a smaller / cheaper VM instance `92fff129f` ci: k8s: Don't set cpu limit request for k8s-inotofy test `faf98c062` ci: Reduce the size of the AKS VMs `adc18ecdb` ci: cache: For consistency, read all used env vars `c7a851efd` ci: cache: Pass the exposed env vars to the kata-deploy binaries in docker `6bd15a85d` ci: cache: Export env vars needed to use ORAS `cd4fd1292` metrics: Add iperf cpu utilization limit for qemu `df5cd10ea` metrics: Add iperf value for cpu utilization `a96050a7a` tests: Apply timeout to 'ctr t kill' `9d9303678` tests/vfio: Bump VM image to Fedora 38 `faee59b52` tests/vfio: Accept single device in vfio group for CLH `df3dc1105` tests/vfio: Get rid of sync's `7211c3dcc` gha: vfio: Set test timeout to 15m `1b02f89e4` packaging: kernel: Enable VIRTIO_IOMMU on x86_64 `3a1db7a86` runtime: clh: Support enabling iommu `9f1a42c6c` tests/vfio: Give commands 30s to execute `b46b0ecf8` tests/vfio: Configure a value for 'hot_plug_vfio' for both vmms `bfc93927f` runtime: Remove redundant check in checkPCIeConfig `7c4e73b60` runtime: Add test cases for checkPCIeConfig `fc51e4b9e` runtime: Check config for supported CLH (cold\|hot)_plug_vfio values `509771e6f` runtime: clh: Add hot_plug_vfio entry to config `5f6475a28` tests/vfio: Gather debug info and disable tdp_mmu `8fffdc81c` tests/vfio: Capture journal from vm `df815087e` tests/vfio: Change to get the test working in GHA `a92ddeea1` tests/vfio: Move dependency installation to gha-run.sh `5a551a85b` gha: vfio: Import jobs scripts from tests repo `49e2fa189` metrics: Increase jitter value for qemu `49234433a` metrics: Increase value limit for jitter in clh `813bfdec0` ci: docker: nerdtl: Use io.containerd.kata-${KATA_HYPERVISOR}.io `46bc0b1c0` ci: nerdctl: Create the containerd config `13968aa7f` ci: nerdctl: Switch to tcp port 80 ping `e0c811678` ci: docker: Switch to tcp port 80 ping `1636abbe1` runtime: issue with non-empty []Endpoint in RemoveEndpoints `0aa073967` metrics: Add iperf bandwidth value for qemu `c0ad91476` tests: fix kernel and initrd annotations `615c1cbf1` metrics: Add iperf bandwidth value for kata metrics `d53eb73ee` metrics: Ensure docker is running in init_env `ad08321b8` metrics: Add Cassandra Metrics documentation `a58ea6659` metrics: this PR skips the FIO test temprarily to fix issues `f536ef5ce` ci: docker: Also run the smoke test with runc `c83f167c5` ci: docker: Run the tests after the kata-static is created `12d833d07` ci: Add a very basic nerdctl sanity test `348b8644d` ci: Add a very basic docker sanity test `a75fd5eb8` runk: Fix rust unecessary mut error `a31c14517` kata-ctl: useless-vec warning `c8419fc3b` kata-ctl: Resolve non-minimal-cfg warning `3eaf68d95` agent-ctl: Allow clippy lint `1d8b78959` runtime-rs: Fix useless-vec warning `99f3d69e9` runtime-rs: Remove mut `16fbc27b0` dragonball: Allow ambiguous-glob-reexports `bbf191951` dragonball: Resolve non-minimal-cfg warning `75cfdd5d5` agent: config: Allow clippy lint `f3a0fd590` agent: config: Fix useles-vec warning `9e423bd3d` libs: Fix clippy unnecesary hashes error `444395050` versions: Bump rust version `a16b0962b` chore(cargo): update cargo lock `ca4b6b051` runtime: Naming conflict of network devices `202049f35` feat(runtime-rs): introduce huge page type to select VM RAM's backend `f811b064c` ci: use github.ref_name instead of $GITHUB_REF_NAME `6d795c089` ci: Add more target-branch related fixes `8509c3187` ci: Fix target-branch usage `060499dca` metrics: Remove warning from metrics documentation `c0f697fcc` runtime: Allow kernel_params annotation `b03e49794` dragonball: fix for non-deterministic builds `976d10150` runtime-rs: hypervisor: Remove debug kernel options `fde34610c` kernel: Add erofs patches needed for CC related work `dc6a4588a` versions: Bump kernel to the latest LTS release (6.1.52) `52f6449b7` kata-manager: Remove initcall_debug kernel option `8b4a0b368` kata-deploy: Remove curl after it's used `139c7f03a` kata-deploy: Fix aarch64 image build `470d06541` agent: optimize the code of systemd cgroup manager `bd24afcf7` gha: Manually rebase PR atop of the target branch before testing `72c510d05` runtime/virtiofsd: Drop all references to "--cache=none" `ead724bec` protocol: removing gogo.nullable feature `d8e4bb985` protocol: remove unused PROTO_FILE env `5e1106a77` protocol: remove unused import_path `87accaaec` protocol: use workdir during build `711a7ed96` protocol: remove mapping definitions `8db84c1bd` protocol: force GOPATH to be set `68156d77a` protocol: breaking lines to improve readability `670a8e9c7` kata-deploy: Switch to an alpine image `9d74b7ccc` k8s: ci: Skip "Pod quota" test with firecracker `f6cd3930c` ci: k8s: Remove useless skip statement from tests `3cc20b47a` ci: k8s: Also check for "fc" (for firecracker) `b5bad3cb0` ci: k8s: Add clean-up-garm argument for gha-run.sh `aaec5a09f` ci: k8s: devmapper tests should be using ubuntu 20.04 `27fa7d828` ci: k8s: Add a kata-deploy-garm target `fa62a4c01` ci: k8s: Export KUBERNETES env var `8c9380a79` ci: k8s: Install bats on GARM runners `3de23034f` ci: k8s: Wait some time after restarting k3s `adfea55b8` metrics: fix FIO test initialization `2df183fd9` ci: k8s: Append, instead of overwrite, the devmapper config `369a8af8f` ci: k8s: Decrease k3s sleep from 4 to 2 minutes `ada65b988` ci: k8s: Use vanilla kubectl with k3s `ad45ab5d3` ci: k8s: Ensure k3s is deploy with --write-kubeconfig-mode=644 `028a97e0d` ci: k8s: Use the proper command for sleep `3a427795e` metrics: Use TensorFlow optimized image `8d99972a8` ci: k8s: Fix typo in run-k8s-tests-on-garm.yaml `deed1b927` Dragonball: optimize the placement of dbs-upcall features `0e8bd50cb` ci: k8s: Add k8s devmapper tests (part 0) `b28b54df0` ci: k8s: Add a function to configure devmapper for containerd `54f711721` ci: k8s: Add a function to deploy k3s `81536f21a` runtime/qemu: Pass "--xattr" to virtiofsd instead of "-o xattr" `b1dd09a4d` runtime: Allow virtio_fs_extra_args annotation `2efda20c7` packaging: do not install docker-compose-plugin for s390x\|ppc64le `438fbf966` metrics: Add write 95 percentile for FIO for qemu `024b4d2ff` metrics: Add write 95 percentile FIO value `e98e5cdea` metrics: Add checkmetrics to gha run script `c1edfe551` metrics: Add checkmetrics value for qemu for iperf `6a79ecedf` metrics: Add jitter value for clh `f609a9a75` metrics: Add test selector to iperf metrics `5b8db3042` metrics: Enable iperf benchmark on gha for kata metrics `60f733d30` CI: switch static-checks-dragonball CI machines to Azure `7870b33a2` runtime-rs: bring hybridVsock devices in manager. `18c94ebbe` kata-deploy: Create kata-static.tar with correct ownership `57e7bf14a` agent: refine StorageDeviceGeneric::cleanup() `53edb1937` agent: implement StorageDeviceGeneric::cleanup() `0c63453e2` types: make StorageDevice::cleanup() return possible error code `3a3d77b3b` agent: move StorageDeviceGeneric from kata-types into agent `b151cfd14` metrics: re-enable memory-usage initialization step `f3e1a6a94` osbuilder: alpine: Change mirror `ac612aef5` osbuilder: alpine: Match the version on versions.yaml `9cd706d1c` agent: avoid possible leakage of storage device `bf21411e9` tests: add policy to k8s tests `d0e061067` runtime: config: use the SEV initrd for SNP `67fed26f1` runtime: Use TDX image with in the qemu-tdx config `ac939c458` gha: Rebase atop of the target branch `82cd14ba3` versions: Update alpine to its 3.18 version `666882575` metrics: Add grabdata script for metrics report `c290eaed8` kata-sys-util: protection: Update TDX checks `d7a996c68` gha: Update to checkout@v3 action `c2ba29c15` runtime: Fix data race in ioCopy `211de08d9` osbuilder: Remove chcon operation for guest SELinux `9f21fa9b3` metrics: Add report generator link to general documentation `c0ed5ea0a` metrics: Add README for kata metrics report `a7b59a5bf` metrics: Add limit for 90 percentile for qemu value `99db6568e` metrics: Add limit for write 90 percentile value for clh `6e06392c5` metrics: Enable FIO limits for kata metrics `2e4c87472` runtime/vc: runPrestartHooks should ignore GetHypervisorPid failure `21204caf2` runtime: fail early when starting docker container with FC `32fd01371` runtime: run prestart hooks before starting VM for FC `00e7ffd98` tests: check vmx only on Intel machines `c8dd3c073` metrics: Fix memory footprint qemu limit `8877ec62f` metrics: Fix memory inside limits for kata metrics `80146f207` tests: Fixes cpuType check on AMD machines `7e364716d` metrics: Add test setup details to metrics report `17dc1b976` metrics: Add boot lifecycle times to metrics report `3b0d6538f` metrics: Add memory inside container to metrics report `79fbb9d24` metrics: Add scaling system footprint in metrics report `8e6d4e6f3` metrics: Add metrics reportgen `139ffd4f7` metrics: Add report file titles `878d1a2e7` metrics: Generate PNGs alongside the PDF report `fce248797` metrics: Add metrics report R files `08812074d` metrics: Add report dockerfile `69781fc02` metrics: Add metrics report script `e286e842c` tests: Expand confidential test to support TDX `e31f099be` tests: Expand confidential test to support SNP `c3b9d4945` tests: Add confidential test for SEV `538c965c2` metrics: fix parsing issue on memory-usage test `3818bf331` local-build: Remove $HOME/.docker/buildx/activity/default `d1b54ede2` qemu: tdx: Workaround SMP issue with TDX 1.5 `1e34220c4` qemu: tdx: Adapt to the TDX 1.5 stack `8115a0522` versions: tdx: Update Kernel to 6.2 + TDX `ec18180f3` versions: tdx: Update TDVF to the "edk2-stable202302" `9803b2428` versions: tdx: Update QEMU to v7.2 + TDX v1.10 `dffc16e5b` runtime-rs: check peer close in log_forwarder `aaa5ab126` agent: simplify storage device by removing StorageDeviceObject `fb49d5d7c` gha: Avoid "fail-fast" in tests that are known to be flaky `183f51d6f` tests: use unique test name `6a974679f` tests: delete k8s deployment at the test's end `32a778b6d` metrics: Remove unused variable in tensorflow nhwc script `d8f3ce649` kata-deploy: Don't try to remove /opt/kata `936e8091a` gha: vfio: Run on Ubuntu 23.04 runner `0e7248264` agent: move storage device related code into dedicated files `268e84655` runtime-rs: Fix volumes and rootfs cleanup issues `8f49ee33b` agent: refine storage related code a bit `60ca12ccb` agent: switch to new storage subsystem `fcbda0b41` kata-types: introduce StorageDevice and StorageHandlerManager `b03b1f613` agent: simplify the way to manage storage object `8392c71bf` sys-util: support more mount flags in parse_mount_options() `c00d8f3d4` agent: use create_mount_destination() from kata-sys-util `5e867f053` types: add more mount related constants `880e6c9a7` agent: use function from kata-sys-utils to reduce code `3b881fbc0` local-build: Remove GID before creating group `959ca4944` metrics: Add TensorFlow ResNet50 fp32 Dockerfile `4b7d72c4a` metrics: Add TensorFlow ResNet50 FP32 benchmark `5cba38c17` kata-deploy: Avoid failing on content removal `18d42da21` runtime/fc: fix image/initrd annotation handling `9fda7059a` runtime/clh: fix image/initrd annotation handling `1a0092d63` runtime/qemu: fix image/initrd annotation handling `22d8f335d` libs,tests: fix typo disable_guest_seccomp in configuration-anno-1.toml `8afd158ce` metrics: Add disk link to README `40914b25d` kata-agent: use default filemode for block device when it is set to 0 `eee2ee6ee` metrics: Fix FIO path `39bc3488f` metrics: Use function from metrics common in pytorch script `400eb8874` gha: capture additional kata-deploy output `4aee3eade` kata-types: implement serde methods for KataVirtualVolume `b875e3932` kata-types: validate KataVirtualVolume object `fa2fdc105` kata-types: implement two conversion helpers for KataVirtualVolume `6326af20e` kata-types: introduce KataVirtualVolume `c8b43f8b3` metrics: Fix README for pytorch `fb571f8be` metrics: Enable kata runtime in K8s for FIO test. `cb056f8cb` rootfs: agent: Policy support with AGENT_INIT=yes `85c02828e` metrics: Update tensorflow name in gha run script `e8a511934` metrics: Fix check results for tensorflow benchmark `2d896ad12` gha: kata-deploy: Do the runtime class cleanup as part of the cleanup `4ffc2c86f` gha: kata-deploy: Add the first kata-deploy test `8616c050a` metrics: Remove unused variable in tensorflow mobilenet script `285e616b5` tests: common: Ensure test_type is used as part of the cluster's name `790bd3548` tests: commob: Don't fail if yq is not part of the cache `ce6adecd0` gha: kata-deploy: Add run-kata-deploy-tests.sh `cfc29c11a` gha: k8s: Stop running kata-deploy tests as part of the k8s suite `f4dd15286` tests: k8s: Call ensure_yq() in setup.sh `339569b69` kata-deploy: Properly create default runtime class `2a491e9b1` metrics: Fix MobileNet help me description `d19a75e80` gha: ci: Start running kata-deploy tests `d90f7ac68` runtime-rs: add unit test for block driver `e44919f0d` runtime-rs: add load_test_config for unit test `7f48a6937` runtime-rs: add driver option `bade6a5c3` docs: Fix TensorFlow word across the document `1a1b20776` docs: Add Tensorflow Resnet50 documentation `24baededc` metrics: Add Dockerfile for ResNet50 int8 `6d971ba8d` metrics: Add Tensorflow ResNet50 int8 benchmark `25d151bd1` runk: Modify kill command's error message for containerd tests `b3592ab25` gha: cri-containerd: Enable tests `84dd02e0f` gha: cri-containerd: Add timeout to the crictl calls on testContainerStop `b29782984` gha: cri-containerd: Show pod before deleting it `ae0930824` gha: cri-containerd: Print kata logs in case of error `6c8b2ffa6` gha: cri-containerd: Group containerd logs `9e898701f` gha: cri-containerd: Ensure RUNTIME takes KATA_HYPERVISOR into account `76dac8f22` agent: simplify error handling `18a7fd8e4` metrics: Rename tensorflow scripts `e55fa93db` tests: kata-deploy: Add placeholder for kata-deploy-tests-on-tdx `d9ee17aae` tests: kata-deploy: Add placeholder for kata-deploy-tests-on-aks `ab829d103` agent: runtime: add the Agent Policy feature `831e73ff9` tests: kata-deploy: Add functional/kata-deploy/gha-run.sh placeholder `af1b46bbf` tests: Add gha-run-k8s-common.sh `416445e7e` docs: Remove installation step in virtcontainers doc `72cbcf040` kata-deploy: Add k0s support `767434d50` metrics: fix the loop used to stop kata components #7629 `5d0f0d43c` metrics: Add cassandra statefulset yaml `c1dcc1396` metrics: Add cassandra service yaml `2297a0d1c` metrics: Add block loop pvc yaml for cassandra `e3d511946` metrics: Add block loop pv yaml for cassandra test `989027159` metrics: Add block loop pvc for cassandra test `349b89969` metrics: Add Cassandra Kubernetes benchmark for kata metrics `c52d09052` gha: static-checks: Move to the Azure instances `8815ed066` runtime: Remove config warnings `afe1a6ac5` agent: support copying of directories and symlinks `ab13ef87e` runtime: propagate configmap/secrets etc changes for remote-hyp `c074ec4df` runtime: Copy shared files recursively `fdcd52ff7` metrics: Add check containers are running in tensorflow mobilenet `36337ee14` metrics: Add check containers are up in tensorflow script `f700f9b0b` metrics: Remove unused variable in tensorflow script `833cf7a68` metrics: Add check containers are running function `918c78308` metrics: Add check containers are up in tensorflow mobilenet script `9d57a1fab` metrics: Use check containers are up in tensorflow script `1c84680d8` metrics: Add check containers are up in common script `d3e57cf45` metrics: Use collect_results function in tensorflow mobilenet test `286de046a` metrics: Remove collect results function definition `9879709aa` metrics: Add common functions to the common script `4746fa3da` docs: Specify supported Firecracker version using `versions.yaml` `cc922be5e` versions: Update firecracker version to 1.4.0 `39e67b06e` dragonball: vsock add fifo/pipe stream support for passed fd hybridStream `473b0d3a3` metrics: compute tensorflow statistics `03d1fa67b` ci: unencrypted-image: Fix build context `eb463b38e` ci: unencrypted-image: Don't fail to build on s390x `a2d731ad2` ci: create-confidential-image: Add dependent actions `d1a629622` metrics: Add nginx documentation to network README `498f7c054` metrics: Add nginx kubernetes yaml `f8a5255cf` metrics: Add network nginx benchmark `43fe5d1b9` ci: k8s: tees: Ensure PR_NUMBER is exported `54f6a7850` ci: {{ pr-number }} should be {{ inputs.pr-number }} `034d7aab8` tests: k8s: Ensure the runtime classes are properly created `fac8ccf5c` ci: Add build-and-publish-tee-confidential-unencrypted-image `ab5f603ff` ci: k8s: Add the image used for unencrypted confidential tests `1e8fe131b` k8s: tests: Take advantage of `SHIMS` and `DEFAULT_SHIM` env vars `729b2dd61` agent: avoid creating new `Vec` instances when easily avoidable `aeaec9dae` tests: upgrade bats version `e66496986` metrics: install kata once and run multiple checks `baabfa9f1` agent: refine implementation of mount related code `98ba211a3` agent: fix a bug in update_ephemeral_mounts() `5333618d7` agent: make add_storage() take &[Storage] instead of Vec<Storage> `37f34781d` agent: simplify function online_cpu_memory() `d3c542237` agent: refine style of code related to sandbox `71a9f6778` agent: avoid unwrap() in function do_remove_container() `84badd89d` agent: avoid clone objects when possible `b23c5ed15` deps: Bump dependent crate versions `863283716` metrics: General improvements to mobilenet tensorflow test `3c319d8d4` metrics: Add iperf to gha run script `5b5caf890` gha: Add iperf network metrics `66db5b535` metrics: Add latency test to network README `c36572418` agent: avoid unnecessary calls to `Arc::clone` `4fbe0a3a5` runtime: bind-mount mounted block device into container `7e1b1949d` runtime: add support for kata overlays `6c867d9e8` agent: add io.katacontainers.fs-opt.overlay-rw option `6163c3565` agent: skip mount options that start with "io.katacontainers." `b2ff97aa0` dragonball: use version 0.10.4 of `fuse-backend-rs` `845eeb4d7` agent: Allow clippy::redundant_clone in the unit tests `1163fc9de` release: Revert kata-deploy changes after 3.2.0-rc0 release `3958a39d0` runtime-rs: Introduce directly attachable network `1e15369e5` metrics: Improve naming testing containers in launch times test `5dbe88330` metrics: Clean kata components before start a metric test. `3b45060b6` metrics: Add latency server yaml `9bb8451df` metrics: Add latency client yaml `64fdb9870` metrics: Add network latency test `a81ad3b58` runtime-rs: Add block device handling in cloud hypervisor `3230dec95` kata-deploy: Use host's systemctl `1b21a4624` docs: Use control-plane term instead of master `28e5e9c86` runtime-rs: fix number of queues handling in dragonball share fs device `f1d8de9be` runk: Allow runk to launch a container without pid namespace Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-20 14:44:50 +02:00
Fabiano Fidêncio	f6e20ac230	Merge pull request #7195 from fidencio/topic/adapt-kata-deploy-stable-to-using-ubuntu kata-deploy-stable: Switch to using the ubuntu based payload	2023-10-20 14:42:04 +02:00
Fabiano Fidêncio	a93fdb014b	kata-deploy-stable: Adapt to what we're using in the stable branch This is basically to make sure that folks trying to use the kata-deploy script from the main branch, to deploy stable kata-deploy images, do not have a hard time. Fixes: #7194 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-20 12:58:42 +02:00
James O. D. Hunt	79ed501a20	Merge pull request #8258 from jodh-intel/protection-fix-tdx-typo libs: protection: Fix typo in TDX output	2023-10-20 08:36:22 +01:00
Dan Mihai	52aaf10759	agent: no endpoint blocking from agent-config.toml Remove the ability to block access to kata agent endpoints by using agent-config.toml. That functionality is now implemented using the Agent Policy feature (#7573). The CCv0 branch relied on blocking endpoints using agent-config.toml but will set-up an equivalent default policy file instead (#8219). Fixes: #8228 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-10-20 02:26:54 +00:00
Fabiano Fidêncio	468a3e4b53	Merge pull request #8260 from gkurz/fix-8259 ci: k8s: Fix bogus firecracker check in k8s-credentials-secrets.bat	2023-10-19 23:58:22 +02:00
GabyCT	5d6bdbd0a1	Merge pull request #8241 from GabyCT/topic/enableagenttest tests: Enable agent stability test	2023-10-19 14:12:49 -06:00
Greg Kurz	36109da93f	ci: k8s: Fix bogus firecracker check in k8s-credentials-secrets.bat Fixes #8259 Signed-off-by: Greg Kurz <groug@kaod.org>	2023-10-19 21:53:23 +02:00
GabyCT	dc295600b8	Merge pull request #8157 from GabyCT/topic/fixsevdoc docs: Fix paths to build kernel in SNP VMs documentation	2023-10-19 11:42:03 -06:00
Gabriela Cervantes	d01daf749b	tests: Adjust timeout for agent stability test This PR adjusts the timeout for the agent stability test to run on the gha. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-19 16:55:23 +00:00
James O. D. Hunt	9b14dda147	libs: protection: Fix typo in TDX output Add the missing closing bracket to the output of the TDX details, so rather than: ```bash $ sudo kata-ctl env 2>/dev/null \| grep available_guest_protection available_guest_protection = "tdx (major_version: 1, minor_version: 0" : ^ : Missing ')' ! ``` ... we now have: ```bash $ sudo kata-ctl env 2>/dev/null \| grep available_guest_protection available_guest_protection = "tdx (major_version: 1, minor_version: 0)" : ^ : Aha! ``` Added a unit test for this scenario. Fixes: #8257. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-10-19 16:06:08 +01:00
James O. D. Hunt	9336e2e492	Merge pull request #8155 from jodh-intel/runtime-rs-check-ch-tdx-build-feature runtime-rs: ch: Add TDX CH features check	2023-10-19 14:13:08 +01:00
James O. D. Hunt	048cc70654	Merge pull request #8213 from jodh-intel/validate-hypervisor-cfg-name runtime: Validate hypervisor section name in config file	2023-10-19 07:40:58 +01:00
Dan Mihai	99db6dff24	Merge pull request #8230 from microsoft/danmihai1/opa-data tests: query data from the OPA service	2023-10-18 15:32:23 -07:00
James O. D. Hunt	0e0867f15d	runtime-rs: ch: Add TDX CH features check If you attempt to create a container (a TD) on a TDX system using a custom build of Cloud Hypervisor (CH) that was not built with the `tdx` CH feature, Kata will report the following, somewhat cryptic, CH error: ``` ApiError(VmBoot(InvalidPayload)) ``` Newer versions of CH now report their build-time features in the ping API response message so we now use that, if available, to detect this scenario and generate a user-friendly error message instead. This changes improves the readability of `handle_guest_protection()` and adds a couple of additional tests for that method. Fixes: #8152. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-10-18 18:07:39 +01:00
James O. D. Hunt	409eadddb2	runtime-rs: ch: Improve readability of guest protection checks Improve the way `handle_guest_protection()` is structured by inverting the logic and checking the value of the `confidential_guest` setting before checking the guest protection. This makes the code easier to understand. > Notes: > > - This change also unconditionally saves the available guest protection > (where previously it was only saved when `confidential_guest=true`). > This explains the minor unit test fix. > > - This changes also errors if the CH driver finds an unexpected > protection (since only Intel TDX is currently tested). Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-10-18 18:06:02 +01:00
Greg Kurz	9863805752	Merge pull request #8201 from fidencio/topic/release-tag-repo-stop-tagging-the-tests-repo release: tag_repos: Stop tagging the `tests` repo	2023-10-18 18:10:39 +02:00
Gabriela Cervantes	a58afe70b8	metrics: Add iperf udp benchmark This PR adds the iperf udp benchmark for bandwdith measurement for network metrics. Fixes #8246 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-18 15:52:03 +00:00
Jianyong Wu	f9c9d8f645	runtime: QemuVirt: hotadd virtio-mem dev to pcie root port Hotplug virtio-mem device to pcie root port for Qemu Virt. Fixes: #7646 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-10-18 06:35:57 +00:00
Jianyong Wu	ef18c9550c	runtime:qemuvirt: hotadd net dev to pcie root port Hotplug network device to pcie root port as this is the only way on QemuVirt. Fixes: #7646 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-10-18 06:35:57 +00:00
Jianyong Wu	f1aec98f9d	qemu/virt: use pcie_root_port to do device hotplug for virt ACPI PCI device hotplug on qemu virt is not supported. The only way to hotplug pci device is pcie native way. Thus we need create pcie root port as default. Pcie root port number depends on following: 1. reserved one for network device as default; 2. virtio-mem dev; 3. add enough port for vhost user blk dev; Fixes: #7646 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-10-18 06:35:57 +00:00
Jianyong Wu	28a41e1d16	runtime: add a new API for Network interface Add GetEndpointsNum API for Network Interface to get the number of network endpoints. This is used for caculate the number of pcie root port for QemuVirt. Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-10-18 06:35:57 +00:00
Songqian Li	09d46450f1	dragonball: add metrics support for balloon device Fixes: #7248 Signed-off-by: Songqian Li <mail@lisongqian.cn>	2023-10-18 14:02:56 +08:00
Gabriela Cervantes	82a0814fc2	tests: Enable agent stability test This PR enables the agent stability test for stability gha CI. Fixes #8240 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-17 15:16:06 +00:00
Dan Mihai	32be8e3a87	tests: query data from the OPA service Add example for querying json data from the OPA service. Fixes: #8231 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-10-17 13:31:43 +00:00
David Esparza	d90d1c5c10	Merge pull request #8243 from dborquez/fix_systemctl_masked_query metrics: fixes common.sh function to always return true	2023-10-16 20:17:24 -06:00
Dan Mihai	b81c0a6693	tests: encode policy file during test Encode policy file during test - easier to understand than hard-coding the encoded file contents. Fixes: #8214 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-10-16 15:58:12 -07:00
David Esparza	4f9681b411	metrics: fixes common.sh function to always return true This PR corrects the init env() helper function, to make that systemctl always returns true when enumerating masked services, and preventing the test from failing Fixes: #8242 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-10-16 15:57:57 -06:00
David Esparza	59e8b1d5a7	Merge pull request #8206 from dborquez/memory_footprint_test_removing_trailing_commas_to_make_json_results_file_valid Memory footprint test removing trailing commas to make json results file valid	2023-10-16 14:31:28 -06:00
Gabriela Cervantes	2ef2b2a6dc	docs: Fix paths to build kernel in SNP VMs documentation This PR fixes the correct path to setup, build and install properly the kernel for snp. Fixes #8156 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-16 20:09:02 +00:00
Fabiano Fidêncio	db37692f36	Merge pull request #8226 from microsoft/danmihai1/policy-typo policy: allow access to ReseedRandomDev	2023-10-16 19:17:31 +02:00
Peng Tao	45e82b6581	Merge pull request #8192 from bergwolf/github/deps runtime/kata-ctl: update dependencies	2023-10-16 16:39:17 +08:00
Chao Wu	44e602d69a	Merge pull request #8014 from openanolis/chao/fix_nydus_break runtime-rs : fix Nydus support for runtime-rs + Dragonball	2023-10-16 01:30:22 -05:00
Chao Wu	408b59c02c	runtime-rs: fix bugs to support Nydus v5 1. enable virtio-fs-pro in Dragonball to have the ability to process nydus backend registry 2. change passthrough for rw layer's readonly config to false to have the accurate read write ability. Fixes:#8013 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-10-16 10:22:21 +08:00
Chao Wu	157caea9fe	Revert "nydus: Temporarily skip tests on dragonball" This reverts commit `aba36ab188`. Fixes: #8013 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-10-16 10:22:21 +08:00
Chao Wu	678fe3cd31	Dragonball: fix Nydus config serde problem Since Nydus snapshotter has been updated in previous commits, there is a problem that the config passthrough to Dragonball during mount_rafs is RafsConfig instead of ConfigV2, but Dragonball could only serde ConfigV2 so it will panic. We need to add the support for RafsConfig Fixes:#8013 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-10-16 10:22:21 +08:00
Dan Mihai	b6ec621389	policy: allow access to ReseedRandomDev Allow access to the ReseedRandomDev endpoint by default. Using false for ReseedRandomDevRequest was unintended. Fixes: #8225 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-10-13 21:18:27 +00:00
David Esparza	908519db9d	metrics: skips docker restart when it is not installed or is masked. To avoid errors when initializing the test environment, the kill_processes_before_start() helper function needs to verify that docker is installed before attempting to stop it. Fixes: #8218 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-10-13 18:02:00 +00:00
David Esparza	c2763120aa	metrics: removing trailing comma characters from json file. This PR removes trailing commas so that the json results file is valid. This PR also changes the way data results are collected by terating through the array of memory values to calculate their average. Fixes: #8204 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-10-13 18:00:57 +00:00
Beraldo Leal	5ef691528d	tests: fixes permission denied when running test After running cri-containerd/integration-tests twice we receive permission denied during containerd clean. Fixes: #8216 Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-10-12 19:23:40 +00:00
GabyCT	1974d13122	Merge pull request #8188 from dborquez/metrics_add_fio_readme.md metrics: removal of reference in the documentation to the fio dax subtest.	2023-10-12 10:53:55 -06:00
James O. D. Hunt	3e8cf6959c	runtime: Validate hypervisor section name in config file Previously, if you accidentally modified the name of the hypervisor section in the config file, the default golang runtime gives a cryptic error message ("`VM memory cannot be zero`"). This can be demonstrated using the `kata-runtime` utility program which uses the same golang config package as the actual runtime (`containerd-shim-kata-v2`): ```bash $ kata-runtime env >/dev/null; echo $? 0 $ sudo sed -i 's!^\[hypervisor\.qemu\]!\[hypervisor\.foo\]!g' /etc/kata-containers/configuration.toml $ kata-runtime env >/dev/null; echo $? VM memory cannot be zero 1 ``` The hypervisor name is now validated so that the behaviour becomes: ```bash $ kata-runtime env >/dev/null; echo $? 0 $ sudo sed -i 's!^\[hypervisor\.qemu\]!\[hypervisor\.foo\]!g' /etc/kata-containers/configuration.toml $ ./kata-runtime env >/dev/null; echo $? /etc/kata-containers/configuration.toml: configuration file contains invalid hypervisor section: "foo" 1 ``` Fixes: #8212. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-10-12 13:53:37 +01:00
James O. D. Hunt	45d28998d9	Merge pull request #8149 from jodh-intel/runtime-rs-ch-detect-tdx-version runtime-rs: ch: Detect Intel TDX version	2023-10-12 10:09:42 +01:00
QuanweiZhou	f904e64155	Merge pull request #8179 from Apokleos/directvol-urlEncode runitme-rs: use the same base64 as kata-runtime/direct-volume does	2023-10-12 09:04:11 +08:00
GabyCT	bc6eadf4f6	Merge pull request #8197 from GabyCT/topic/enablescability tests: Enable scability test for stability CI	2023-10-11 16:41:46 -06:00
Archana Shinde	f814b1a0a2	Merge pull request #8073 from amshinde/runtime-rs-vfio-clh runtime-rs: Add support for adding vfio device for cloud-hypervisor	2023-10-11 15:01:55 -07:00
Gabriela Cervantes	ef6388e815	tests: Remove unused function from scability test This PR removes an unused function from scability test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-11 19:44:21 +00:00
Fabiano Fidêncio	fbc8f8f466	scripts: Use install_yq from the `kata-containers` repo As the file is already part of the kata-containers repo, and the tests repo is about to become read-only, we're good to drop the tests references from here and use everything coming from the `kata-containers` repo instead. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-11 12:52:55 +02:00
Fabiano Fidêncio	65b1a2d277	release: tag_repos: Stop tagging / updating the `tests` repo As we've moved all the tests to the `kata-containers` repo, the `tests` repo will become a read-only repo. Fixes: #8200 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-11 11:45:27 +02:00
James O. D. Hunt	87b760f569	runtime-rs: ch: Detect Intel TDX version Improve the `GuestProtection` handling to detect the version of Intel TDX available. The TDX version is now logged by the Cloud Hypervisor driver. Fixes: #8147. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-10-11 09:38:00 +01:00
alex.lyn	73e81f5e39	runitme-rs: unify base64 encoding for direct-volume Direct-volume needs to use the same base64 character set as kata-runtime/direct-volume does. Fixes: #8175 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-10-11 14:00:13 +08:00
Gabriela Cervantes	c6463cb5ae	tests: Fix path for versions yaml for soak parallel test This PR fixes the path for versions yaml for soak parallel test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-10 22:29:20 +00:00
David Esparza	89c9454fca	metrics: removal of reference in the documentation to the dax test. This PR removes the reference in the documentation to the DAX subtest of the FIO benchmark, because this metric is currently WIP. Fixes: #8159 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-10-10 15:55:59 -06:00
Gabriela Cervantes	30ff58904e	tests: Enable scability test for stability CI This PR enables the scability test for stability CI gha. Fixes #8196 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-10 19:59:57 +00:00
GabyCT	538131ab44	Merge pull request #8154 from GabyCT/topic/addstability tests: Enable soak parallel stability test	2023-10-10 13:53:14 -06:00
Archana Shinde	8d6f7b9096	runtime-rs: Add support for handling vfio device for cloud-hypervisor This change adds support for adding and removing vfio devices for cloud-hypervisor. Fixes: #6691 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-10-10 12:25:44 -07:00
Gabriela Cervantes	e786b2b019	gha: Add install dependencies for stability tests This PR adds the install dependencies for stability tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-10 16:05:48 +00:00
Chao Wu	936553ae79	Merge pull request #7505 from lisongqian/feat/dragonball_metrics dragonball: vcpu metrics change to be recorded per vcpu	2023-10-10 10:52:40 -05:00
Wainer Moschetta	d311c3dd04	Merge pull request #7621 from wainersm/gha-run-local ci: k8s: adapt gha-run.sh to run locally	2023-10-10 11:19:19 -03:00
David Esparza	93fef543e0	Merge pull request #8127 from dborquez/fix_iperf_check_kata_processes_issue metrics: removes kata components and k8s deployment when test finishes	2023-10-10 07:05:24 -06:00
lisongqian	dbfe6512fc	dragonball: vcpu metrics change to be recorded per vcpu In this commit, the vcpu metrics in Dragonball will be changed to record per-vcpu. Fixes: #7248 Signed-off-by: lisongqian <mail@lisongqian.cn>	2023-10-10 16:22:40 +08:00
lisongqian	fa60fbe023	dragonball: METRICS is refactored to RwLock<DragonballMetrics> In this commit, the METRICS is refactored to RwLock<DragonballMetrics>. Fixes: #7248 Signed-off-by: lisongqian <mail@lisongqian.cn>	2023-10-10 16:22:40 +08:00
Peng Tao	500d1c5cee	kata-ctl: update rustls-webpki/webpki dependency The old ones have security issues. ref: https://github.com/briansmith/webpki/issues/69 https://github.com/briansmith/webpki/issues/69 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-10-10 03:56:45 +00:00
Peng Tao	d7660d82a0	runtime: unify gopkg.in/yaml.v3 to v3.0.1 The older versions have Denial of Service issues. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-10-10 03:56:45 +00:00
Peng Tao	fc9a107e8e	runtime: unify swag and testify dependency So that we don't need to depend on that many versions of them. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-10-10 03:56:45 +00:00
Peng Tao	79ebb959c5	runtime: update runc dependency to v1.1.9 To pick up security fixes. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-10-10 03:56:45 +00:00
Peng Tao	7f3e8bd65e	runtime: unify golang.org/x/text to v0.7.0 The older versions contain security issues. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-10-10 03:56:45 +00:00
Peng Tao	df325ae371	runtime: update golang.org/x/net to v0.7.0 To pick up fix for the following issue: A maliciously crafted HTTP/2 stream could cause excessive CPU consumption in the HPACK decoder, sufficient to cause a denial of service from a small number of small requests. Fixes: #8190 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-10-10 03:56:39 +00:00
David Esparza	bba34910df	metrics: stops kata components and k8s deployment when test finishes This PR adds a trap whenever the scrip exits, it deletes the iperf k8s deployment and k8s services, and deletes the kata components. This way, when the script finishes, it verifies that there are indeed no kata components still running. Fixes: #8126 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-10-09 13:41:43 -06:00
Gabriela Cervantes	84e3d884e4	gha: Add general dependencies to stability tests This PR adds the general dependencies to stability tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-09 17:02:49 +00:00
Gabriela Cervantes	dec3951ca5	tests: Add soak parallel stability test This PR adds the soak parallel stability test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-09 17:02:49 +00:00
Gabriela Cervantes	0f04d527d9	tests: Enable soak parallel test This PR enables the soak parallel test for stability test. Fixes #8153 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-09 17:02:49 +00:00
Wainer dos Santos Moschetta	e669282c25	ci: k8s: set KUBERNETES default value The KUBERNETES variable is mostly used by kata-deploy whether to apply k3s specific deployments or not. It is used to select the type of kubernetes to be installed (k3s, k0s, rancher...etc) and it is always set on CI. Running the script locally we want to set a value by default to avoid `KUBERNETES: unbound variable` errors. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:08:48 -03:00
Wainer dos Santos Moschetta	c30c3ff185	tests: run k8s-volume on a given node This test can give false-positive on a multi-node cluster. Changed it to use the new get_one_kata_node() and the modified exec_host() to run the setup commands on a given node (that has kata installed) and ensure the test pod is scheduled at that same node. Fixes #7619 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:08:48 -03:00
Wainer dos Santos Moschetta	666993da8d	tests: run k8s-file-volume on a given node This test can give false-positive on a multi-node cluster. Changed it to use the new get_one_kata_node() and the modified exec_host() to run the setup commands on a given node (that has kata installed) and ensure the test pod is scheduled at that same node. Fixes #7619 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:08:48 -03:00
Wainer dos Santos Moschetta	3a00fc9101	tests: exec_host() now gets the node name The exec_host() simply fails on cluster with multi-nodes because `kubectl get node -o name" will return a list o names. Moreover, it will return control nodes names which usually don't have kata installed. Fixes #7619 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	61c9c17bff	tests: add get_one_kata_node() to tests_common.sh The introduced get_one_kata_node() returns the first node that has the kata-runtime=true label, i.e., supposedly a node with kata installed. This is useful for tests that should run on a determined worker node on a multi-nodes cluster. Fixes #7619 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	68f083c4d0	ci: k8s: set KATA_HYPERVISOR default value Let KATA_HYPERVISOR be qemu by default in gh-run.sh as this variable is required to tweak some configurations of kata-deploy. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	6677a61fe4	ci: k8s: configurable deploy kata timeout The deploy-kata() of gha-run.sh will wait for 10 minutes for the kata deploy installation finish. This allow users of the script to overwrite that value by exporting the KATA_DEPLOY_WAIT_TIMEOUT environment variable. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	200e542921	ci: k8s: shellcheck fixes to gha-run.sh Fixed a couple of warns shellcheck emitted and disabled others: * SC2154 (var is referenced but not assigned) * SC2086 (Double quote to prevent globbing and word splitting) Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	4af78be13a	kata-deploy: re-format kata-[deploy\|cleanup].yaml The .tests/integration/kubernetes/gh-run.sh script run `yq write` a couple of times to edit the kata-[deploy\|cleanup].yaml, resulting on the file being formatted again. This is annoying because leaves the git tree dirty. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	d54e6d9cda	ci: k8s: run_tests() for kcli The only difference to the other platforms is that it needs to export KUBECONFIG. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	c2ef1f0fb0	ci: k8s: add deploy-kata-kcli() to gh-run.sh The cleanup-kcli() behaves like other deploy kata for bare-metal (e.g. sev, tdx...etc) except that KUBECONFIG should be exported. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	d2be8eef1a	ci: k8s: add cleanup-kcli() to gha-run.sh The cleanup-kcli() behaves like other clean up for bare-metal (e.g. sev, tdx...etc) except that KUBECONFIG should be exported. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	cbb9aa15b6	ci: k8s: set default image for deploy_kata() On CI workflows the variables DOCKER_REGISTRY, DOCKER_REPO and DOCKER_TAG are exported to match the built image. However, when running the script outside of CI context, a developer might just use the latest image which in this case will be `quay.io/kata-containers/kata-deploy-ci:kata-containers-latest`. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	89bef7d036	ci: k8s: create k8s clusters with kcli Adapted the gha-run.sh script to create a Kubernetes cluster locally using the kcli tool. Use `./gha-run.sh create-cluster-kcli` to create it, and `./gha-run.sh delete-cluster-kcli` to delete. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Fabiano Fidêncio	1280f85343	Merge pull request #8171 from bergwolf/github/fix-up-gha GHA: fix up referenced yaml exceeding 20 limit problem	2023-10-09 09:37:03 +02:00
Peng Tao	954d40cce5	gha: combine coco jobs into a single yaml So that we don't risk exceeding the GHA 20 rerefenced yaml files limit that easy. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-10-08 14:22:01 +00:00
Peng Tao	b60e0a9b57	gha: combine basic amd64 jobs into a single yaml GHA has an undocumented limitation that there can be at most 20 referenced yamls in a single yaml file. We workaround it by combining multiple jobs into a single yaml file. Fixes: #8161 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-10-08 13:55:01 +00:00
Fabiano Fidêncio	108db0a721	Merge pull request #8162 from sprt/sprt/unbreak-ci gha: ci: Revert tracing test PR to unbreak CI	2023-10-08 10:13:46 +02:00
Aurélien Bombo	e9bd852113	gha: ci: Revert tracing test PR to unbreak CI Revert "Merge pull request #8115 from fidencio/topic/ci-add-tracing-tests" This unbreaks CI as seen in https://github.com/kata-containers/kata-containers/actions/runs/6434757133 Fixes: #8161 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-10-06 14:13:17 -07:00
James O. D. Hunt	16fe81f27c	Merge pull request #8124 from jodh-intel/ch-enable-feature runtime-rs: ch: Enable feature	2023-10-06 13:02:08 +01:00
Fabiano Fidêncio	fa6786d1d7	Merge pull request #8117 from fidencio/topic/ci-add-runk-tests gha: ci: Port runk tests over	2023-10-06 11:19:55 +02:00
Fabiano Fidêncio	8fec654716	Merge pull request #8115 from fidencio/topic/ci-add-tracing-tests ci: gha: Port tracing tests over	2023-10-06 10:06:57 +02:00
GabyCT	265f53e594	Merge pull request #8082 from dborquez/enable_fio_on_ctr Enable fio test using containerd client	2023-10-05 17:26:22 -06:00
GabyCT	c8b9ec1cb5	Merge pull request #8108 from GabyCT/topic/ghastability gha: Add stability tests workflow for gha	2023-10-05 17:10:10 -06:00
James O. D. Hunt	b8a46a4b85	runtime-rs: ch: Enable feature Enable the Cloud Hypervisor driver (the `cloud-hypervisor` build feature) for the rust runtime. Fixes: #6264. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-10-05 17:58:39 +01:00
Gabriela Cervantes	0f2dc8c675	gha: Add containerd stability tests to ci yaml This PR adds containerd stability tests to ci yaml. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-05 15:21:24 +00:00
Fabiano Fidêncio	89f73e658d	Merge pull request #8110 from fidencio/topic/gha-be-more-specific-about-the-arm-runners gha: arm64: Ensure the builder is arm64-builder	2023-10-04 21:20:08 +02:00
Fabiano Fidêncio	da91c9df88	ci: Port runk tests to this repo I'm basically moving the runk tests from the tests repo to this one, and I'm adding the "Signed-off-by:" of every single contributor the tests. Fixes: #8116 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Chen Yiyang <cyyzero@qq.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-04 20:41:29 +02:00
Fabiano Fidêncio	7f23772763	ci: Add placeholder for runk tests The runk test has been executed as part of the former "ubuntu" jenkins CI. We're porting it to GHA and running it against LTS containerd. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-04 20:40:32 +02:00
Fabiano Fidêncio	9205acc3d2	ci: Move tracing tests here I'm basically moving the tracing tests from the tests repo to this one, and I'm adding the "Signed-off-by:" of every single contributor to the tests. Fixes: #8114 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com> Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> Signed-off-by: Salvador Fuentes <salvador.fuentes@intel.com> Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com> Signed-off-by: yaoyinnan <yaoyinnan@foxmail.com>	2023-10-04 20:02:27 +02:00
Gabriela Cervantes	85d290a048	gha: Add stability gha run script This PR adds the stability gha run script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-04 17:45:45 +00:00
Gabriela Cervantes	54f0c8f88e	gha: Add stability tests workflow for gha This PR adds the stability test workflow for gha for the kata CI. Fixes #8107 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-04 16:32:13 +00:00
Fabiano Fidêncio	3bb2923e5d	ci: Add placeholder for tracing tests The tracing tests are currently running as part of the Jenkins CI with the following setups: * Container Engines: containerd * VMMs: QEMU \| Cloud Hypervisor * Snapshotters: overlayfs \| devmapper We'll be restricting those tests to be running on LTS version of containerd, without devmapper. As it's known due to our GHA limitation, this is just a placeholder and the tests will actually be added in the next interations. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-04 18:02:02 +02:00
Fabiano Fidêncio	2c3bf406dc	ci: Create a function to install docker This will be re-used in other tests as well. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-04 15:01:51 +02:00
Fabiano Fidêncio	c2cce12de5	Merge pull request #8100 from fidencio/topic/kata-deploy-build-agent kata-deploy: Build kata-agent as we build all the other components	2023-10-04 11:56:03 +02:00
Steve Horsman	c430cc3707	Merge pull request #8098 from stevenhorsman/k8s-registry-suite versions: migrate out of k8s.gcr.io	2023-10-04 10:51:39 +01:00
Fabiano Fidêncio	119f03de26	gha: arm64: Ensure the builder is arm64-builder Otherwise we'll use any arm64 machine that's added as a runner, and whenever new machines are added those may end up being only used for running some specific set of the tests. Fixes: #8109 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-04 11:08:11 +02:00
Fabiano Fidêncio	59b9380d1c	Merge pull request #8093 from stevenhorsman/crictl-pod-config-update doc: Update crictl pod-config	2023-10-04 10:49:04 +02:00
David Esparza	8c498ef5ee	metrics: Use jq tool to pretty-print json metrics output This PR enables the use of jq pretty-print feature to improve the formatting of metric results json files. Fixes: #8081 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-10-03 23:33:19 -06:00
David Esparza	a2159a6361	metrics: Enables FIO test for kata containers FIO benchmark is enabled to measure IO in Kata at different latencies using containerd client, in order to complement the CI metrics testing set. This PR asl deprecated the previous Fio bench based on k8s. Fixes: #8080 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-10-03 23:32:38 -06:00
Fabiano Fidêncio	f337315952	Merge pull request #8106 from fidencio/topic/gha-fix-k0s-related-cis gha: Fix k0s deployment	2023-10-03 21:47:40 +02:00
GabyCT	d1d9af5de2	Merge pull request #8085 from GabyCT/topic/stabilitytests tests: Add stability test for kata CI	2023-10-03 11:28:49 -06:00
Fabiano Fidêncio	70e7ec3e23	gha: Fix k0s deployment The tests are failing when setting up k0s, and that happens because we download a kubectl binary matching the kubernetes version k0s is using, and we do that by: ``` sudo k0s kubectl version --short 2>/dev/null \| ... ``` With kubectl 1.28, which is now the default on k0s, `kubectl version --short` has been removed, leading us to an empty stringm causing then the error in the CI. Fixes: #8105 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-03 17:21:40 +02:00
Fabiano Fidêncio	560bbffb57	packaging: tools: Remove `set -x` leftover This was used for debugging, and ended up being merged with that. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-03 15:33:55 +02:00
Fabiano Fidêncio	18fa483d90	packaging: release: Mention newly added images We've added two new containerd builder images recently, one for the components under `src/tools` and another one for the Kata Containers agent. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-03 15:33:55 +02:00
Fabiano Fidêncio	ca3b888371	packaging: tools: Fix container image env var name This should be TOOLS_CONTAINER_BUILDER instead of VIRTIOFSD_CONTAINER_BUILDER. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-03 15:33:55 +02:00
Fabiano Fidêncio	5ca66795c7	packaging: Allow passing the TOOLS_CONTAINER_BUILDER This follows what we've been doing for all the components we're building, but was missed as part of #8077. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-03 15:33:55 +02:00
Fabiano Fidêncio	02acef9575	gha: Build the kata-agent as part of our workflows The kata-agent binary won't be released, just built so it can be used, later on, as part of our tests and as part of the rootfs build. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-03 15:33:55 +02:00
Fabiano Fidêncio	5208386ab1	packaging: Build the kata-agent Let's add the needed functions to start building the kata-agent, with or without the OPA support. For now this build is not used as part of the rootfs build, but later on this will (not as part of this series, though). Fixes: #8099 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-03 15:33:55 +02:00
Fabiano Fidêncio	1727487eef	agent: Allow specifying DESTDIR and AGENT_POLICY via env vars This will help to build the agent binary as part of the kata-deploy localbuild, as we need to pass the DESTDIR to where the agent will be installed, and also whether we're building the agent with policy support enabled or not. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-03 14:18:45 +02:00
Fabiano Fidêncio	45c1188839	packaging: Add get_agent_image_name() This will be used for building the kata-agent. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-03 14:17:38 +02:00
Wainer dos Santos Moschetta	0db8fb8f98	versions: migrate out of k8s.gcr.io The k8s.gcr.io is deprecated for a while now and has been redirected to registry.k8s.io. However on some bare-metal machines in our testing pools that redirection is not working, so let's just replace the registries. Fixes #8098 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com> (cherry picked from commit b2c3bca558c38deff2117d5909d9071c23c05590)	2023-10-03 11:52:59 +01:00
stevenhorsman	a1a0543671	doc: Fix spelling Spell check failed with: ``` [kata-spell-check.sh:275] WARNING: Word 'overcommitment': did you mean one of the following?: over commitment, over-commitment, commitment ``` So update this to pass the static checks Fixes: # Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-10-03 10:17:38 +01:00
Gabriela Cervantes	6339605a14	tests: Add general stability fixes This PR adds general stability fixes. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-02 19:42:46 +00:00
stevenhorsman	59ae244442	doc: Update crictl pod-config - Ensure that our documented crictl pod config file contents have uid and namespace fields for compatibility with crictl 1.24+ This avoids a user potentially hitting the error: ``` getting sandbox status of pod "d3af2db414ce8": metadata.Name, metadata.Namespace or metadata.Uid is not in metadata "&PodSandboxMetadata{Name:nydus-sandbox,Uid:,Namespace:default,Attempt:1,}" getting sandbox status of pod "-A": rpc error: code = NotFound desc = an error occurred when try to find sandbox: not found ``` Fixes: #8092 Signed-off-by: stevenhorsman <steven@uk.ibm.com> (cherry picked from commit `8f8c2215`)	2023-10-02 14:53:46 +01:00
Gabriela Cervantes	fd19f4082f	tests: Add agent stability test This PR adds the agent stability test to stability test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-28 22:37:02 +00:00
Gabriela Cervantes	215577032f	tests: Add cassandra stress in stability tests This PR adds the cassandra stress at the stability tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-28 22:34:45 +00:00
GabyCT	a890ad3a16	Merge pull request #8066 from GabyCT/topic/urlvra docs: Update url in kata vra document	2023-09-28 14:59:34 -06:00
Zvonko Kaiser	79e33c211c	Merge pull request #7325 from zvonkok/vfio-sandbox-id-debug gpu: Adding CDI support for cold and hot-plug of VFIO devices	2023-09-28 21:31:12 +02:00
Gabriela Cervantes	f2d3ea988d	tests: Add stressng dockerfile for stability tests This PR adds the stressng dockerfile for stability tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-28 16:35:22 +00:00
Gabriela Cervantes	6493aa309e	tests: Add stressor CPU test for stability tests This PR adds the stressor CPU test for stability tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-28 16:33:08 +00:00
Gabriela Cervantes	ef68a3a36b	metrics: Add stability test for kata CI This PR adds the stability test for kata containers repository. Fixes #8084 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-28 16:23:36 +00:00
David Esparza	f7ef45b167	Merge pull request #8077 from fidencio/topic/kata-deploy-ship-the-tools kata-deploy: build & ship the rust components from src/tools/	2023-09-28 09:59:19 -06:00
Zvonko Kaiser	7c934dc7da	gpu: Fix cold-plug of VFIO devices We need to do proper sandbox sizing when we're doing cold-plug introduce CDI, the de-facto standard for enabling devices in containers. containerd will pass-through annotations for accumulated CPU,Memory and now CDI devices. With that information sandbox sizing can be derived correctly. Fixes: #7331 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-09-28 09:49:13 +00:00
GabyCT	fcc755fc3b	Merge pull request #8068 from GabyCT/topic/limitlatency metrics: Add latency value limits for kata CI	2023-09-27 13:28:41 -06:00
Greg Kurz	defbb64ac8	Merge pull request #8036 from rye-stripe/bugfix/overhead-metrics runtime: fix reading cgroup stats of sandboxes	2023-09-27 19:39:55 +02:00
Archana Shinde	95455e6fe8	Merge pull request #8058 from likebreath/0925/clh_v35.0 Upgrade to Cloud Hypervisor v35.0	2023-09-27 10:39:32 -07:00
Gabriela Cervantes	8d66ef5185	metrics: Increase qemu jitter value This PR increases qemu jitter value. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-27 17:31:07 +00:00
Gabriela Cervantes	5600e28b54	metrics: Increase jitter value for clh This PR increases jitter value for clh. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-27 17:30:19 +00:00
Fabiano Fidêncio	a6b1f5e21b	ci: Build src/tools components as part of our tests / releases Build those as part of our CI and release workflows. Fixes #5520 #5348 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-27 18:50:25 +02:00
Fabiano Fidêncio	501a168a81	kata-deploy: Build components from src/tools Let's add targets and actually enable users and oursevles to build those components in the same way we build the rest of the project. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-27 18:49:02 +02:00
Fabiano Fidêncio	6ef42db5ec	static-build: Add scripts to build content from src/tools As we'd like to ship the content from src/tools, we need to build them in the very same way we build the other components, and the first step is providing scripts that can build those inside a container. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-27 18:48:56 +02:00
Fabiano Fidêncio	4d08ec29bc	packaging: Add get_tools_image_name() This will be used for building all the (rust) components from src/tools. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-27 18:48:35 +02:00
Fabiano Fidêncio	98097c96de	packaging: Use git abbreviated hash This will make it easier to build images that rely on several directories hashes. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-27 18:48:30 +02:00
Fabiano Fidêncio	8b25e90027	Merge pull request #8075 from fidencio/topic/ci-add-kata-monitor-tests ci: Port kata-monitor tests from Jenkins to GHA	2023-09-27 15:48:46 +02:00
Fabiano Fidêncio	489caf1ad0	ci: kata-monitor: Move tests over Let's move, adapt, and use the kata-monitor tests from the tests repo. In this PR I'm keeping the SoB from every single contributor from who touched those tests in the past. Fixes: #8074 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: yaoyinnan <yaoyinnan@foxmail.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-09-27 11:40:31 +02:00
Fabiano Fidêncio	a3fb067f1b	ci: Add placeholder for kata-monitor tests The kata-monitor tests is currently running as part of the Jenkins CI with the following setups: * Container Engines: CRI-O \| containerd * VMMs: QEMU When using containerd, we're testing it with: * Snapshotter: overlayfs \| devmapper We will stop running those tests on devmapper / overlayfs as that hardly would get us a functionality issue. Also, we're restricting this to run with the LTS version of containerd, when containerd is used. As it's known due to our GHA limitation, this is just a placeholder and the tests will actually be added in the next iterations. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-27 11:31:17 +02:00
Fabiano Fidêncio	57cb4ce204	ci: Make install_kata aware of container engines This will help us when running tests using CRI-O. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-27 11:31:17 +02:00
Fabiano Fidêncio	de1eeee334	ci: Create a generic install_crio function This will serve us quite will in the upcoming tests addition, which will also have to be executed using CRi-O. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-27 11:26:13 +02:00
Fabiano Fidêncio	64a2000859	ci: Add install_cni_plugins helper This will become handy when doing tests with CRI-O, as CRI-O doesn't install the CNI plugins for us. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-27 11:26:13 +02:00
Fabiano Fidêncio	8132fe15c9	ci: Modify containerd default config Let's ensure we have runc running with `SystemdCgroups = false`, otherwise we'll face failures when running tests depending on runc on Ubuntu 22.04, woth LTS containerd. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-27 11:16:12 +02:00
Chelsea Mafrica	a49bc68374	runtime-rs: Update status for pause and resume Pause and resume task do not currently update the status of the container to paused or running, so fix this. This is specifically for pausing the task and not the VM. Fixes #6434 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-09-26 17:22:47 -07:00
Gabriela Cervantes	8cb7df1bed	metrics: Add checkmetrics for latency test This PR adds the checkmetrics for latency test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-26 19:11:08 +00:00
Gabriela Cervantes	e90440ae24	metrics: Add qemu latency value limit This PR adds the qemu latency value limit for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-26 17:30:09 +00:00
Gabriela Cervantes	a74a8f8a9d	metrics: Add latency value limits for kata CI This PR adds latency value limits for kata CI. Fixes #8067 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-26 17:29:07 +00:00
Gabriela Cervantes	d7def8317a	metrics: Fix general check static warnings This PR fixes general check static warnings. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-26 16:30:59 +00:00
GabyCT	309103169d	Merge pull request #8056 from GabyCT/topic/fixlatencypath metrics: Fix latency yamls path	2023-09-26 10:16:55 -06:00
Gabriela Cervantes	928553d1ba	docs: Update url in kata vra document This PR updates the url in kata vra document. Fixes #8065 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-26 16:13:12 +00:00
GabyCT	5c0afaacf4	Merge pull request #8018 from GabyCT/topic/fixreadme metrics: Fix metrics README	2023-09-26 09:51:47 -06:00
David Esparza	83326f89b3	Merge pull request #8054 from GabyCT/topic/fixcrdoc metrics: Fix C-Ray documentation	2023-09-26 09:50:19 -06:00
James O. D. Hunt	31478b9c33	Merge pull request #7944 from jodh-intel/runtime-rs-ch-enable-tdx runtime-rs: ch: Enable Intel TDX	2023-09-26 14:11:12 +01:00
James O. D. Hunt	b0a3293d53	runtime-rs: ch: Enable Intel TDX Allow Cloud Hypervisor to create a confidential guest (a TD or "Trust Domain") rather than a VM (Virtual Machine) on Intel systems that provide TDX functionality. > Notes: > > - At least currently, when built with the `tdx` feature, Cloud Hypervisor > cannot create a standard VM on a TDX capable system: it can only create > a TD. This implies that on TDX capable systems, the Kata Configuration > option `confidential_guest=` must be set to `true`. If it is not, Kata > will detect this and display the following error: > > ``` > TDX guest protection available and must be used with Cloud Hypervisor (set 'confidential_guest=true') > ``` > > - This change expands the scope of the protection code, changing > Intel TDX specific booleans to more generic "available guest protection" > code that could be "none" or "TDX", or some other form of guest > protection. Fixes: #6448. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-09-26 10:55:25 +01:00
James O. D. Hunt	523399c329	runtime-rs: ch: Add more consts Introduce a few new constants (for PCI segment count and FS queues) and move the disk queue constants to `convert.rs` to allow them to be used there too. > Note: > > This change gives the `ShareFs` code it's own set of values rather > than relying on the disk queue constants. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-09-26 08:41:32 +01:00
James O. D. Hunt	dea8065811	runtime-rs: ch: Remove unused function Delete the `handle_pending_devices_after_boot()` function which is no longer required. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-09-26 08:41:32 +01:00
James O. D. Hunt	995f2c015f	runtime-rs: ch: Only handle particular pending device types Modify the Cloud Hypervisor `add_device()` method to add `ShareFs` and `Network` devices to the list of pending devices since only these two device types need to be cached before VM startup. Full details in the comments. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-09-26 08:41:32 +01:00
James O. D. Hunt	b1b96a5c49	runtime-rs: ch: Remove erroneous "virtio-blk-mmio" check Remove the `VIRTIO_BLK_MMIO` check which appears to have been added erroneously in the first place. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-09-26 08:41:32 +01:00
Gabriela Cervantes	9ac29b8d38	metrics: Add init_env function to latency test This Pr adds the init_env function to latency test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-25 22:06:00 +00:00
Bo Chen	dfd0c9fa9a	runtime: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v35.0. Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by openapi-generator. Fixes: #8057 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-09-25 12:22:37 -07:00
Bo Chen	8f9f087e35	versions: Upgrade to Cloud Hypervisor v35.0 Details of this release can be found in ourroadmap project as iteration v35.0: https://github.com/orgs/cloud-hypervisor/projects/6. Fixes: #8057 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-09-25 12:22:01 -07:00
Fabiano Fidêncio	a4daa86535	Merge pull request #8028 from fidencio/topic/ci-test-with-crio-part-2 ci: k8s: crio: Follow up patches to have CRI-O also working as part of our CI	2023-09-25 18:40:42 +02:00
Gabriela Cervantes	81c8babca9	metrics: Fix latency yamls path This PR fixes the latency yamls path for the latency test for kata metrics. Fixes #8055 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-25 15:52:24 +00:00
Gabriela Cervantes	4815736820	metrics: Fix C-Ray documentation This PR fixes the C-Ray documentation for kata metrics. Fixes #8052 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-25 15:27:58 +00:00
Fabiano Fidêncio	ef63d67c41	ci: crio: Trail '\r' from exec_host() output We've faced this as part of the CI, only happening with the CRI-O tests: ``` not ok 1 Test readonly volume for pods # (from function `exec_host' in file tests_common.sh, line 51, # in test file k8s-file-volume.bats, line 25) # `exec_host "echo "$file_body" > $tmp_file"' failed with status 127 # [bats-exec-test:38] INFO: k8s configured to use runtimeclass # bash: line 1: $'\r': command not found # # Error from server (NotFound): pods "test-file-volume" not found ``` I must say I didn't dig into figuring out why this is happening, but we may be safe enough to just trail the '\r', as long as all the tests keep passing on containerd. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-25 16:42:18 +02:00
Fabiano Fidêncio	74c12b2927	ci: crio: Enable default capabilities We need the default capabilities to be enabled, especially `SYS_CHROOT`, in order to have tests accessing the host to pass. A huge thanks to Greg Kurz for spotting this and suggesting the fix. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Greg Kurz <groug@kaod.org>	2023-09-25 14:56:15 +02:00
Fabiano Fidêncio	358dc2f569	kata-deploy: Fix CRI-O detection Some of the "k8s distros" allow using CRI-O in a non-official way, and if that's done we cannot simply assume they're on containerd, otherwise kata-deploy will simply not work. In order to avoid such issue, let's check for `cri-o` as the container engine as the first place and only proceed with the checks for the "k8s distros" after we rule out that CRI-O is not being used. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-25 14:56:15 +02:00
Fabiano Fidêncio	ebaa4fa4c1	ci: crio: Pass `-y` to apt That was something overlooked during my tests. :-/ Fixes: #8005 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-25 14:56:15 +02:00
GabyCT	11cf0e2d28	Merge pull request #8038 from GabyCT/topic/latency metrics: Enable latency test in gha run script	2023-09-22 16:57:53 -06:00
GabyCT	3ef57b335e	Merge pull request #8045 from jepio/fix-docker-ownership local-build: Fix .docker ownership before build-payload	2023-09-22 14:43:38 -06:00
Archana Shinde	9bb9a3e7a4	Merge pull request #7966 from amshinde/runtime-rs-network-clh runtime-rs: Add network support for cloud-hypervisor	2023-09-22 13:08:09 -07:00
Gabriela Cervantes	97e73b2234	metrics: Fix spelling warnings This PR fixes general spelling warnings detected by the spelling check. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-22 15:50:51 +00:00
Gabriela Cervantes	36c8cd6f1f	metrics: Fix metrics README This PR fixes the network metrics section at the README by leaving the current tests that we have in our kata metrics. Fixes #8017 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-22 15:28:58 +00:00
Fabiano Fidêncio	c5a5a0c95e	Merge pull request #8012 from arronwy/strip osbuild: Reduce guest components binary size with strip	2023-09-22 15:45:38 +02:00
Fabiano Fidêncio	9d190f2390	Merge pull request #8042 from GabyCT/topic/pandoc gha: Add pandoc as a dependency for static checks	2023-09-22 15:31:18 +02:00
Jeremi Piotrowski	15425a2b80	local-build: Fix .docker ownership before build-payload The permissions on .docker/buildx/activity/default are regularly broken by us passing docker.sock + $HOME/.docker to a container running as root and then using buildx inside. Fixup ownership before executing docker commands. Fixes: #8027 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-22 13:44:53 +02:00
Jeremi Piotrowski	a5338e885e	Merge pull request #8030 from portersrc/8027-ci-rootfs-image-build-asset-is-failing-oras ci: rootfs-image build-asset is failing	2023-09-22 11:07:50 +02:00
Chao Wu	6f98fbafde	Merge pull request #6706 from guixiongwei/feat/thp feat(runtime-rs): introduce huge page mode to select VM RAM's backend	2023-09-22 15:27:06 +08:00
Gabriela Cervantes	13ca7d9f97	gha: Add pandoc as a dependency for static checks To avoid the failure of not finding pandoc command this PR adds that package as a dependency for static checks. Fixes #8041 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-21 20:14:41 +00:00
Jeremi Piotrowski	28dd5ae91e	Merge pull request #7799 from UiPath/clh-directio-support clh: Direct IO support for block devices	2023-09-21 19:16:08 +02:00
David Esparza	6de9f39895	Merge pull request #8020 from GabyCT/topic/fixhunspell gha: Install hunspell for static checks	2023-09-21 10:58:40 -06:00
Gabriela Cervantes	08bc8e4db4	metrics: Add latency benchmark for gha This PR adds the latency benchmark for gha for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-21 16:14:39 +00:00
Gabriela Cervantes	6776b55d7e	metrics: Enable latency test in gha run script This PR enables the latency test for gha run script for kata metrics. Fixes #8037 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-21 16:11:58 +00:00
Peteris Rudzusiks	94e2ccc2d5	runtime: fix reading cgroup stats of sandboxes The cgroup stats come from resourcecontrol package in the form of pointers to structs. The sandbox Stat() method incorrectly was expecting structs. This caused the cpu and memory stats to always be 0, which in turn caused incorrect pod overhead metrics. Fixes #8035 Signed-off-by: Peteris Rudzusiks <rye@stripe.com>	2023-09-21 17:00:53 +02:00
Alexandru Matei	d507d189bb	fc: Add support for noflush cache option Firecracker supports noflush semantic via Unsafe cache type. There is no support for direct i/o, remove it from config file Fixes: #7823 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2023-09-21 14:48:24 +03:00
Alexandru Matei	2ca781518a	clh: Direct IO support for block devices Clh suports direct i/o for disks. It doesn't offer any support for noflush, removed passing of option to cloud-hypervisor internal config Fixes: #7798 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2023-09-21 14:48:24 +03:00
Fabiano Fidêncio	dd27912f31	Merge pull request #8032 from fidencio/topic/ci-make-push-after-build-be-trigger-by-workflow-dispatch ci: Trigger payload-after-push on workflow_dispatch	2023-09-21 10:25:24 +02:00
Fabiano Fidêncio	0c95697cc4	ci: Trigger payload-after-push on workflow_dispatch This will allow us to easily test failures and fixes on that workflows. Fixes: #8031 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-21 09:24:13 +02:00
Chris Porter	28cbc3b51c	ci: rootfs-image build-asset is failing Fixes: #8027 Signed-off-by: Chris Porter <porter@ibm.com>	2023-09-21 00:58:42 -05:00
Fabiano Fidêncio	21f6f9a173	Merge pull request #8016 from fidencio/topic/ci-test-with-crio-part-1 ci: Actually enable the CRI-O tests	2023-09-21 07:42:27 +02:00
Wainer Moschetta	87e64a07ed	Merge pull request #7979 from beraldoleal/gogo-removal protocol: remove gogoprotobuff tests	2023-09-20 22:38:10 -03:00
Gabriela Cervantes	87a8616488	gha: Install hunspell for static checks Seems like the static checks are failing due the missing of the hunspell package this PR fixes that. Fixes #8019 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-20 16:58:10 +00:00
Fabiano Fidêncio	8c3c50ca8a	ci: Actually enable the CRI-O tests The test has been added to the repo, but we have to also add it to the list of jobs to be executed. Fixes: #8005 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-20 18:01:25 +02:00
David Esparza	03554c799a	Merge pull request #8006 from fidencio/topic/ci-test-with-crio-part-0 ci: k8s: Also run tests with CRI-O	2023-09-20 07:45:17 -06:00
Fabiano Fidêncio	c6a9e50c37	Merge pull request #8004 from microsoft/danmihai1/quoted-spaces runtime: support kernel params including spaces	2023-09-20 12:10:51 +02:00
Wang, Arron	3a6510ad61	osbuild: Reduce guest components binary size with strip opa_linux_amd64_static 38M => 27M kata-agent 30M => 23M ls -alh opa_linux_amd64_static -rw-rw-r-- 1 arron arron 38M Jul 28 01:59 opa_linux_amd64_static ➜ kata-containers git:(main) ✗ strip opa_linux_amd64_static ➜ kata-containers git:(main) ✗ ls -alh opa_linux_amd64_static -rw-rw-r-- 1 arron arron 27M Sep 20 16:12 opa_linux_amd64_static ls -alh ./usr/bin/kata-agent -rwxr-xr-x. 1 root root 30M Jul 30 23:41 ./usr/bin/kata-agent ls -alh ./usr/bin/kata-agent -rwxr-xr-x. 1 root root 23M Sep 20 16:13 ./usr/bin/kata-agent Fixes: #8011 Signed-off-by: Wang, Arron <arron.wang@intel.com>	2023-09-20 16:23:17 +08:00
Fabiano Fidêncio	07a6e63a6b	ci: k8s: rke2: Use sudo to call systemd Otherwise we'll face the following error: ``` Failed to enable unit: Interactive authentication required. ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-20 08:48:29 +02:00
Fabiano Fidêncio	03b82e8484	ci: k8s: Add a CRI-O test Let's make sure we'll also be testing k8s using CRI-O. For now, we'll only be running the CRI-O test with QEMU. Once it becomes stable we can expand this to other Hypervisors as well. Fixes: #8005 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-20 00:59:09 +02:00
Fabiano Fidêncio	d7105cf7a4	ci: k8s: Add a method to install CRI-O This is based on official CRI-O documentations[0] and right now we're making this specific to Ubuntu as that's what we have as runners. We may want to expand this in the future, but we're good for now. [0]: https://github.com/cri-o/cri-o/blob/main/install.md#apt-based-operating-systems Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-20 00:59:09 +02:00
Fabiano Fidêncio	54c0a471b1	ci: k8s: k0s: Allow passing parameters to the k0s installer We'll need this in order to setup k0s with a different container engine. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-20 00:59:09 +02:00
Fabiano Fidêncio	31ef64606c	Merge pull request #8007 from fidencio/topic/ci-kata-deploy-fix-garm-runner-name ci: kata-deploy: Fix runner name	2023-09-20 00:58:33 +02:00
Beraldo Leal	730ef51693	deps: updating dependencies Updating dependencies after make check, make test. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-09-19 16:54:35 -04:00
GabyCT	6111ef6fb6	Merge pull request #7990 from GabyCT/topic/parallelbandwidth metrics: Enable parallel bandwidth iperf limit	2023-09-19 14:52:21 -06:00
Fabiano Fidêncio	3a2c83d69b	ci: kata-deploy: Fix runner name It should be garm-ubuntu-2004-smaller instead of garm-ubuntu-2004-small. Fixes: #7890 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 22:34:37 +02:00
Dan Mihai	82ff2db460	runtime: support kernel params including spaces Support quoted kernel command line parameters that include space characters. Example: dm-mod.create="dm-verity,,,ro,0 736328 verity 1 /dev/vda1 /dev/vda2 4096 4096 92041 0 sha256 f211b9f1921ef726d57a72bf82be23a510076639fa8549ade10f85e214e0ddb4 065c13dfb5b4e0af034685aa5442bddda47b17c182ee44ba55a373835d18a038" Fixes: #8003 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-09-19 20:26:38 +00:00
Beraldo Leal	604a9dd673	protocol: remove gogoprotobuff tests This is part of a bigger effort to drop gogoprotobuff from our code base. IIUC, those options are basically used by *pb_test.go, and since we are dropping gogoprotobuff and those are auto generated tests, let's just remove it. Fixes #7978. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-09-19 12:55:42 -04:00
Fabiano Fidêncio	5560e72024	Merge pull request #7896 from fidencio/topic/ground-work-for-testing-all-k8s-flavours-we-support ci: kata-deploy: Enable all k8s flavours that we support	2023-09-19 17:44:34 +02:00
Fabiano Fidêncio	f7fa7f602a	ci: Enable kata-deploy tests for all the supported k8s flavours Let's ensure we test kata-deploy on RKE2 and k0s as well. Fixes: #7890 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 13:38:10 +02:00
Fabiano Fidêncio	2c908b598c	ci: kata-deploy: Add the ability to deploy rke2 This will be very useful in the near future, when we start testing kata-deploy with rke2 as well. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 13:38:10 +02:00
Fabiano Fidêncio	eaf6164916	ci: kata-deploy: Add the ability to deploy k0s This will be very useful in the near future, when we start testing kata-deploy with k0s as well. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 13:38:10 +02:00
Fabiano Fidêncio	0015257636	ci: kata-deploy: Add deploy-k8s argument to gha-run.sh We'll be using exactly the same code used for the k8s tests, which are already deploying k3s on GARM. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 13:38:10 +02:00
Fabiano Fidêncio	bf2cb02283	ci: kata-deploy: Expland tests to run on k0s / rke2 We just need to make sure the correct overlay is applied, following what we already have been doing for k3s. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 13:38:10 +02:00
Fabiano Fidêncio	6d5d844e5c	Merge pull request #7983 from sprt/resource-group-naming ci: Create clusters in individual resource groups	2023-09-19 12:54:21 +02:00
Fabiano Fidêncio	b12b9e1886	ci: kata-deploy: Add placeholder for tests on GARM We'll be testing kata-deploy with different kubernetes flavours as part of our GARM tests, and this is a place-holder for this. Once enabled, we'll do nothing, just `return 0`, so we can then properly add the tests after this commit gets merged. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 12:42:02 +02:00
Fabiano Fidêncio	9e1fb8a966	ci: kata-deploy: Export KUBERNETES env var So we have a better control on which flavour of kubernetes kata-deploy is expected to be targetting. This was also done as part of `fa62a4c01b`, for the k8s tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 12:37:56 +02:00
Fabiano Fidêncio	09cc0ed438	ci: Move deploy_k8s() to gha-run-k8s-common.sh This will allow us to re-use the function in the kata-deploy tests, which will come soon. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 12:37:56 +02:00
Fabiano Fidêncio	1829f5c049	Merge pull request #7992 from skaegi/virtiofsd-1.8.0 versions: Bump virtiofsd to v1.8.0	2023-09-19 11:52:49 +02:00
Fabiano Fidêncio	486fe14c99	ci: Properly set K8S_TEST_UNION Otherwise only the first test will be executed Signed-off-by: Aurélien Bombo <abombo@microsoft.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 10:23:58 +02:00
Aurélien Bombo	d9ef1352af	ci: Add first letter of the K8S_TEST_HOST_TYPE to resource group name Ideally we'd add the instance_type or the full K8S_TEST_HOST_TYPE but that exceeds the maximum amount of characteres allowed for the cluster name. With this in mind, let's use the first letter of K8S_TEST_HOST_TYPE instead. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-09-19 10:23:58 +02:00
Aurélien Bombo	68267a3996	ci: Create clusters in individual resource groups This makes it so that each AKS cluster is created in its own individual resource group, rather than using the "kataCI" resource group for all test clusters. This is to accommodate a tool that we recently introduced in our Azure subscription which automatically deletes resource groups after a set amount of time, in order to keep spending under control. The tool will automatically delete any resource group, unless it has a tag SkipAutoDeleteTill = YYYY-MM-DD. When this tag is present, the resource group will be retained until the specified date. Note that I tagged all current resource groups in our subscription with SkipAutoDeleteTill = 2043-01-01 so that we don't lose any existing resources. Fixes: #7982 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-09-19 10:23:55 +02:00
Fabiano Fidêncio	84c0d59d23	Merge pull request #7985 from fidencio/topic/clh-use-static_sandbox_resource_mgmt-as-default-on-arm clh: arm: Use static_sandbox_resource_mgmt=true	2023-09-19 09:25:34 +02:00
Gabriela Cervantes	9aa8d1c917	metrics: Add parallel bandwidth limit for qemu This PR adds the parallel bandwidth limit for qemu for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-18 21:08:54 +00:00
Simon Kaegi	44c7c082d9	versions: Bump virtiofsd to v1.8.0 https://gitlab.com/virtio-fs/virtiofsd/-/releases/v1.8.0 was released two weeks ago. We have fully tested and are using this version. Also bumps toolchain version to match what virtiofsd used. Fixes: #7960 Signed-off-by: Simon Kaegi <simon.kaegi@gmail.com>	2023-09-18 15:21:15 -04:00
Fabiano Fidêncio	5f8e210d3b	Merge pull request #7961 from ChengyuZhu6/update_nydus Bump nydus versions and update nydus tests	2023-09-18 21:02:20 +02:00
Fabiano Fidêncio	c3ee913bf6	Merge pull request #7953 from gkurz/extra-monitor-socket runtime/qemu: Rework QMP/HMP support	2023-09-18 19:04:14 +02:00
Gabriela Cervantes	af59d4bf4a	metrics: Enable parallel bandwidth iperf limit This PR enables the parallel bandwidth iperf limit for kata metrics. Fixes #7989 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-18 16:32:11 +00:00
Fabiano Fidêncio	aba36ab188	nydus: Temporarily skip tests on dragonball We're hitting a specific issue after updating, which will require some work on dragonball before it can be re-added here. The issue: ``` ... 3: failed to do rafs mount\\n 4: fail to attach rafs \\\"/var/lib/containerd-nydus/snapshots/2/fs/image/image.boot\\\"\\n 5: add share fs mount\\n 6: Mount rafs at /rafs/197ef3db03c86b91bf3045ff59183ce8b5750941ad1d3484f4a8301a70f5109f/rootfs_lower error: Failed to Mount backend ... Caused by: vmm action error: FsDevice(AttachBackendFailed(\\\"attach/detach a backend filesystem failed:: missing field `version` at line 1 column 489\\\"))\"): unknown" ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	b8a8dfcd15	nydus: Use `kata-${KATA_HYPERVISOR}` instead of `kata` This will ensure we're testing with the correct runtime, instead of using the `default` one. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
ChengyuZhu6	f6df3d6efb	static-build: Fix arch error on nydus build Fix the arch error when downloading the nydus tarball. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Signed-off-by: Steven Horsman <steven@uk.ibm.com>	2023-09-18 17:40:06 +02:00
ChengyuZhu6	2f9c9e2e63	tests: nydus: Update nydus tests To support the v0.12.0 nydus-snapshotter, we need to update the config files and the commandline to start nydus-snapshotter. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	c9a4e7e46d	versions: Bump nydus and nydus-snapshotter to its latest release As we need https://github.com/containerd/nydus-snapshotter/pull/530 in. Fixes #7984 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	b73bde320d	gha: nydus: Populate run() And with this we finally enable the nydus tests to run as part of our GHA CI. Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	b3904a1a30	gha: nydus: Populate install_dependencies() Let's have all the dependencies needed for running the nydus tests installed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	d2b3b67f5d	gha: nydus: Actually install kata when `install-kata` is called We've been simply doing nothing whenever `install-kata` was called, and that was the intent when we added the placeholder calls. Now, let's install kata, as expected. :-) Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	0ec00ad42e	gha: nydus: Get rid of nydus{,-snapshotter} install from nydus_test.sh As we've added install_nydus() and install_nydus_snapshotter(), which do conform with the pattern we're following on GHA, let's rely on them rather than relying on the bits coming from nydus_test.sh. Later on we'll have install_nydus() and install_nydus_snapshotter() as part of the dependencies install in our `gha-run.sh`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	568439c77b	tests: nydus: Add timeout to the crictl calls Similarly to what's been done for the cri-containerd tests, as part of `84dd02e0f9`, we need to add the timeout here for the crictl calls. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	5ac3b76eb1	tests: nydus: Add uid / namespace to the nydus container / sandbox Otherwise we may face errors like: ``` getting sandbox status of pod "d3af2db414ce8": metadata.Name, metadata.Namespace or metadata.Uid is not in metadata "&PodSandboxMetadata{Name:nydus-sandbox,Uid:,Namespace:default,Attempt:1,}" getting sandbox status of pod "-A": rpc error: code = NotFound desc = an error occurred when try to find sandbox: not found ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	376574a16c	tests: nydus: Decorate some calls with `sudo` Otherwise we canoot properly start the nydus snapshotter, nor properly kill it after it's been started. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	4290fd4b67	tests: nydus: Adapt "source ..." to GHA The "source ..." we've been doing was not changed since those tests were part of the Jenkins tests, and we need to adapt them, either setting the correct path or entirely removing the ones that are not relevant to us anymore. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	a84efa3e87	tests: nydus: Adapt check to "clh" instead "cloud-hypervisor" As that's what we've been using as part of the GHA. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	56a14b3950	tests: common: Add install_nydus_snapshotter() This function will be used to download and install the nydus-snapshotter, and it follows the same pattern we already have introduced for downloading and installing another dependencies from GitHub. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	b6563783e2	tests: common: Add install_nydus() This function will be used to download and install nydus, and it follows the same pattern we already have introduced for downloading and installing another dependencies from GitHub. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	72599f1911	clh: arm: Use static_sandbox_resource_mgmt=true Users have noticed that this is needed, as CLH does not yet implement a way to hotplug resources on aarh64. With this patch, when building for x86_64, I can see the this is the resulting config: ``` $ ARCH=amd64 make ... $ cat config/configuration-clh.toml \| grep static_sandbox_resource_mgmt static_sandbox_resource_mgmt=false ``` And when building for aarch64: ``` $ ARCH=arm64 make ... $ cat config/configuration-clh.toml \| grep static_sandbox_resource_mgmt static_sandbox_resource_mgmt=true ``` Fixes: #7941 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 14:14:10 +02:00
Jeremi Piotrowski	dfa6af54df	Merge pull request #7806 from jongwu/clh_serial clh:arm64: use arm AMBA UART for hypervisor debug	2023-09-18 12:29:07 +02:00
Greg Kurz	1f16b6627b	runtime/qemu: Rework QMP/HMP support PR #6146 added the possibility to control QEMU with an extra HMP socket as an aid for debugging. This is great for development or bug chasing but this raises some concerns in production. The HMP monitor allows to temper with the VM state in a variety of ways. This could be intentionally or mistakenly used to inject subtle bugs in the VM that would be extremely hard if not even impossible to debug. We definitely don't want that to be enabled by default. The feature is currently wired to the `enable_debug` setting in the `[hypervisor.qemu]` section of the configuration file. This setting has historically been used to control "debug output" and it is used as such by some downstream users (e.g. Openshift). Forcing people to have the extra HMP backdoor at the same time is abusive and dangerous. A new `extra_monitor_socket` is added to `[hypervisor.qemu]` to give fine control on whether the HMP socket is wanted or not. This setting is still gated by `enable_debug = true` to make it clear it is for debug only. The default is to not have the HMP socket though. This isn't backward compatible with #6416 but it is for the sake of "better safe than sorry". An extra monitor socket makes the QEMU instance untrusted. A warning is thus logged to the journal when one is requested. While here, also allow the user to choose between HMP and QMP for the extra monitor socket. Motivation is that QMP offers way more options to control or introspect the VM than HMP does. Users can also ask for pretty json formatting well suited for human reading. This will improve the debugging experience. This feature is only made visible in the base and GPU configurations of QEMU for now. Fixes #7952 Signed-off-by: Greg Kurz <groug@kaod.org>	2023-09-18 12:13:01 +02:00
Greg Kurz	cab46c9e23	Merge pull request #7973 from fidencio/topic/ci-use-bigger-machine-sizes-for-the-needed-tests-part-0 ci: Use variable size of VMs depending on the tests running	2023-09-18 12:06:44 +02:00
Fabiano Fidêncio	0e3bfac3b3	Merge pull request #7976 from fidencio/topic/ci-static-checks-rework-part-0 ci: Rework static checks	2023-09-18 11:01:18 +02:00
Peng Tao	6eedd9b0b9	Merge pull request #7738 from Xuanqing-Shi/7732/handle-non-empty-endpoints-in-RemoveEndpoints runtime: incorrect handling of non-empty []Endpoint parameter in Remo…	2023-09-18 10:58:28 +08:00
Fabiano Fidêncio	8b1e9b0c75	ci: static-checks: Clean up static-checks job Now that the static-checks job only takes care of running the static-checks, let's clean it up, remove all the unneeded steps, make sure that we're using the actions in their latest version, and have it running in a cost free runner. At some point I'd like to see those tests done in parallel, in the same way that I've organised the build-checks, but that's something for someone else, at some other time. Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 14:23:02 +02:00
Fabiano Fidêncio	2c5ca2eaf8	ci: static-checks: Run tests depending on KVM With this we're removing the dragonball static-checks CI, as the test is running here now. :-) Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 14:22:38 +02:00
Fabiano Fidêncio	509c309ab2	ci: static-checks: Move "sudo make test" to the new test matrix We're moving it out of the previous "static-checks" confusing matrix, and adding it to the matrix that was currently being used for the `make vendor` and `make check` checks. This will allow us to have one job per component, and with that we can easily run those in parallel and on the zero cost runners. Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:53:23 +02:00
Fabiano Fidêncio	4e963cedf4	ci: static-checks: Move "make test" to the new test matrix We're moving it out of the previous "static-checks" confusing matrix, and adding it to the matrix that was currently being used for the `make vendor` and `make check` checks. This will allow us to have one job per component, and with that we can easily run those in parallel and on the zero cost runners. Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:53:17 +02:00
Fabiano Fidêncio	08f2e5ae0b	runtime-rs: Ensure static-checks-build is a dep of `make test` Otherwise `make test` will simply fail with: ``` error[E0583]: file not found for module `config` ``` Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:53:13 +02:00
Fabiano Fidêncio	2bc3a616ae	kata-ctl: Use `loop` instead of `kvm` module in tests This makes it pssible to run the tests in the cost free runners, which are not KVM capable. Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:53:08 +02:00
Fabiano Fidêncio	46daddc500	kata-ctl: Ensure GENERATED_CODE is a dep of `make test` Otherwise `make test` will simply fail with: ``` error[E0583]: file not found for module `version` ``` Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:53:01 +02:00
Fabiano Fidêncio	ec826f328f	agent: Ensure GENERATED_CODE is a dep of `make test` Otherwise `make test` will fail with: ``` error[E0583]: file not found for module `version` ``` Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:52:57 +02:00
Fabiano Fidêncio	1d32410a83	ci: install_libseccomp: Do not depend on the tests repo It makes things way simpler, waaaaay simpler. Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:52:49 +02:00
Fabiano Fidêncio	bf888b9a5e	ci: static-checks: Move "make check" to the new test matrix We're moving it out of the previous "static-checks" confusing matrix, and adding it to the matrix that was currently being used for the `make vendor` checks. This will allow us to have one job per component, and with that we can easily run those in parallel and on the zero cost runners. Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:52:45 +02:00
Fabiano Fidêncio	473ec87806	kata-ctl: Add `kata-types` to the Cargo.lock file Commit message covered everything. :-) Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:52:40 +02:00
Fabiano Fidêncio	ea19549a99	kata-ctl: Ensure GENERATED_CODE is a dep of `make check` Otherwise `make check` would fail with: ``` Error writing files: failed to resolve mod `version`: /home/runner/work/kata-containers/kata-containers/src/tools/kata-ctl/src/ops/version.rs does not exist make: *** [../../../utils.mk:176: standard_rust_check] Error 1 ``` Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:52:36 +02:00
Fabiano Fidêncio	e125775863	tests: install_rust: Also install clippy clippy is used as part our tests, so it's useful to have it installed while we're already installing rust. In case of developers, they also better be using it. :-) Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:52:31 +02:00
Fabiano Fidêncio	e2c61a152c	ci: static-checks: Move vendor check to its own job Similarly to the static-check jobs, those jobs can be run on the zero cost runners. Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:52:30 +02:00
Fabiano Fidêncio	6794d4c843	tests: Move install_rust.sh from the tests repo We'll use it as part of the refactoring we're doing in the static check tests. I can see a lot of other uses of this, but changing all of them to this one is out of the scope for this PR. Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:52:29 +02:00
Fabiano Fidêncio	e64508c308	tests: install_go: Remove tests repo dependency We can rely on the functions that are now part of the common.bash. Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:52:28 +02:00
Fabiano Fidêncio	11dff731b7	tests: Move functions from kata_arch script here We can use this a lot as part of our CI, but right now I'm just moving those here with the intent to use later on in this series. Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:52:28 +02:00
Fabiano Fidêncio	75c974c802	ci: static-checks: Move kernel config check to its own job It doesn't make sense to run this for all the bits of the matrix, neither it's demanding enough to require running this in one of our Azure sponsored runners. Fixes: #7974 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 12:52:25 +02:00
Archana Shinde	9c233bb9e0	test: Add test to verify try_from for clh Netconfig Add tests to verify conversion from runtime NetworkConfig to clh specific config. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-09-16 00:24:14 -07:00
Fabiano Fidêncio	c69a1e33bd	ci: Use variable size of VMs depending on the tests running Let me start with a fair warning that this commit is hard to split into different parts that could be easily tested (or not tested, just ignored) without breaking pieces. Now, about the commit itself, as we're on the run to reduce costs related to our sponsorship on Azure, we can split the k8s tests we run in 2 simple groups: * Tests that can be run in the smaller Azure instance (D2s_v5) * Tests that required the normal Azure instance (D4s_v5) With this in mind, we're now passing to the tests which type of host we're using, which allows us to select to run either one of the two types of tests, or even both in case of running the tests on a baremetal system. Fixes: #7972 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 09:13:54 +02:00
Archana Shinde	9049d311df	runtime-rs: Add network support for cloud-hypervisor This PR adds support for adding a network device before starting the cloud-hypervisor VM. Support for adding and removing network devices is not really added to the resource manager, so supporting this for cloud-hypervisor is not scoped in this PR. This also changes "pending_devices" for clh implementation from an Option of vector to simply a vector. This simplifies the structure a bit as we can simple iterate over the pending devices instead of having to check for a "Some" value as this is not really required. Fixes: #6333 Signed-off-by: Shuaiyi Zhang <zhang_syi@qq.com> Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-09-15 23:25:20 -07:00
Greg Kurz	79c494eb4e	Merge pull request #7969 from fidencio/topic/ci-cache-using-oras-part-3 ci: cache: Check the sha256sum of the components & fix ovmf-sev cache usage	2023-09-15 16:30:22 +02:00
Fabiano Fidêncio	eecd5bf2aa	ci: cache: Fix ovmf-sev cache The cached tarball is relying on the component name, thus it's important to set it correctly, otherwise we'll end up always building it. With this patch applied: ``` ≡ ⨯ make ovmf-sev-tarball make ovmf-sev-tarball-build make[1]: Entering directory '/home/ffidenci/src/upstream/kata-containers/kata-containers' /home/ffidenci/src/upstream/kata-containers/kata-containers/tools/packaging/kata-deploy/local-build//kata-deploy-binaries-in-docker.sh --build=ovmf-sev sha256:67cc94e393dc1d5bfc2b77a77e83c9b1c0833d0fbbebaa9e9e36f938bb841fcc Build kata version 3.2.0-rc0: ovmf-sev INFO: DESTDIR /home/ffidenci/src/upstream/kata-containers/kata-containers/tools/packaging/kata-deploy/local-build/build/ovmf-sev/destdir Downloading a76f5522493f ovmf-sev-builder-image-version Downloading 7e98c854bd94 kata-static-ovmf-sev.tar.xz Downloading 559311973ff8 ovmf-sev-version Downloaded a76f5522493f ovmf-sev-builder-image-version Downloading 353b655c2297 ovmf-sev-sha256sum Downloaded 559311973ff8 ovmf-sev-version Downloaded 353b655c2297 ovmf-sev-sha256sum Downloaded 7e98c854bd94 kata-static-ovmf-sev.tar.xz Pulled [registry] ghcr.io/kata-containers/cached-artefacts/ovmf-sev:latest-main-x86_64 Digest: sha256:933236c2c79e53be3ca7acc0b966d0ddac9c0335edcb1e8cad8b9bb3aaf508ce kata-static-ovmf-sev.tar.xz: OK INFO: Using cached tarball of ovmf-sev drwxr-xr-x runner/runner 0 2023-09-15 10:34 ./ drwxr-xr-x runner/runner 0 2023-09-15 10:34 ./opt/ drwxr-xr-x runner/runner 0 2023-09-15 10:34 ./opt/kata/ drwxr-xr-x runner/runner 0 2023-09-15 10:34 ./opt/kata/share/ drwxr-xr-x runner/runner 0 2023-09-15 10:34 ./opt/kata/share/ovmf/ -rwxr-xr-x runner/runner 4194304 2023-09-15 10:34 ./opt/kata/share/ovmf/AMDSEV.fd ~/src/upstream/kata-containers/kata-containers/tools/packaging/kata-deploy/local-build/build ~/src/upstream/kata-containers/kata-containers/tools/packaging/kata-deploy/local-build/build/ovmf-sev/builddir ~/src/upstream/kata-containers/kata-containers/tools/packaging/kata-deploy/local-build/build/ovmf-sev/builddir make[1]: Leaving directory '/home/ffidenci/src/upstream/kata-containers/kata-containers' ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 12:39:22 +02:00
Fabiano Fidêncio	86c41074b4	ci: cache: Check the sha256sum of the component We've removed this in the part 2 of this effort, as we were not caching the sha256sum of the component. Now that this part has been merged, let's get back to checking it. Fixes: #7834 -- part 3 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 12:34:30 +02:00
Fabiano Fidêncio	f5e52d02d3	Merge pull request #7964 from fidencio/topic/ci-cache-using-oras-part-2 ci: cache: Use the artefacts stored in ghcr.io/kata-containers/cached-artefacts/${component}	2023-09-15 12:29:28 +02:00
Fabiano Fidêncio	2fe0b494da	Merge pull request #7959 from fidencio/topic/ci-run-on-smaller-garm-instances ci: Run some of the GARM tests in smaller instances	2023-09-15 11:30:13 +02:00
Fabiano Fidêncio	460988c5f7	ci: cache: Remove the script used to cache artefacts on Jenkins That's not needed anymore, as we've switched to using ORAS and an OCI registry to cache the artefacts. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 10:27:55 +02:00
Fabiano Fidêncio	4533a7a416	ci: cache: Also store the ${component} sha256sum This is something that was done by our Jenkins jobs, but that I ended up missing when writing `d0c257b3a7`. Now, let's also add the sha256sum to the cached artefact, and in a coming up PR (after this one is merged) we will also start checking for that. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 10:25:26 +02:00
Fabiano Fidêncio	eccc76df63	ci: cache: Use the cached artefacts from ORAS In the previous series related to the artefacts we build, we've switching from storing the artefacts on Jenkins, to storing those in the ghcr.io/kata-containers/cached-artefacts/${artefact_name}. Now, let's take advantage of that and actually use the artefacts coming from that "package" (as GitHub calls it). NOTE: One thing that I've noticed that we're missing, is storing and checking the sha256sum of the artefact. The storing part will be done in a different commit, and the checking the sha256sum will be done in a different PR, as we need to ensure those were pushed to the registry before actually taking the bullet to check for them. Fixes: #7834 -- part 2 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 10:13:47 +02:00
Jeremi Piotrowski	6f30d00ae7	Merge pull request #7956 from fidencio/topic/ci-reduce-the-machine-size-used ci: Reduce the size of the AKS VMs	2023-09-15 08:49:08 +02:00
Steve Horsman	1b8f3fa9ae	Merge pull request #7957 from fidencio/topic/ci-cache-using-oras-part-1 ci: cache: Allow pushing our artefacts to an OCI registry	2023-09-15 07:45:24 +01:00
Jianyong Wu	7f5e77bcb8	kernel: enable Arm pl011 support Enable pl011 (ttyAMA0) support in kernel for aarch64. Fixes: #5080 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-09-15 01:45:16 +00:00
Jianyong Wu	241c355e07	clh:arm64: use arm AMBA uart for hypervisor debug cloud hypervisor on arm64 only support arm AMBA UART(pl011) as tty. So, the console should be set to "ttyAMA0" instead of "ttyS0" when enable hypervisor debug mode. Fixes: #5080 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-09-15 01:44:23 +00:00
Fabiano Fidêncio	094b6b2cf8	ci: k8s: Temporarily disable tests that require a bigger VM instance The list of tests which require a bigger VM instance is: * k8s-number-cpus.bats -- failing on all CIs * k8s-parallel.bats -- only failing on the cbl-mariner CI * k8s-scale-nginx.bats -- only failing on the cbl-mariner CI We'll keep those disabled while we re-work the logic to only run those in a bigger (and more expensive) VM instance. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 01:33:19 +02:00
GabyCT	6fe5cd3bd5	Merge pull request #7937 from GabyCT/topic/iperfbandwidth metrics: Add iperf value for cpu utilization	2023-09-14 16:47:19 -06:00
Fabiano Fidêncio	d0c257b3a7	ci: cache: Push cached artefacts to ghcr.io Let's push the artefacts to ghcr.io and stop relying on jenkins for that. Fixes: #7834 -- part 1 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 00:39:57 +02:00
Fabiano Fidêncio	108f1b60dd	kata-deploy: Generate latest_{artefact,image_builder} files Right now this is not used, but it'll be used when we start caching the artefacts using ORAS. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 00:39:57 +02:00
Fabiano Fidêncio	be2eb7b378	ci: cache: Install ORAS in the kata-deploy binaries builder container ORAS is the tool which will help us to deal with our artefacts being pushed to and pulled from a container registry. As both the push to and the pull from will be done inside the kata-deploy binaries builder container, we need it installed there. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 00:39:57 +02:00
Fabiano Fidêncio	fb24fb0dc1	ci: k8s: devmapper: Use a smaller / cheaper VM instance We don't need to run on a D4s_v5. as those tests are not CPU / memory intense. With this is mind, let's use a smaller version of the instance, the D2s_v5 one. Fixes: #7958 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 00:27:05 +02:00
Fabiano Fidêncio	1daf02f5d4	ci: nydus: Use a smaller / cheaper VM instance We don't need to run on a D4s_v5. as those tests are not CPU / memory intense. With this is mind, let's use a smaller version of the instance, the D2s_v5 one. Fixes: #7958 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 00:25:41 +02:00
Fabiano Fidêncio	e60d81f554	ci: nerdctl: Use a smaller / cheaper VM instance We don't need to run on a D4s_v5. as those tests are not CPU / memory intense. With this is mind, let's use a smaller version of the instance, the D2s_v5 one. Fixes: #7958 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 00:25:41 +02:00
Fabiano Fidêncio	4db416997c	ci: docker: Use a smaller / cheaper VM instance We don't need to run on a D4s_v5. as those tests are not CPU / memory intense. With this is mind, let's use a smaller version of the instance, the D2s_v5 one. Fixes: #7958 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 00:25:41 +02:00
Fabiano Fidêncio	32841827b8	ci: cri-containerd: Use a smaller / cheaper VM instance We don't need to run on a D4s_v5. as those tests are not CPU / memory intense. With this is mind, let's use a smaller version of the instance, the D2s_v5 one. Fixes: #7958 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 00:25:35 +02:00
Fabiano Fidêncio	92fff129fd	ci: k8s: Don't set cpu limit request for k8s-inotofy test Without setting the cpu limit / request to 1, we can make this test run in a smaller VM instance without any issue. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-14 22:03:16 +02:00
Fabiano Fidêncio	faf98c0623	ci: Reduce the size of the AKS VMs We do not need a very powerful machine for our tests, as we're not building anything there. The instance we switched to (Standard_D2s_v5) still has nested virt available, as shown here[0], but has half of the amount of vCPUs / Memory, which should be fine only for running the tests, costing us basically half of the price[1]. [0]: https://learn.microsoft.com/en-us/azure/virtual-machines/dv5-dsv5-series [1]: https://azure.microsoft.com/en-us/pricing/details/virtual-machines/linux/#pricing Fixes: #7955 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-14 22:03:16 +02:00
Fabiano Fidêncio	adc18ecdb1	ci: cache: For consistency, read all used env vars Instead of having some of them only being considered if explicitly passed to the script. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-14 20:24:48 +02:00
Fabiano Fidêncio	c7a851efd7	ci: cache: Pass the exposed env vars to the kata-deploy binaries in docker As the environment variables are now being passed down from the GitHub Actions, let's make sure they're exposed to the container used to build the kata-deploy binaries, and during the build process we'll be able to use those to log in and push the artefacts to the OCI registry, using ORAS. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-14 20:24:48 +02:00
Fabiano Fidêncio	2e8b41f39c	Merge pull request #7954 from fidencio/topic/ci-cache-using-oras-part-0 ci: cache: Export env vars needed to use ORAS	2023-09-14 20:23:55 +02:00
Fabiano Fidêncio	6bd15a85d5	ci: cache: Export env vars needed to use ORAS We do the build of our artefacts inside a container image, and we need to expose some env vars to the container so ORAS can be used there to push the artefacts we want to cache to ghcr.io. The env vars we're exposing are: * ARTEFACT_REGISTRY: The registry where we're going to save the artefacts. * ARTEFACT_REGISTRY_USERNAME: The username to log in to the registry, as ORAS does not use the same json file used by docker. * ARTEFACT_REGISTRY_PASSWORD: The pasword to log in to the the registry, as the ORAS does not use the same json file used by docker. * TARGET_BRANCH: The target branch, which will be part of the tag of the artefact, as we may end up caching the artefacts for both main and stable branches. Fixes: #7834 -- part 0 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-14 19:36:33 +02:00
Gabriela Cervantes	cd4fd1292a	metrics: Add iperf cpu utilization limit for qemu This PR adds the iperf cpu utilization limit for qemu for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-14 17:17:47 +00:00
Gabriela Cervantes	df5cd10ea0	metrics: Add iperf value for cpu utilization This PR adds the iperf value for cpu utilization for kata metrics. Fixes #7936 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-14 16:06:49 +00:00
Jeremi Piotrowski	b54dd8cdf4	Merge pull request #7704 from jepio/vfio-part-1 gha: vfio: Import test script	2023-09-14 16:45:31 +02:00
Jeremi Piotrowski	a96050a7ad	tests: Apply timeout to 'ctr t kill' This task has been observed to hang at times. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	9d93036783	tests/vfio: Bump VM image to Fedora 38 We need a very recent L2 guest kernel to fix all the bugs that occur in nested virtualization. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	faee59b520	tests/vfio: Accept single device in vfio group for CLH cloud hypervisor does not emulate pcie switches or pci bridges, so we need to accept a lonely device. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	df3dc1105c	tests/vfio: Get rid of sync's It is fine to start a VM with the disk image without syncing it as we now run the test in an ephemeral Azure instance. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	7211c3dccc	gha: vfio: Set test timeout to 15m Sometimes the test gets stuck running commands in the container - need to investigate why later. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	1b02f89e4f	packaging: kernel: Enable VIRTIO_IOMMU on x86_64 Cloud Hypervisor exposes a VIRTIO_IOMMU device to the VM when IOMMU support is enabled. We need to add it to the whitelist because dragonball uses kernel v5.10 which restricted VIRTIO_IOMMU to ARM64 only. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	3a1db7a86b	runtime: clh: Support enabling iommu by enabling IOMMU on the default PCI segment. For hotplug to work we need a virtualized iommu and clh exposes one if there is some device or PCI segment that requests it. I would have preferred to add a separate PCI segment for hotplugging vfio devices but unfortunately kata assumes there is only one segment all over the place. See create_pci_root_bus_path(), split_vfio_pci_option() and grep for '0000'. Enabling the IOMMU on the default PCI segment requires passing enabling IOMMU on every device that is attached to it, which is why it is sprinkled all over the place. CLH does not support IOMMU for VirtioFs, so I've added a non IOMMU segment for that device. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	9f1a42c6cc	tests/vfio: Give commands 30s to execute This is a to catch the case of the guest getting stuck. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	b46b0ecf8b	tests/vfio: Configure a value for 'hot_plug_vfio' for both vmms This shouldn't be hiding behind only a qemu check, we need this for clh as well. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	bfc93927fb	runtime: Remove redundant check in checkPCIeConfig There is no way for this branch to be hit, as port is only set when it is different than config.NoPort. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	7c4e73b609	runtime: Add test cases for checkPCIeConfig These test cases shows which options are valid for CLH/Qemu, and test that we correctly catch unsupported combinations. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	fc51e4b9eb	runtime: Check config for supported CLH (cold\|hot)_plug_vfio values The only supported options are hot_plug_vfio=root-port or no-port. cold_plug_vfio not supported yet. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	509771e6f5	runtime: clh: Add hot_plug_vfio entry to config hot_plug_vfio needs to be set to root-port, otherwise attaching vfio devices to CLH VMs fails. Either cold_plug_vfio or hot_plug_vfio is required, and we have not implemented support for cold_plug_vfio in CLH yet. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	5f6475a28a	tests/vfio: Gather debug info and disable tdp_mmu tdp_mmu had some issues up until around Linux v6.3 that make it work particularly bad when running nested on Hyper-V. Reload the module at the start of the test and disable the tdp_mmu param. Gather debug info at the end of the test to make it easier to figure out what went wrong. This uses github actions group syntax so that each section can be collapsed. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	8fffdc81c5	tests/vfio: Capture journal from vm For debugging (though this doesn't get exposed yet). Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	df815087e7	tests/vfio: Change to get the test working in GHA - reduce memory and cpu usage to fit in a D4s_v5 - source correct lib - mount workspace from 9p - disable cpu mitigations for speed - drop unused commands and variables - install containerd - install kata from built artifacts Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	a92ddeea15	tests/vfio: Move dependency installation to gha-run.sh To match the flow of other github actions workflows. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Jeremi Piotrowski	5a551a85b1	gha: vfio: Import jobs scripts from tests repo This imports the vfio test scripts github.com/kata-containers/tests. The test case doesn't work yet but doing the changes in a separate commit will make it easier to track the changes. The only change in this commit is renaming vfio_jenkins_job_build.sh -> vfio_fedora_vm_wrapper.sh Fixes: #6555 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-14 14:23:28 +02:00
Fabiano Fidêncio	a1e3fa7ac4	Merge pull request #7905 from microsoft/danmihai1/mariner-annotations tests: fix kernel and initrd annotations	2023-09-14 10:37:42 +02:00
GabyCT	1d331124ad	Merge pull request #7925 from GabyCT/topic/bandwidthlimit metrics: Add iperf bandwidth value for kata metrics	2023-09-13 17:43:55 -06:00
Gabriela Cervantes	49e2fa189c	metrics: Increase jitter value for qemu This PR increases the jitter value for qemu for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-13 22:36:09 +00:00
Gabriela Cervantes	49234433a7	metrics: Increase value limit for jitter in clh This PR increases the value limit for jitter in clh. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-13 21:27:08 +00:00
David Esparza	0a24d3f718	Merge pull request #7923 from GabyCT/topic/addcassandradoc metrics: Add Cassandra Metrics documentation	2023-09-13 10:17:00 -06:00
GabyCT	c565053bac	Merge pull request #7895 from GabyCT/topic/removewarning metrics: Remove warning from metrics documentation	2023-09-13 10:16:38 -06:00
Fabiano Fidêncio	8b9df1d32e	Merge pull request #7929 from fidencio/topic/use-tcp-port-ping-on-docker-nerdctl-tests ci: docker: nerdctl: Switch to tcp port 80 ping	2023-09-13 15:46:31 +02:00
Peng Tao	55ca7e8aec	Merge pull request #7907 from Xuanqing-Shi/7876/network-devices-naming-conflict runtime: Naming conflict of network devices	2023-09-13 19:29:41 +08:00
Fabiano Fidêncio	813bfdec01	ci: docker: nerdtl: Use io.containerd.kata-${KATA_HYPERVISOR}.io This will ensure that we're calling the correct binary for the hypervisor. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-13 13:10:14 +02:00
Fabiano Fidêncio	46bc0b1c01	ci: nerdctl: Create the containerd config Otherwise we'll fail to configure kata-containers in the `install-kata` step. This is mostly needed because the nerdctl-full tarball doesn't provide a contaienrd configuration, just the binary, as contaienrd does not actually require a configuration file to run with the default config. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-13 13:00:57 +02:00
Fabiano Fidêncio	13968aa7f6	ci: nerdctl: Switch to tcp port 80 ping TIL that the Azure VMs we use are created without an explicit outbund connectivity defined. This leads us to issues using `ping ...` as part of our tests, and when consulting Jeremi Piotrowski about the issue he pointed me out to two interesting links: * https://learn.microsoft.com/en-us/azure/virtual-network/ip-services/default-outbound-access * https://learn.microsoft.com/en-us/archive/blogs/mast/use-port-pings-instead-of-icmp-to-test-azure-vm-connectivity For your own sanity, do not read the comments, after all this is internet. :-) Anyways, the suggestion is to use nping instead, which is provided by the nmap package, so we can explicitly switch to using the tcp port 80 for the ping. With this in mind, I'm switching the image we use for the test and using one that provided nping as a possible entry point, and from now on (this part of) the tests should work. Fixes: #7910 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-13 13:00:57 +02:00
Fabiano Fidêncio	e0c811678b	ci: docker: Switch to tcp port 80 ping TIL that the Azure VMs we use are created without an explicit outbund connectivity defined. This leads us to issues using `ping ...` as part of our tests, and when consulting Jeremi Piotrowski about the issue he pointed me out to two interesting links: * https://learn.microsoft.com/en-us/azure/virtual-network/ip-services/default-outbound-access * https://learn.microsoft.com/en-us/archive/blogs/mast/use-port-pings-instead-of-icmp-to-test-azure-vm-connectivity For your own sanity, do not read the comments, after all this is internet. :-) Anyways, the suggestion is to use nping instead, which is provided by the nmap package, so we can explicitly switch to using the tcp port 80 for the ping. With this in mind, I'm switching the image we use for the test and using one that provided nping as a possible entry point, and from now on (this part of) the tests should work. Fixes: #7910 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-13 13:00:57 +02:00
shixuanqing	1636abbe1c	runtime: issue with non-empty []Endpoint in RemoveEndpoints In the RemoveEndpoints(), when the endpoints paramete isn't empty, using idx may result in wrong endpoint removals. To improve, directly passing the endpoint parameter helps locate the correct elements within n.eps. Fixes: #7732 Signed-off-by: shixuanqing <1356292400@qq.com> Fixes: #7732 Signed-off-by: shixuanqing <1356292400@qq.com> Update src/runtime/virtcontainers/network_linux.go Co-authored-by: Xuewei Niu <justxuewei@apache.org>	2023-09-13 09:47:18 +00:00
Peng Tao	9766f9090c	Merge pull request #7719 from beraldoleal/nullable Remove gogoproto.nullable extension	2023-09-13 15:11:56 +08:00
David Esparza	c2b2a00ad9	Merge pull request #7899 from GabyCT/topic/startdocker metrics: Ensure docker is running in init_env	2023-09-12 23:01:26 -06:00
Gabriela Cervantes	0aa073967d	metrics: Add iperf bandwidth value for qemu This PR adds the iperf bandwidth value for qemu for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-12 20:57:14 +00:00
Dan Mihai	c0ad914766	tests: fix kernel and initrd annotations Fix kernel and initrd annotations in the k8s tests on Mariner. These annotations must be applied to the spec.template for Deployment, Job and ReplicationController resources. Fixes: #7764 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-09-12 20:15:25 +00:00
Gabriela Cervantes	615c1cbf19	metrics: Add iperf bandwidth value for kata metrics This PR adds the iperf bandwidth value for kata metrics. Fixes #7924 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-12 19:30:24 +00:00
Gabriela Cervantes	d53eb73eec	metrics: Ensure docker is running in init_env This PR ensures that docker is running as part of the init_env function in kata metrics to avoid failures like docker is not running and making the kata metrics CI to fail. Fixes #7898 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-12 19:13:09 +00:00
GabyCT	c0d502493e	Merge pull request #7921 from dborquez/metrics_disable_fio_test metrics: this PR skips the FIO test temprarily to fix issues	2023-09-12 12:08:48 -06:00
Gabriela Cervantes	ad08321b83	metrics: Add Cassandra Metrics documentation This PR adds the Cassandra Metrics documentation for kata metrics. Fixes #7922 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-12 16:30:35 +00:00
David Esparza	a58ea66592	metrics: this PR skips the FIO test temprarily to fix issues FIO test is showing ongoing issues when running in k8s. Working on running FIO on the ctr client which has been shown to be stable. Fixes: #7920 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-09-12 10:23:57 -06:00
Fabiano Fidêncio	2d8447fc6b	Merge pull request #7916 from fidencio/topic/add-functional-nerdctl-tests ci: Add a very basic nerdctl sanity test	2023-09-12 17:47:08 +02:00
James O. D. Hunt	7feb8de9dc	Merge pull request #7887 from jodh-intel/hypervisor-remove-debug-kernel-options runtime-rs: hypervisor: Remove debug kernel options	2023-09-12 16:31:48 +01:00
Fabiano Fidêncio	f536ef5ce1	ci: docker: Also run the smoke test with runc This will help us to make sure that the failure is actually related to Kata Containers. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-12 16:54:02 +02:00
Fabiano Fidêncio	c83f167c59	ci: docker: Run the tests after the kata-static is created There's no reason to wait till the payload is created to run the tests, as we rely on the tarball, not on the kata-deploy payload. That was a mistake on my side, and that's already fixed for the nerdctl tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-12 16:53:47 +02:00
Fabiano Fidêncio	12d833d07d	ci: Add a very basic nerdctl sanity test Let's add a very basic sanity test to check that we can spawn a containers using nerdctl + Kata Containers. This will ensure that, at least, we don't regress to the point where this feature doesn't work at all. In the future, we should also test all the VMMs with devmapper, but that's for a follow-up PR after this test is working as expected. Fixes: #7911 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-12 16:52:55 +02:00
Greg Kurz	be71a0ab4e	Merge pull request #7811 from stevenhorsman/bump-rust-to-1.72 versions: Bump rust version	2023-09-12 15:30:35 +02:00
Fabiano Fidêncio	b020912629	Merge pull request #7913 from fidencio/topic/add-functional-docker-tests ci: Add a very basic docker sanity test	2023-09-12 15:28:49 +02:00
Fabiano Fidêncio	348b8644d6	ci: Add a very basic docker sanity test Let's add a very basic sanity test to check that we can spawn a containers using docker + Kata Containers. This will ensure that, at least, we don't regress to the point where this feature doesn't work at all. For now we're running this test against Cloud Hypervisor and QEMU only, due to an already reported issue with dragonball: https://github.com/kata-containers/kata-containers/issues/7912 In the future, we should also test all the VMMs with devmapper, but that's for a follow-up PR after this test is working as expected. Fixes: #7910 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-12 15:15:26 +02:00
stevenhorsman	a75fd5eb81	runk: Fix rust unecessary mut error - Fix `error: variable does not need to be mutable` in rust 1.72 Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
stevenhorsman	a31c145172	kata-ctl: useless-vec warning - Fix clippy::useless-vec warning Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
stevenhorsman	c8419fc3bb	kata-ctl: Resolve non-minimal-cfg warning - In rust 1.72, clippy warned clippy::non-minimal-cfg as the cfg has only one condition, so doesn't need to be wrapped in the any combinator. Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
stevenhorsman	3eaf68d954	agent-ctl: Allow clippy lint - Allow `clippy::redundant-closure-call` which has issues with the guard function passed into the `run_if_auto_values` macro Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
stevenhorsman	1d8b78959d	runtime-rs: Fix useless-vec warning Fix clippy::useless-vec warning Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
stevenhorsman	99f3d69e94	runtime-rs: Remove mut Fix `error: variable does not need to be mutable` Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
stevenhorsman	16fbc27b09	dragonball: Allow ambiguous-glob-reexports The bindgen generated code is triggering lots of ambiguous-glob-reexports warnings in rust 1.70+ Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
stevenhorsman	bbf1919516	dragonball: Resolve non-minimal-cfg warning - In rust 1.72, clippy warned clippy::non-minimal-cfg as the cfg has only one condition, so doesn't need to be wrapped in the all combinators. Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
stevenhorsman	75cfdd5d59	agent: config: Allow clippy lint - Allow `clippy::redundant-closure-call` in `from_cmdline` which has issues with the guard function passed into the `parse_cmdline_param` macro Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
stevenhorsman	f3a0fd5907	agent: config: Fix useles-vec warning Fix clippy::useless-vec warning Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
stevenhorsman	9e423bd3d6	libs: Fix clippy unnecesary hashes error - Fix error: unnecessary hashes around raw string literal Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
stevenhorsman	444395050a	versions: Bump rust version Bump rust to 1.72.0 to test what extra warnings/issues we get Fixes: #7902 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-09-12 11:31:49 +01:00
Yipeng Yin	a16b0962b5	chore(cargo): update cargo lock Update cargo lock for runtime-rs, agent and kata-ctl. Signed-off-by: Yipeng Yin <yinyipeng@bytedance.com>	2023-09-12 15:27:38 +08:00
Chao Wu	c800d0739f	Merge pull request #7889 from UiPath/fix-dragonball-build dragonball: fix for non-deterministic builds	2023-09-12 14:06:18 +08:00
shixuanqing	ca4b6b051d	runtime: Naming conflict of network devices When creating a new endpoint, we check existing endpoint names and automatically adjust the naming of the new endpoint to ensure uniqueness. Fixes: #7876 Signed-off-by: shixuanqing <1356292400@qq.com>	2023-09-12 04:29:51 +00:00
Guixiong Wei	202049f35e	feat(runtime-rs): introduce huge page type to select VM RAM's backend This commit allows us to specify the huge page backend when enabling huge page. Currently, we support two backends: thp and hugetlbfs, the default is hugetlbfs. To ensure backward compatibility, we introduce another configuration item "hugepage_type" to select the memory backend, which is available only when "enable_hugepages" is true. Besides, we add an annotation "io.katacontainers.config.hypervisor.hugepage_type" to configure huge page type per pod. Fixes: #6703 Signed-off-by: Guixiong Wei <weiguixiong@bytedance.com> Signed-off-by: Yipeng Yin <yinyipeng@bytedance.com>	2023-09-12 11:28:27 +08:00
Zhongtao Hu	e1f54f96d0	Merge pull request #7766 from Apokleos/wrap-vsock-virtiofs runtime-rs: bring hybrid vsock devices in manager.	2023-09-12 09:27:34 +08:00
GabyCT	af29eeb8b1	Merge pull request #7901 from fidencio/topic/ci-target-branch-fixes-follow-up-3 ci: use github.ref_name instead of $GITHUB_REF_NAME	2023-09-11 15:31:29 -06:00
Fabiano Fidêncio	f811b064ca	ci: use github.ref_name instead of $GITHUB_REF_NAME As, regardless of what's mentioned in the documentation, it seems that $GITHUB_REF_NAME is passed down as a literal string. Fixes: #7414 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-11 22:14:55 +02:00
Fabiano Fidêncio	dc0b350e49	Merge pull request #7900 from fidencio/topic/ci-target-branch-fixes-follow-up-2 ci: Add more target-branch related fixes	2023-09-11 21:26:26 +02:00
Fabiano Fidêncio	6d795c089e	ci: Add more target-branch related fixes The ones for the payload-after-push.yamland ci-nightly.yaml are not that much important right now, but they're needed for when we start running those on stable branches as well. The other ones were missed during `bd24afcf73`. Fixes: #7414 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-11 20:42:57 +02:00
Fabiano Fidêncio	07d0ad0ad7	Merge pull request #7897 from fidencio/topic/ci-devmapper-do-the-rebase-as-well ci: Fix target-branch usage	2023-09-11 20:30:53 +02:00
Fabiano Fidêncio	d7f991d139	Merge pull request #7151 from Yuan-Zhuo/fix-systemd-cgroup agent: optimize the code of systemd cgroup manager	2023-09-11 20:15:51 +02:00
Fabiano Fidêncio	8509c31870	ci: Fix target-branch usage We missed those one as part of `bd24afcf73`. Fixes: #7414 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-11 20:10:27 +02:00
Gabriela Cervantes	060499dcae	metrics: Remove warning from metrics documentation Now that the metrics migration from the tests to kata containers has been completed, this PR removes the warning from the main metrics documentation. Fixes #7894 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-11 16:41:48 +00:00
GabyCT	b384757ac7	Merge pull request #7874 from fidencio/topic/manually-rebase-branches-atop-of-the-target-one gha: Manually rebase PR atop of the target branch before testing	2023-09-11 10:35:01 -06:00
Fabiano Fidêncio	46e73cf7a2	Merge pull request #7884 from fidencio/topic/update-kernel-to-the-latest-lts-plus-bring-in-erofs-patches Update kernel to the latest LTS release (v6.1.52) and bring in erofs patches needed for the CC work	2023-09-11 13:58:43 +02:00
James O. D. Hunt	c0f697fcc5	runtime: Allow kernel_params annotation To support the removal of the `initcall_debug` and `earlyprintk=` options from the default guest kernel cmdline, add `kernel_params` to the list of enabled annotations to allow those kernel options (or others) to be set using `kata-deploy` for either runtime. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-09-11 12:12:12 +01:00
Alexandru Matei	b03e49794e	dragonball: fix for non-deterministic builds Fixes: #7888 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2023-09-11 14:07:10 +03:00
Fabiano Fidêncio	93bad13769	Merge pull request #7875 from fidencio/topic/kata-deploy-fix-arm64-image-build kata-deploy: Fix aarch64 image build	2023-09-11 11:36:52 +02:00
James O. D. Hunt	976d10150c	runtime-rs: hypervisor: Remove debug kernel options Removed the following kernel command line options: - `earlyprintk=ttyS0` - `initcall_debug` Both these options are only useful when debugging a guest kernel failure which is not a common occurrence. Further, the `earlyprintk=` option can have a large negative performance impact (it can increase the VM boot time significantly). If the user wishes to use either of these options, they can add them to the `kernel_params=` setting in the Kata configuration file's hypervisor stanza. Fixes: #7886. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-09-11 09:43:39 +01:00
Fabiano Fidêncio	fde34610cd	kernel: Add erofs patches needed for CC related work All the patches have already been merged upstream and they've just been cherry-picked to this branch. Fixes: #7885 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-11 10:39:37 +02:00
Fabiano Fidêncio	dc6a4588a2	versions: Bump kernel to the latest LTS release (6.1.52) We're bumping here in order to make our lives easier backporting EROFS patches needed for the CC related work. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-11 10:32:16 +02:00
James O. D. Hunt	52f6449b70	kata-manager: Remove initcall_debug kernel option Removed the addition of the `initcall_debug` kernel option when agent debugging enabled. This option has nothing to do with the agent. If the user wishes to use this option, they can add it to the `kernel_params=` setting in the Kata configuration file's hypervisor stanza. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-09-11 09:31:44 +01:00
Fabiano Fidêncio	6cd5d83a37	Merge pull request #7865 from gkurz/fix-more-virtiofs-args runtime: Fix more virtiofs args	2023-09-09 21:30:16 +02:00
Fabiano Fidêncio	8b4a0b368f	kata-deploy: Remove curl after it's used There's no need to keep curl there after the kubectl binary has already been downloaded. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-09 10:52:05 +02:00
Fabiano Fidêncio	139c7f03ab	kata-deploy: Fix aarch64 image build Similarly to what's been done for x86_64 -> amd64, we need to do a aarch64 -> arm64 change in order to be able to download the kubectl binary. Fixes: #7861 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-09 10:51:52 +02:00
Fabiano Fidêncio	94f5a69346	Merge pull request #7862 from fidencio/topic/kata-deploy-use-alpine-as-base-image kata-deploy: Switch to an alpine image	2023-09-09 09:02:13 +02:00
Yuan-Zhuo	470d065415	agent: optimize the code of systemd cgroup manager 1. Directly support CgroupManager::freeze through systemd API. 2. Avoid always passing unit_name by storing it into DBusClient. 3. Realize CgroupManager::destroy more accurately by killing systemd unit rather than stop it. 4. Ignore no such unit error when destroying systemd unit. 5. Update zbus version and corresponding interface file. Acknowledgement: error handling for no such systemd unit error refers to Fixes: #7080, #7142, #7143, #7166 Signed-off-by: Yuan-Zhuo <yuanzhuo0118@outlook.com> Signed-off-by: Yohei Ueda <yohei@jp.ibm.com>	2023-09-09 13:56:43 +08:00
GabyCT	fa818bfad1	Merge pull request #7867 from GabyCT/topic/optimizedimage metrics: Use TensorFlow optimized image	2023-09-08 11:34:21 -06:00
Fabiano Fidêncio	bd24afcf73	gha: Manually rebase PR atop of the target branch before testing We're changing what's been done as part of `ac939c458c`, as we've notcied issues using `github.event.pull_request.merge_commit_sha`. Basically, whenever a force-push would happen, the reference of merge_commit_sha wouldn't be updated, leading us to test PRs with the old code. :-/ In order to get the rebase properly working, we need to ensure we pull the hash of the commit as part of checkout action, and ensure fetch-depth is set to 0. Fixes: #7414 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 18:56:31 +02:00
GabyCT	dc7414f5c1	Merge pull request #7870 from dborquez/metrics_fio_fix_clean_env_order metrics: fix FIO test initialization	2023-09-08 10:28:10 -06:00
Greg Kurz	72c510d057	runtime/virtiofsd: Drop all references to "--cache=none" This syntax belongs to the legacy C virtiofsd implementation that we don't support anymore since kata-containers 3.1.3 because of other API breaking changes. People have been warned to switch from "none" to "never" since kata-containers 2.5.2. Let's officially do that. The compat code that would convert "none" to "never" isn't needed anymore. Just drop it. Fixes #7864 Signed-off-by: Greg Kurz <groug@kaod.org>	2023-09-08 17:57:30 +02:00
Beraldo Leal	ead724bec1	protocol: removing gogo.nullable feature gogo.nullable is the main gogo.protobuf' feature used here. Since we are trying to remove gogo.protobuf, the first reasonable step seems to be remove this feature. This is a core update, and it will change how the structs are defined. I could spot only a few places using those structs, based on make check/build. Fixes #7723. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-09-08 11:49:01 -04:00
Beraldo Leal	d8e4bb9859	protocol: remove unused PROTO_FILE env There is no reference to PROTO_FILE and this is not working. Also we are not inside a Makefile, so makes sense to adapt the usage to reflect the script instead of a make command. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-09-08 11:49:01 -04:00
Beraldo Leal	5e1106a770	protocol: remove unused import_path import_path is used as the default package when no input files specify go_package. However, all the files we are currently building already have a go_package definition, making this behavior both redundant and error-prone. Additionally, one of our files (types.pb.go) resides outside the grpc directory, indicating that it's indeed ignored but also inconsistent. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-09-08 11:49:01 -04:00
Beraldo Leal	87accaaecb	protocol: use workdir during build Currently, the script searches for .proto files within $GOPATH/. Consequently, modifications to a definition file in the current working directory won't influence the output .pb.go if the directory is outside of $GOPATH. For developers, it's more intuitive to alter the local codebase than the version stored in $GOPATH. With this modification, the generated .pb.go files will be relative to the current working directory, removing the need to clone this project under $GOPATH/src/github.com/kata-containers. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-09-08 11:49:01 -04:00
Beraldo Leal	711a7ed965	protocol: remove mapping definitions The definitions are already specified in the .proto files using the go_package option. Centralizing them in one location reduces the potential for errors and simplifies the script. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-09-08 11:49:01 -04:00
Beraldo Leal	8db84c1bd2	protocol: force GOPATH to be set Currently, if GOPATH is not set, errors will raise since protoc is using GOPATH to find packages. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-09-08 11:49:01 -04:00
Beraldo Leal	68156d77ac	protocol: breaking lines to improve readability Just a small change to improve the readability of modules before the actual changes. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-09-08 11:49:01 -04:00
Fabiano Fidêncio	670a8e9c73	kata-deploy: Switch to an alpine image This will make our image smaller, and still ensure it's multi-arch support. Fixes: #7861 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 17:39:51 +02:00
Fabiano Fidêncio	0b26a5d053	Merge pull request #7871 from fidencio/topic/ci-add-k8s-devmapper-tests-follow-up-3 ci: k8s: Add clean-up-garm argument for gha-run.sh	2023-09-08 17:27:57 +02:00
Fabiano Fidêncio	9d74b7ccc9	k8s: ci: Skip "Pod quota" test with firecracker The test is failing, and an issue has been opened to track it. For now, let's skip it. Issue: https://github.com/kata-containers/kata-containers/issues/7873 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 15:51:46 +02:00
Fabiano Fidêncio	f6cd3930c5	ci: k8s: Remove useless skip statement from tests There's absolutely no need to have the skip check as part of the test itself when it's already done as part of the setup function. We're only touching the files here that were touched in the previous commit. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 14:25:29 +02:00
Fabiano Fidêncio	3cc20b47a6	ci: k8s: Also check for "fc" (for firecracker) Let's keep both checks for now, but in the future we'll be able to remove the check for "firecracker", as the hypervisor name used as part of the GitHub Actions has to match what's used as part of the kata-deploy stuff, which is `fc` (as in `kata-fc for the runtime class) instead of `firecracker`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 14:25:24 +02:00
Fabiano Fidêncio	b5bad3cb0f	ci: k8s: Add clean-up-garm argument for gha-run.sh The tests are failing to finish as the argument is invalid. Fixes: #6542 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 14:04:50 +02:00
Fabiano Fidêncio	05e2e7636e	Merge pull request #7868 from fidencio/topic/ci-add-k8s-devmapper-tests-follow-up-2 ci: k8s: Second round of fix-ups with the devmapper CI	2023-09-08 11:02:20 +02:00
Fabiano Fidêncio	aaec5a09f3	ci: k8s: devmapper tests should be using ubuntu 20.04 That's what we've been using as part of Jenkins, so let's ensure things will work as they did before, and only after that consider upgrading the base OS used for the tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 10:09:04 +02:00
Fabiano Fidêncio	27fa7d828d	ci: k8s: Add a kata-deploy-garm target We've been using the `kata-deploy-tdx` target as that also uses k3s as base, but it's better to just have a specific garm target. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 10:09:04 +02:00
Fabiano Fidêncio	fa62a4c01b	ci: k8s: Export KUBERNETES env var So we have a better control on which flavour of kubernetes kata-deploy is expected to be targetting. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 10:09:04 +02:00
Fabiano Fidêncio	8c9380a798	ci: k8s: Install bats on GARM runners GARM runners do not come with the whole set of tools we need, or are used to when it comes to the GHA runners, so we need to manually install bats on those. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 10:09:04 +02:00
Fabiano Fidêncio	3de23034f8	ci: k8s: Wait some time after restarting k3s Let's put a 1 minute sleep, just to make sure everything is back up again. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:46:58 +02:00
David Esparza	adfea55b8f	metrics: fix FIO test initialization This PR changes the order in which the FIO test first cleans the environment and then checks if the environment is indeed clean. Fixes: #7869 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-09-07 15:41:59 -06:00
Fabiano Fidêncio	2df183fd99	ci: k8s: Append, instead of overwrite, the devmapper config As we were using `tee` without the `-a` (or `--apend`) aptton, the containerd config would be overwritten, leading to a NotReady state of the Node. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:12:55 +02:00
Fabiano Fidêncio	369a8af8f7	ci: k8s: Decrease k3s sleep from 4 to 2 minutes It should be plenty, and worked well in local tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:12:55 +02:00
Fabiano Fidêncio	ada65b988a	ci: k8s: Use vanilla kubectl with k3s Let's download the vanilla kubectl binary into `/usr/bin/`, as we need to avoid hitting issues like: ```sh error: open /etc/rancher/k3s/k3s.yaml.lock: permission denied ``` The issue basically happens because k3s links `/usr/local/bin/kubectl` to `/usr/local/bin/k3s`, and that does extra stuff that vanilla `kubectl` doesn't do. Also, in order to properly use the k3s.yaml config with the vanilla kubectl, we're copying it to ~/.kube/config. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:12:55 +02:00
Fabiano Fidêncio	ad45ab5d33	ci: k8s: Ensure k3s is deploy with --write-kubeconfig-mode=644 Otherwise the /etc/rancher/k3s/k3s.yaml is not readable by other users than root. As --write-config-mode is being passed, and that's an option that has to be passed to the `server`, -s is also added to the command line. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:12:55 +02:00
Fabiano Fidêncio	028a97e0d5	ci: k8s: Use the proper command for sleep `wait` waits for a job to complete, not a number of seconds. Not sure how I got that wrong in the first place, but it's what it's. Fixes: #6542 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:12:55 +02:00
David Esparza	34f580901f	Merge pull request #7824 from dborquez/fix_memory_usage_initialization metrics: re-enable memory-usage initialization step	2023-09-07 14:24:27 -06:00
Gabriela Cervantes	3a427795ea	metrics: Use TensorFlow optimized image This PR replaces the ubuntu image for one which has TensorFlow optimized for kata metrics. Fixes #7866 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-07 15:38:51 +00:00
Chao Wu	cd8c217ee1	Merge pull request #6879 from openanolis/chao/update_upstream_upcall_feature Dragonball: optimize the placement of dbs-upcall features	2023-09-07 18:07:53 +08:00
Fabiano Fidêncio	dfa1cce916	Merge pull request #7860 from fidencio/topic/ci-add-k8s-devmapper-tests-follow-up-1 ci: k8s: Fix typo in run-k8s-tests-on-garm.yaml	2023-09-07 11:48:30 +02:00
Fabiano Fidêncio	8d99972a8a	ci: k8s: Fix typo in run-k8s-tests-on-garm.yaml integrations -> integration integrtion -> integration Fixes: #6542 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 11:31:30 +02:00
Fabiano Fidêncio	0483d3d16d	Merge pull request #7841 from fidencio/topic/ci-add-k8s-devmapper-tests ci: k8s: Add k8s devmapper tests (part 0)	2023-09-07 10:53:09 +02:00
Jeremi Piotrowski	f6cc01d77c	Merge pull request #7833 from jepio/kata-static-fix-ownership kata-deploy: Create kata-static.tar with correct ownership	2023-09-07 10:16:23 +02:00
Peng Tao	435e890cd9	Merge pull request #7703 from bergwolf/github/nerdctl-fc runtime: run prestart hooks before starting VM for FC	2023-09-07 10:55:31 +08:00
Chao Wu	deed1b927d	Dragonball: optimize the placement of dbs-upcall features Currently, the dbs-upcall features have 2 problems that are needed to be fixed : There are redundant dbs-upcall features that are needed to be removed. Some place should be controlled by dbs-upcall but not being implemented. This commit will fix those two problems. fixes: #6878 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-09-07 10:27:29 +08:00
Fabiano Fidêncio	0e8bd50cbb	ci: k8s: Add k8s devmapper tests (part 0) Let's enable the devmapper kubernetes tests to match exactly what's been tested as part of the Jenkins CI. Fixes: #6542 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-06 23:08:38 +02:00
Fabiano Fidêncio	b28b54df04	ci: k8s: Add a function to configure devmapper for containerd This function right now is completely based on what's part of the tests repo[0], and that's the reason I'm keeping the `Signed-off-by` of all the contributors to that file. This is not perfect, though, as it changes the default snapshotter to devmapper, instead of only doing so for the Kata Containers specific runtime handlers. OTOH, this is exactly what we've always been doing as part of the tests. We'll improve it, soon enough, when we get to also add a way for kata-deploy to set up different snapshotters for different handlers. But, for now, this is as good (or as bad) as it's always been. It's important to note that the devmapper setup doesn't take into consideration a BM machine, and this is not suitable for that. We're really only targetting GHA runners which will be thrown away after the run is over. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Shiming Zhang <wzshiming@foxmail.com> Signed-off-by: Marcel Apfelbaum <marcel@redhat.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-06 23:08:17 +02:00
Fabiano Fidêncio	54f7117212	ci: k8s: Add a function to deploy k3s One can use different kubernetes flavours for getting a kubernetes cluster up and running. As part of our CI, though, I really would like to avoid contributors spending time maintaining and updating kubernetes dependencies, as done with the tests repo, and which has been proven to be really good on getting things rotten. With this in mind, I'm taking the bullet and using "k3s" as the way to deploy kubernetes for the devmapper related tests, and that's the reason I'm adding a function to do so, and this will be used later on as part of this series. It's important to note that the k3s setup doesn't take into consideration a BM machine, and this is not suitable for that. We're really only targetting GHA runners which will be thrown away after the run is over. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-06 23:07:41 +02:00
David Esparza	cf258090aa	Merge pull request #7843 from GabyCT/topic/ffiolimit metrics: Add write 95 percentile FIO value	2023-09-06 14:52:00 -06:00
Fabiano Fidêncio	c5e1e7ddc3	Merge pull request #7854 from fidencio/topic/runtime-allow-virtio_fs_extra_args-annotation runtime: Allow virtio_fs_extra_args annotation	2023-09-06 19:20:40 +02:00
Greg Kurz	81536f21af	runtime/qemu: Pass "--xattr" to virtiofsd instead of "-o xattr" The "-o" syntax belongs to the legacy C virtiofsd. It is deprecated with the rust implementation. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-09-06 17:50:35 +02:00
Fabiano Fidêncio	b1dd09a4d3	runtime: Allow virtio_fs_extra_args annotation Some use cases may just require passing extra arguments to virtiofsd, and having this disabled by default makes it impossible to set when using kata-deploy, as changes in the configuration file would be overwritten by the daemon-set. With this in mind, let's allow users to pass whatever thet need (and here I'm specifically looking at `--xattr`) as a virtio_fs_extra_arg. Fixes: #7853 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-06 17:11:16 +02:00
Hyounggyu Choi	d27fe18167	Merge pull request #7849 from BbolroC/hot-fix-dockerbuild packaging: do not install docker-compose-plugin for s390x\|ppc64le	2023-09-06 13:13:25 +02:00
Hyounggyu Choi	2efda20c77	packaging: do not install docker-compose-plugin for s390x\|ppc64le This PR is to skip installing docker-compose-plugin while buiding a `build-kata-deploy` image for s390x\|ppc64le. It is a temporary solution to fix current CI failures for s390x regarding `hash sum mismatch`. Fixes: #7848 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-09-06 11:12:03 +02:00
Zhongtao Hu	aa85e0b3ec	Merge pull request #7714 from justxuewei/volumes-cleanup runtime-rs: Fix volumes and rootfs cleanup issues	2023-09-06 10:13:55 +08:00
Gabriela Cervantes	438fbf9669	metrics: Add write 95 percentile for FIO for qemu This PR adds the write 95 percentile for FIO for qemu for checkmetrics for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-05 22:50:31 +00:00
Gabriela Cervantes	024b4d2ffe	metrics: Add write 95 percentile FIO value This PR adds the write 95 percentile FIO value for checkmetrics for kata metrics. Fixes #7842 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-05 21:00:05 +00:00
GabyCT	3e3a91fd2c	Merge pull request #7577 from GabyCT/topic/enableiperfm metrics: Enable iperf benchmark on gha for kata metrics	2023-09-05 14:53:47 -06:00
Gabriela Cervantes	e98e5cdea2	metrics: Add checkmetrics to gha run script This PR adds the checkmetrics to gha run script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-05 17:05:03 +00:00
Gabriela Cervantes	c1edfe5511	metrics: Add checkmetrics value for qemu for iperf This PR adds the checkmetrics value for qemu for iperf benchmark. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-05 16:04:52 +00:00
Gabriela Cervantes	6a79ecedf9	metrics: Add jitter value for clh This PR adds jitter value for clh for iperf metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-05 16:04:52 +00:00
Gabriela Cervantes	f609a9a754	metrics: Add test selector to iperf metrics This PR adds test selector to iperf metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-05 16:04:52 +00:00
Gabriela Cervantes	5b8db30422	metrics: Enable iperf benchmark on gha for kata metrics This PR enables the iperf benchmark to run on the gha for kata metrics. Fixes #7575 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-05 16:04:52 +00:00
Jeremi Piotrowski	cf46b056fd	Merge pull request #7839 from openanolis/chao/switch_to_azure CI: switch static-checks-dragonball CI machines to Azure	2023-09-05 10:59:02 +02:00
Chao Wu	60f733d301	CI: switch static-checks-dragonball CI machines to Azure Previously, static-checks-dragonball is using machines from Alibaba Cloud to run all the CI jobs. Currently, we are going through an internal process to apply for the new machines for Dragonball CI. Before the internal process is over, we will temporarily use Azure VM to run static-checks-dragonball jobs. fixes: #7838 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-09-05 15:19:07 +08:00
alex.lyn	7870b33a2d	runtime-rs: bring hybridVsock devices in manager. Currently, virtio_vsock are still outside of the device manager. This causes some management issues,such as the inability to unify PCI address management. Just do some work for hybrid vsock. Fixes: #7655 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-09-05 08:46:56 +08:00
Jeremi Piotrowski	18c94ebbe3	kata-deploy: Create kata-static.tar with correct ownership Pass --owner and --group to the tar invokation to prevent gihtub runner user from leaking into release artifacts. Fixes: #7832 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-09-04 17:24:00 +02:00
Fabiano Fidêncio	b663ec21ac	Merge pull request #7803 from GabyCT/topic/readmereportdoc metrics: Add README for kata metrics report	2023-09-03 21:57:13 +02:00
Fabiano Fidêncio	e490b0bc76	Merge pull request #7808 from ManaSugi/fix/remove-manual-chcon osbuilder: Remove chcon operation for guest SELinux	2023-09-03 21:55:02 +02:00
Fabiano Fidêncio	27dab249a0	Merge pull request #7800 from jodh-intel/kata-sys-util-update-tdx-protection-checks kata-sys-util: protection: Update TDX checks	2023-09-02 14:47:51 +02:00
Jiang Liu	d5729e818c	Merge pull request #7819 from jiangliu/storage-cleanup Improve the way to clean up storage devices for sandbox	2023-09-02 17:02:51 +08:00
Jiang Liu	57e7bf14a6	agent: refine StorageDeviceGeneric::cleanup() Refine StorageDeviceGeneric::cleanup() to improve safety. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-09-02 14:22:21 +08:00
Jiang Liu	53edb19374	agent: implement StorageDeviceGeneric::cleanup() Refactor cleanup_sandbox_storage as StorageDeviceGeneric::cleanup(). Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-09-02 14:00:26 +08:00
Jiang Liu	0c63453e28	types: make StorageDevice::cleanup() return possible error code Make StorageDevice::cleanup() return possible error code. Fixes: #7818 Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-09-02 13:27:06 +08:00
Jiang Liu	3a3d77b3b5	agent: move StorageDeviceGeneric from kata-types into agent Move StorageDeviceGeneric from kata-types into agent, so we can refactor code later. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-09-02 13:12:17 +08:00
Jiang Liu	d848126b61	Merge pull request #7821 from jiangliu/storage-leak agent: avoid possible leakage of storage device	2023-09-02 12:40:40 +08:00
Fabiano Fidêncio	4f92e6df90	Merge pull request #7683 from microsoft/danmihai1/policy-tests tests: add policy to existing tests	2023-09-01 23:52:15 +02:00
David Esparza	b151cfd140	metrics: re-enable memory-usage initialization step This PR re-enables the initialization step disabled on `538c965c2b`. Fixes: #7804 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-09-01 14:29:34 -06:00
Fabiano Fidêncio	f3e1a6a94f	osbuilder: alpine: Change mirror As we're hitting a lot of: ``` ERROR: https://dl-5.alpinelinux.org/alpine/v3.18/main: operation timed out ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-01 16:01:42 +00:00
Fabiano Fidêncio	ac612aef5e	osbuilder: alpine: Match the version on versions.yaml We've switching to 3.18 as part of `82cd14ba39`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-01 16:01:33 +00:00
Jiang Liu	9cd706d1c9	agent: avoid possible leakage of storage device When a storage device is used by more than one container, the second and forth instances will cause storage device reference count leakage, thus cause storage device leakage. The reason is: add_storages() will increase reference count of existing storage device, but forget to add the device to the `mount_list` array, thus leak the reference count. Fixes: #7820 Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-09-01 22:52:42 +08:00
Dan Mihai	bf21411e90	tests: add policy to k8s tests Use AGENT_POLICY=yes when building the Guest images, and add a permissive test policy to the k8s tests for: - CBL-Mariner - SEV - SNP - TDX Also, add an example of policy rejecting ExecProcessRequest. Fixes: #7667 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-09-01 14:28:08 +00:00
Dan Mihai	d0e0610679	runtime: config: use the SEV initrd for SNP Thanks Unmesh Deodhar! Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-09-01 14:28:08 +00:00
Fabiano Fidêncio	67fed26f18	runtime: Use TDX image with in the qemu-tdx config Let's make sure we use the TDX image as part of the QEMU TDX configuration, which will help us to have the policies tested here. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-01 14:28:08 +00:00
Fabiano Fidêncio	f65ffb23da	Merge pull request #7814 from fidencio/topic/gha-rebase-prs-atop-of-main-for-the-tests gha: Rebase PR atop of the target branch before testing	2023-09-01 16:26:32 +02:00
Fabiano Fidêncio	ef70aeb6b8	Merge pull request #7817 from fidencio/topic/update-alpine-to-its-latest-release versions: Update alpine to its 3.18 version	2023-09-01 14:51:58 +02:00
Fabiano Fidêncio	ac939c458c	gha: Rebase atop of the target branch We have two scenarios we care about this, `pull_request` and `pull_request_target` events triggered a job. `pull_request` event: When using the checkout action, it'll already provide a "rebased atop of main" repo for us, nothing else is needed, and that's basically what we already have as part of the jobs in our CI. `pull_request_target` event: This one is a little bit tricky, as the checkout action, unless passing a spsecific repo, give us the PR checked out rebased atop of the HEAD of the PR branch. Jeremi Piotrowski nicely pointed out that we could use github.event.pull_request.merge_commit_sha instead, which is the result of the PR's branch with the official repo target branch. Now, the only cases where the contributor's rebase would still be needed is when the action itself has been changed. Fixes: #7414 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-01 11:23:31 +02:00
Jeremi Piotrowski	bde06758b1	Merge pull request #7761 from jepio/iocopy-fix-race runtime: Fix data race in ioCopy	2023-09-01 09:30:54 +02:00
Fabiano Fidêncio	82cd14ba39	versions: Update alpine to its 3.18 version 3.15 will be out of life in 2 months from now. Fixes: #7816 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-31 23:02:54 +02:00
GabyCT	d75c7b5f9c	Merge pull request #7813 from GabyCT/topic/genreport metrics: Add grabdata script for metrics report	2023-08-31 13:33:38 -06:00
Gabriela Cervantes	6668825752	metrics: Add grabdata script for metrics report This PR adds the grabdata script so it can be used for the metrics report for kata metrics. Fixes #7812 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-31 16:17:29 +00:00
James O. D. Hunt	c290eaed8c	kata-sys-util: protection: Update TDX checks Update the protection checking code to detect newer versions of Intel TDX (whose userland interface has now stabilised). > Note: that we don't need to retain the existing behaviour since: > > - We haven't yet landed the TDX feature (#6448). > - Systems wishing to use TDX will need to use the latest available > system components (such as firmware and host kernel). Also added an explicit TDX unit test. Fixes: #7384. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-08-31 16:15:15 +01:00
Fabiano Fidêncio	d7a996c686	gha: Update to checkout@v3 action At this point we should always be using the latest checkout action. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-31 16:02:31 +02:00
Jeremi Piotrowski	d7612440b8	Merge pull request #7789 from beraldoleal/tests/amd Fixes tests on AMD machines	2023-08-31 11:23:51 +02:00
Jeremi Piotrowski	c2ba29c15b	runtime: Fix data race in ioCopy IoCopy is a tricky function (I don't claim to fully understand its contract), but here is what I see: The goroutine that runs it spawns 3 goroutines - one for each stream to handle (stdin/stdout/stderr). The goroutine then waits for the stream goroutines to exit. The idea is that when the process exits and is closed, the stdout goroutine will be unblocked and close stdin - this should unblock the stdin goroutine. The stderr goroutine will exit at the same time as the stdout goroutine. The iocopy routine then closes all tty.io streams. The problem is that the stdout goroutine decrements the WaitGroup before closing the stdin stream, which causes the iocopy goroutine to race to close the streams. Move the wg.Done() of the stdout routine past the close so that this race becomes impossible. I can't guarantee that this doesn't affect some unspecified behavior. Fixes: #5031 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-08-31 10:17:38 +02:00
Manabu Sugimoto	211de08d9e	osbuilder: Remove chcon operation for guest SELinux Remove the `chcon` operation which adds `container_runtime_exec_t` label to the `kata-agent` binary because the container-selinux package including the `39f83cc74d` commit has been released officially. Ref. https://centos.pkgs.org/9-stream/centos-appstream-x86_64/container-selinux-2.221.0-1.el9.noarch.rpm.html The container-selinux package is installed in a guest rootfs when we create it with `SELinux = yes`, and `restorecon` sets `container_runtime_exec_t` to the `kata-agent`. Fixes: #7807 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2023-08-31 16:44:32 +09:00
GabyCT	b467f2ef68	Merge pull request #7772 from GabyCT/topic/fiolimit metrics: Enable FIO limits for kata metrics	2023-08-30 14:49:04 -06:00
Gabriela Cervantes	9f21fa9b39	metrics: Add report generator link to general documentation This PR adds the report generator link to general documentation. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-30 16:55:14 +00:00
Gabriela Cervantes	c0ed5ea0ad	metrics: Add README for kata metrics report This PR adds the README for kata metrics report. Fixes #7802 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-30 16:36:08 +00:00
Fabiano Fidêncio	aa2b51a831	Merge pull request #7783 from GabyCT/topic/makereport metrics: Add metrics report script	2023-08-30 17:11:39 +02:00
Gabriela Cervantes	a7b59a5bf9	metrics: Add limit for 90 percentile for qemu value This PR adds the limit for 90 percentile for qemu value for FIO kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-30 13:53:38 +00:00
Gabriela Cervantes	99db6568e9	metrics: Add limit for write 90 percentile value for clh This PR adds the limit for write 90 percentile value for clh for FIO metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-30 13:53:38 +00:00
Gabriela Cervantes	6e06392c55	metrics: Enable FIO limits for kata metrics This PR enables the FIO limits for kata metrics. Fixes #7771 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-30 13:53:38 +00:00
David Esparza	924d06a7f5	Merge pull request #7787 from GabyCT/topic/fixmemoryinsidelimit metrics: Fix memory inside limits for kata metrics	2023-08-30 07:45:17 -06:00
Peng Tao	2e4c874726	runtime/vc: runPrestartHooks should ignore GetHypervisorPid failure If we are running FC hypervisor, it is not started when prestart hooks are executed. So we should just ignore such error and just go ahead and run the hooks. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-08-30 03:06:11 +00:00
Peng Tao	21204caf20	runtime: fail early when starting docker container with FC FC does not support network device hotplug. Let's add a check to fail early when starting containers created by docker. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-08-30 02:52:01 +00:00
Peng Tao	32fd013716	runtime: run prestart hooks before starting VM for FC Add a new hypervisor capability to tell if it supports device hotplug. If not, we should run prestart hooks before starting new VMs as nerdctl is using the prestart hooks to set up netns. To make nerdctl + FC to work, we need to run the prestart hooks before starting new VMs. Fixes: #6384 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-08-30 02:52:01 +00:00
Beraldo Leal	00e7ffd988	tests: check vmx only on Intel machines When running on amd machines, those tests will fail because there is no vmx flag. Following other tests that checks for cpuType, let's adapt them to restrict vmx only on Intel machines. Fixes #7788. Related #5066 Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-08-29 20:04:31 -04:00
Gabriela Cervantes	c8dd3c0737	metrics: Fix memory footprint qemu limit This PR fixes the memory footprint qemu limit for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 22:51:21 +00:00
Gabriela Cervantes	8877ec62fb	metrics: Fix memory inside limits for kata metrics This PR fixes the memory inside limit for clh for kata metrics due to the recent changes that we had in the script which impacted in the performance measurement. Fixes #7786 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 21:38:18 +00:00
Beraldo Leal	80146f2078	tests: Fixes cpuType check on AMD machines cpuType is not initialized yet. gets 0 (Intel) by default, failing on AMD machines. Fixes #7785 Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-08-29 17:04:07 -04:00
Gabriela Cervantes	7e364716dd	metrics: Add test setup details to metrics report This PR adds test setup details to metrics report. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 17:56:53 +00:00
Gabriela Cervantes	17dc1b9760	metrics: Add boot lifecycle times to metrics report This PR adds the boot lifecycle times to metrics report. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 17:55:44 +00:00
Gabriela Cervantes	3b0d6538f2	metrics: Add memory inside container to metrics report This PR adds memory inside container to metrics report. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 17:53:17 +00:00
Gabriela Cervantes	79fbb9d243	metrics: Add scaling system footprint in metrics report This PR adds scaling system footprint in metrics report. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 17:51:27 +00:00
Gabriela Cervantes	8e6d4e6f3d	metrics: Add metrics reportgen This PR adds metrics reportgen for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 17:45:36 +00:00
Gabriela Cervantes	139ffd4f75	metrics: Add report file titles This PR adds report file titles for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 17:43:06 +00:00
GabyCT	8f2dae7b53	Merge pull request #7775 from dborquez/fix_memory_usage_parsing_results metrics: fix parsing issue on memory-usage test	2023-08-29 11:26:13 -06:00
Gabriela Cervantes	878d1a2e7d	metrics: Generate PNGs alongside the PDF report This PR generates the PNGs for the kata metrics PDF report. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 16:50:32 +00:00
Gabriela Cervantes	fce2487971	metrics: Add metrics report R files This PR adds the metrics report R files. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 16:45:22 +00:00
Gabriela Cervantes	08812074d1	metrics: Add report dockerfile This PR adds the report dockerfile for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 16:28:32 +00:00
Gabriela Cervantes	69781fc027	metrics: Add metrics report script This PR adds metrics report script for kata metrics. Fixes #7782 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-29 16:25:14 +00:00
Chao Wu	e4fb20c74a	Merge pull request #7585 from lifupan/main dragonball: vsock add fifo/pipe stream support for passed fd hybridSt…	2023-08-29 23:39:21 +08:00
Fabiano Fidêncio	50e51bcafe	Merge pull request #7185 from UnmeshDeodhar/add-cc-sev-test tests: Add confidential test	2023-08-29 15:32:25 +02:00
Fabiano Fidêncio	e286e842c1	tests: Expand confidential test to support TDX Let's expand the confidential test to also support TDX. The main difference on the test, though, is that we're not grepping for a string in the `dmesg` output, but rather relying on `cpuid` to detect a TDX guest. Fixes: #7184 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-29 14:10:47 +02:00
Unmesh Deodhar	e31f099be1	tests: Expand confidential test to support SNP Let's expand the confidential test to also support SNP. Fixes: #7184 Signed-off-by: Unmesh Deodhar <udeodhar@amd.com>	2023-08-29 14:10:47 +02:00
Unmesh Deodhar	c3b9d4945e	tests: Add confidential test for SEV Add a test case for the launch of unencrypted confidential container, verifying that we are running inside a TEE. Right now the test only works with SEV, but it'll be expanded in the coming commits, as part of this very same series. Fixes: #7184 Signed-Off-By: Unmesh Deodhar <udeodhar@amd.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-29 14:10:34 +02:00
David Esparza	538c965c2b	metrics: fix parsing issue on memory-usage test This PR fixes an issues in the parsing results stage, by collecting just the n-results from the n-running containers, discarding irrelevant data. Fixes: #7774 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-08-28 23:39:46 -06:00
Fabiano Fidêncio	708b0a3052	Merge pull request #7768 from fidencio/topic/update-tdx-to-the-6.2-kernel-based-stack tdx: Update the components needed for using the 6.2 kernel stack	2023-08-28 19:27:15 +02:00
Fabiano Fidêncio	3818bf3311	local-build: Remove $HOME/.docker/buildx/activity/default The file can be removed between builds without causing any issue, and leaving it around has been causing us some headache due to: ``` ERROR: open /home/runner/.docker/buildx/activity/default: permission denied ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-28 13:41:36 +02:00
Fabiano Fidêncio	d1b54ede29	qemu: tdx: Workaround SMP issue with TDX 1.5 `...,sockets=1,cores=numvcpus,threads=1,...` must be used. Fixes: #7770 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-28 13:41:36 +02:00
Archana Shinde	1e34220c41	qemu: tdx: Adapt to the TDX 1.5 stack QEMU for TDX 1.5 makes use of private memory map/unmap. Make changes to govmm to support this. Support for private backing fd for memory is added as knob to the qemu config. Userspace's map/unmap operations are done by fallocate() ioctl on the backing store fd. Reference: https://lore.kernel.org/linux-mm/20220519153713.819591-1-chao.p.peng@linux.intel.com/ Fixes: #7770 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-28 13:41:36 +02:00
Fabiano Fidêncio	8115a0522d	versions: tdx: Update Kernel to 6.2 + TDX This is the version that's been used and tested inside Intel, and it matches with https://github.com/intel/tdx-tools/releases/tag/2023ww15. Fixes: #7770 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-28 13:11:34 +02:00
Fabiano Fidêncio	ec18180f34	versions: tdx: Update TDVF to the "edk2-stable202302" This is the version that's been used and tested inside Intel, and it matches with https://github.com/intel/tdx-tools/releases/tag/2023ww15. Fixes: #7770 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-28 13:11:34 +02:00
Fabiano Fidêncio	9803b24286	versions: tdx: Update QEMU to v7.2 + TDX v1.10 This is the version that's been used and tested inside Intel, and it matches with https://github.com/intel/tdx-tools/releases/tag/2023ww15. Fixes: #7770 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-28 13:11:27 +02:00
Fabiano Fidêncio	02a08c956b	Merge pull request #7754 from microsoft/danmihai1/pod-quota-deployment tests: delete k8s deployment at the test's end	2023-08-27 17:52:00 +02:00
Fabiano Fidêncio	98037ced52	Merge pull request #7755 from microsoft/danmihai1/unique-test-name tests: use unique test name	2023-08-27 17:27:40 +02:00
Zhongtao Hu	f0440a9cfe	Merge pull request #7742 from frezcirno/fix-log-forwarder-loop runtime-rs: check peer close in log_forwarder	2023-08-26 10:44:09 +08:00
Fabiano Fidêncio	16a610d788	Merge pull request #7758 from fidencio/topic/gha-avoid-fail-fast-till-everything-is-ultra-stable gha: Avoid "fail-fast" in tests that are known to be flaky	2023-08-25 16:49:26 +02:00
Jiang Liu	91db888d83	Merge pull request #7602 from jiangliu/agent-storage Refine storage device management for kata-agent	2023-08-25 22:20:18 +08:00
Zixuan Tan	dffc16e5b3	runtime-rs: check peer close in log_forwarder The log_forwarder task does not check if the peer has closed, causing a meaningless loop during the period of “kata vm exit”, when the peer closed, and “ShutdownContainer RPC received” that aborts the log forwarder. This patch fixes the problem. Fixes: #7741 Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com>	2023-08-25 19:00:07 +08:00
Jiang Liu	aaa5ab1264	agent: simplify storage device by removing StorageDeviceObject Simplify storage device implementation by removing StorageDeviceObject. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-25 17:23:16 +08:00
Fabiano Fidêncio	fb49d5d7ce	gha: Avoid "fail-fast" in tests that are known to be flaky Otherwise we'll have to re-run all the tests due to a flaky behaviour in one of the parts. Fixes: #7757 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-25 10:00:17 +02:00
Dan Mihai	183f51d6f6	tests: use unique test name k8s-pid-ns.bats was already using the test name from k8s-kill-all-process-in-container.bats - probably a copy/paste bug. Fixes: #7753 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-08-25 03:41:06 +00:00
Dan Mihai	6a974679f2	tests: delete k8s deployment at the test's end At the end of k8s-kill-all-process-in-container.bats, delete the deployment it created. Fixes: #7752 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-08-25 03:34:37 +00:00
David Esparza	686eb3878b	Merge pull request #7751 from GabyCT/topic/unusednhwc metrics: Remove unused variable in tensorflow nhwc script	2023-08-24 18:34:06 -06:00
Fabiano Fidêncio	f1d8e1f513	Merge pull request #7747 from fidencio/topic/kata-deploy-dont-try-to-remove-opt-kata kata-deploy: Don't try to remove /opt/kata	2023-08-24 18:56:52 +02:00
Gabriela Cervantes	32a778b6da	metrics: Remove unused variable in tensorflow nhwc script This PR removes unused variable in tensorflow nhwc script. Fixes #7750 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-24 15:54:27 +00:00
David Esparza	875a85ee14	Merge pull request #7736 from GabyCT/topic/tensorflowfp32 metrics: Add TensorFlow ResNet50 FP32 benchmark	2023-08-24 08:56:24 -06:00
Fabiano Fidêncio	d8f3ce6497	kata-deploy: Don't try to remove /opt/kata The directory is a host path mount and cannot be removed from within the container. What we actually want to remove is whatever is inside that directory. This may raise errors like: ``` rm: cannot remove '/opt/kata/': Device or resource busy ``` Fixes: #7746 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-24 13:57:36 +02:00
Jeremi Piotrowski	71c90b994a	Merge pull request #7745 from jepio/vfio-part-0 gha: vfio: Run on Ubuntu 23.04 runner	2023-08-24 12:15:19 +02:00
Greg Kurz	9991772b26	Merge pull request #7718 from littlejawa/fix_filemode_when_zero kata-agent: use default filemode for block device when it is set to 0	2023-08-24 11:40:28 +02:00
Jeremi Piotrowski	936e8091a7	gha: vfio: Run on Ubuntu 23.04 runner The vfio test requires nested-nested virtualization: L0 Azure host -> L1 Ubuntu VM -> L2 Fedora VM -> L3 Kata This hits a kernel bug on v5.15 but works quite nicely on the v6.2 kernel included in Ubuntu 23.04. We can switch back to Ubuntu 22.04 when they roll out v6.2. Fixes: #6555 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-08-24 10:10:02 +02:00
Jiang Liu	0e7248264d	agent: move storage device related code into dedicated files Move storage device related code into dedicated files. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-24 13:48:51 +08:00
Xuewei Niu	268e846558	runtime-rs: Fix volumes and rootfs cleanup issues There are several processes for container exit: - Non-detach mode: `Wait` request is sent by containerd, then `wait_process()` will be called eventually. - Detach mode: `Wait` request is not sent, the `wait_process()` won’t be called. - Killed by ctr: For example, a container runs `tail -f /dev/null`, and is killed by `sudo ctr t kill -a -s SIGTERM <CID>`. Kill request is sent, then `kill_process()` will be called. User executes `sudo ctr c rm <CID>`, `Delete` request is sent, then `delete_process()` will be called. - Exited on its own: For example, a container runs `sleep 1s`. The container’s state goes to `Stopped` after 1 second. User executes the delete command as below. Where do we do container cleanup things? - `wait_process()`: No, because it won’t be called in detach mode. - `delete_process()`: No, because it depends on when the user executes the delete command. - `run_io_wait()`: Yes. A container is considered exited once its IO ended. And this always be called once a container is launched. Fixes: #7713 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com> Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-08-24 13:23:47 +08:00
Jiang Liu	8f49ee33b2	agent: refine storage related code a bit Refine storage related code by: - remove the STORAGE_HANDLER_LIST - define type alias - move code near to its caller Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-24 13:09:10 +08:00
Jiang Liu	60ca12ccb0	agent: switch to new storage subsystem Switch to new storage subsystem to create a StorageDevice for each storage object. Fixes: #7614 Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-24 13:09:09 +08:00
Jiang Liu	fcbda0b419	kata-types: introduce StorageDevice and StorageHandlerManager Introduce StorageDevice and StorageHandlerManager, which will be used to refine storage device management for kata-agent. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-24 13:08:55 +08:00
Jiang Liu	b03b1f6134	agent: simplify the way to manage storage object Simplify the way to manage storage objects, and introduce StorageStateCommon structures for coming extensions. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-24 12:58:24 +08:00
Jiang Liu	8392c71bf2	sys-util: support more mount flags in parse_mount_options() Support more mount flags in parse_mount_options(). Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-24 12:17:39 +08:00
Jiang Liu	c00d8f3d48	agent: use create_mount_destination() from kata-sys-util Use create_mount_destination() from kata-sys-util crate to reduce redundant code. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-24 12:17:38 +08:00
Jiang Liu	5e867f0538	types: add more mount related constants Add more mount related constants. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-24 12:17:36 +08:00
Jiang Liu	880e6c9a76	agent: use function from kata-sys-utils to reduce code Use function get_linux_mount_info() from kata-sys-util crate to share common code. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-24 12:17:34 +08:00
QuanweiZhou	a6921dd837	Merge pull request #7698 from jiangliu/virtual-volume kata-types: introduce KataVirtualVolume to support nydus, direct volume and image pull	2023-08-24 11:50:39 +08:00
Fabiano Fidêncio	7705c5962e	Merge pull request #7728 from ManaSugi/fix/typo-test-toml libs,tests: fix typo disable_guest_seccomp in configuration-anno-1.toml	2023-08-23 23:55:41 +02:00
GabyCT	c1712e1930	Merge pull request #7737 from jepio/fix-local-build local-build: Remove GID before creating group	2023-08-23 12:26:39 -06:00
Jeremi Piotrowski	3b881fbc0e	local-build: Remove GID before creating group docker install now creates a group with gid 999 which happens to match what we need to get docker-in-docker to work. Remove the group first as we don't need it. Fixes: #7726 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-08-23 18:58:38 +02:00
David Esparza	ebce5d25a9	Merge pull request #7734 from fidencio/topic/kata-deploy-fix-removal kata-deploy: Avoid failing on content removal	2023-08-23 10:29:57 -06:00
Gabriela Cervantes	959ca49447	metrics: Add TensorFlow ResNet50 fp32 Dockerfile This PR adds the TensorFlow ResNet50 fp32 Dockerfile for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-23 16:24:58 +00:00
Gabriela Cervantes	4b7d72c4a8	metrics: Add TensorFlow ResNet50 FP32 benchmark This PR adds TensorFlow ResNet50 FP32 benchmark for kata metrics. Fixes #7735 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-23 16:21:09 +00:00
Fabiano Fidêncio	e7e4cc2182	Merge pull request #7716 from bergwolf/github/image-initrd-assets runtime: fix image and initrd assets handling	2023-08-23 18:02:15 +02:00
Fabiano Fidêncio	5cba38c175	kata-deploy: Avoid failing on content removal We can simply use `rm -f` all over the place and avoid the container returning any error. Fixes: #7733 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-23 16:49:26 +02:00
Peng Tao	18d42da21e	runtime/fc: fix image/initrd annotation handling Right now if we configure an image annotation and have a config file setting initrd, the initrd config would override the image annotation. Make sure annotations are preferred over config options in image and initrd path handling. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-08-23 03:47:28 +00:00
Peng Tao	9fda7059a5	runtime/clh: fix image/initrd annotation handling We should make sure annotations are preferred over config options in image and initrd path handling. Fixes: #7705 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-08-23 03:47:28 +00:00
Peng Tao	1a0092d631	runtime/qemu: fix image/initrd annotation handling Right now if we configure an image annotation and have a config file setting initrd, the initrd config would override the image annotation. Add a helper function ImageOrInitrdAssetPath to make sure annotations are preferred over config options in image and initrd path handling. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-08-23 03:47:27 +00:00
Manabu Sugimoto	22d8f335d6	libs,tests: fix typo disable_guest_seccomp in configuration-anno-1.toml Change `pdisable_guest_seccomp` to `disable_guest_seccomp` Fixes: #7727 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2023-08-23 12:08:18 +09:00
GabyCT	b8990c0490	Merge pull request #7722 from GabyCT/topic/adddiskreadme metrics: Add disk link to README	2023-08-22 12:29:54 -06:00
GabyCT	514d3d42b8	Merge pull request #7712 from GabyCT/topic/fixfiopath metrics: Fix FIO path	2023-08-22 12:28:28 -06:00
Gabriela Cervantes	8afd158cef	metrics: Add disk link to README This PR adds disk link to README documentation for kata metrics. Fixes #7721 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-22 16:20:31 +00:00
Julien Ropé	40914b25d4	kata-agent: use default filemode for block device when it is set to 0 When the FileMode field for the device is unset (0), use a default value instead to allow the use of the device from the container. This behaviour is seen from cri-o typically. Note: this is what runc is doing, which is why regular containers don't have an issue. This change makes sure kata behaves the same as runc. Fixes: #7717 Signed-off-by: Julien Ropé <jrope@redhat.com>	2023-08-22 16:08:14 +02:00
Fabiano Fidêncio	8032797418	Merge pull request #7708 from microsoft/danmihai1/kata-deploy-log gha: capture additional kata-deploy output	2023-08-21 23:43:51 +02:00
David Esparza	d2c130ea69	Merge pull request #7710 from GabyCT/topic/fixpytorch1 metrics: Use function from metrics common in pytorch script	2023-08-21 15:31:24 -06:00
Gabriela Cervantes	eee2ee6eeb	metrics: Fix FIO path This PR fixes the FIO path for the FIO files. Fixes #7711 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-21 21:06:04 +00:00
David Esparza	9347051592	Merge pull request #7666 from dborquez/metrics_improve_fio_test metrics: Enable kata runtime in K8s for FIO test.	2023-08-21 13:51:57 -06:00
Gabriela Cervantes	39bc3488f5	metrics: Use function from metrics common in pytorch script This PR uses a common function into the pytorch script. Fixes #7709 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-21 16:12:35 +00:00
Dan Mihai	400eb88743	gha: capture additional kata-deploy output 10 lines can be insufficient for diagnostics. Fixes: #7707 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-08-21 15:58:57 +00:00
GabyCT	700759232f	Merge pull request #7690 from GabyCT/topic/fixpytorch metrics: Fix README for pytorch	2023-08-21 09:50:14 -06:00
Jiang Liu	6e038e66e4	Merge pull request #7680 from GabyCT/topic/removetime metrics: Remove unused variable in tensorflow mobilenet script	2023-08-21 23:39:07 +08:00
Jiang Liu	4aee3eade0	kata-types: implement serde methods for KataVirtualVolume Implement serilization/deserialization methods for KataVirtualVolume. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-21 16:46:56 +08:00
Jiang Liu	b875e39323	kata-types: validate KataVirtualVolume object Implement method validate() for KataVirtualVolume to validate message format. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-21 16:42:07 +08:00
Jiang Liu	fa2fdc1057	kata-types: implement two conversion helpers for KataVirtualVolume Enable conversions from NydusExtraOptions/DirectVolumeMountInfo to KataVirtualVolume. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-21 16:35:26 +08:00
Jiang Liu	6326af20e3	kata-types: introduce KataVirtualVolume Introduce structure KataVirtualVolume to to encapsulate information for extra mount options and direct volumes, so we could build a common infrastructure to handle these cases. Fixes: #7699 Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-21 16:19:47 +08:00
Gabriela Cervantes	c8b43f8b3e	metrics: Fix README for pytorch This PR fixes the pytorch reference in the README file. Fixes #7689 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-18 20:14:49 +00:00
Aurélien	fa34d61805	Merge pull request #7664 from microsoft/danmihai1/agent-init-policy rootfs: agent: Policy support with AGENT_INIT=yes	2023-08-18 10:51:55 -07:00
Fabiano Fidêncio	7e66d1f6b5	Merge pull request #7649 from fidencio/topic/k8s-tests-remove-kata-deploy-tests gha: k8s: kata-deploy: Move kata-deploy specific tests from integration/kubernetes to functional/kata-deploy	2023-08-18 07:47:26 +02:00
David Esparza	fb571f8be9	metrics: Enable kata runtime in K8s for FIO test. This PR configures the corresponding kata runtime in K8s based on the tested hypervisor. This PR also enables FIO metrics test in the kata metrics-ci. Fixes: #7665 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-08-17 17:11:27 -06:00
Dan Mihai	cb056f8cb3	rootfs: agent: Policy support with AGENT_INIT=yes When building with AGENT_POLICY=yes and AGENT_INIT=yes: 1. Include OPA and the Policy settings in rootfs. 2. Start OPA from the kata agent. Before these changes, building with both AGENT_POLICY=yes and AGENT_INIT=yes was unsupported. Starting OPA from systemd (when AGENT_INIT=no) was already supported. Fixes: #7615 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-08-17 22:37:58 +00:00
GabyCT	c358056a3f	Merge pull request #7685 from GabyCT/topic/changename metrics: Fix check results for tensorflow benchmark	2023-08-17 15:39:43 -06:00
Gabriela Cervantes	85c02828e1	metrics: Update tensorflow name in gha run script This PR update tensorflow name in gha run script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-17 20:17:48 +00:00
Gabriela Cervantes	e8a5119343	metrics: Fix check results for tensorflow benchmark This PR fixes the check results for tensorflow benchmark now that we change the name of the test. Fixes #7684 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-17 19:52:45 +00:00
Fabiano Fidêncio	2d896ad12f	gha: kata-deploy: Do the runtime class cleanup as part of the cleanup Instead of doing this as part of the test itself, let's ensure it's done before running the tests and during the tests cleanup. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-17 18:54:46 +02:00
Fabiano Fidêncio	4ffc2c86f3	gha: kata-deploy: Add the first kata-deploy test This test, at least for now, only checks whether the runtimeclasses have been properly created. This is just a migration from a test we had as part of the k8s suite. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-17 18:54:46 +02:00
GabyCT	4ba684e6e4	Merge pull request #7653 from GabyCT/topic/tensorflowfp32 metrics: Add Tensorflow ResNet50 int8 benchmark	2023-08-17 10:44:25 -06:00
Gabriela Cervantes	8616c050ae	metrics: Remove unused variable in tensorflow mobilenet script This PR removes unused variable in tensorflow mobilenet script. Fixes #7679 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-17 16:04:18 +00:00
Fabiano Fidêncio	285e616b5e	tests: common: Ensure test_type is used as part of the cluster's name By doing this we can make sure there won't be any clash on the cluster name created for either the k8s or the kata-deploy tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-17 14:22:16 +02:00
Fabiano Fidêncio	790bd3548d	tests: commob: Don't fail if yq is not part of the cache This may happen on external runners. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-17 14:22:14 +02:00
Fabiano Fidêncio	ce6adecd0a	gha: kata-deploy: Add run-kata-deploy-tests.sh This will have the same function as run-k8s-tests.sh has, but for kata-deploy. Right now it doesn't have any tests, and the command to actually run the tests is commented out, but right now this is just a placeholder that will be populated sooner than later. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-17 09:49:03 +02:00
Fabiano Fidêncio	cfc29c11a3	gha: k8s: Stop running kata-deploy tests as part of the k8s suite In a follow-up series, we'll add a whole suite for the kata-deploy tests. With this in mind, let's already get rid of this one and avoid more kata-deploy tests to land here. Fixes: #7642 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-17 09:48:54 +02:00
Fabiano Fidêncio	e470a650e0	Merge pull request #7654 from sprt/ci-fixes kata-deploy: Properly create default runtime class	2023-08-17 09:43:34 +02:00
Wedson Almeida Filho	962378606e	Merge pull request #7627 from wedsonaf/error-conv agent: simplify error handling	2023-08-16 21:02:38 -03:00
Aurélien Bombo	f4dd152863	tests: k8s: Call ensure_yq() in setup.sh It wasn't the `common.bash` import in `run_kubernetes_tests.sh` causing the yq error so let's try this instead. Reference: https://github.com/kata-containers/kata-containers/actions/runs/5674941359/job/15379797568#step:10:341 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-08-16 14:13:56 -07:00
GabyCT	3d0cfc88c9	Merge pull request #7662 from GabyCT/topic/fixhelptensorflow metrics: Fix MobileNet help me description	2023-08-16 14:13:39 -06:00
Aurélien Bombo	339569b69c	kata-deploy: Properly create default runtime class The default `kata` runtime class would get created with the `kata` handler instead of `kata-$KATA_HYPERVISOR`. This made Kata use the wrong hypervisor and broke CI. Fixes: #7663 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-08-16 11:04:44 -07:00
Gabriela Cervantes	2a491e9b1f	metrics: Fix MobileNet help me description This PR fixes MobileNet help me description in the tensorflow script. Fixes #7661 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-16 15:25:39 +00:00
Fabiano Fidêncio	606e419fac	Merge pull request #7660 from fidencio/topic/add-kata-deploy-tests-as-part-of-the-ci gha: ci: Start running kata-deploy tests	2023-08-16 16:44:08 +02:00
Fabiano Fidêncio	d19a75e80c	gha: ci: Start running kata-deploy tests Let's add the tests as part of the ci.yaml, so they an be triggered as part of each PR. For this PR those tests won't be triggered, courtesy to the `pull_request_target` event we rely on. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-16 16:08:05 +02:00
Fabiano Fidêncio	4adcf2192e	Merge pull request #7651 from ManaSugi/runk/containerd-test runk: Modify kill command's error message for containerd tests	2023-08-16 15:37:48 +02:00
Zhongtao Hu	5c8a61a4c8	Merge pull request #7558 from openanolis/fix/driver_option runtime-rs: add driver option	2023-08-16 13:56:29 +08:00
Zhongtao Hu	d90f7ac689	runtime-rs: add unit test for block driver add unit test for block driver Fixes:#7539 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-08-16 11:45:27 +08:00
Zhongtao Hu	e44919f0da	runtime-rs: add load_test_config for unit test add load_test_config for unit test Fixes:#7539 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-08-16 11:32:56 +08:00
Zhongtao Hu	7f48a69379	runtime-rs: add driver option add driver option when handle linux devices Fixes:#7539 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-08-16 11:32:49 +08:00
Gabriela Cervantes	bade6a5c3b	docs: Fix TensorFlow word across the document This PR fixes the TensorFlow word across the document to have uniformity across all the document. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-15 20:13:05 +00:00
Fabiano Fidêncio	0bc48eab60	Merge pull request #7640 from fidencio/topic/gha-cri-containerd-enable-tests gha: cri-containerd: Enable tests	2023-08-15 21:18:28 +02:00
Gabriela Cervantes	1a1b207760	docs: Add Tensorflow Resnet50 documentation This PR adds the Tensorflow Resnet50 documentation. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-15 17:46:44 +00:00
Gabriela Cervantes	24baededc0	metrics: Add Dockerfile for ResNet50 int8 This PR adds the dockerfile for ResNet50 int8 benchmark. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-15 17:38:26 +00:00
Gabriela Cervantes	6d971ba8df	metrics: Add Tensorflow ResNet50 int8 benchmark This PR adds the Tensorflow ResNet50 int8 script for kata metrics. Fixes #7652 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-15 17:30:22 +00:00
Manabu Sugimoto	25d151bd1b	runk: Modify kill command's error message for containerd tests The error message when the kill command is executed with the container's state == Stopped should be "container not running" because the containerd tests expect that OCI runtimes return the error message and compare it. If the error message is different from the expected one, the tests fail. Fixes: #7650 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2023-08-16 00:39:50 +09:00
GabyCT	0bbabeaaf8	Merge pull request #7644 from GabyCT/topic/renametensorflow metrics: Rename tensorflow scripts	2023-08-15 09:23:24 -06:00
Fabiano Fidêncio	46d25d908d	Merge pull request #7643 from fidencio/topic/add-functional-kata-deploy-tests gha: tests: Add kata-deploy functional tests -- Part 1	2023-08-15 15:23:48 +02:00
Fabiano Fidêncio	b3592ab25c	gha: cri-containerd: Enable tests As the cri-containerd tests have been fully migrated to GHA, let's make sure we get them running. Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:32:42 +02:00
Fabiano Fidêncio	84dd02e0f9	gha: cri-containerd: Add timeout to the crictl calls on testContainerStop As part of the runners, we're hitting a timeout that I cannot reproduce, at all, when allocating the same instance and running the tests manually. The default timeout to connect to the server is 2s when using `crictl`. Let's increase this to 20s. It's fairly important to mention that in the first tests I used a timeout of 10s, and that helped but we still hit issues every now and then. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:31:54 +02:00
Fabiano Fidêncio	b29782984a	gha: cri-containerd: Show pod before deleting it It'll help us to debug failures with the pod stop / pod delete. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:31:54 +02:00
Fabiano Fidêncio	ae0930824a	gha: cri-containerd: Print kata logs in case of error We need this to fully understand what are the issues we're facing. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:31:54 +02:00
Fabiano Fidêncio	6c8b2ffa60	gha: cri-containerd: Group containerd logs This improves readability in case of failures by a lot. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:31:54 +02:00
Fabiano Fidêncio	9e898701f5	gha: cri-containerd: Ensure RUNTIME takes KATA_HYPERVISOR into account Short commit log says it all. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:31:54 +02:00
Wedson Almeida Filho	76dac8f22c	agent: simplify error handling We extend the `Result` and `Option` types with associated types that allows converting a `Result<T, E>` and `Option<T>` into `ttrpc::Result<T>`. This allows the elimination of many `match` statements in favor of calling the map function plus the `?` operator. This transformation simplifies the code. Fixes: #7624 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-08-15 06:55:27 -03:00
Fabiano Fidêncio	e107d1d94e	Merge pull request #7574 from microsoft/danmihai1/policy agent: runtime: add Agent Policy feature	2023-08-15 11:29:13 +02:00
Bin Liu	ea81eb6c2e	Merge pull request #7169 from chethanah/runk/support-no-pid-ns runk: Support without pid ns	2023-08-15 13:00:40 +08:00
Gabriela Cervantes	18a7fd8e4e	metrics: Rename tensorflow scripts This PR renames the tensorflow scripts to include the data format that is being used as we will have multiple tests with different data and model formats for tensorflow so this will help us to distinguish them. Fixes #7645 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-14 20:40:35 +00:00
GabyCT	a740c80251	Merge pull request #7626 from GabyCT/topic/cassandrak metrics: Add Cassandra Kubernetes benchmark for kata metrics	2023-08-14 14:22:52 -06:00
GabyCT	4e5e39e8b3	Merge pull request #7618 from GabyCT/topic/addfunctionscommon metrics: Add common functions to the common script	2023-08-14 14:22:30 -06:00
GabyCT	a19d471c01	Merge pull request #7629 from dborquez/metrics_improve_stopping_kata_components metrics: fix the loop used to stop kata components	2023-08-14 14:22:06 -06:00
Fabiano Fidêncio	e55fa93db9	tests: kata-deploy: Add placeholder for kata-deploy-tests-on-tdx This will not be tested as part of the PR, thanks to the `pull_request_target` event, but we want it to be added so we can build atop of that in a coming up series. Fixes: #7642 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-14 21:38:00 +02:00
Fabiano Fidêncio	d9ee17aaec	tests: kata-deploy: Add placeholder for kata-deploy-tests-on-aks This will not be tested as part of the PR, thanks to the `pull_request_target` event, but we want it to be added so we can build atop of that in a coming up series. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-14 21:37:52 +02:00
Chelsea Mafrica	22465d22f0	Merge pull request #7638 from ManaSugi/fix/virtcontainers-doc docs: Remove installation step in virtcontainers doc	2023-08-14 10:21:57 -07:00
Dan Mihai	ab829d1038	agent: runtime: add the Agent Policy feature Fixes: #7573 To enable this feature, build your rootfs using AGENT_POLICY=yes. The default is AGENT_POLICY=no. Building rootfs using AGENT_POLICY=yes has the following effects: 1. The kata-opa service gets included in the Guest image. 2. The agent gets built using AGENT_POLICY=yes. After this patch, the shim calls SetPolicy if and only if a Policy annotation is attached to the sandbox/pod. When creating a sandbox/pod that doesn't have an attached Policy annotation: 1. If the agent was built using AGENT_POLICY=yes, the new sandbox uses the default agent settings, that might include a default Policy too. 2. If the agent was built using AGENT_POLICY=no, the new sandbox is executed the same way as before this patch. Any SetPolicy calls from the shim to the agent fail if the agent was built using AGENT_POLICY=no. If the agent was built using AGENT_POLICY=yes: 1. The agent reads the contents of a default policy file during sandbox start-up. 2. The agent then connects to the OPA service on localhost and sends the default policy to OPA. 3. If the shim calls SetPolicy: a. The agent checks if SetPolicy is allowed by the current policy (the current policy is typically the default policy mentioned above). b. If SetPolicy is allowed, the agent deletes the current policy from OPA and replaces it with the new policy it received from the shim. A typical new policy from the shim doesn't allow any future SetPolicy calls. 4. For every agent rpc API call, the agent asks OPA if that call should be allowed. OPA allows or not a call based on the current policy, the name of the agent API, and the API call's inputs. The agent rejects any calls that are rejected by OPA. When building using AGENT_POLICY_DEBUG=yes, additional Policy logging gets enabled in the agent. In particular, information about the inputs for agent rpc API calls is logged in /tmp/policy.txt, on the Guest VM. These inputs can be useful for investigating API calls that might have been rejected by the Policy. Examples: 1. Load a failing policy file test1.rego on a different machine: opa run --server --addr 127.0.0.1:8181 test1.rego 2. Collect the API inputs from Guest's /tmp/policy.txt and test on the machine where the failing policy has been loaded: curl -X POST http://localhost:8181/v1/data/agent_policy/CreateContainerRequest \ --data-binary @test1-inputs.json Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-08-14 17:07:35 +00:00
Fabiano Fidêncio	831e73ff91	tests: kata-deploy: Add functional/kata-deploy/gha-run.sh placeholder Right now this file does nothing, as it's not even called by any GHA. However, it'll be populated later on as part of a different series, where we'll have kata-deploy specific tests running here. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-14 17:46:10 +02:00
Fabiano Fidêncio	af1b46bbf2	tests: Add gha-run-k8s-common.sh Let's split a good portion of `tests/integration/kuberentes/gha-run.sh` out, and put them in a place where they can be used to the soon-to-come kata-deploy specific tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-14 17:45:58 +02:00
Jeremi Piotrowski	a57e7ffe14	Merge pull request #7211 from stevenhorsman/propogate-secrets Propogate secrets, config maps etc into guest if sharedFS not available	2023-08-14 11:24:47 +02:00
Manabu Sugimoto	416445e7eb	docs: Remove installation step in virtcontainers doc Remove the installation step in the virtcontainers doc because the virtcontainers install/uninstall targets have been removed by `86723b51ae` and they are not used anymore. Fixes: #7637 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2023-08-14 15:15:24 +09:00
Fabiano Fidêncio	b975c27793	Merge pull request #7547 from stevefan1999-personal/patch-k0s kata-deploy: Preliminary k0s support	2023-08-12 14:28:13 +02:00
Fabiano Fidêncio	6ed57d1e9a	Merge pull request #7447 from fidencio/topic/gha-move-static-jenkins-to-azure-instances gha: static-checks: Move to the Azure instances	2023-08-12 13:31:54 +02:00
Steve Fan	72cbcf040b	kata-deploy: Add k0s support Add k0s support to kata-deploy, in the very same way kata-containers already supports k3s, and rke2. k0s support requires v1.27.1, which is noted as part of the kata-deploy documentation, as it's the way to use dynamic configuration on containerd CRI runtimes. This support will only be part of the `main` branch, as it's not a bug fix that can be backported to the `stable-3.2` branch, and this is also noted as part of the documentation. Fixes: #7548 Signed-off-by: Steve Fan <29133953+stevefan1999-personal@users.noreply.github.com>	2023-08-11 21:17:23 +02:00
David Esparza	767434d50a	metrics: fix the loop used to stop kata components #7629 This PR fixed the loop that stops the kata-shim and the hypervisors used in metrics checks. Fixes: #7628 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-08-11 12:32:41 -06:00
Gabriela Cervantes	5d0f0d43c7	metrics: Add cassandra statefulset yaml This PR adds cassandra statefulset yaml for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-11 17:22:39 +00:00
Gabriela Cervantes	c1dcc1396f	metrics: Add cassandra service yaml This PR adds the cassandra service yaml for the benchmark. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-11 17:22:36 +00:00
Gabriela Cervantes	2297a0d1c5	metrics: Add block loop pvc yaml for cassandra This PR adds block loop pvc yaml for cassandra test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-11 17:22:33 +00:00
Gabriela Cervantes	e3d511946f	metrics: Add block loop pv yaml for cassandra test This PR adds the block loop pv yaml for cassandra test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-11 17:22:29 +00:00
Gabriela Cervantes	9890271594	metrics: Add block loop pvc for cassandra test This PR adds the block loop pvc for cassandra test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-11 17:22:19 +00:00
Gabriela Cervantes	349b89969a	metrics: Add Cassandra Kubernetes benchmark for kata metrics This PR adds Cassandra Kubernetes benchmark for kata metrics tests. Fixes #7625 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-11 17:21:48 +00:00
Fabiano Fidêncio	c52d090522	gha: static-checks: Move to the Azure instances The GHA runners are not exactly powerful, which makes the static-checks take way too long (almost an hour). Let's give a try and move those to the same size of Azure instances used as part of our CI, and probably have this time reduced. Fixes: #7446 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-11 18:47:47 +02:00
stevenhorsman	8815ed0665	runtime: Remove config warnings Remove configuration file shared_fs = none warnings now that there is a solution to updating configMaps, secrets etc Fixes: #7210 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-08-11 16:31:08 +01:00
Yohei Ueda	afe1a6ac5a	agent: support copying of directories and symlinks This patch allows copying of directories and symlinks when static file copying is used between host and guest. This change is necessary to support recursive file copying between shim and agent. Signed-off-by: Yohei Ueda <yohei@jp.ibm.com> (cherry picked from commit `de232b8030`)	2023-08-11 16:31:08 +01:00
Pradipta Banerjee	ab13ef87ee	runtime: propagate configmap/secrets etc changes for remote-hyp For remote hypervisor, the configmap, secrets, downward-api or project-volumes are copied from host to guest. This patch watches for changes to the host files and copies the changes to the guest. Note that configmap updates takes significantly longer than updates via downward-api. This is similar across runc and Kata runtimes. Fixes: #7210 Signed-off-by: Pradipta Banerjee <pradipta.banerjee@gmail.com> Signed-off-by: Julien Ropé <jrope@redhat.com> (cherry picked from commit `3081cd5f8e`) (cherry picked from commit 68ec673bc4d9cd853eee51b21a0e91fcec149aad)	2023-08-11 16:31:08 +01:00
Yohei Ueda	c074ec4df1	runtime: Copy shared files recursively This patch enables recursive file copying when filesystem sharing is not used. Signed-off-by: Yohei Ueda <yohei@jp.ibm.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> (cherry picked from commit `5422a056f2`) (cherry picked from commit 16055ce040bbd724be2916bc518d89b69c9e0ca5) Fixes: #7210	2023-08-11 16:16:52 +01:00
Peng Tao	a39fd6c066	Merge pull request #7611 from ManaSugi/fix/fc-version versions: Update firecracker version to 1.4.0	2023-08-11 16:43:37 +08:00
Chao Wu	7031b5db07	Merge pull request #7535 from ManaSugi/fix/allow-redundant-clone agent: Allow clippy::redundant_clone in the unit tests	2023-08-11 14:17:56 +08:00
Gabriela Cervantes	fdcd52ff78	metrics: Add check containers are running in tensorflow mobilenet This PR adds check containers are running in tensorflow mobilenet that is being defined in common script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-10 20:17:20 +00:00
Gabriela Cervantes	36337ee146	metrics: Add check containers are up in tensorflow script This PR adds the check containers are up function from common in tensorflow script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-10 20:15:18 +00:00
Gabriela Cervantes	f700f9b0ba	metrics: Remove unused variable in tensorflow script This PR removes an unused variable in tensorflow script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-10 20:13:37 +00:00
Gabriela Cervantes	833cf7a684	metrics: Add check containers are running function This PR adds the check containers are running function the common metrics script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-10 20:12:22 +00:00
Gabriela Cervantes	918c783084	metrics: Add check containers are up in tensorflow mobilenet script This PR adds the check containers are up in the common script in the tensorflow mobilenet script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-10 20:06:40 +00:00
Gabriela Cervantes	9d57a1fab4	metrics: Use check containers are up in tensorflow script This PR uses the check containers are up from the common script in the tensorflow script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-10 17:42:09 +00:00
Gabriela Cervantes	1c84680d8c	metrics: Add check containers are up in common script This PR adds check containers are up in common script for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-10 17:39:24 +00:00
Gabriela Cervantes	d3e57cf454	metrics: Use collect_results function in tensorflow mobilenet test This PR uses the collect results function defined in common for the tensorflow mobilenet test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-10 17:34:30 +00:00
Gabriela Cervantes	286de046af	metrics: Remove collect results function definition This PR removes the collect results function from tensorflow script as it is going to be referenced in the common metrics script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-10 17:31:23 +00:00
Gabriela Cervantes	9879709aae	metrics: Add common functions to the common script This PR adds the collect results function to the common metrics script. Fixes #7617 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-10 17:27:11 +00:00
Fabiano Fidêncio	a89c9cd620	Merge pull request #7557 from wedsonaf/no-new-vecs agent: avoid creating new `Vec` instances when easily avoidable	2023-08-10 18:43:46 +02:00
Manabu Sugimoto	4746fa3daa	docs: Specify supported Firecracker version using `versions.yaml` Specify the supported version of Firecracker using our `versions.yaml` to improve the maintainability of the documentation. Fixes: #7610 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2023-08-10 16:49:45 +09:00
Manabu Sugimoto	cc922be5ec	versions: Update firecracker version to 1.4.0 This patch upgrades Firecracker version from v1.1.0 to v1.4.0. * Generate swagger models for v1.4.0 (from `firecracker.yaml`) - The version of go-swagger used is v0.30.0 * The firecracker v1.4.0 includes the following changes. - Added * Added support for custom CPU templates allowing users to adjust vCPU features exposed to the guest via CPUID, MSRs and ARM registers. * Introduced V1N1 static CPU template for ARM to represent Neoverse V1 CPU as Neoverse N1. * Added support for the virtio-rng entropy device. The device is optional. A single device can be enabled per VM using the /entropy endpoint. * Added a cpu-template-helper tool for assisting with creating and managing custom CPU templates. - Changed * Set FDP_EXCPTN_ONLY bit (CPUID.7h.0:EBX[6]) and ZERO_FCS_FDS bit (CPUID.7h.0:EBX[13]) in Intel's CPUID normalization process. - Fixed * Fixed feature flags in T2S CPU template on Intel Ice Lake. * Fixed CPUID leaf 0xb to be exposed to guests running on AMD host. * Fixed a performance regression in the jailer logic for closing open file descriptors. * A race condition that has been identified between the API thread and the VMM thread due to a misconfiguration of the api_event_fd. * Fixed CPUID leaf 0x1 to disable perfmon and debug feature on x86 host. * Fixed passing through cache information from host in CPUID leaf 0x80000006. * Fixed the T2S CPU template to set the RRSBA bit of the IA32_ARCH_CAPABILITIES MSR to 1 in accordance with an Intel microcode update. * Fixed the T2CL CPU template to pass through the RSBA and RRSBA bits of the IA32_ARCH_CAPABILITIES MSR from the host in accordance with an Intel microcode update. * Fixed passing through cache information from host in CPUID leaf 0x80000005. * Fixed the T2A CPU template to disable SVM (nested virtualization). * Fixed the T2A CPU template to set EferLmsleUnsupported bit (CPUID.80000008h:EBX[20]), which indicates that EFER[LMSLE] is not supported. Fixes: #7610 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2023-08-10 16:48:13 +09:00
Fupan Li	39e67b06e9	dragonball: vsock add fifo/pipe stream support for passed fd hybridStream Since the passed fd through unix socket would be any stream fd such as pipe/fifo fd or any other socket fd, thus we should deal with it as a normal hybrid stream instead of a unix stream. Fixes:#7584 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2023-08-10 11:07:10 +08:00
David Esparza	7bf994827d	Merge pull request #7609 from dborquez/tensorflow_check_completion metrics: compute tensorflow statistics	2023-08-09 18:47:47 -06:00
David Esparza	dcdb3b067f	Merge pull request #7606 from GabyCT/topic/nginx metrics: Add network nginx benchmark	2023-08-09 16:14:13 -06:00
David Esparza	2defdcc598	Merge pull request #7579 from dborquez/simplify_gha_metrics_workflow metrics: install kata once and run multiple checks	2023-08-09 14:45:09 -06:00
David Esparza	473b0d3a31	metrics: compute tensorflow statistics This PR computes average results for TF bench. Additionally, it improves the data parsing from all running containers. Fixes: #7603 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-08-09 14:42:30 -06:00
Fabiano Fidêncio	0a8208c670	Merge pull request #7608 from fidencio/topic/create-image-to-be-used-by-the-confidential-tests-follow-up-3 ci: unencrypted-image: Fix build context	2023-08-09 21:00:46 +02:00
Fabiano Fidêncio	03d1fa67b1	ci: unencrypted-image: Fix build context The build context should be the folder where the Dockerfile is present, otherwise the files copied into the image won't be found. Fixes: #7595 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 20:32:36 +02:00
Fabiano Fidêncio	eb463b38ec	ci: unencrypted-image: Don't fail to build on s390x Let's make sure that we don't fail in case we're building non x86_64. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 20:32:36 +02:00
Fabiano Fidêncio	ebc86091d1	Merge pull request #7607 from fidencio/topic/create-image-to-be-used-by-the-confidential-tests-follow-up-2 ci: create-confidential-image: Add dependent actions	2023-08-09 19:53:49 +02:00
Fabiano Fidêncio	a2d731ad26	ci: create-confidential-image: Add dependent actions Following the example on https://github.com/docker/build-push-action, it's clear that the actions to "Set up QEMU" and "Set up Docker Buildx" are missing. Let's add them, and also take the advantage to bump the build-push-action to its v4, which, by the way, had a typo on its name (build-and-push-action does NOT exist, build-push-action does). Fixes: #7595 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 18:36:51 +02:00
Gabriela Cervantes	d1a6296221	metrics: Add nginx documentation to network README This PR adds nginx documentation to network README for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-09 16:17:46 +00:00
Gabriela Cervantes	498f7c0549	metrics: Add nginx kubernetes yaml This PR adds the nginx kubernetes yaml. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-09 16:14:04 +00:00
Gabriela Cervantes	f8a5255cf7	metrics: Add network nginx benchmark This PR adds the network nginx benchmark for kata metrics. Fixes #7605 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-09 16:12:21 +00:00
Fabiano Fidêncio	86f705d98b	Merge pull request #7604 from fidencio/topic/create-image-to-be-used-by-the-confidential-tests-follow-up-1 Follow up fixes for https://github.com/kata-containers/kata-containers/pull/7596	2023-08-09 18:05:46 +02:00
Fabiano Fidêncio	43fe5d1b90	ci: k8s: tees: Ensure PR_NUMBER is exported Right now this is not being used, but it'll as the image generated for the confidential tests have that as part of their tag. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 17:45:42 +02:00
Fabiano Fidêncio	54f6a78500	ci: {{ pr-number }} should be {{ inputs.pr-number }} One of the joys to rely on the `pull_request_target` is to only be able to catch those after those are merged. Fixes: #7595 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 17:41:07 +02:00
Fabiano Fidêncio	5cdf981a2b	Merge pull request #7596 from fidencio/topic/create-image-to-be-used-by-the-confidential-tests tests: Create image that will be used in the unencrypted confidential tests	2023-08-09 17:06:07 +02:00
Fabiano Fidêncio	c932369f42	Merge pull request #7492 from fidencio/topic/adapt-tests-to-the-new-kata-deploy-env-vars kata-deploy: Ensure we cover SHIMS / DEFAULT_SHIM as part of our tests	2023-08-09 12:55:03 +02:00
Fabiano Fidêncio	034d7aab87	tests: k8s: Ensure the runtime classes are properly created With these 2 simple checks we can ensure that we do not regress on the behaviour of allowing the runtime classes / default runtime class to be created by the kata-deploy payload. Fixes: #7491 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 11:46:04 +02:00
Fabiano Fidêncio	fac8ccf5cd	ci: Add build-and-publish-tee-confidential-unencrypted-image This will be done before running TEE tests, and it's a hard dependency fr them. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 11:36:10 +02:00
Fabiano Fidêncio	ab5f603ffa	ci: k8s: Add the image used for unencrypted confidential tests Let's add here the image we'll be using for unencrypted confidential tests. Later on, we'll make sure to build and use this image as part of our CI. The image can easily be built as a multi-arch image, and has `cpuid` installed in case of `x86_64` build, so it can be used to detect whether we're running on a TEE guest without having to rely on `dmesg \| grep ...`. Fixes: #7595 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 11:33:18 +02:00
Fabiano Fidêncio	36d53dd2af	Merge pull request #7598 from UnmeshDeodhar/upgrade-bats-version tests: upgrade bats version	2023-08-09 11:18:56 +02:00
Fabiano Fidêncio	1e8fe131bd	k8s: tests: Take advantage of `SHIMS` and `DEFAULT_SHIM` env vars We don't have to do any sed to replace the runtimeclass being used by the moment we start taking advantage of the `DEFAULT_SHIM` environment variable exposed merged in the previous commits. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 11:15:34 +02:00
Wedson Almeida Filho	729b2dd611	agent: avoid creating new `Vec` instances when easily avoidable There are many places where the code currently creates new `Vec` instances when it's not really needed. The result is a perf hit because it allocates memory, copies all elements, then frees the memory; in some cases, copying elements also involves extra allocations (e.g., when elements are strings, or structs containing strings). This patch addresses a number of these cases. Fixes: #7203 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-08-09 02:38:36 -03:00
Jiang Liu	311671abb5	Merge pull request #7552 from jiangliu/agent-r1 Fix mimor bugs and improve coding stype of agent rpc/sandbox/mount	2023-08-09 13:19:02 +08:00
Unmesh Deodhar	aeaec9dae9	tests: upgrade bats version Instead of using package manager to install bats, building this from source. This gives us the updated version of bats which supports functions such as setup_file and teardown_file. We can use these functions into our current tests. Fixes: #7597 Signed-off-by: Unmesh Deodhar <udeodhar@amd.com>	2023-08-08 18:16:39 -05:00
David Esparza	e664969862	metrics: install kata once and run multiple checks This PR changes the metrics workflow in order to just install kata once, and run the checks for multiple hypervisor variations. In this way we save time avoiding installing kata for each hypervisor to be tested. Fixes: #7578 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-08-08 10:25:13 -06:00
Jiang Liu	baabfa9f1f	agent: refine implementation of mount related code Refine implementation of mount by: - log message with `path.display()` instead of `{:?}` - add prefix "_" to unused variables - pass by reference instead of by value to avoid creating redundant array - exactly matching prefix "fsgid=" instead of "fsgid" - avoid redundant clone() operations Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-08 18:03:03 +08:00
Jiang Liu	98ba211a34	agent: fix a bug in update_ephemeral_mounts() There's a bug in function update_ephemeral_mounts() which only handles the first storage object and ignores all other storage objects. Fixes: #7551 Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-08 18:03:02 +08:00
Jiang Liu	5333618d70	agent: make add_storage() take &[Storage] instead of Vec<Storage> Simplify add_storage() by taking &[Storage] instead of Vec<Storage>. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-08 18:03:01 +08:00
Jiang Liu	37f34781d1	agent: simplify function online_cpu_memory() Simplify function online_cpu_memory() by on calling update_cpuset_path() for containers with cpuset configured. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-08 18:03:00 +08:00
Jiang Liu	d3c5422379	agent: refine style of code related to sandbox Refine style of code related to sandbox by: - remove unnecessary comments for caller to take lock, we have already taken `&mut self`. - change "count < 1 " to "count == 0", `count` is type of u32. - make remove_sandbox_storage() to take `&mut self` instead of `&self`. - group related function to each others - avoid search the map twice in function find_process() - avoid unwrap() in function run_oom_event_monitor() - avoid unwrap() in online_resources() Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-08 18:02:59 +08:00
Jiang Liu	71a9f67781	agent: avoid unwrap() in function do_remove_container() Avoid unwrap() in function do_remove_container(), and also make implmementation symmetric for both timeout and non-timeout cases. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-08 18:02:58 +08:00
Jiang Liu	84badd89d7	agent: avoid clone objects when possible Optimize agent rpc implementation by: - avoid clone objects when possible - avoid unwrap() when possible - explictly drop object to ensure order Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-08-08 18:02:56 +08:00
Chao Wu	b098960442	Merge pull request #7581 from justxuewei/bump-versions deps: Bump dependent crate versions	2023-08-08 15:16:57 +08:00
Chao Wu	24bf637835	Merge pull request #7500 from pmores/fix-queue-num-in-dragonball-share-fs fix number of queues handling in dragonball share fs device	2023-08-08 12:07:25 +08:00
Xuewei Niu	b23c5ed155	deps: Bump dependent crate versions This pull request is mainly for updating vm-memory and vmm-sys-util. The affacted crates include: - vm-memory: from 0.9.0 to 0.10.0 - vmm-sys-util: from 0.10.0 to 0.11.0 - virtio-queue: from 0.6.0 to 0.7.0 - fuse-backend-rs: from 0.10.4 to 0.10.5 - linux-loader: from 0.6.0 to 0.8.0 - nydus-api: from 0.3.0 to 0.3.1 - nydus-rafs: from 0.3.1 to 0.3.2 - nydus-storage: from 0.6.3 to 0.6.4 Fixes: #0000 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-08-08 11:54:09 +08:00
Fupan Li	5a20d8dcaf	Merge pull request #7383 from justxuewei/dan runtime-rs: Introduce directly attachable network	2023-08-08 09:54:28 +08:00
Chelsea Mafrica	553fd79ea9	Merge pull request #7572 from GabyCT/topic/resnet50fp32 metrics: General improvements to mobilenet tensorflow test	2023-08-07 13:33:28 -07:00
GabyCT	194120b679	Merge pull request #7540 from GabyCT/topic/enableiperf gha: Add iperf network metrics	2023-08-07 13:40:02 -06:00
Gabriela Cervantes	863283716d	metrics: General improvements to mobilenet tensorflow test This PR renames the mobilenet tensorflow test to have a more specific tensorflow name mainly because tensorflow has different configurations and we will add more tensorflow tests so we want to distinguish each tensorflow test. Fixes #7571 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-07 16:50:00 +00:00
Gabriela Cervantes	3c319d8d4c	metrics: Add iperf to gha run script This PR adds iperf to gha run script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-07 16:20:00 +00:00
Gabriela Cervantes	5b5caf8908	gha: Add iperf network metrics This PR adds the iperf network metrics to the github actions for kata metrics. Fixes #7535 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-07 16:20:00 +00:00
Chelsea Mafrica	4559caf619	Merge pull request #7467 from ManaSugi/doc/use-k8-control-plane docs: Use control-plane term instead of master	2023-08-06 23:40:51 -07:00
Fabiano Fidêncio	b365bef570	Merge pull request #7191 from wedsonaf/avoid-clones agent: avoid unnecessary calls to `Arc::clone`	2023-08-06 15:34:07 +02:00
GabyCT	7144acb2a5	Merge pull request #7527 from GabyCT/topic/latency metrics: Add network latency test	2023-08-04 15:54:07 -06:00
Gabriela Cervantes	66db5b5350	metrics: Add latency test to network README This PR adds latency test to network README for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-04 20:27:27 +00:00
Wedson Almeida Filho	c36572418f	agent: avoid unnecessary calls to `Arc::clone` These calls cause two extra atomic instructions each time they're used, one to increment and another one to decrement the refcount. Since we don't need them because the referred value is guaranteed to outlive the function, remove the calls. Fixes: #7190 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-08-03 20:53:05 -03:00
Fabiano Fidêncio	8c03deac3a	Merge pull request #7106 from wedsonaf/image-pulling Image pulling on the host	2023-08-04 01:08:42 +02:00
Wedson Almeida Filho	4fbe0a3a53	runtime: bind-mount mounted block device into container When the mounted block device isn't a layer, we want to mount it into containers, but since it's already mounted with the correct fs (e.g., tar, ext4, etc.) in the pod, we just bind-mount it into the container. Fixes: #7536 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-08-03 17:58:39 -03:00
Wedson Almeida Filho	7e1b1949d4	runtime: add support for kata overlays When at least one `io.katacontainers.fs-opt.layer` option is added to the rootfs, it gets inserted into the VM as a layer, and the file system is mounted as an overlay of all layers using the overlayfs driver. Additionally, if the `io.katacontainers.fs-opt.block_device=file` option is present in a layer, it is mounted as a block device backed by a file on the host. Fixes: #7536 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-08-03 17:58:39 -03:00
Wedson Almeida Filho	6c867d9e86	agent: add io.katacontainers.fs-opt.overlay-rw option This causes the overlay-fs driver to add the `upperdir` and `workdir` options to an overlay-fs mount so that the mount becomes writable using a discardable directory under the container id. Fixes: #7536 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-08-03 17:58:39 -03:00
Wedson Almeida Filho	6163c35657	agent: skip mount options that start with "io.katacontainers." This is so that file systems don't fail when we pass kata-specific options from the snapshotter to kata. Fixes: #7536 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-08-03 17:58:39 -03:00
Fabiano Fidêncio	fa35afa982	Merge pull request #7542 from wedsonaf/ci-fix Use version 0.10.4 of `fuse-backend-rs`	2023-08-03 22:50:11 +02:00
Wedson Almeida Filho	b2ff97aa01	dragonball: use version 0.10.4 of `fuse-backend-rs` Version 0.10.5, which was just released, breaks `nydus-storage`. This is a workaround to fix the CI which is blocking other PRs. Fixes: #7541 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-08-03 14:15:17 -03:00
Fabiano Fidêncio	ebdae7cfdf	Merge pull request #7520 from jepio/host-systemctl kata-deploy: Use host's systemctl	2023-08-03 13:53:28 +02:00
Manabu Sugimoto	845eeb4d7b	agent: Allow clippy::redundant_clone in the unit tests Allow `clippy::redundant_clone` in the agent's unit tests because rustc>=1.70 shows the errors as false-negatives. These `clone()` are required because the following codes refer to the variable, but the clippy analyzes them by mistake, using the conservative and limited approach. Ref. https://rust-lang.github.io/rust-clippy/master/index.html#/redundant_clone Fixes: #7534 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2023-08-03 19:07:40 +09:00
Fabiano Fidêncio	e2755a47b8	Merge pull request #7524 from fidencio/revert-kata-deploy-changes-after-3.2.0-rc0-release release: Revert kata-deploy changes after 3.2.0-rc0 release	2023-08-03 11:28:43 +02:00
Fabiano Fidêncio	1163fc9de2	release: Revert kata-deploy changes after 3.2.0-rc0 release As 3.2.0-rc0 has been released, let's switch the kata-deploy / kata-cleanup tags back to "latest", and re-add the kata-deploy-stable and the kata-cleanup-stable files. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-03 10:08:20 +02:00
Xuewei Niu	3958a39d07	runtime-rs: Introduce directly attachable network Kata containers as VM-based containers are allowed to run in the host netns. That is, the network is able to isolate in the L2. The network performance will benefit from this architecture, which eliminates as many hops as possible. We called it a Directly Attachable Network (DAN for short). The network devices are placed at the host netns by the CNI plugins. The configs are saved at {dan_conf}/{sandbox_id}.json in the format of JSON, including device name, type, and network info. At the very beginning stage, the DAN only supports host tap devices. More devices, like the DPDK, will be supported in later versions. The format of file looks like as below: ```json { "netns": "/path/to/netns", "devices": [{ "name": "eth0", "guest_mac": "xx:xx:xx:xx:xx", "device": { "type": "vhost-user", "path": "/tmp/test", "queue_num": 1, "queue_size": 1 }, "network_info": { "interface": { "ip_addresses": ["192.168.0.1/24"], "mtu": 1500, "ntype": "tuntap", "flags": 0 }, "routes": [{ "dest": "172.18.0.0/16", "source": "172.18.0.1", "gateway": "172.18.31.1", "scope": 0, "flags": 0 }], "neighbors": [{ "ip_address": "192.168.0.3/16", "device": "", "state": 0, "flags": 0, "hardware_addr": "xx:xx:xx:xx:xx" }] } }] } ``` Fixes: #1922 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-08-03 15:33:34 +08:00
David Esparza	7d1c48c881	Merge pull request #7530 from dborquez/fix_check_running_processes metrics: stop kata components before start a metric test.	2023-08-02 23:51:27 -06:00
Zhongtao Hu	e719423262	Merge pull request #7127 from cmaf/runtime-rs-ch-blk-2 runtime-rs: Add block device handling for cloud hypervisor	2023-08-03 09:46:32 +08:00
David Esparza	1e15369e59	metrics: Improve naming testing containers in launch times test This commit provides a new way to name the containers used in the launch-times-test in this form: 'kata_launch_times_RANDOM_NUMBER', where RANDOM_NUMBER is in the 0-1000 range. Fixes: #7529 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-08-02 17:04:55 -06:00
David Esparza	5dbe88330f	metrics: Clean kata components before start a metric test. This PR kills all kata components before start a new metric test. Fixes: #7528 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-08-02 17:04:51 -06:00
Fabiano Fidêncio	d424f3c595	Merge pull request #7523 from fidencio/3.2.0-rc0-branch-bump # Kata Containers 3.2.0-rc0	2023-08-02 20:04:37 +02:00
Zvonko Kaiser	cf8899f260	Merge pull request #7494 from zvonkok/vfio-mode vfio: Fix vfio device ordering	2023-08-02 19:45:22 +02:00
Gabriela Cervantes	3b45060b61	metrics: Add latency server yaml This PR adds latency server yaml for kubernetes test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-02 16:52:17 +00:00
Gabriela Cervantes	9bb8451df5	metrics: Add latency client yaml This PR adds latency client yaml for the kubernetes test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-02 16:50:51 +00:00
Gabriela Cervantes	64fdb98704	metrics: Add network latency test This PR adds network latency test for kata metrics. Fixes #7526 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-02 16:46:48 +00:00
Chelsea Mafrica	a81ad3b587	runtime-rs: Add block device handling in cloud hypervisor Add functions for adding a block device to a container for CH. Fixes #6690 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-08-02 09:18:48 -07:00
David Esparza	542012c8be	Merge pull request #7503 from GabyCT/topic/ghafio metrics: Add FIO test to gha for kata metrics CI	2023-08-02 10:05:09 -06:00
David Esparza	5979f3790b	Merge pull request #7516 from GabyCT/topic/addiperf metrics: Add iperf3 network test	2023-08-02 10:04:51 -06:00
Fabiano Fidêncio	006ecce49a	release: Kata Containers 3.2.0-rc0 - ci-on-push: Make the CI also run for the stable-* branches - ci: k8s: Do not fail when gathering info on AKS nodes - kata-deploy: enable cross build for non-x86 - runtime-rs: add support for gather metrics in runtime-rs - kata-ctl: add monitor subcommand for runtime-rs - release: release-note.sh: Fix typos and reference to images - metrics: Add sysbench performance test - Simplify implementation of runtime-rs/service `6ad16d497` release: Adapt kata-deploy for 3.2.0-rc0 `025596b28` ci-on-push: Make the CI also run for the stable-* branches `7ffc0c122` static-build: enable cross build for qemu `35d6d86ab` static-build: enable cross-build for image build `2205fb9d0` static-build: enable cross build for virtiofsd `11631c681` static-build: enable cross build for shim-v2 `7923de899` static-build: cross build kernel `e2c31fce2` kata-deploy: enable cross build for kata deploy script `2fc5f0e2e` kata-depoly: prepare env for cross build in lib.sh `f5e9985af` release: release-note.sh: Fix typos and reference to images `f910c66d6` ci: k8s: Do not fail when gathering info on AKS nodes `632818176` metrics: Add k8s sysbench documentation `b3901c46d` runtime-rs: ignore errors during clean up sandbox resources `5a1b5d367` metrics: Add sysbench pod yaml `ad413d164` metrics: Add sysbench dockerfile `151256011` metrics: Add sysbench performance test `62e328ca5` runtime-rs: refine implementation of TaskService `458e1bc71` runtime-rs: make send_message() as an method of ServiceManager `1cc1c81c9` runtime-rs: fix possibe bug in ServiceManager::run() `1a5f90dc3` runtime-rs: simplify implementation of service crate `731e7c763` kata-ctl: add monitor subcommand for runtime-rs The previous kata-monitor in golang could not communicate with runtime-rs to gather metrics due to different sandbox addresses. This PR adds the subcommand monitor in kata-ctl to gather metrics from runtime-rs and monitor itself. `d74639d8c` kata-ctl: provide the global TIMEOUT for creating MgmtClient `02cc4fe9d` runtime-rs: add support for gather metrics in runtime-rs Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-02 16:59:41 +02:00
Fabiano Fidêncio	6ad16d4977	release: Adapt kata-deploy for 3.2.0-rc0 kata-deploy files must be adapted to a new release. The cases where it happens are when the release goes from -> to: * main -> stable: * kata-deploy-stable / kata-cleanup-stable: are removed * stable -> stable: * kata-deploy / kata-cleanup: bump the release to the new one. There are no changes when doing an alpha release, as the files on the "main" branch always point to the "latest" and "stable" tags. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-02 16:59:41 +02:00
Fabiano Fidêncio	4e812009f5	Merge pull request #7519 from fidencio/topic/gha-ci-run-on-stable-branches ci-on-push: Make the CI also run for the stable-* branches	2023-08-02 16:13:06 +02:00
Jeremi Piotrowski	3230dec950	kata-deploy: Use host's systemctl when interacting with systemd. We have occasionally faced issues with compatibility between the systemctl version used inside the kata-deploy container and the systemd version on the host. Instead of using a containerized systemctl with bind mounted sockets, nsenter the host and run systemctl from there. This provides less coupling between the kata-deploy container and the host. Fixes: #7511 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-08-02 15:32:01 +02:00
Fabiano Fidêncio	29855ed0c6	Merge pull request #7510 from fidencio/topic/ci-k8s-aks-do-not-fail-gathering-info ci: k8s: Do not fail when gathering info on AKS nodes	2023-08-02 09:44:19 +02:00
Fabiano Fidêncio	025596b289	ci-on-push: Make the CI also run for the stable-* branches As we only support one stable branch, it'll be used as part of the stable-3.2 and onwards. Fixes: #7518 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-02 09:26:24 +02:00
Fabiano Fidêncio	e1a69c0c92	Merge pull request #6586 from jongwu/cross_build kata-deploy: enable cross build for non-x86	2023-08-02 09:11:56 +02:00
Fupan Li	1a6b27bf6a	Merge pull request #5797 from Yuan-Zhuo/add-metrics-for-runtime-rs runtime-rs: add support for gather metrics in runtime-rs	2023-08-02 13:40:22 +08:00
Fupan Li	a536d4a7bf	Merge pull request #6672 from Yuan-Zhuo/add-monitor-in-kata-ctl kata-ctl: add monitor subcommand for runtime-rs	2023-08-02 13:39:02 +08:00
Gabriela Cervantes	ad6e53c399	metrics: Modify boot time values This PR modifies boot time values limit. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-01 23:34:15 +00:00
Jianyong Wu	7ffc0c1225	static-build: enable cross build for qemu Depends on mutiarch feature of ubuntu, we can set up cross build environment easily and achive as good build performance as native build. Fixes: #6557 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-08-01 23:28:52 +02:00
Jianyong Wu	35d6d86ab5	static-build: enable cross-build for image build It's too long a time to cross build agent based on docker buildx, thus we cross build rootfs based on a container with cross compile toolchain of gcc and rust with musl libc. Then we get fast build just like native build. rootfs initrd cross build is disabled as no cross compile tolchain for rust with musl lib if found for alpine and based on docker buildx takes too long a time. Fixes: #6557 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-08-01 23:28:52 +02:00
Gabriela Cervantes	f764248095	gha: Add FIO test to run metrics yaml This PR adds FIO test to run metrics yaml. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-01 20:29:16 +00:00
Jianyong Wu	2205fb9d05	static-build: enable cross build for virtiofsd Based on messense/rust-musl-cross which offer cross build musl lib environment to cross compile virtiofsd. Fixes: #6557 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-08-01 22:10:46 +02:00
Jianyong Wu	11631c681a	static-build: enable cross build for shim-v2 shim-v2 has go and rust code. For rust code, we use messense/rust-musl-cross to build for speed up as it doesn't depends on qemu emulation. Build go code based on docker buildx as it doesn't support cross build now. Fixes: #6557 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-08-01 22:10:46 +02:00
Jianyong Wu	7923de8999	static-build: cross build kernel Prepare cross build environment based on current Dockerfile. Fixes: #6557 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-08-01 22:10:46 +02:00
Jianyong Wu	e2c31fce23	kata-deploy: enable cross build for kata deploy script kata-deploy-binaries-in-docker.sh is the entry to build kata components. set some environment to facilitate the following cross build work. Fixes: #6557 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-08-01 22:10:46 +02:00
Jianyong Wu	2fc5f0e2e0	kata-depoly: prepare env for cross build in lib.sh We leverage three env, TARGET_ARCH means the buid target tuple; ARCH nearly the same meaning with TARGET_ARCH but has been widely used in kata; CROSS_BUILD means if you want to do cross compile. Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-08-01 22:10:46 +02:00
Fabiano Fidêncio	c0171ea0a7	Merge pull request #7508 from fidencio/topic/fix-release-notes-typos-and-references release: release-note.sh: Fix typos and reference to images	2023-08-01 22:05:32 +02:00
Gabriela Cervantes	58f9a57c20	metrics: Add network reference to general README metrics This PR adds network reference to the general metrics README. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-01 16:54:00 +00:00
Gabriela Cervantes	07694ef3ae	metrics: Add Kata Containers network metrics README This PR adds the Kata Containers network metrics README. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-01 16:49:09 +00:00
Gabriela Cervantes	d8439dba89	metrics: Add iperf3 deployment yaml This PR adds the iperf3 deployment yaml. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-01 16:45:01 +00:00
Gabriela Cervantes	bda83cee5d	metrics: Add iperf3 daemonset for k8s This PR adds the iperf3 daemonset for k8s. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-01 16:42:15 +00:00
Gabriela Cervantes	badff23c71	metrics: Add iperf3 service yaml for k8s This PR adds the iperf3 service yaml for k8s. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-01 16:37:19 +00:00
Gabriela Cervantes	27c02367f9	metrics: Add iperf3 network test This PR adds the iperf3 benchmark test for kata metrics. Fixes #7515 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-08-01 16:30:46 +00:00
GabyCT	a0a524efc2	Merge pull request #7486 from kata-containers/topic/addsysbench metrics: Add sysbench performance test	2023-08-01 10:17:48 -06:00
Fabiano Fidêncio	f5e9985afe	release: release-note.sh: Fix typos and reference to images diferent -> different And also let's make sure we escape the backticks around the kata-deploy environment variables, otherwise bash will try to interpret those. Fixes: #7497 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-01 12:42:03 +02:00
Fabiano Fidêncio	f910c66d6f	ci: k8s: Do not fail when gathering info on AKS nodes Otherwise the VM deletion may not delete, leaving us with several machines behind. Fixes: #7509 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-01 12:36:33 +02:00
Manabu Sugimoto	1b21a46246	docs: Use control-plane term instead of master Replace `master` with `control-plane` in the context of K8s because `master` is a legacy term and haven't been used any more. Ref. https://github.com/kubernetes/enhancements/tree/master/keps/sig-cluster-lifecycle/kubeadm/2067-rename-master-label-taint Fixes: #7466 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2023-08-01 17:41:40 +09:00
Chao Wu	1a94aad44f	Merge pull request #7480 from jiangliu/rt-service Simplify implementation of runtime-rs/service	2023-08-01 16:05:33 +08:00
Chao Wu	2d13e2d71c	Merge pull request #7504 from fidencio/topic/gha-release-fix-upload-versions-yaml release: Fix upload-versions-yaml	2023-08-01 13:58:07 +08:00
GabyCT	b77d69aeee	Merge pull request #7396 from GabyCT/topic/addghatensorflow metrics: Enable Tensorflow metrics for kata CI	2023-07-31 17:13:24 -06:00
Fabiano Fidêncio	743291c6c4	release: Fix upload-versions-yaml This requires the GITHUB_UPLOAD_TOKEN. While we're here, let's also fix the name of the action and remove the "-tarball" suffix, as it's not really a tarball. Fixes: #7497 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-31 23:57:33 +02:00
Fabiano Fidêncio	a71d35c764	Merge pull request #7499 from fidencio/topic/gha-release-ensure-stage-is-defined-for-amr64-s300x gha: release: `stage` must be defined for arm64 / s390x yamls	2023-07-31 22:55:54 +02:00
Gabriela Cervantes	6328181762	metrics: Add k8s sysbench documentation This PR adds k8s sysbench documentation at general density documentation. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-31 20:28:37 +00:00
Chelsea Mafrica	f74b7aba18	Merge pull request #7488 from cmaf/docs-k8s-links docs: Update links for pods and kubelet	2023-07-31 12:44:24 -07:00
Gabriela Cervantes	8933d54428	metrics: Add FIO to gha run script This PR adds FIO to gha run script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-31 17:51:11 +00:00
Gabriela Cervantes	8a584589ff	metrics: Add DAX FIO README This PR adds DAX FIO README information. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-31 17:42:44 +00:00
Gabriela Cervantes	21f5b65233	metrics: Add FIO information in storage general README This PR adds FIO information in storage general README. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-31 17:33:39 +00:00
Gabriela Cervantes	69f05cf9e6	metrics: Add FIO general README This PR adds FIO general README information. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-31 17:30:05 +00:00
Gabriela Cervantes	87d41b3dfa	metrics: Add FIO test to gha for kata metrics CI This PR adds FIO test to gha for kata metrics CI. Fixes #7502 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-31 16:50:16 +00:00
Pavel Mores	28e5e9c86e	runtime-rs: fix number of queues handling in dragonball share fs device Looks like a copy/paste error... Fixes #7501 Signed-off-by: Pavel Mores <pmores@redhat.com>	2023-07-31 17:25:47 +02:00
Fabiano Fidêncio	ff8d7e7e41	Merge pull request #7496 from fidencio/topic/topic/kata-deploy-take-nfd-into-consideration-pre-work k8s: Rely on the USING_NFD environment variable passed by the jobs	2023-07-31 14:56:15 +02:00
Fabiano Fidêncio	1b111a9aab	gha: release: `stage` must be defined for arm64 / s390x yamls `stage` has been added, but only hooked up to the amd64 logic, leaving arm64 and s390x behind. Let's fix this right now, and make sure no error occurs when passing this down to the yaml files. Fixes: #7497 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-31 14:41:35 +02:00
Fabiano Fidêncio	684a6e1a55	Revert "gha: release: `stage` must be a string" This reverts commit `7c857d38c1`. I've misunderstood the error given by github action, let's fix this in the next commit. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-31 14:37:52 +02:00
Fabiano Fidêncio	99711f107f	Merge pull request #7498 from fidencio/topic/gha-release-stage-must-be-a-string gha: release: `stage` must be a string	2023-07-31 14:32:47 +02:00
Fabiano Fidêncio	7c857d38c1	gha: release: `stage` must be a string Otherwise we'll face the following error as part of our GHA: ``` The workflow is not valid. kata-containers/kata-containers/.github/workflows/release-$foo.yaml (Line: 13, Col: 14): Invalid input, stage is not defined in the referenced workflow. ``` Fixes: #7497 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-31 13:39:13 +02:00
Fabiano Fidêncio	28e171bf73	Merge pull request #7490 from fidencio/3.2.0-alpha4-branch-bump # Kata Containers 3.2.0-alpha4	2023-07-31 13:34:15 +02:00
Fabiano Fidêncio	91e1e612c3	k8s: Rely on the USING_NFD environment variable passed by the jobs Let's make sure we can rely on the tests passing down whether they want to be tested using Node Feataure Discovery or not. Right now, only the TDX job has this option set to "true", all the other jobs have this option set to "false". We can and have to merge this one before merging the NFD related patches as: 1) It causes no harm in exporting this environment variable, but not having it used 2) It will allow us to test the NFD after this one is merged, as changes in the yaml file, in the case of the pull_request_target event, are not taken into consideration before they're merged Fixes: #7495 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-31 13:30:18 +02:00
Zvonko Kaiser	cddcde1d40	vfio: Fix vfio device ordering If modeVFIO is enabled we need 1st to attach the VFIO control group device /dev/vfio/vfio an 2nd the actuall device(s) afterwards.Sort the devices starting with device #1 being the VFIO control group device and the next the actuall device(s) /dev/vfio/<group> Fixes: #7493 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-07-31 11:26:27 +00:00
Fabiano Fidêncio	7edc7172c0	release: Kata Containers 3.2.0-alpha4 - tests: Add `k8s-volume` and `k8s-file-volume` tests to GHA CI - metrics: Update boot time for kata metrics - metrics: Add FIO report files for kata metrics - kata-deploy: Allow runtimeclasses to be created by the daemonset - runtime-rs: change block index to 0 - agent: fix typo in constant - metrics: Add FIO benchmark for metrics tests - gha: dragonball: Run only on the dragonball labeled machine - tests: Fix `k8s-job` test - agent,libs: Remove unused 'mut' keywords - runtime-rs: remove unneeded 'mut' keywords - tests: QoL improvements for running tests locally - agent: exclude symlinks from recursive ownership change - cache: kernel: Fix kernel caching - runk: Add Docker guide to README - metrics: General improvements to json.bash script - kata-deploy: Allow shim creation based on what's passed to the daemonset - gha: ci: Add skeleton of vfio job - s390x: Fixing device.Bus assignment - release: Mention the container images used to build the project - kata-deploy-binaries: kernel_cache: Take module_dir into account - ci: nydus: Fix typo in "source" - gha: ci: Add no-op nydus tests to our CI - Dragonball: migrate dragonball-sandbox crates to Kata - ci: gha: Add cri-containerd tests (but still do not enable them) - packaging/tools: Add kata-debug and use it as part of our CI - cache: kernel: Consider changes in tools/packaging/kernel - kata-deploy: Properly get the path of the versions.yaml file - kata-deploy: Add VERSION and versions.yaml to the final tarball - metrics: Add C-Ray performance test - metrics: enable TensorFlow benchmark to be run on gha - metrics: Add function to memory inside container script - Revert "metrics: Replace backslashes used to escape double quoted key in jq expr" - versions: Bump virtiofsd to v1.7.0 - metrics: stop hypervirsor and shim at init_env stage - ci: k8s: Adapt "source ..." to the new location of gha-run.sh - ci: Move `tests/integration/gha-run.sh` to `tests/integration/kuberentes/` ... and also remove KUBECONFIG from the tdx envs - versions: Update kernel to version v6.1.x - agent: Fix exec hang issues with a backgroud process - agent: Ignore already mounted dev/fs/pseudo-fs - ci: k8s: Bring TDX tests back - metrics: Update machine learning documentation - gha: ci: cri-containerd: Fix KATA_HYPERVSIOR typo - tests: Add MobileNet Tensorflow performance benchmark - metrics: replace backslashes used to escape double quoted jq key expr. - runtime-rs: enhancement of Device Manager for network endpoints. - feat(Tracing): tracing in Rust runtime - runtime-rs: ignore unconfigured network interfaces - metrics: Stop running kata-env before kata is properly installed. - metrics: use rm -f to remove the oldest continerd config file. - kernel: Update kernel config name - kata-deploy: Add a debug option to kata-deploy (and also use it as part of our CI) - runtime-rs: add parameter for propagation of (u)mount events - kata-ctl: Move GuestProtection code to kata-sys-util - tests: Add function before function name in common.bash for metrics - tests: Add metrics storage documentation - metrics: Fix metrics ts generator to treat numbers as decimals - gha: ci: Add cri-containerd tests skeleton -- follow up 1 - dragonball/agent: Add some optimization for Makefile and bugfixes of unit tests on aarch64 - metrics: Enable blogbench test - tests: Add machine learning performance tests - tests: gha: ci: Add cri-containerd tests skeleton - metrics: Enable memory inside container metrics - tools: Use a consistent target name when building mariner initrd - gha: ci: Gather info about the node / pods - runtime-rs: Do not scan network if network model is "none" - gha: k8s: tdx: Temporarily disable TDX tests - metrics: Update memory usage script - gha: Cancel previous jobs if a PR is updated - gha: nightly: Fix long name of AKS clusters issue and make the CI easier to test - README: Add badge for our Nightly CI - gha: Do not run all the tests if only docs are updated - bugfix: plus default_memory when calculating mem size - gha: ci: Use github.sha to get the last commit reference - dragonball: Don't fail if a request asks for more CPUs than allowed - gha: ci: Fix refernce passed to checkout@v3 - gha: ci: Avoid using env also in the ci-nightly and payload-after-push - gha: k8s: Ensure cluster doesn't exist before creating it - gha: ci: More follow up fixes after adding a nightly CI - tests: Enable running k8s tests on Mariner - gha: ci: Avoid using env unless it's really needed - gha: ci: Follow up fixes for the nightly jobs - tests: Enable memory usage metrics tests - gha: Add nightly jobs - metrics: storing metrics workflow artifacts - gha: k8s: Ensure tests are running on a specific namespace - metrics: Adds blogbench and webtool metrics tests - gha: dragonball: Correctly propagate PATH update - versions: Upgrade to Cloud Hypervisor v33.0 - Convert `is_allowed`, `ttrpc_error` and `sl` to functions - gha: release: Use a specific release of hub - metrics: Add checkmetrics to gha-run.sh for metrics CI - packaging: Fix indentation of build.sh script at ovmf - doc: Add documentation for the virtualization reference architecture - gpu: Update kernel building to the latest changes - runtime: fix PCIe topology for GPUDirect use-case - metrics: Add memory footprint tests - runtime: Add "none" as a shared_fs option - metrics: Uniformity across function names in gha-run.sh - runtime-rs: support physical endpoint using device manager - runtime-rs: bugfix for direct volume path's validation. - metrics: Fix retrieving hypervisor version on metrics - runtime-rs: fix build error on AArch64 - checkmetrics: Add checkmetrics makefile and documentation - docs: Add boot time metrics documentation - runtime-rs: add support spdk/vhost-user based volume. - static-build: Remove kata-version parameter - dragonball: avoid obtaining lock twice in create_stdio_console - metrics: Add checkmetrics for kata metrics CI - metrics: enable launch-times test on gha-run metrics script - docs: Add general metrics documentation - add support vfio device manager - gha: Don't automatically trigger CI - kata-ctl: Check for vm capability - docs: fix spelling of "crate" - packaging: Fix indentation in init.sh script - gha: Fix gha actions - metrics: install kata and launch-times test - tests: Move tests helper script to this repo - tests: Add json script for metrics tests - Cherry pick initramfs caching updates from CCv0 - gha: Fix format for run launchtimes metrics yaml - tests: Add tests lib common script - Fix deprecated virtiofsd args (go shim only) - gha: Add base branch on SHA on pull requst - gha: ci-on-push: Run metrics tests - docs: Update Developer Guide - runtime-rs: Enhance flexibility of virtio-fs config - versions: Update firecracker version to 1.3.3 - tools: Fix no-op builds - runtime-rs: update Cargo.lock - gha: Fix `stage` definition in matrix - feat(runtime): vcpu resize capability - packaging: Remove snap package - gha: Add new build targets for Mariner - Dragonball: support resize memory - Port Measured rootfs feature from CCv0 branch to main - add support direct volume and refactor device manager - gha: Fix gha-run.sh and unbreak CI - kata-ctl: Switch to slog logging; add --log-level and --json-logging arguments - log-parser: Update log parser link at README - gha: aks: Extract `run` commands to a script - runtime-rs: handle copy files when share_fs is not available - agent-ctl: fix the compile error - agent: fix the issue of exec hang with a backgroud process - runtime-rs: bugfix: update Cargo.lock - gha: aks: Use short SHA in cluster name - README: Display badge for the "Publish Artefacts" job and update the Kata Containers logo - kata-deploy: Change how we get the Ubuntu k8s key - gha: aks: Ensure host_os is used everywhere needed - kubernetes: add agnhost command in pod yaml - main \| release: Standardize kata static file name - packaging: make BUILDER_REGISTRY configurable - gha: aks: Add the host_os as part of the aks cluster's name - kernel: Modify build-kernel.sh to accomodate for changes in version.yaml - gha: Fix Mariner cluster creation - gha: Unbreak CI and fix cluster creation step - Dragonball: support vcpu hotplug on aarch64 - runtime-rs/sandbox_bindmounts: add support for sandbox bindmounts - runtime-rs/kata-ctl: Enhancement of DirectVolumeMount. - gha: Create Mariner host as part of k8s tests - netlink: Fix the issue of update_interface - gha: Increase timeout for AKS jobs and give more time to start running the tests - runtime: sending SIGKILL to qemu - dragonball: convert BlockDeviceMgr and VirtioNetDeviceMgr functions to methods - dragonball: Remove virtio-net and vsock devices gracefully - kata-deploy: Improve shim backup / restore - doc: Update git commands - kata-deploy: Fix indentation on kata deploy merge script `8353aae41` ci: k8s: Rework get_nodes_and_pods_info() `6ad5d7112` ci: k8s: Do not gather node info before running the tests `5261e3a60` ci: k8s: Group messages to improve readability `9cc6b5f46` ci: k8s: Get logs from kata-deploy `9d285c622` ci: k8s: Let kata-deploy take care of the runtimeclasses `87568ed98` gha: Test split out runtimeclasses are in sync with all-in-one file `39192c608` kata-deploy: Print variables passed to the script `0e157be6f` kata-deploy: Allow runtimeclasses to be created by the daemonset `a27433324` kata-deploy: Change default values of DEBUG `69535b808` kata-deploy: runtimeclass: Split out entries `9e1710674` kata-runtimeClasses: Alphabetically sort the enrties `6222bd910` tests: Add k8s-file-volume test `187a72d38` tests: Add k8s-volume test `0c8427035` metrics: Add boot time value for qemu `6520dfee3` metrics: Update boot time for kata metrics `ff2279061` metrics: Update runtime and configuration paths `a5d4e3388` metrics: Add compare virtiofsd dax script `5e937fa62` metrics: Update general FIO tests `b0bea47c5` metrics: Add makefile to report generator `73c57b9a1` metrics: Add FIO report files for kata metrics `c8fcd29d9` runtime-rs: use device manager to handle virtio-pmem `901c19225` runtime-rs: support configure vm_rootfs_driver `5d6199f9b` runtime-rs: use device manager to handle vm rootfs `20f1f62a2` runtime-rs: change block index to 0 `662f87539` metrics: Add general FIO makefile `c5a87eed2` tests: gha: Add timeout to cluster creation `6daeb08e6` tests: k8s: Clean up node debuggers after running `3aa6c77a0` gha: dragonball: Run only on the dragonball labeled machine `37641a543` metrics: Add example config for fio jobs `314aec73d` agent: fix typo in constant `4703434b1` tests: k8s: Allow using custom resource group `350f3f70b` tests: Import `common.bash` in `run_kubernetes_tests.sh` `d7f04a64a` tests: k8s: Leave `runtimeclass_workloads/` alone `bdde6aa94` tests: k8s: Split deployment and testing commands `91a0b3b40` tests: aks: Simply delete cluster when cleaning up `3c1044d9d` metrics: Update FIO paths for k8s runner `6177a0db3` metrics: Add env files for FIO `a45900324` metrics: Add fio exec `ea198fddc` metrics: Add FIO runner k8s `8f7ef41c1` metrics: Add FIO vendor code `6293c17bd` metrics: Add FIO benchmark for metrics tests `ff4cfcd8a` runk: Add Docker guide to README `c8ac56569` cache: kernel: Harmonize commit with fetching side `81775ab1b` cache: kernel: Fix SEV kernel caching `717f775f3` gha: ci: Add skeleton of vfio job `b9f100b39` agent,libs: Remove unused 'mut' keywords `a56f96bb2` kata-deploy: Allow shim creation based on what's passed to the daemonset `4a5ab38f1` metrics: General improvements to json.bash script `d4eba3698` kata-deploy-binaries: kernel_cache: Take module_dir into account `b7c9867d6` release: Mention the container images used to build the project `7c4b59781` ci: nydus: Fix typo in "source" `6a680e241` gha: ci: Add placeholder for the nydus tests as part of the CI `fb4f7a002` gha: nydus: Add a no-op GHA for nydus `4a207a16f` gha: nydus: Bring tests as they are from the tests repo `2c8f83424` runtime-rs: remove unneeded 'mut' keywords `1fc715bc6` s390x: Add AP Attach/Detach test `e91f5edba` ci: cri-containerd: Fix default typo for testContainerStart() `8b8aef09a` ci: cri-containerd: Temporarily disable TestContainerSwap `56767001c` ci: cri-containerd: Add namespace / uid to the pods `a84773652` ci: cri-containerd: Always use sudo to call crictl `99ba86a1b` ci: cri-containerd: Add /usr/local/go/bin to the PATH `7f3b30999` ci: cri-containerd: Add `function` before each function `fde22d6bc` ci: cri-containerd: Assume podman is always used `9465a0496` ci: cri-containerd: Adapt "source ..." to this repo `df8d14411` ci: cri-containerd: Remove CI variable `f90570aef` ci: cri-containerd: Remove unused runc_runtime_bin `c3637039f` ci: cri-containerd: Remove KILL_VMM_TEST env var `bc4919f9b` ci: cri-containerd: Always run shim-v2 tests `f9e332c6d` ci: cri-containerd: Stop cloning containerd `cfd662fee` ci: cri-containerd: Remove ununsed SNAP_CI var `d36c3395c` ci: cri-containerd: Update copyright `b5be8a4a8` ci: cri-containerd: Move integration-tests.sh as it was `f2e00c95c` ci: cri-containerd: Populate install_dependencies() `897955252` versions: Add "latest" field for cri-tools `1bbcbafa6` ci: Add clone_cri_container() `f66c68a2b` ci: Add install_cri_tools() `4dd828414` ci: Add install_cri_containerd() `ad47d1b9f` ci: Add download_github_project_tarball() `788c562a9` ci: Add get_latest_patch_release_from_a_github_project() `6742f3a89` ci: Use `function` before each install_go.sh function `5eacecffc` ci: Adjust paths for install_go.sh `8ed1595f9` ci: Update copyright for install_go.sh `6123d0db2` ci: Move install_go.sh as it was `8653be71b` ci: Do not take cross-build into consideration for kata-arch.sh `6a76bf92c` ci: Fix style / identation if kata-arch.sh `72743851c` ci: Add `function` before each kata-arch.sh function `9f6d4892c` ci: Update copyright for kata-arch.sh `6f73a7283` ci: Move kata-arch.sh as it was `3615d7343` ci: Add get_from_kata_deps() `34779491e` gha: kubernetes: Avoid declaring repo_root_dir `f3738beac` tests: Use $HOME/go as fallback for $GOPATH `b87ed2741` tests: Move `ensure_yq` to common.bash `124e39033` tests: common: Fix quoting when globbing `db77c9a43` tests: Make install_kata take care of the links `13715db1f` tests: Do not call `install_check_metrics` when installing kata `630634c5d` ci: k8s: Group logs to make them easier to read `228b30f31` ci: k8s: Gather node info during the cleanup `81f99543e` ci: k8s: Cleanup cluster before deleting it `38a7b5325` packaging/tools: Add kata-debug `ae6e8d2b3` kata-deploy: Properly get the path of the versions.yaml file `309e23255` cache: kernel: Consider changes in tools/packaging/kernel `59fdd69b8` kata-deploy: Add VERSION and versions.yaml to the final tarball `5dddd7c5d` release: Upload versions.yaml as part of the release `bad3ac84b` metrics: Rename C-Ray to cpu performance tests `87d99a71e` versions: Remove "kernel-experimental" `545de5042` vfio: Fix tests `62aa6750e` vfio: Added better handling of VFIO Control Devices `dd422ccb6` vfio: Remove obsolete HotplugVFIOonRootBus `114542e2b` s390x: Fixing device.Bus assignment `371a118ad` agent: exclude symlinks from recursive ownership change `e64edf41e` metrics: Add tensorflow function in gha-run script `67a6fff4f` metrics: Enable tensorflow benchmark on gha `01450deb6` Revert "metrics: Replace backslashes used to escape double quoted key in jq expr." `843006805` metrics: Add function to memory inside container script `bbd3c1b6a` Dragonball: migrate dragonball-sandbox crates to Kata `fad801d0f` ci: k8s: Adapt "source ..." to the new location of gha-run.sh `55e2f0955` metrics: stop hypervirsor and shim at init_env stage `556e663fc` metrics: Add disk link to general metrics README `98c121709` metrics: Add C-Ray README `8e7d9926e` metrics: Add C-Ray Dockerfile `e2ee76978` metrics: Add C-Ray performance test `2ee2cd307` ci: k8s: Move gha-run.sh to the kubernetes dir `88eaff533` ci: tdx: Adjust KUBECONFIG `c09e268a1` versions: Downgrade SEV(-SNP) kernel back to v5.19.x `6a7a32365` versions: Bump virtiofsd to v1.7.0 `ac5f5353b` ci: k8s: Bring TDX tests back `950b89ffa` versions: Update kernel to version v6.1.38 `8ccc1e5c9` metrics: Update machine learning documentation `f50d2b066` gha: ci: cri-containerd: Fix KATA_HYPERVSIOR typo `620b94597` metrics: Add Tensorflow Mobilenet documentation `6c91af0a2` agent: Fix exec hang issues with a backgroud process `59f4731bb` metrics: Stop running kata-env before kata is properly installed. `468f017e2` metrics: Replace backslashes used to escape double quoted key in jq expr. `64f013f3b` ci: k8s: Enable debug when running the tests `8f4b1df9c` kata-deploy: Give users the ability to run it on DEBUG mode `2c8dfde16` kernel: Update kernel config name `150e54d02` runtime-rs: ignore unconfigured network interfaces `3ae02f920` metrics: use rm -f to remove older continerd config file. `a864d0e34` tests: Add tensorflow mobilenet dockerfile `788d2a254` tests: Add tensorflow mobilenet performance test `3fed61e7a` tests: Add storage link to general metrics documentation `b34dda4ca` tests: Add storage blogbench metrics documentation `6787c6390` runtime-rs: add parameter for propagation of (u)mount events `6e5679bc4` tests: Add function before function name in common.bash for metrics `62080f83c` kata-sys-util: Fix compilation errors `02d99caf6` static-checks: Make cargo clippy pass. `982420682` agent: Make the static checks pass for agent `61e4032b0` kata-ctl: Remove all utility functions to get platform protection `a24dbdc78` kata-sys-util: Move utilities to get platform protection `dacdf7c28` kata-ctl: Remove cpu related functions from kata-ctl `f5d195717` kata-sys-util: Move additional functionality to cpu.rs `304b9d914` kata-sys-util: Move CPU info functions `7319cff77` ci: cri-containerd: Add LTS / Active versions for containerd `2a957d41c` ci: cri-containerd: Export GOPATH `75a294b74` ci: cri-containerd: Ensure deps are installed `6924d14df` metrics: Fix metrics ts generator to treat numbers as decimals `9e048c8ee` checkmetrics: Add blogbench read value for qemu `2935aeb7d` checkmetrics: Add blogbench write value for qemu `02031e29a` checkmetrics: Add blogbench read value for clh `107fae033` checkmetrics: Add blogbench write value for clh `8c75c2f4b` metrics: Update blogbench Dockerfile `49723a9ec` metrics: Add double quotes to variables `dc67d902e` metrics: Enable blogbench test `438fe3b82` gha: ci: Add cri-containerd tests skeleton `bd08d745f` tests: metrics: Move metrics specific function to metrics gha-run.sh `3ffd48bc1` tests: common: Move a few utility functions to common.bash `7f961461b` tests: Add machine learning README `bb2ef4ca3` tests: Add `function` before each function `063f7aa7c` tests: Add Pytorch Dockerfile `1af03b9b3` tests: Add Pytorch performance test `4cecd6237` tests: Add tensorflow Dockerfile `c4094f62c` tests: Add metrics machine learning performance tests `89b622dcb` gha: k8s: tdx: Temporarily disable TDX tests `8c9d08e87` gha: ci: Gather info about the node / pods `283f809dd` runtime-rs: Enhancing Device Manager for network endpoints. `a65291ad7` agent: rustjail: update test_mknod_dev `46b81dd7d` agent: clippy: fix cargo clippy warnings `c4771d9e8` agent: Makefile: enable set SECCOMP dynamically `a88212e2c` utils.mk: update BUILD_TYPE argument `883b4db38` dragonball: fix cargo test on aarch64 `6822029c8` runtime-rs: Do not scan network if network model is "none" `ce54e43eb` metrics: Update memory usage script `fbc2a91ab` gha: Cancel previous jobs if a PR is updated `307cfc8f7` tools: Use a consistent target name when building mariner initrd `d780cc08f` gha: nightly: Also use `workflow_dispatch` to trigger it `b99ff3026` gha: nightly: Fix name size limit for AKS `aedc586e1` dragonball: Makefile: add coverage target `310e069f7` checkmetrics: Enable checkmetrics for memory inside test `1363fbbf1` README: Add badge for our Nightly CI `1776b18fa` gha: Do not run all the tests if only docs are updated `28c29b248` bugfix: plus default_memory when calculating mem size `0c1cbd01d` gha: ci: after-push: Use github.sha to get the last commit reference `37a955678` gha: ci: nightly: Use github.sha to get the last commit reference `ed23b47c7` tracing: Add tracing to runtime-rs `96e9374d4` dragonball: Don't fail if a request asks for more CPUs than allowed `38f0aaa51` Revert "gha: k8s: dragonball: Skip k8s-number-cpus" `828a72183` gha: k8s: dragonball: Skip k8s-oom `a79505b66` gha: k8s: dragonball: Skip k8s-number-cpus `275c84e7b` Revert "agent: fix the issue of exec hang with a backgroud process" `2be342023` checkmetrics: Add memory usage inside container value for qemu `6ca34f949` checkmetrics: Add memory inside container value for clh `6c6892423` metrics: Enable memory inside container metrics `0ad298895` gha: ci: Fix refernce passed to checkout@v3 `86904909a` gha: ci: Avoid using env also in the ci-nightly and payload-after-push `f72cb2fc1` agent: Remove shadowed function, add slog-term `1d05b9cc7` gha: ci: Pass down secrets to ci-on-push / ci-nightly `c5b4164cb` gha: ci: Fix tarball-suffix passed to the metrics tests `07810bf71` agent: Ignore already mounted dev/fs/pseudo-fs `11e3ccfa4` gha: ci: Avoid using env unless it's really needed `c45f646b9` gha: k8s: Ensure cluster doesn't exist before creating it `1a7bbcd39` gha: ci: Fix typo pull_requesst -> pull_request `ddf4afb96` gha: ci: Fix set-fake-pr-number job `8a0a66655` gha: ci: schedule expects a list, not a map `5c0269dc5` gha: ci: Add pr-number input to the correct job `de83cd9de` gha: ci: Use $VAR instead of ${{ env.VAR }} `6acce83e1` metrics: Fix the call to check_metrics function `e067d1833` gha: Add a nightly CI job `7c0de8703` gha: k8s: Ensure tests are running on a specific namespace `106e30571` gha: Create a re-usable `ci.yaml` file `cc3993d86` gha: Pass event specific info from the caller workflow `4e396e728` metrics: Add function keyword to to helper metrics functions `1ca17c2f7` metrics: storing metrics workflow artifacts `5a61065ab` checkmetrics: Add checkmetrics value for memory usage in qemu `78086ed1f` checkmetrics: Add memory usage value for clh `1c3dbafbf` metrics: Fix function of how to retrieve multiple values `18968f428` metrics: Add function to have uniformity `35d096b60` metrics: Adds blogbench and webtool metrics tests `d8f90e89d` metrics: Rename function at memory usage script `b9d66e0d5` metrics: Fix double quotes variables in memory usage script `476a11194` tests: Enable memory usage metrics tests `b568c7f7d` tests/integration: Provide default value for KATA_HOST_OS `d6e96ea06` tests/integration: Use AzureLinux instead of Mariner `40c46c75e` tests/integration: Perform yq install in run_tests() `d8b8f7e94` metrics: Enable launch tests time metrics `72fd562bd` gha: release: Use a specific release of hub `0502354b4` checkmetrics: Add checkmetrics json for qemu `b481ef188` makefile: Add -buildvcs=false flag to go build `e94aaed3c` ci_worker: Add checkmetrics ci worker for cloud hypervisor `917576e6f` metrics: Add double quotes in all variables `cc8f0a24e` metrics: Add checkmetrics to gha-run.sh for metrics CI `477856c1e` gha: dragonball: Correctly propagate PATH update `1c211cd73` gha: Swap asset/release in build matrix `0152c9aba` tools: Introduce `USE_CACHE` environment variable `2b5975689` tests: Build CLH with glibc for Mariner `80c78eadc` tests: Use baked-in kernel with Mariner `532755ce3` tests: Build Mariner rootfs initrd `6a21e20c6` runtime: Add "none" as a shared_fs option `5681caad5` versions: Upgrade to Cloud Hypervisor v33.0 `b2ce8b4d6` metrics: Add memory footprint tests to the CI `d035955ef` doc: Add documentation for the virtualization reference architecture `0f454d0c0` gpu: Fixing typos for PCIe topology changes `6bb2ea819` packaging: Fix indentation of build.sh script at ovmf `0504bd725` agent: convert the `sl` macros to functions `0860fbd41` agent: convert the `ttrpc_error` macro to a function `0e5d6ce6d` agent: convert the `is_allowed` macro to a function `f680fc52b` agent: change `AGENT_CONFIG`'s lazy type to just `AgentConfig` `beb706368` metrics: Uniformity across function names `1f3e837e4` runtime-rs: fix build error on AArch64 `6fd25968c` runtime-rs: bugfix for direct volume path's validation. `415578cf3` docs: Add general README `bff4672f7` runtime-rs: support physical endpoint using device manager `32cba7e44` metrics: Fix retrieving hypervisor version on metrics `aa7946de4` checkmetrics: Add general checkmetrics documentation `2fac2b72f` checkmetrics: Add checkmetrics makefile `e45899ae0` docs: Add time tests documentation reference `28130d3ce` docs: Add boot time metrics documentation `0df2fc270` runtime-rs: add support spdk/vhost-user based volume. `17198089e` vendor: Add vendor checkmetrics dependencies `f1dfea6e8` docs: Add metrics documentation reference `8330fb8ee` gpu: Update unit tests `859359424` metrics: enable launch-times test on gha-run metrics script `c4ee601bf` metrics: Add checkmetrics for kata metrics CI `e0d6475b4` gha: Don't automatically trigger CI `b535c7cbd` tests: Enable running k8s tests on Mariner `71071bdb6` docs: Add general metrics documentation `610f7986e` check: Relax the unrestricted_guest check when running in a VM `1b406b9d0` kata-ctl:Implement functionality to check host is capable of running VM `adf88eaa8` static-build: Remove kata-version parameter `09720babc` docs: fix spelling of "crate" `7185afc50` gha: Fix gha actions `21294b868` packaging: Fix indentation in init.sh script `fad3ac9f5` metrics: install kata and launch-times test `4bbfcfaf1` tests: Move tests helper script to this repo `f152f0e8c` metrics: Add launch-times to metrics tests `59510cfee` runtime-rs: add support vfio device based volume `1e3b372bb` runtime-rs: add support vfio device manager `6b0848930` gha: Fix format for run launchtimes metrics yaml `3cefa43e7` tests: Add json script for metrics tests `6a3710055` initramfs: Build dependencies as part of the Dockerfile `aa2380fdd` packaging: Add infra to push the initramfs builder image `1c7fcc6cb` packaging: Use existing image to build the initramfs `a43ea24df` virtiofsd: Convert legacy `-o` sub-options to their `--` replacement `8e00dc694` virtiofsd: Drop `-o no_posix_lock` `2a15ad978` virtiofsd: Stop using deprecated `-f` option `c3043a6c6` tests: Add tests lib common script `b16e0de73` gha: Add base branch on SHA on pull requst `72f2cb84e` gpu: Reset cold or hot plug after overriding `fbacc0964` gpu: PCIe topology, consider vhost-user-block in Virt `bc152b114` gha: ci-on-push: Run metrics tests `dad731d5c` docs: Update Developer Guide `b11246c3a` gpu: Various fixes for virt machine type `40101ea7d` vfio: Added annotation for hot(cold) plug `8f0d4e261` vfio: Cleanup of Cold and Hot Plug `b5c4677e0` vfio: Rearrange the bus assignemnt `b1aa8c8a2` gpu: Moved the PCIe configs to drivers `55a66eb7f` gpu: Add config to TOML `da42801c3` gpu: Add config settings tests for hot-plug `de39fb7d3` runtime: Add support for GPUDirect and GPUDirect RDMA PCIe topology `9318e022a` gpu: Add CC relates configs `b7932be4b` gpu: Add Arm64 Kernel Settings `211b0ab26` gpu: Update Kernel Config `5f103003d` gpu: Update kernel building to the latest changes `35e4938e8` tools: Fix no-op builds `347385b4e` runtime-rs: Enhance flexibility of virtio-fs config `21d227853` versions: Update firecracker version to 1.3.3 `0e2379909` gha: Fix `stage` definition in matrix `ae2cfa826` doc: add vcpu handlint doc for runtime-rs `7b1e67819` fix(clippy): fix clippy error `67972ec48` feat(runtime-rs): calculate initial size `aaa96c749` feat(runtime-rs): modify onlineCpuMemRequest `d66f7572d` feat(runtime-rs): clear cpuset in runtime side `a0385e138` feat(runtime-rs): update linux resource when stop_process `a39e1e6cd` feat(runtime-rs): merge the update_cgroups in update_linux_resources `fa6dff9f7` feat(runtime-rs): support vcpu resizing on runtime side `8cb4238b4` packaging: Remove snap package `213773998` runtime-rs: update Cargo.lock `56d2ea9b7` kata-ctl: Refactor kernel module check `9f7a45996` gha: Add `rootfs-initrd-mariner` build target `f28a62164` gha: Add `cloud-hypervisor-glibc` build target `8fb7ab751` dragonball: introduce virtio-balloon device `7ed949497` dragonball: introduce virtio-mem device `776a15e09` runtime-rs: add support direct volume. `a8e0f51c5` dragonball: extend DeviceOpContext `abae11404` runtime-rs: refactor device manager implementation `210a15794` dragonball: avoid obtaining lock twice in create_stdio_console `69668ce87` tests: gha-run: Use correct env variable for repo `f487199ed` gha: aks: Fix argument in call to gha-run.sh `f6afae9c7` packaging: Add rootfs-image-tdx-tarball target `f62b2670c` config: Add root hash value and measure config to kernel params `008058807` kernel: Integrate initramfs into Guest kernel `28b264562` initramfs: Add build script to generate initramfs `5cb02a806` image-build: generate root hash as an separate partition for rootfs `31c0ad207` packaging: Add cryptsetup support in Guest kernel and rootfs `980d084f4` log-parser: Update log parser link at README `410bc1814` agent-ctl: fix the compile error `77519fd12` kata-ctl: Switch to slog logging; add --log-level, --json-logging args `aab603096` gha: aks: Extract `run` commands to a script `e4eb664d2` runtime-rs: update rust to 1.69.0 `ed37715e0` runtime-rs: handle copy files when share_fs is not available `5f6fc3ed7` runtime-rs: bugfix: update Cargo.lock `1c6d22c80` gha: aks: Use short SHA in cluster name `3c1f6d36d` readme: Update Kata Containers logo `388684113` readme: Add status badge for the "Publish Artefacts" job `26f752038` kata-deploy: Change how we get the Ubuntu k8s key `aebd3b47d` gha: aks: Ensure host_os is used everywhere needed `0c8282c22` gha: aks: Add the host_os as part of the aks cluster's name `4b89a6bda` release: Standardize kata static file name `9228815ad` kernel: Modify build-kernel.sh to accomodate for changes in version.yaml `03027a739` gha: Fix Mariner cluster creation `43e73bdef` packaging: make BUILDER_REGISTRY configurable `ffe3157a4` dragonball: add arm64 patches for upcall `560442e6e` dragonball: add vcpu_boot_onlined vector `e31772cfe` dragonball: add support resize_vcpu on aarch64 `64c764c14` dragonball: update dbs-boot to v0.4.0 `fd9b41464` dragonball: update comment for init_microvm `af16d3fca` gha: Unbreak CI and fix cluster creation step `5ddc4f94c` runtime-rs/kata-ctl: Enhancement of DirectVolumeMount. `25d2fb0fd` agent: fix the issue of exec hang with a backgroud process `4af4ced1a` gha: Create Mariner host as part of k8s tests `eee7aae71` runtime-rs/sandbox_bindmounts: add support for sandbox bindmounts `557b84081` gha: aks: Wait longer to start running the tests `c04c872c4` gha: aks: Increase the timeout time `428041624` kata-deploy: Improve shim backup / restore `14c3f1e9f` kata-deploy: Fix indentation on kata deploy merge script `0e47cfc4c` runtime: sending SIGKILL to qemu `6a0035e41` doc: Update git commands `433b5add4` kubernetes: add agnhost command in pod yaml `c477ac551` dragonball: Convert VirtioNetDeviceMgr function to method `4659facb7` dragonball: Convert BlockDeviceMgr function to method `ee6deef09` dragonball: Remove virtio-net and vsock devices gracefully `2bda92fac` netlink: Fix the issue of update_interface Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-31 09:02:07 +02:00
Jiang Liu	b3901c46d6	runtime-rs: ignore errors during clean up sandbox resources Ignore errors during clean up sandbox resources as much as we can. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-07-31 13:07:43 +08:00
Chelsea Mafrica	8a2c201719	docs: Update links for pods and kubelet The links for pods and kubelets no longer work so update to new links with relevant info. Fixes #7487 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-07-29 00:38:35 +00:00
Gabriela Cervantes	5a1b5d3672	metrics: Add sysbench pod yaml This PR adds the sysbench pod yaml for the sysbench performance test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 20:03:15 +00:00
Gabriela Cervantes	ad413d1646	metrics: Add sysbench dockerfile This PR adds sysbench dockerfile. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 19:58:10 +00:00
Gabriela Cervantes	1512560111	metrics: Add sysbench performance test This PR adds the sysbench performance test for kata CI. Fixes #7485 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 19:54:12 +00:00
Gabriela Cervantes	bee1a628bd	metrics: Fix json result for tensorflow This PR fixes the json result for tensorflow.i Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 17:02:16 +00:00
Jiang Liu	62e328ca5c	runtime-rs: refine implementation of TaskService Refine implementation of TaskService, making handler_message() as a method. Fixes: #7479 Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-07-29 00:47:33 +08:00
Jiang Liu	458e1bc712	runtime-rs: make send_message() as an method of ServiceManager Simplify implementation by making send_message() as an method of ServiceManager. Fixes: #7479 Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-07-29 00:47:31 +08:00
Jiang Liu	1cc1c81c9a	runtime-rs: fix possibe bug in ServiceManager::run() Multiple instances of task service may get registered by ServiceManager::run(), fix it by making operation symmetric. Fixes: #7479 Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-07-29 00:47:30 +08:00
Jiang Liu	1a5f90dc3f	runtime-rs: simplify implementation of service crate Simplify implementation of service crate. Fixes: #7479 Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2023-07-29 00:47:28 +08:00
Gabriela Cervantes	51cd99c927	metrics: Round axelnet and resnet results This PR rounds the axelnet and resnet results in order to extract properly the result. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 16:15:22 +00:00
Gabriela Cervantes	3b883bf5a7	metrics: Fix atoi invalid syntax This PR will avoid to have the strconv.atoi parsing error when we are retrieving the results from the json. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 16:15:22 +00:00
Gabriela Cervantes	f9dec11a8f	checkmetrics: Move checkmetrics to gha-run script This PR moves the checkmetrics to gha-run script to gathered tensorflow information. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 16:15:22 +00:00
Gabriela Cervantes	53af71cfd0	checkmetrics: Add AlexNet value for qemu This PR adds AlexNet value for qemu for checkmetrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 16:15:22 +00:00
Gabriela Cervantes	a435d36fe1	checkmetrics: Add Resnet value for qemu This PR adds the Resnet value for qemu for checkmetrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 16:15:22 +00:00
Gabriela Cervantes	a79a3a8e1d	checkmetrics: Add alexnet value for clh This PR adds the AlexNet value for clh for checkmetrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 16:15:22 +00:00
Gabriela Cervantes	3c32875046	checkmetrics: Add Resnet value for clh This PR adds the checkmetrics Resnet value for clh. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 16:15:22 +00:00
Gabriela Cervantes	08dfaa97aa	metrics: General improvements to the tensorflow script This PR adds general improvements to the tensorflow script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 16:15:22 +00:00
Gabriela Cervantes	63b8534b41	metrics: Enable Tensorflow metrics for kata CI This PR enables the Tensorflow benchmark metrics for kata CI. Fixes #7395 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-28 16:15:22 +00:00
Aurélien	e8f8641988	Merge pull request #7132 from sprt/aks-volume-tests tests: Add `k8s-volume` and `k8s-file-volume` tests to GHA CI	2023-07-28 08:58:03 -07:00
Fabiano Fidêncio	68b9acfd02	Merge pull request #7474 from GabyCT/topic/upboo metrics: Update boot time for kata metrics	2023-07-28 17:55:43 +02:00
David Esparza	f89abcbad8	Merge pull request #7473 from GabyCT/topic/addfioreport metrics: Add FIO report files for kata metrics	2023-07-28 09:37:21 -06:00
Fabiano Fidêncio	c9742d6fa9	Merge pull request #7411 from fidencio/topic/kata-deploy-create-runtime-classes kata-deploy: Allow runtimeclasses to be created by the daemonset	2023-07-28 16:05:49 +02:00
Yuan-Zhuo	731e7c763f	kata-ctl: add monitor subcommand for runtime-rs The previous kata-monitor in golang could not communicate with runtime-rs to gather metrics due to different sandbox addresses. This PR adds the subcommand monitor in kata-ctl to gather metrics from runtime-rs and monitor itself. Fixes: #5017 Signed-off-by: Yuan-Zhuo <yuanzhuo0118@outlook.com>	2023-07-28 17:30:08 +08:00
Yuan-Zhuo	d74639d8c6	kata-ctl: provide the global TIMEOUT for creating MgmtClient Several functions in kata-ctl need to establish a connection with runtime-rs through MgmtClient. This PR provides a global TIMEOUT to avoid multiple definitions. Fixes: #5017 Signed-off-by: Yuan-Zhuo <yuanzhuo0118@outlook.com>	2023-07-28 17:23:37 +08:00
Yuan-Zhuo	02cc4fe9db	runtime-rs: add support for gather metrics in runtime-rs 1. Implemented metrics collection for runtime-rs shim and dragonball hypervisor. 2. Described the current supported metrics in runtime-rs.(docs/design/kata-metrics-in-runtime-rs.md) Fixes: #5017 Signed-off-by: Yuan-Zhuo <yuanzhuo0118@outlook.com>	2023-07-28 17:16:51 +08:00
Fabiano Fidêncio	8353aae41a	ci: k8s: Rework get_nodes_and_pods_info() The amount of info we've added seemed unnecessary, and ends up making our lives even harder when trying to find errors. Let's just rely on the kata-debug container to collect the needed info for us. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	6ad5d7112e	ci: k8s: Do not gather node info before running the tests It's been proven to not be useful, and ends up making things more confusing due to the amount of logs printed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	5261e3a60c	ci: k8s: Group messages to improve readability Right now is getting way too easy to get lost in the logs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	9cc6b5f461	ci: k8s: Get logs from kata-deploy Let's make sure we can debug kata-deploy in case something goes wrong during its execution. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	9d285c6226	ci: k8s: Let kata-deploy take care of the runtimeclasses By doing this we can test the change done for the daemonset. :-) Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	87568ed985	gha: Test split out runtimeclasses are in sync with all-in-one file This is needed in order to not lose track of what's been created and what's been added here and there. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	39192c6084	kata-deploy: Print variables passed to the script This will help folks to debug / understand what's been passed to the kata-deploy.sh script. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	0e157be6f2	kata-deploy: Allow runtimeclasses to be created by the daemonset Let's allow the daemonset to create the runtimeclasses, which will decrease one manual step a user of kata-deploy should take, and also help us in the Confidential Containers land as the Operator can just delegate it to this script. Fixes: #7409 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	a274333248	kata-deploy: Change default values of DEBUG This can be easily done as there was no official release with the previous values. The reason we're doing so is because when using `yq` to replace the value, even when forcing `--tag '!!str' "yes"`, the content is placed without quotes, causing errors in our CI. While here, we're also removing the fallback value for DEBUG, as it is always set in the kata-deploy.yaml file. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 09:50:39 +02:00
Fabiano Fidêncio	69535b8089	kata-deploy: runtimeclass: Split out entries This will make things simpler to only create the handlers defined by the kata-deploy user. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 09:43:45 +02:00
Fabiano Fidêncio	9e1710674a	kata-runtimeClasses: Alphabetically sort the enrties This will become handy in the near future, as we want to have separate enrties for each file, while still keeping this one. Having the entries sorted will make our lives easier to test those are always in sync. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 09:43:45 +02:00
Zhongtao Hu	61a8eabf8e	Merge pull request #7139 from openanolis/fix/devmanager runtime-rs: change block index to 0	2023-07-28 14:04:19 +08:00
Aurélien Bombo	6222bd9103	tests: Add k8s-file-volume test This imports the k8s-file-volume test from the tests repo and modifies it slightly to set up the host volume on the AKS host. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-27 14:07:55 -07:00
Aurélien Bombo	187a72d381	tests: Add k8s-volume test This imports the k8s-volume test from the tests repo and modifies it slightly to set up the host volume on the AKS host. Fixes: #6566 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-27 14:06:43 -07:00
Gabriela Cervantes	0c84270357	metrics: Add boot time value for qemu This PR adds the boot time value and limit for qemu. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-27 20:06:24 +00:00
Gabriela Cervantes	6520dfee37	metrics: Update boot time for kata metrics This PR updates the boot time limit for kata metrics. Fixes #7475 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-27 19:14:19 +00:00
Gabriela Cervantes	ff22790617	metrics: Update runtime and configuration paths This PR updates the runtime and configuration paths for kata containers. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-27 17:14:03 +00:00
Gabriela Cervantes	a5d4e33880	metrics: Add compare virtiofsd dax script This PR adds the compare virtiofsd dax script for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-27 16:53:50 +00:00
Gabriela Cervantes	5e937fa622	metrics: Update general FIO tests This PR updates general FIO tests by adding the recent date of a change. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-27 16:47:17 +00:00
Gabriela Cervantes	b0bea47c53	metrics: Add makefile to report generator This PR adds the makefile to report generator for the FIO test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-27 16:42:11 +00:00
Gabriela Cervantes	73c57b9a19	metrics: Add FIO report files for kata metrics This PR adds FIO report files for kata metrics. Fixes #7472 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-27 16:39:35 +00:00
Chelsea Mafrica	e941b3a094	Merge pull request #7456 from alakesh/agent-fix-typo agent: fix typo in constant	2023-07-27 09:31:24 -07:00
David Esparza	ba8a8fcbf2	Merge pull request #7442 from GabyCT/topic/addgofilesfio metrics: Add FIO benchmark for metrics tests	2023-07-27 10:20:43 -06:00
Zhongtao Hu	c8fcd29d9b	runtime-rs: use device manager to handle virtio-pmem use device manager to handle virtio-pmem device Fixes: #7119 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-07-27 20:18:49 +08:00
Zhongtao Hu	901c192251	runtime-rs: support configure vm_rootfs_driver support configure vm_rootfs_driver in toml config Fixes: #7119 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-07-27 20:12:53 +08:00
Zhongtao Hu	5d6199f9bc	runtime-rs: use device manager to handle vm rootfs use device manager to handle vm rootfs, after attach the block device of vm rootfs, we need to increase index number Fixes: #7119 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com> Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-07-27 20:12:45 +08:00
James O. D. Hunt	20f1f62a2a	runtime-rs: change block index to 0 Change block index in SharedInfo to 0 for vda. Fixes #7119 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com> Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-07-27 20:11:44 +08:00
Chao Wu	ede1dae65d	Merge pull request #7465 from fidencio/topic/fix-dragonball-static-check-runner-selector gha: dragonball: Run only on the dragonball labeled machine	2023-07-27 10:19:26 +08:00
Gabriela Cervantes	662f87539e	metrics: Add general FIO makefile This PR adds a general FIO makefile for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-26 20:46:02 +00:00
Fabiano Fidêncio	f28af98ac6	Merge pull request #7453 from sprt/fix-ci-node-debugger tests: Fix `k8s-job` test	2023-07-26 22:27:21 +02:00
Fabiano Fidêncio	8a22b5f075	Merge pull request #7439 from ManaSugi/fix/remove-unused-mut agent,libs: Remove unused 'mut' keywords	2023-07-26 21:25:41 +02:00
Fabiano Fidêncio	9792ac49fe	Merge pull request #7425 from jongwu/remove_mut runtime-rs: remove unneeded 'mut' keywords	2023-07-26 21:24:40 +02:00
Fabiano Fidêncio	24564a8499	Merge pull request #7455 from sprt/local-tests tests: QoL improvements for running tests locally	2023-07-26 21:23:43 +02:00
Aurélien Bombo	c5a87eed29	tests: gha: Add timeout to cluster creation This has been intermittently taking a while lately so let's add a timeout. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-26 10:19:07 -07:00
Aurélien Bombo	6daeb08e69	tests: k8s: Clean up node debuggers after running This deletes node debugger pods after execution since their presence may affect tests that assume only test workloads pods are present. For example, in `k8s-job` we wait for any pod to be in the `Succeeded` state before proceeding, which causes failures. Fixes: #7452 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-26 10:19:07 -07:00
Fabiano Fidêncio	3aa6c77a01	gha: dragonball: Run only on the dragonball labeled machine Static checks for dragonball are landing on any of the self-hosted runners, and the reason for that is because "self-hosted" was the label selector used. Let's use "dragonball" instead, as the machine has that label as well. Fixes: #7464 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-26 18:15:04 +02:00
Gabriela Cervantes	37641a5430	metrics: Add example config for fio jobs This PR adds example config for fio jobs. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-26 16:03:12 +00:00
Alakesh Haloi	314aec73d4	agent: fix typo in constant It fixes a constant name to have the right spelling Fixes: #7457 Signed-off-by: Alakesh Haloi <a_haloi@apple.com>	2023-07-26 00:06:34 -05:00
Aurélien Bombo	4703434b12	tests: k8s: Allow using custom resource group This simply allows setting a custom resource group when debugging locally, so as to prevent name collisions and not pollute the namespace. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-25 15:45:44 -07:00
Aurélien Bombo	350f3f70b7	tests: Import `common.bash` in `run_kubernetes_tests.sh` Not sure why this works in GHA, but the `info` call on line 65 would fail locally. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-25 15:45:44 -07:00
Aurélien Bombo	d7f04a64a0	tests: k8s: Leave `runtimeclass_workloads/` alone Makes it so that `setup.sh` doesn't make changes in `runtimeclass_workloads/` directly. Instead we treat that as a template directory and we use the new directory `runtimeclass_workloads_work/` as a work dir. This has two advantages: * Allows rerunning tests without the assumption that `setup.sh` must be idempotent. E.g. the `set_runtime_class()` step would break. * Doesn't pollute your git environment with a bunch of changes when developing. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-25 15:45:44 -07:00
Aurélien Bombo	bdde6aa948	tests: k8s: Split deployment and testing commands This splits deploying Kata and running the tests into separate commands to make it possible to rerun tests locally without having to redeploy Kata each time. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-25 15:44:46 -07:00
Aurélien Bombo	91a0b3b406	tests: aks: Simply delete cluster when cleaning up If we're going to delete the cluster anyway, no need to call kata-cleanup. Fixes: #7454 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-25 15:44:46 -07:00
Gabriela Cervantes	3c1044d9d5	metrics: Update FIO paths for k8s runner This PR updates the FIO paths for k8s runner. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-25 20:50:03 +00:00
Eric Ernst	5385ddc560	Merge pull request #7365 from alakesh/symlink-fix agent: exclude symlinks from recursive ownership change	2023-07-25 11:27:48 -07:00
Gabriela Cervantes	6177a0db3e	metrics: Add env files for FIO This PR adds the env files for FIO for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-25 17:48:45 +00:00
Gabriela Cervantes	a45900324d	metrics: Add fio exec This PR adds fio exec for the FIO benchmark. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-25 17:36:08 +00:00
Gabriela Cervantes	ea198fddcc	metrics: Add FIO runner k8s Add program to execute FIO workloads using k8s. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-25 17:34:29 +00:00
Gabriela Cervantes	8f7ef41c14	metrics: Add FIO vendor code This PR adds the FIO vendor code. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-25 17:24:29 +00:00
Gabriela Cervantes	6293c17bde	metrics: Add FIO benchmark for metrics tests This PR adds the FIO benchmark scripts and resources for the metrics tests section. Fixes #7441 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-25 16:36:33 +00:00
Fabiano Fidêncio	cdf04e5018	Merge pull request #7437 from jepio/fix-sev-kernel-cache cache: kernel: Fix kernel caching	2023-07-25 18:10:03 +02:00
GabyCT	7a3b55ce67	Merge pull request #7432 from ManaSugi/runk/doc-docker runk: Add Docker guide to README	2023-07-25 09:56:02 -06:00
GabyCT	c1bd527163	Merge pull request #7430 from GabyCT/topic/fixjson metrics: General improvements to json.bash script	2023-07-25 09:45:53 -06:00
Fabiano Fidêncio	6efd684a46	Merge pull request #7408 from fidencio/topic/kata-deploy-add-SHIMS-and-SHIM_DEFAULT-as-env kata-deploy: Allow shim creation based on what's passed to the daemonset	2023-07-25 16:56:46 +02:00
Fabiano Fidêncio	5b82268d2c	Merge pull request #7436 from jepio/vfio-gha gha: ci: Add skeleton of vfio job	2023-07-25 14:44:04 +02:00
Manabu Sugimoto	ff4cfcd8a2	runk: Add Docker guide to README `runk` can launch containers using Docker, so add the guide to it's README. ```sh $ sudo dockerd --experimental --add-runtime="runk=/usr/local/bin/runk" $ sudo docker run -it --rm --runtime runk busybox echo hello runk hello runk ``` Fixes: #7431 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2023-07-25 20:10:49 +09:00
Jeremi Piotrowski	c8ac56569a	cache: kernel: Harmonize commit with fetching side kata-deploy-binaries.sh uses the last commit in tools/packaging/static-build/kernel for its version check, while the cache generation uses tools/packaging/kernel. Use tools/packaging/static-build/kernel as $kata_config_version is already part of the version string and covers any changes to tools/packaging/kernel. Fixes: #7403 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-25 12:23:05 +02:00
Jeremi Piotrowski	81775ab1b3	cache: kernel: Fix SEV kernel caching The SEV kernel cache calls create_cache_asset() twice, once for the kernel and once for modules. Both calls need to use the same version string, otherwise the second call overwrites the "latest" file of the first one and the cache is not used. Fixes: #7403 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-25 11:58:19 +02:00
Jeremi Piotrowski	717f775f30	gha: ci: Add skeleton of vfio job This job will run on a nested virt capable Azure VM (improving test concurrency). This is just a placeholder while we adapt the test to GHA. Fixes: #6555 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-25 11:13:04 +02:00
Manabu Sugimoto	b9f100b391	agent,libs: Remove unused 'mut' keywords Remove unused `mut` because the agent compilation fails when the rust compiler is >= 1.71. This is related to #7425 Fixes: #7438 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2023-07-25 17:41:08 +09:00
Fabiano Fidêncio	a56f96bb2b	kata-deploy: Allow shim creation based on what's passed to the daemonset Instead of hardcoding shims as part of the script, let's ensure we can allow them to be created based on environment variables passed to the daemonset. This change brings no functionality change as the default values in the daemonset are exactly what has been used as part of the scripts. Fixes: #7407 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-25 08:30:00 +02:00
Fabiano Fidêncio	5ce0b4743f	Merge pull request #7382 from zvonkok/vfio-ap-debug s390x: Fixing device.Bus assignment	2023-07-25 08:26:25 +02:00
David Esparza	b11d618a3f	Merge pull request #7413 from fidencio/topic/release-publish-builder-images release: Mention the container images used to build the project	2023-07-24 15:46:31 -06:00
Fabiano Fidêncio	56fdeb1247	Merge pull request #7417 from fidencio/topic/kata-deploy-binaries-cached-kernel-fix kata-deploy-binaries: kernel_cache: Take module_dir into account	2023-07-24 22:26:09 +02:00
Gabriela Cervantes	4a5ab38f16	metrics: General improvements to json.bash script This PR adds general improvements like putting function before function name and consistency in how we declare variables and so on to have uniformity across the metrics scripts. Fixes #7429 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-24 16:51:38 +00:00
Fabiano Fidêncio	d4eba36980	kata-deploy-binaries: kernel_cache: Take module_dir into account `module_dir` has been passed to the function but was never assigned to a var, leading to errors when trying to use it. Fixes: #7416 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-24 18:19:13 +02:00
Fabiano Fidêncio	b7c9867d60	release: Mention the container images used to build the project This is a small step towards build reproducibility. Fixes: #7412 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-24 18:01:57 +02:00
Wainer Moschetta	2e9853c761	Merge pull request #7427 from fidencio/topic/gha-port-nydus-tests-follow-up-1 ci: nydus: Fix typo in "source"	2023-07-24 11:20:05 -03:00
Fabiano Fidêncio	7c4b597816	ci: nydus: Fix typo in "source" We should source from `nydus_dir`, instead of `cri_containerd_dir`, and that was a leftover from `fb4f7a002c`. Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-24 14:55:09 +02:00
Fabiano Fidêncio	589672d510	Merge pull request #7426 from fidencio/topic/gha-port-nydus-tests gha: ci: Add no-op nydus tests to our CI	2023-07-24 13:56:57 +02:00
Fabiano Fidêncio	6a680e241b	gha: ci: Add placeholder for the nydus tests as part of the CI This will triger the nydus tests, but as they currently are they'll just return "okay" without actually executing. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-24 13:37:36 +02:00
Fabiano Fidêncio	fb4f7a002c	gha: nydus: Add a no-op GHA for nydus This newly added GHA does nothing, is not even triggered, and it's just a placeholder that we'll grow in the next commits / PRs, so we can actually start running the nydus tests as part of our CI. Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-24 13:37:33 +02:00
Fupan Li	0ae987973b	Merge pull request #7367 from openanolis/chao/migrate_dragonball_sandbox Dragonball: migrate dragonball-sandbox crates to Kata	2023-07-24 17:52:11 +08:00
Fabiano Fidêncio	4a207a16f9	gha: nydus: Bring tests as they are from the tests repo Let's bring the nydus tests, without any kind of modification, from the tests repo. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-24 10:56:41 +02:00
Jianyong Wu	2c8f83424d	runtime-rs: remove unneeded 'mut' keywords These unneeded 'mut' keywords blocks built by rust 1.71.0. Remove them. Fixes: #7424 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-07-24 08:47:15 +00:00
Zvonko Kaiser	1fc715bc65	s390x: Add AP Attach/Detach test Now that we have propper AP device support add a unit test for testing the correct Attach/Detach of AP devices. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-07-23 13:44:19 +00:00
Fabiano Fidêncio	e1a4040a6c	Merge pull request #7326 from fidencio/topic/gha-ci-add-cri-containerd-tests ci: gha: Add cri-containerd tests (but still do not enable them)	2023-07-21 19:29:38 +02:00
Fabiano Fidêncio	6a59e227b6	Merge pull request #7399 from fidencio/topic/add-kata-debug packaging/tools: Add kata-debug and use it as part of our CI	2023-07-21 17:05:27 +02:00
Fabiano Fidêncio	e91f5edba0	ci: cri-containerd: Fix default typo for testContainerStart() It must but {1:-0}, instead of {1-0}. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	8b8aef09af	ci: cri-containerd: Temporarily disable TestContainerSwap The test is currently failing with GHA, and I don't think it makes sense to block all the other tests to get merged while it's happening. For now, let's disable it and re-enable it as soon as we have it passing. Reference: https://github.com/kata-containers/kata-containers/issues/7410 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	56767001cb	ci: cri-containerd: Add namespace / uid to the pods Otherwise crictl will fail to remove them with: ``` getting sandbox status of pod "$pod": metadata.Name, metadata.Namespace or metadata.Uid is not in metadata "..." ``` A huge shout out to Steven Horsman for helping to debug this one. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	a84773652c	ci: cri-containerd: Always use sudo to call crictl Otherwise we may get the following error: ``` time="2023-07-15T21:12:13Z" level=fatal msg="validate service connection: validate CRI v1 runtime API for endpoint \"unix:///run/containerd/containerd.sock\": rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing dial unix /run/containerd/containerd.sock: connect: permission denied\"" ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	99ba86a1b2	ci: cri-containerd: Add /usr/local/go/bin to the PATH Otherwise go is not picked up. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	7f3b309997	ci: cri-containerd: Add `function` before each function We've been doing this for all files moved to this repo. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	fde22d6bce	ci: cri-containerd: Assume podman is always used For this set of tests, we'll always be using podman in order to avoid having containerd pulled in by docker. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	9465a04963	ci: cri-containerd: Adapt "source ..." to this repo Let's adapt what we "source" to the kata-containers repo. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	df8d144119	ci: cri-containerd: Remove CI variable We always want to run the tests using as much debug as possible. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	f90570aef0	ci: cri-containerd: Remove unused runc_runtime_bin The variable is not used anywhere in our tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	c3637039f4	ci: cri-containerd: Remove KILL_VMM_TEST env var We don't need the env var, we just need to restrict the test according to the KATA_HYPERVISOR used, as right now it's very specifict to QEMU. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	bc4919f9b2	ci: cri-containerd: Always run shim-v2 tests We only have shim-v2 as the runtime type, so we always need to run tests using it. :-) We had to adjust the script in order to properly run the tests with the current logic. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	f9e332c6db	ci: cri-containerd: Stop cloning containerd It's already done as part of the install_dependencies() Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	cfd662fee9	ci: cri-containerd: Remove ununsed SNAP_CI var We don't support SNAP anymore, thus we can remove the var. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	d36c3395c0	ci: cri-containerd: Update copyright As we're touching the file already, let's update its Copyright info. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	b5be8a4a8f	ci: cri-containerd: Move integration-tests.sh as it was Let's move the `integration/containerd/cri/integration-tests.sh` file from the tests repo to this one. The file has been moved as it is, it's not used, and in the following commits we'll clean it up before actually using it. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	f2e00c95c0	ci: cri-containerd: Populate install_dependencies() Let's install all the dependencies needed for running the `cri-containerd` tests. The list of dependencies we have are: * From the system - build-essential - jq - podman-docker * From our own repo - yq - go * From GitHub projects - containerd - cri-tools Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	8979552527	versions: Add "latest" field for cri-tools As we don't want to disrupt what we have on the `tests` repo, let's create a "latest" entry and use that for the GitHub actions tests. Once we deprecate the `tests` repo we can decide whether we want to stick to using "latest" or switch back to "version". Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	1bbcbafa67	ci: Add clone_cri_container() This function will simply clone containerd repo, specifically on a tag we want to use to test. This can be expanded for different projects, and it will be the case as soon as we grow the tests. But, for now, let's keep it simple. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	f66c68a2bf	ci: Add install_cri_tools() This function will install cri-tools in the host, and soon enough (as part of this PR) we'll be using it to install cri-tools as part of the cri-containerd tests. I've decided to have this as part of the `common.bash` as other tests that will be added in the future will require cri-tools to be installed as well. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	4dd828414f	ci: Add install_cri_containerd() This function will install cri-containerd in the host, and soon enough (as part of this PR) we'll be using it to install cri-containerd as part of the cri-containerd tests. I've decided to have this as part of the `common.bash` as other tests that will be added in the future will require cri-containerd to be installed as well. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	ad47d1b9f8	ci: Add download_github_project_tarball() This function will hel us to get the tarball, from a github project, that we're going to use as part of our tests. Right now this is not used anywhere, but it'll soon enough (as part of this series) be used to download the cri-containerd / cri-tools / cni tarballs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	788c562a95	ci: Add get_latest_patch_release_from_a_github_project() This function will help us to get the latest patch release from a GitHub project. The idea behind this function is that we don't have to keep updating versions.yaml that frequently (or worse, have it outdated as it currently is), and always test against the latest patch release of a given project's version that we care about. Although right now this is not used anywhere, this will be used with the coming cri-containerd tests, which will be part of this series. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	6742f3a898	ci: Use `function` before each install_go.sh function We've been doing this for all files moved to this repo. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	5eacecffc3	ci: Adjust paths for install_go.sh Let's adjust paths for what we source and the scripts we call, after moving from the tests repo to this one. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	8ed1595f96	ci: Update copyright for install_go.sh As we're touching the file already, let's update its Copyright info. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	6123d0db2c	ci: Move install_go.sh as it was Let's move `.ci/install_go.sh` file from the tests repo to this one. The file has been moved as it is, it's not used, and in the following commits we'll clean it up before actually using it. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	8653be71b2	ci: Do not take cross-build into consideration for kata-arch.sh Right now we'd need to import lib.sh just in order to get cross-build information for rust, and it seems a little bit premature to do so at this stage and only for rust. Let's skip it and keep this transition simple. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	6a76bf92cb	ci: Fix style / identation if kata-arch.sh We've been using: ``` function foo() { } ``` instead of ``` function foo() { } ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	72743851c1	ci: Add `function` before each kata-arch.sh function We've been doing this for all files moved to this repo. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	9f6d4892c8	ci: Update copyright for kata-arch.sh As we're touching the file already, let's update its Copyright info. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	6f73a72839	ci: Move kata-arch.sh as it was Let's move `.ci/kata-arch.sh` file from the tests repo to this one. The file has been moved as it is, it's not used, and in the following commits we'll clean it up before actually using it. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	3615d73433	ci: Add get_from_kata_deps() First of all, I'm 100% aware that I'm duplicating this function here as I've copied it from the packaging stuff, and I'm not exactly proud of that. However, right now it seems a little bit premature to combine that set of scripts with this set of scripts in a single one and make them used by both pieces of our project. Anyways, this functions helps to get information from the `versions.yaml` file, and it'll be used as part of the cri-containerd tests and a few others in the future. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	34779491e0	gha: kubernetes: Avoid declaring repo_root_dir This is already declared as part of the `common.bash` file, so let's just make sure we use it from there. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	f3738beaca	tests: Use $HOME/go as fallback for $GOPATH Considering that someone may want to run the tests locally, we shouldn't rely on having GITHUB_WORKSPACE exported, and fallback to $HOME/go if needed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	b87ed27416	tests: Move `ensure_yq` to common.bash As this function will be used by different scripts, let's move it to a common place. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Jeremi Piotrowski	124e390333	tests: common: Fix quoting when globbing When the glob star is inside quotes, there is only one iteration of the loop and b holds all matches at once. Move the glob out of the quotes so that we actually iterate over matched paths. Fixes: #6543 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	db77c9a438	tests: Make install_kata take care of the links It makes the kata-containers installation more complete. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	13715db1f8	tests: Do not call `install_check_metrics` when installing kata The `install_kata` function was moved from the metrics' `gha-run.sh` file to the `common.bash` in the commit `3ffd48bc16`, but I didn't notice that it brought with it a call to `install_check_metrics`, which is totally unrelated to installing Kata Containers. Let's remove the call so the function is a little bit less specific, and move the call to install_check_metrics to the metrics `gha-run.sh` file. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	e149a3c783	Merge pull request #7404 from fidencio/topic/cache-consider-changes-in-the-scripts-used-to-build-the-kernel cache: kernel: Consider changes in tools/packaging/kernel	2023-07-21 15:05:01 +02:00
Fabiano Fidêncio	630634c5df	ci: k8s: Group logs to make them easier to read Otherwise it becomes really hard to find the info you're looking for. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 14:05:30 +02:00
Fabiano Fidêncio	228b30f31c	ci: k8s: Gather node info during the cleanup This will make our lives easier to debug issues with the CI. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 14:05:30 +02:00
Fabiano Fidêncio	81f99543ec	ci: k8s: Cleanup cluster before deleting it This will help us to in two fronts: * catching possible issues related to kata-deploy cleanup * do more (like, in the future, collect logs) after the tests run Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 14:05:30 +02:00
Fabiano Fidêncio	38a7b5325f	packaging/tools: Add kata-debug kata-debug is a tool that is used as part of the Kata Containers CI to gather information from the node, in order to help debugging issues with Kata Containers. As one can imagine, this can be expanded and used outside of the CI context, and any contribution back to the script is very much welcome. The resulting container is stored at the [Kata Containers quay.io space](https://quay.io/repository/kata-containers/kata-debug) and can be used as shown below: ```sh kubectl debug $NODE_NAME -it --image=quay.io/kata-containers/kata-debug:latest ``` Fixes: #7397 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 14:05:30 +02:00
Fabiano Fidêncio	a0fd41fd37	Merge pull request #7406 from fidencio/topic/merge-tarball-fix-version-yaml-not-found kata-deploy: Properly get the path of the versions.yaml file	2023-07-21 14:04:18 +02:00
Fabiano Fidêncio	ae6e8d2b38	kata-deploy: Properly get the path of the versions.yaml file We need to correctly get the full path of the versions.yaml file as part of the merge-builds.sh script, as we do a `pushd` there and that leads to a fail merging the artefacts as the `versions.yaml` file does not exists in that path. Fixes: #7405 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 12:02:11 +02:00
Fabiano Fidêncio	309e232553	cache: kernel: Consider changes in tools/packaging/kernel Any change in the script used to build the kernel should invalidate the cache. Fixes: #7403 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 11:48:29 +02:00
GabyCT	f95a7896b1	Merge pull request #7394 from fidencio/topic/ship-VERSIOB-and-versions.yaml-as-part-of-release-tarball kata-deploy: Add VERSION and versions.yaml to the final tarball	2023-07-20 14:38:21 -06:00
GabyCT	14025baafe	Merge pull request #7376 from GabyCT/topic/addcray metrics: Add C-Ray performance test	2023-07-20 14:37:53 -06:00
GabyCT	b629f6a822	Merge pull request #7363 from GabyCT/topic/enabletensorflow metrics: enable TensorFlow benchmark to be run on gha	2023-07-20 13:36:55 -06:00
Fabiano Fidêncio	59fdd69b85	kata-deploy: Add VERSION and versions.yaml to the final tarball Let's make things simpler to figure out which version of Kata Containers has been deployed, and also which artefacts come with it. This will help us immensely in the future, for the TEEs use case, so we can easily know whether we can deploy a specific guest kernel for a specific host kernel. Fixes: #7394 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-20 18:33:14 +02:00
Fabiano Fidêncio	5dddd7c5d1	release: Upload versions.yaml as part of the release Although this file is far away from being a SBOM, it'll help folks to easily visualise which components are part of a release, and even have SBOMs generated from that. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-20 18:31:21 +02:00
Gabriela Cervantes	bad3ac84b0	metrics: Rename C-Ray to cpu performance tests This PR renames C-Ray tests to cpu category. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-20 15:56:02 +00:00
Fabiano Fidêncio	87d99a71ec	versions: Remove "kernel-experimental" We've not been using nor shipping this kernel for a very long time. Regardless, we're leaving behind the logic in the kernel scripts to build it, in case it becomes necessary in the future. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-20 17:14:22 +02:00
Zvonko Kaiser	545de5042a	vfio: Fix tests Now with more elaborate checking of cold\|hot plug ports we needed to update some of the tests. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-07-20 13:42:44 +00:00
Zvonko Kaiser	62aa6750ec	vfio: Added better handling of VFIO Control Devices Depending on the vfio_mode we need to mount the VFIO control device additionally into the container. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-07-20 13:42:42 +00:00
Fabiano Fidêncio	fe07ac662d	Merge pull request #7387 from GabyCT/topic/fixmemoryinsidec metrics: Add function to memory inside container script	2023-07-20 10:06:15 +02:00
Zvonko Kaiser	dd422ccb69	vfio: Remove obsolete HotplugVFIOonRootBus Removing HotplugVFIOonRootBus which is obsolete with the latest PCI topology changes, users can set cold_plug_vfio or hot_plug_vfio either in the configuration.toml or via annotations. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-07-20 07:25:40 +00:00
Zvonko Kaiser	114542e2ba	s390x: Fixing device.Bus assignment The device.Bus was reset if a specific combination of configuration parameters were not met. With the new PCIe topology this should not happen anymore Fixes: #7381 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-07-20 07:24:26 +00:00
Alakesh Haloi	371a118ad0	agent: exclude symlinks from recursive ownership change currently when fsGroup is used with direct-assign, kata agent recursively changes ownership and permission for each file including symlinks. However the problem with symlinks is, the permission of the symlink itself may not be same as the underlying file. So while doing recursive ownership and permission changes we should skip symlinks. Fixes: #7364 Signed-off-by: Alakesh Haloi <a_haloi@apple.com>	2023-07-19 20:42:55 -07:00
Gabriela Cervantes	e64edf41e5	metrics: Add tensorflow function in gha-run script This PR adds the tensorflow function in gha-run script in order to be triggered in the gha. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-19 21:31:51 +00:00
Gabriela Cervantes	67a6fff4f7	metrics: Enable tensorflow benchmark on gha This PR enables the TensorFlow benchmark on gha for the kata metrics CI. Fixes #7362 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-19 21:31:51 +00:00
GabyCT	c3f21c36f3	Merge pull request #7388 from dborquez/revert-commit-broke-checkmetrics-baseline-values Revert "metrics: Replace backslashes used to escape double quoted key in jq expr"	2023-07-19 14:36:16 -06:00
David Esparza	01450deb6a	Revert "metrics: Replace backslashes used to escape double quoted key in jq expr." This reverts commit `468f017e21`. Fixes: #7385 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-07-19 10:07:11 -06:00
Gabriela Cervantes	8430068058	metrics: Add function to memory inside container script This PR adds function before function of the variables at the memory inside container script in order to have uniformity across the script. Fixes #7386 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-19 16:00:53 +00:00
Chao Wu	bbd3c1b6ab	Dragonball: migrate dragonball-sandbox crates to Kata In order to make it easier for developers to contribute to Dragonball, we decide to migrate all dragonball-sandbox crates to Kata. fixes: #7262 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-07-19 19:41:57 +08:00
Chao Wu	7153b51578	Merge pull request #7372 from fidencio/topic/bump-virtiofsd-to-v1.7.0 versions: Bump virtiofsd to v1.7.0	2023-07-19 10:51:49 +08:00
GabyCT	8c662916ab	Merge pull request #7377 from dborquez/add_verbosity_to_blogbench metrics: stop hypervirsor and shim at init_env stage	2023-07-18 15:57:54 -06:00
Fabiano Fidêncio	5f7da301fd	Merge pull request #7378 from fidencio/topic/ci-k8s-fix-source-path ci: k8s: Adapt "source ..." to the new location of gha-run.sh	2023-07-18 22:30:55 +02:00
Fabiano Fidêncio	fad801d0fb	ci: k8s: Adapt "source ..." to the new location of gha-run.sh This is a follow up of `2ee2cd307b`, which changed the location of gha-run.sh Fixes: #7373 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-18 21:26:41 +02:00
David Esparza	55e2f0955b	metrics: stop hypervirsor and shim at init_env stage This PR kills the hypervisor and the kata shim in the init_env stage prior to launch any metric test. Additionally this PR adds info messages in the main blocks of the blogbench test to help in debugging. Fixes: #7366 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-07-18 12:05:29 -06:00
Gabriela Cervantes	556e663fce	metrics: Add disk link to general metrics README This PR adds the disk link information to the general metrics README. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-18 16:42:35 +00:00
Gabriela Cervantes	98c1217093	metrics: Add C-Ray README This PR adds the C-Ray documentation at the README file. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-18 16:35:54 +00:00
Gabriela Cervantes	8e7d9926e4	metrics: Add C-Ray Dockerfile This PR adds the C-Ray Dockerfile for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-18 16:33:55 +00:00
Gabriela Cervantes	e2ee769783	metrics: Add C-Ray performance test This PR adds C-Ray performance test in order to be part of the kata metrics CI. Fixes #7375 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-18 16:32:23 +00:00
Fabiano Fidêncio	2011e3d72a	Merge pull request #7374 from fidencio/topic/ci-tdx-adjust-kubeconfig-path ci: Move `tests/integration/gha-run.sh` to `tests/integration/kuberentes/` ... and also remove KUBECONFIG from the tdx envs	2023-07-18 17:32:57 +02:00
Fabiano Fidêncio	8e09e04f48	Merge pull request #6788 from jepio/kernel-update-6.1-lts versions: Update kernel to version v6.1.x	2023-07-18 17:29:21 +02:00
Chao Wu	935432c36d	Merge pull request #7352 from justxuewei/exec-hang agent: Fix exec hang issues with a backgroud process	2023-07-18 23:02:18 +08:00
Fabiano Fidêncio	2ee2cd307b	ci: k8s: Move gha-run.sh to the kubernetes dir The file belongs there, as it's only used for k8s related tests. Fixes: #7373 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-18 15:45:06 +02:00
Fabiano Fidêncio	88eaff5330	ci: tdx: Adjust KUBECONFIG We don't need to export KUBECONFIG there. Let's just make sure we have the server correctly setup and avoid doing that. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-18 15:39:52 +02:00
Jeremi Piotrowski	c09e268a1b	versions: Downgrade SEV(-SNP) kernel back to v5.19.x CC-GPU seems to have issues with v6.1, so downgrade the kernels used for SEV-SNP to a known-working version. It is worth mentioning that TDX is also still on 5.19. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-18 15:29:46 +02:00
Fabiano Fidêncio	25d80fcec2	Merge pull request #6993 from zvonkok/kata-agent-init-mount agent: Ignore already mounted dev/fs/pseudo-fs	2023-07-18 14:11:44 +02:00
Fabiano Fidêncio	4687f2bf9d	Merge pull request #7369 from fidencio/topic/gha-ci-bring-tdx-back ci: k8s: Bring TDX tests back	2023-07-18 13:28:33 +02:00
Fabiano Fidêncio	6a7a323656	versions: Bump virtiofsd to v1.7.0 https://gitlab.com/virtio-fs/virtiofsd/-/releases/v1.7.0 was released Today. Fixes: #7371 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-18 12:33:13 +02:00
Fabiano Fidêncio	ac5f5353ba	ci: k8s: Bring TDX tests back Now that we have a new TDX machine plugged into our CI, let's re-enable the TDX tests. Fixes: #7368 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-18 10:33:43 +02:00
Jeremi Piotrowski	950b89ffac	versions: Update kernel to version v6.1.38 Kernel v6.1.38 is the current latest LTS version, switch to it. No patches should be necessary. Some CONFIG options have been removed: - CONFIG_MEMCG_SWAP is covered by CONFIG_SWAP and CONFIG_MEMCG - CONFIG_ARCH_RANDOM is unconditionally compiled in - CONFIG_ARM64_CRYPTO is covered by CONFIG_CRYPTO and ARCH=arm64 Fixes: #6086 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-18 10:04:21 +02:00
GabyCT	7729d82e6e	Merge pull request #7360 from GabyCT/topic/updategraldoc metrics: Update machine learning documentation	2023-07-17 15:30:13 -06:00
Fabiano Fidêncio	26d525fcf3	Merge pull request #7361 from fidencio/topic/gha-ci-add-cri-containerd-tests-skeleton-follow-up-2 gha: ci: cri-containerd: Fix KATA_HYPERVSIOR typo	2023-07-17 22:38:50 +02:00
GabyCT	b4852c8544	Merge pull request #7335 from kata-containers/topic/addmobilenet tests: Add MobileNet Tensorflow performance benchmark	2023-07-17 14:36:59 -06:00
Gabriela Cervantes	8ccc1e5c93	metrics: Update machine learning documentation This PR updates the machine learning documentation related with Tensorflow and Pytorch benchmarks. Fixes #7359 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-17 20:32:49 +00:00
Fabiano Fidêncio	f50d2b0664	gha: ci: cri-containerd: Fix KATA_HYPERVSIOR typo KATA_HYPERVSIOR should be KATA_HYPERVISOR Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-17 21:56:51 +02:00
David Esparza	687596ae41	Merge pull request #7320 from dborquez/fix_jq_checkmetrics_checkvar_expression metrics: replace backslashes used to escape double quoted jq key expr.	2023-07-17 13:50:18 -06:00
Gabriela Cervantes	620b945975	metrics: Add Tensorflow Mobilenet documentation This PR adds the Tensorflow mobilinet documentation for the machine learning README. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-17 17:39:05 +00:00
Zhongtao Hu	d50f3888af	Merge pull request #7219 from Apokleos/network-refactor runtime-rs: enhancement of Device Manager for network endpoints.	2023-07-17 14:13:51 +08:00
QuanweiZhou	ce14f26d82	Merge pull request #5450 from openanolis/trace_rs feat(Tracing): tracing in Rust runtime	2023-07-17 09:27:13 +08:00
Manabu Sugimoto	f1d8de9be6	runk: Allow runk to launch a container without pid namespace Allow runk to launch a container even though users don't specify the pid namespace in `config.json` because general container runtimes such as runc also can launch a container without the namespace. On the other hand, Kata Containers doesn't allow it due to security issue so this feature should be enabled in only runk. Fixes: #7168 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2023-07-16 23:31:14 +05:30
Zhongtao Hu	419f8a5db7	Merge pull request #7021 from cheriL/7020/ignore-unconfigured-netinterface runtime-rs: ignore unconfigured network interfaces	2023-07-16 10:11:15 +08:00
Xuewei Niu	6c91af0a26	agent: Fix exec hang issues with a backgroud process Issue #4747 and pull request #4748 fix exec hang issues where the exec command hangs when a process's stdout is not closed. However, the PR might cause the exec command not to work as expected, leading to CI failure. The PR was reverted in #7042. This PR resolves the exec hang issues and has undergone 1000 rounds of testing to verify that it would not cause any CI failures. Fixes: #4747 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com> Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-07-16 08:32:45 +08:00
David Esparza	5a9829996c	Merge pull request #7349 from dborquez/fix_extract_kata_env_for_metrics metrics: Stop running kata-env before kata is properly installed.	2023-07-14 15:20:52 -06:00
David Esparza	59f4731bb2	metrics: Stop running kata-env before kata is properly installed. This PR makes kata-env is called only after some metrics have completed his workload. This fixes a bug that occurs when kata-env was being called before kata is already installed on the testing platform. Fixes: #7348 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-07-14 13:40:48 -06:00
David Esparza	468f017e21	metrics: Replace backslashes used to escape double quoted key in jq expr. This PR uses squared brackets in a jq expression to access key values corresponding to metric results in json format. The values are the data inputs into the checkmetrics tool. Fixes: #7319 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-07-14 18:41:41 +00:00
GabyCT	b9535fb187	Merge pull request #7337 from dborquez/fix_remove_old_metrics_config metrics: use rm -f to remove the oldest continerd config file.	2023-07-14 09:19:41 -06:00
Fabiano Fidêncio	7a854507cc	Merge pull request #7333 from zvonkok/main kernel: Update kernel config name	2023-07-14 13:49:27 +02:00
Fabiano Fidêncio	cfc90fad84	Merge pull request #7344 from fidencio/topic/kata-deploy-add-a-debug-option kata-deploy: Add a debug option to kata-deploy (and also use it as part of our CI)	2023-07-14 13:16:55 +02:00
Fabiano Fidêncio	64f013f3bf	ci: k8s: Enable debug when running the tests This will help us to gather more information about Kata Containers in case of failure. Fixes: #7343 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-14 12:18:11 +02:00
Fabiano Fidêncio	8f4b1df9cf	kata-deploy: Give users the ability to run it on DEBUG mode The DEBUG env var introduced to the kata-deploy / kata-cleanup yaml file will be responsible for: * Setting up the CRI Engine to run with the debug log level set to debug * The default is usually info * Setting up Kata Containers to enable: * debug logs * debug console * agent logs This will help a lot folks trying to debug Kata Containers while using kata-deploy, and also help us to always run with DEBUG=yes as part of our CI. Fixes: #7342 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-14 12:18:08 +02:00
Chao Wu	9b3dc572ae	Merge pull request #7018 from nubificus/feat_bindmount_propagation runtime-rs: add parameter for propagation of (u)mount events	2023-07-14 15:21:41 +08:00
Zvonko Kaiser	2c8dfde168	kernel: Update kernel config name Fixes: #7294 When installing the kernel config adjust the name like the vmlinuz and vmlinux files so that any added suffixes are also reflected in the kernel config name. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-07-14 06:50:35 +00:00
Archana Shinde	b9b8ccca0c	Merge pull request #7236 from amshinde/move-guestprotection kata-ctl: Move GuestProtection code to kata-sys-util	2023-07-13 23:50:17 -07:00
soup	150e54d02b	runtime-rs: ignore unconfigured network interfaces Fixes: #7020 Signed-off-by: soup <lqh348659137@outlook.com>	2023-07-14 14:16:03 +08:00
David Esparza	3ae02f9202	metrics: use rm -f to remove older continerd config file. In order to run kata metrics we need to check that the containerd config file is properly set. When this is not the case, we need to remove that file, and generate a valid one. This PR runs rm -f in order to ignore errors in case the file to delete does not exist. Fixes: #7336 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-07-13 16:20:03 -06:00
David Esparza	22d4e4c5a6	Merge pull request #7328 from GabyCT/topic/updatecommon tests: Add function before function name in common.bash for metrics	2023-07-13 16:11:30 -06:00
Gabriela Cervantes	a864d0e349	tests: Add tensorflow mobilenet dockerfile This PR adds the tensorflow mobilenet dockerfile. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-13 21:24:40 +00:00
Gabriela Cervantes	788d2a254e	tests: Add tensorflow mobilenet performance test This PR adds tensorflow mobilenet performance test for kata metrics. Fixes #7334 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-13 21:18:25 +00:00
David Esparza	e8917d7321	Merge pull request #7330 from GabyCT/topic/storagedoc tests: Add metrics storage documentation	2023-07-13 15:10:53 -06:00
GabyCT	8db43eae44	Merge pull request #7318 from dborquez/fix_timestamp_generator_on_metrics metrics: Fix metrics ts generator to treat numbers as decimals	2023-07-13 11:21:09 -06:00
Gabriela Cervantes	3fed61e7a4	tests: Add storage link to general metrics documentation This PR adds storage link to general metrics README. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-13 16:03:49 +00:00
Gabriela Cervantes	b34dda4ca6	tests: Add storage blogbench metrics documentation This PR adds the storage metrics documentation for blogbench for kata metrics. Fixes #7329 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-13 16:00:14 +00:00
Anastassios Nanos	6787c63900	runtime-rs: add parameter for propagation of (u)mount events Add an extra parameter in `bind_mount_unchecked` to specify the propagation type: "shared" or "slave". Fixes: #7017 Signed-off-by: Anastassios Nanos <ananos@nubificus.co.uk>	2023-07-13 15:58:22 +00:00
Gabriela Cervantes	6e5679bc46	tests: Add function before function name in common.bash for metrics This PR adds function before the function name in common.bash script in order to have uniformity across all the script. Fixes #7327 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-13 15:48:47 +00:00
Archana Shinde	62080f83cb	kata-sys-util: Fix compilation errors Fix compilation errors for aarch64 and s390x Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-07-13 20:09:43 +05:30
Archana Shinde	02d99caf6d	static-checks: Make cargo clippy pass. Get rid of cargo clippy warnings. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-07-13 20:08:13 +05:30
Archana Shinde	9824206820	agent: Make the static checks pass for agent The static checks for the agent require Cargo.lock to be updated. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-07-13 20:08:13 +05:30
Archana Shinde	61e4032b08	kata-ctl: Remove all utility functions to get platform protection Since these have been added to kata-sys-util, remove these from kata-ctl. Change all invocations to get platform protection to make use of kata-sys-util. Fixes: #7144 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-07-13 20:08:13 +05:30
Archana Shinde	a24dbdc781	kata-sys-util: Move utilities to get platform protection Add utilities to get platform protection to kata-sys-util Fixes: #7144 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-07-13 20:08:13 +05:30
Archana Shinde	dacdf7c282	kata-ctl: Remove cpu related functions from kata-ctl Remove cpu related functions which have been moved to kata-sys-util. Change invocations in kata-ctl to make use of functions now moved to kata-sys-util. Signed-off-by: Nathan Whyte <nathanwhyte35@gmail.com> Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-07-13 20:08:13 +05:30
Archana Shinde	f5d1957174	kata-sys-util: Move additional functionality to cpu.rs Make certain imports architecture specific as these are not used on all architectures. Move additional constants and functionality to cpu.rs. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-07-13 20:08:13 +05:30
Nathan Whyte	304b9d9146	kata-sys-util: Move CPU info functions Move get_single_cpu_info and get_cpu_flags into kata-sys-util. Add new functions that get a list of flags and check if a flag exists in that list. Fixes #6383 Signed-off-by: Nathan Whyte <nathanwhyte35@gmail.com> Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-07-13 20:08:13 +05:30
Fabiano Fidêncio	eed3c7c046	Merge pull request #7322 from fidencio/topic/gha-ci-add-cri-containerd-tests-skeleton-follow-up gha: ci: Add cri-containerd tests skeleton -- follow up 1	2023-07-13 13:53:48 +02:00
Fabiano Fidêncio	7319cff77a	ci: cri-containerd: Add LTS / Active versions for containerd As we'll be testing against the LTS and the Active versions of containers, let's add those entries to the versions.yaml file and make sure we export what we want to use for the tests as an env var. The approach taken should not break the current way of getting the containerd version. LTS and Active versions of containerd can be found at: https://containerd.io/releases/#support-horizon Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-13 12:05:47 +02:00
Fabiano Fidêncio	2a957d41c8	ci: cri-containerd: Export GOPATH Let's make sure this is exported, as it'll be needed in order to install `yq`, which will be used to get the versions of the dependencies to be installed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-13 12:05:47 +02:00
Fabiano Fidêncio	75a294b74b	ci: cri-containerd: Ensure deps are installed Let's make sure we install the needed dependencies for running the `cri-containerd` tests. Right now this commit is basically adding a placeholder, and later on, when we'll actually be able to test the job, we'll add the logic of installing the needed dependencies. The obvious dependencies we've spotted so far are: * From the OS * jq * curl (already present) * From our repo * yq (using the install_yq script) * From GitHub * cri-containerd * cri-tools * cni plugins We may need a few more packages, but we will only figure this out as part of the actual work. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-13 12:04:22 +02:00
Zhongtao Hu	b69cdb5c21	Merge pull request #7286 from xuejun-xj/xuejun/up-fix dragonball/agent: Add some optimization for Makefile and bugfixes of unit tests on aarch64	2023-07-13 09:39:23 +08:00
GabyCT	ee17097e88	Merge pull request #7282 from GabyCT/topic/enableblogbench metrics: Enable blogbench test	2023-07-12 16:35:52 -06:00
David Esparza	f63673838b	Merge pull request #7315 from GabyCT/topic/machinelearning tests: Add machine learning performance tests	2023-07-12 15:57:11 -06:00
David Esparza	6924d14df5	metrics: Fix metrics ts generator to treat numbers as decimals Use bc tool to perform math operations even when variables contain values with leading zero. Fixes: #7317 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-07-12 20:57:33 +00:00
Gabriela Cervantes	9e048c8ee0	checkmetrics: Add blogbench read value for qemu This PR adds the blogbench read value for qemu. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 20:38:27 +00:00
Gabriela Cervantes	2935aeb7d7	checkmetrics: Add blogbench write value for qemu This PR adds the blogbench write value for qemu limit. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 20:37:27 +00:00
Gabriela Cervantes	02031e29aa	checkmetrics: Add blogbench read value for clh This PR adds the blogbench read value for clh limit. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 20:37:27 +00:00
Gabriela Cervantes	107fae033b	checkmetrics: Add blogbench write value for clh This PR adds the blogbench write value limit for clh. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 20:37:27 +00:00
Gabriela Cervantes	8c75c2f4bd	metrics: Update blogbench Dockerfile This PR udpates the blogbench dockerfile to have non interactive mode. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 20:37:27 +00:00
Gabriela Cervantes	49723a9ecf	metrics: Add double quotes to variables This PR adds double quotes to variables in the blogbench script to have uniformity across all the tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 20:37:27 +00:00
Gabriela Cervantes	dc67d902eb	metrics: Enable blogbench test This PR enables the blogbench performance test for the kata metrics CI. Fixes #7281 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 20:37:24 +00:00
Fabiano Fidêncio	3f38f75918	Merge pull request #7314 from fidencio/topic/gha-ci-add-cri-containerd-tests-skeleton tests: gha: ci: Add cri-containerd tests skeleton	2023-07-12 22:21:47 +02:00
Fabiano Fidêncio	438fe3b829	gha: ci: Add cri-containerd tests skeleton This PR builds the foundation for us to start migrating the cri-containerd tests from Jenkins to GitHub Actions. Right now the test does nothing and should always finish successfully. The coming PRs will actually introduce logic to the `gha-run.sh` script where we'll be able to run the tests and make sure those pass before having them actually merged. Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-12 20:57:39 +02:00
Fabiano Fidêncio	bd08d745f4	tests: metrics: Move metrics specific function to metrics gha-run.sh `compress_metrics_results_dir()` is only used by the metrics GHA. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-12 20:56:55 +02:00
Fabiano Fidêncio	3ffd48bc16	tests: common: Move a few utility functions to common.bash Those functions were originally introduced as part of the `metrics/gha-run.sh` file, but those will be very hand at the time we start adding more tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-12 20:55:05 +02:00
Gabriela Cervantes	7f961461bd	tests: Add machine learning README This PR adds machine learning README. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 16:37:15 +00:00
Fabiano Fidêncio	bb2ef4ca34	tests: Add `function` before each function Let's just keep this standardised. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-12 18:36:09 +02:00
Gabriela Cervantes	063f7aa7cb	tests: Add Pytorch Dockerfile This PR adds Pytorch Dockerfile for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 16:34:17 +00:00
Fabiano Fidêncio	b6282f7053	Merge pull request #7255 from GabyCT/topic/memoryinsideenabled metrics: Enable memory inside container metrics	2023-07-12 18:33:36 +02:00
Gabriela Cervantes	1af03b9b32	tests: Add Pytorch performance test This PR adds Pytorch performance test for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 16:33:02 +00:00
Gabriela Cervantes	4cecd62370	tests: Add tensorflow Dockerfile This PR adds the tensorflow Dockerfile. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 16:31:32 +00:00
Gabriela Cervantes	c4094f62c9	tests: Add metrics machine learning performance tests This PR adds metrics machine learning performance tests like Tensorflow and Pytorch. Fixes #7313 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-12 16:28:25 +00:00
Jeremi Piotrowski	b9a63d66a4	Merge pull request #7297 from jepio/fix-mariner-cache tools: Use a consistent target name when building mariner initrd	2023-07-12 13:43:47 +02:00
Fabiano Fidêncio	1ab99bd6bb	Merge pull request #7276 from fidencio/topic/gha-debug-gha-tests-start gha: ci: Gather info about the node / pods	2023-07-12 12:35:10 +02:00
Chao Wu	f6a51a8a78	Merge pull request #7306 from justxuewei/none-network-model runtime-rs: Do not scan network if network model is "none"	2023-07-12 14:53:52 +08:00
Zvonko Kaiser	4e352a73ee	Merge pull request #7308 from fidencio/topic/gha-temporarily-disable-tdx-runs gha: k8s: tdx: Temporarily disable TDX tests	2023-07-12 08:39:02 +02:00
Fabiano Fidêncio	89b622dcb8	gha: k8s: tdx: Temporarily disable TDX tests TDX tests need to be temporarily disabled as the current machine allocated for this will be off for some time, and a new machine only will be added next week. Fixes: #7307 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-12 08:26:10 +02:00
Fabiano Fidêncio	8c9d08e872	gha: ci: Gather info about the node / pods This is a very simple addition, that should be expanded by https://github.com/kata-containers/kata-containers/pull/7185, and it's targetting gathering more info that will help us to debug CI failures. Fixes: #7296 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-12 08:04:37 +02:00
alex.lyn	283f809dda	runtime-rs: Enhancing Device Manager for network endpoints. Currently, network endpoints are separate from the device manager and need to be included for proper management. In order to do so, we need to refactor the implementation of the network endpoints. The first step is to restructure the NetworkConfig and NetworkDevice structures. Next, we will implement the virtio-net driver and add the Network device to the Device Manager. Finally, we'll unify entries with do_handle_device for each endpoint. Fixes: #7215 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-07-12 11:27:12 +08:00
xuejun-xj	a65291ad72	agent: rustjail: update test_mknod_dev When running cargo test in container, test_mknod_dev may fail sometimes because of "Operation not permitted". Change the device path to "/dev/fifo-test" to avoid this case. Fixes: #7284 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-07-12 11:22:32 +08:00
xuejun-xj	46b81dd7d2	agent: clippy: fix cargo clippy warnings Replace "if let Ok(_) = ..." with ".is_ok()" method. Fixes: #7284 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-07-12 11:22:32 +08:00
xuejun-xj	c4771d9e89	agent: Makefile: enable set SECCOMP dynamically Change ":=" to "?:". Fixes: #7284 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-07-12 11:22:32 +08:00
xuejun-xj	a88212e2c5	utils.mk: update BUILD_TYPE argument Enable to dynamically set BUILD_TYPE argument. Fixes: #7284 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-07-12 11:22:32 +08:00
xuejun-xj	883b4db380	dragonball: fix cargo test on aarch64 1. Update memory end assert because address space layout differs between x86 and arm. 2. Set guest_addr for aarch64 in test_handler_insert_region case. Fixes: #7284 TODO: #7290 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-07-12 11:22:31 +08:00
Xuewei Niu	6822029c81	runtime-rs: Do not scan network if network model is "none" Skip to scan network from netns if the network model is specified to "none". Fixes: #7305 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-07-12 10:00:50 +08:00
Fabiano Fidêncio	ae55893deb	Merge pull request #7303 from GabyCT/topic/cleanupmemoryusage metrics: Update memory usage script	2023-07-11 23:52:05 +02:00
Gabriela Cervantes	ce54e43ebe	metrics: Update memory usage script This PR updates memory usage script by applying the clean_env_ctr at the main in order to avoid failures of leaving certain processes not removed. Fixes #7302 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-11 17:03:25 +00:00
Fabiano Fidêncio	ceb5c69ee8	Merge pull request #7299 from fidencio/topic/gha-stop-previous-workflows-if-a-pr-is-updated gha: Cancel previous jobs if a PR is updated	2023-07-11 16:22:47 +02:00
Fabiano Fidêncio	fbc2a91ab5	gha: Cancel previous jobs if a PR is updated Let's make sure we cancel previous runs, mainly as we have some of those that take a lot of time to run, whenever the PR is updated. This is based on the following stack overflow suggestion: https://stackoverflow.com/questions/66335225/how-to-cancel-previous-runs-in-the-pr-when-you-push-new-commitsupdate-the-curre This is very much needed as we don't want to wait for a long time to have access to a runner because of other runners are still being used performing a task that's meaningless due to the PR update. Fixes: #7298 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-11 14:37:10 +02:00
Jeremi Piotrowski	307cfc8f7a	tools: Use a consistent target name when building mariner initrd Currently a mixture of cbl-mariner and mariner is used when creating the mariner initrd. The kata-static tarball has mariner in the name, but the jenkins url uses cbl-mariner. This breaks cache usage. Use mariner as the target name throughout the build, so that caching works. Fixes: #7292 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-11 14:17:14 +02:00
Fabiano Fidêncio	aa484dc0e3	Merge pull request #7288 from fidencio/topic/add-nightly-jobs-follow-up-7 gha: nightly: Fix long name of AKS clusters issue and make the CI easier to test	2023-07-11 11:16:09 +02:00
Fabiano Fidêncio	d780cc08f4	gha: nightly: Also use `workflow_dispatch` to trigger it This is a very nice suggestion from Steve Horsman, as with that we can manually trigger the workflow anytime we need to test it, instead of waiting for a full day for it to be retriggered via the `schedule` event. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-11 10:42:40 +02:00
Fabiano Fidêncio	b99ff30267	gha: nightly: Fix name size limit for AKS Passing the commit hash as the "pr-number" has shown problematic as it would make the AKS cluster name longer than what's accepted by AKS. One easy way to solve this is just passing "nightly" as the PR number, as that's only used to create the cluster. Fixes: #7247 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-11 09:59:13 +02:00
xuejun-xj	aedc586e14	dragonball: Makefile: add coverage target Add "coverage" target to compute code coverage for dragonball. Fixes: #7284 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-07-11 14:36:25 +08:00
Fabiano Fidêncio	52100bb3dd	Merge pull request #7280 from fidencio/topic/gha-add-badge-for-our-tests README: Add badge for our Nightly CI	2023-07-10 19:35:33 +02:00
Gabriela Cervantes	310e069f73	checkmetrics: Enable checkmetrics for memory inside test This PR enables the checkmetrics to include the memory inside container test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-10 17:05:13 +00:00
Fabiano Fidêncio	b61b15aab6	Merge pull request #7259 from fidencio/topic/gha-restrict-job-run-according-to-files-touched gha: Do not run all the tests if only docs are updated	2023-07-10 18:12:29 +02:00
Fabiano Fidêncio	1363fbbf12	README: Add badge for our Nightly CI This will help folks to monitor the history of the failing tests, as we've done in Jenkins with the "Green Effort CI". Fixes: #7279 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-10 17:31:51 +02:00
Fabiano Fidêncio	9dc63fe338	Merge pull request #7273 from openanolis/runtime-rs-fix-mem-ci bugfix: plus default_memory when calculating mem size	2023-07-10 15:12:05 +02:00
Zvonko Kaiser	fab2e6a93f	Merge pull request #7277 from fidencio/topic/add-nightly-jobs-follow-up-6 gha: ci: Use github.sha to get the last commit reference	2023-07-10 13:36:31 +02:00
Fabiano Fidêncio	1776b18fa0	gha: Do not run all the tests if only docs are updated We should not go through the trouble of running all our tests on AKS / Azure / baremetal machines in case a PR only changes our documentation. Fixes: #7258 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-10 10:30:46 +02:00
Yushuo	28c29b248d	bugfix: plus default_memory when calculating mem size We've noticed this caused regressions with the k8s-oom tests, and then decided to take a step back and do this in the same way it was done before `67972ec48a`. Moreover, this step back is also more reasonable in terms of the controlling logic. And by doing this we can re-enable the k8s-oom.bats tests, which is done as part of this PR. Fixes: #7271 Depends-on: github.com/kata-containers/tests#5705 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-07-10 15:53:04 +08:00
Fabiano Fidêncio	0c1cbd01d8	gha: ci: after-push: Use github.sha to get the last commit reference As we need to pass down the commit sha to the jobs that will be triggered from the `push` event, we must be careful on what exactly we're using there. At first we were using ${{ github.ref }}, but this turns out to be the branch name, rather than the commit hash. In order to actually get the commit hash, Let's use ${{ github.sha }} instead. Fixes: #7247 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-10 09:39:33 +02:00
Fabiano Fidêncio	37a9556789	gha: ci: nightly: Use github.sha to get the last commit reference As we need to pass down the commit sha to the jobs that will be triggered from the `schedule` event, we must be careful on what exactly we're using there. At first we were using ${{ github.ref }}, but this turns out to be the branch name, rather than the commit hash. In order to actually get the commit hash, Let's use ${{ github.sha }} instead, as described by https://docs.github.com/en/actions/using-workflows/events-that-trigger-workflows# Fixes: #7247 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-10 09:39:26 +02:00
Fabiano Fidêncio	afbc1f94d7	Merge pull request #7272 from fidencio/topic/dragonball-k8s-number-cpus-fix dragonball: Don't fail if a request asks for more CPUs than allowed	2023-07-10 08:25:06 +02:00
Ji-Xinyou	ed23b47c71	tracing: Add tracing to runtime-rs Introduce tracing into runtime-rs, only some functions are instrumented. Fixes: #5239 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com> Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-07-09 22:09:43 +08:00
Fabiano Fidêncio	96e9374d4b	dragonball: Don't fail if a request asks for more CPUs than allowed Let's take the same approach of the go runtime, instead, and allocate the maximum allowed number of vcpus instead. Fixes: #7270 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-08 15:50:23 +02:00
Fabiano Fidêncio	38f0aaa516	Revert "gha: k8s: dragonball: Skip k8s-number-cpus" This reverts commit `a79505b667`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-08 14:43:49 +02:00
Fabiano Fidêncio	828a721838	gha: k8s: dragonball: Skip k8s-oom Let's skip the k8s-oom, as the test is currently failing. We've an issue opened for that, and we'll be working on re-enabling it as soon as possible. Reference: https://github.com/kata-containers/kata-containers/issues/7271 Fixes: #7253 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-08 14:27:49 +02:00
Fabiano Fidêncio	a79505b667	gha: k8s: dragonball: Skip k8s-number-cpus Let's skip the k8s-number-cpus, as the test is currently failing. We've an issue opened for that, and we'll be working on re-enabling it as soon as possible. Reference: https://github.com/kata-containers/kata-containers/issues/7270 Fixes: #7253 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-08 14:27:42 +02:00
Fabiano Fidêncio	275c84e7b5	Revert "agent: fix the issue of exec hang with a backgroud process" This reverts commit `25d2fb0fde`. The reason we're reverting the commit is because it to check whether it's the cause for the regression on devmapper tests. Fixes: #7253 Depends-on: github.com/kata-containers/tests#5705 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-08 14:27:40 +02:00
Gabriela Cervantes	2be342023b	checkmetrics: Add memory usage inside container value for qemu This PR adds the memory usage inside container value for qemu. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-07 16:28:28 +00:00
Gabriela Cervantes	6ca34f949e	checkmetrics: Add memory inside container value for clh Add memory inside container value for clh. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-07 16:28:28 +00:00
Gabriela Cervantes	6c68924230	metrics: Enable memory inside container metrics This PR will enable the memory inside container metrics for the Kata CI. Fixes #7254 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-07 16:28:28 +00:00
Fabiano Fidêncio	b7c58320a5	Merge pull request #7267 from fidencio/topic/add-nightly-jobs-follow-up-5 gha: ci: Fix refernce passed to checkout@v3	2023-07-07 18:26:44 +02:00
Fabiano Fidêncio	0ad298895e	gha: ci: Fix refernce passed to checkout@v3 On `cc3993d860` we introduced a regression, where we started passing inputs.commit-hash, instead of github.event.pull_request.head.sha. However, we have been setting commit-hash to github.event.pull_request.sha, meaning that we're mssing a `.head.` there. github.event.pull_request.sha is empty for the pull_request_target event, leading the CI to pull the content from `main` instead of the content from the PR. Fixes: #7247 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-07 17:55:11 +02:00
Fabiano Fidêncio	48d9f8769e	Merge pull request #7264 from fidencio/topic/add-nightly-jobs-follow-up-4 gha: ci: Avoid using env also in the ci-nightly and payload-after-push	2023-07-07 17:10:43 +02:00
Fabiano Fidêncio	86904909aa	gha: ci: Avoid using env also in the ci-nightly and payload-after-push The latter workflow is breaking as it doesn't recognise ${GITHUB_REF}, the former would most likely break as well, but it didn't get triggered yet. The error we're facing is: ``` Determining the checkout info /usr/bin/git branch --list --remote origin/${GITHUB_REF} /usr/bin/git tag --list ${GITHUB_REF} Error: A branch or tag with the name '${GITHUB_REF}' could not be found ``` Fixes: #7247 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-07 14:46:30 +02:00
Fabiano Fidêncio	48c3cec1f4	Merge pull request #7243 from sprt/ensure-cluster-no-exist gha: k8s: Ensure cluster doesn't exist before creating it	2023-07-07 14:03:41 +02:00
Fabiano Fidêncio	3e2b723487	Merge pull request #7263 from fidencio/topic/add-nightly-jobs-follow-up-3 gha: ci: More follow up fixes after adding a nightly CI	2023-07-07 13:58:26 +02:00
Fabiano Fidêncio	18bd2d6e4a	Merge pull request #6839 from sprt/sprt/mariner-ci-tests tests: Enable running k8s tests on Mariner	2023-07-07 13:36:28 +02:00
Zvonko Kaiser	f72cb2fc12	agent: Remove shadowed function, add slog-term Remove shadowed get_mounts(), added slog-term as a new crate, slog can directly log to stdout and we can capture output in the test-cases that are created in the function to be tested. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-07-07 11:28:14 +00:00
Fabiano Fidêncio	1d05b9cc71	gha: ci: Pass down secrets to ci-on-push / ci-nightly We have to do this, otherwise we cannot log into azure. This is a regression introduced by `106e305717`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-07 12:00:33 +02:00
Fabiano Fidêncio	c5b4164cb1	gha: ci: Fix tarball-suffix passed to the metrics tests Instead of passing "-${{ inputs.tag }}-amd64", we must only pass "-${{ inputs.tag }}". This is a regression introduced by `106e305717`. Fixes: #7247 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-07 12:00:24 +02:00
Fabiano Fidêncio	fa0f9954a1	Merge pull request #7261 from fidencio/topic/add-nightly-jobs-follow-up-2 gha: ci: Avoid using env unless it's really needed	2023-07-07 10:13:25 +02:00
Zvonko Kaiser	07810bf71f	agent: Ignore already mounted dev/fs/pseudo-fs Using an initrd and setting KATA_INIT=yes meaning we're using the kata-agent as the init process we need to make sure that the agent is not segfaulting if mounts are already happened. Some workloads need to configure several things in the initrd before the kata-agent starts which involves having /proc or /sys already mounted. Fixes: #6992 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-07-07 07:36:04 +00:00
Fabiano Fidêncio	11e3ccfa4d	gha: ci: Avoid using env unless it's really needed `de83cd9de7` tried to solve an issue, but it clearly seems that I'm using env wrongly, as what ended up being passed as input was "$VAR", instead of the content of the VAR variable. As we can simply avoid using those here, let's do it and save us a headache. Fixes: #7247 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-07 07:31:10 +02:00
Aurélien Bombo	c45f646b9d	gha: k8s: Ensure cluster doesn't exist before creating it The cluster cleanup step will sometimes fail to run, meaning the next run would fail in the cluster creation step. This PR addresses that. Example: https://github.com/kata-containers/kata-containers/actions/runs/5349582743/jobs/9867845852 Fixes: #7242 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-06 15:06:30 -07:00
GabyCT	58e921eace	Merge pull request #7260 from fidencio/topic/add-nightly-jobs-follow-up-1 gha: ci: Follow up fixes for the nightly jobs	2023-07-06 15:45:13 -06:00
GabyCT	54da0d7c91	Merge pull request #7230 from GabyCT/topic/enabmemory tests: Enable memory usage metrics tests	2023-07-06 14:30:56 -06:00
Fabiano Fidêncio	1a7bbcd398	gha: ci: Fix typo pull_requesst -> pull_request Thanks David Esparza for pointing this one out. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-06 22:29:00 +02:00
Fabiano Fidêncio	ddf4afb961	gha: ci: Fix set-fake-pr-number job It has to have steps declared, and we need to make it a dependency for the nightly kata-containers-ci-on-push job. Fixes: #7247 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-06 22:02:08 +02:00
Fabiano Fidêncio	8a0a66655d	gha: ci: schedule expects a list, not a map And because of that we need to declare '- cron', instead of 'cron'. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-06 22:02:08 +02:00
Fabiano Fidêncio	5c0269dc5a	gha: ci: Add pr-number input to the correct job It must have been an input for the AKS jobs, not the SNP one. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-06 22:02:08 +02:00
Fabiano Fidêncio	de83cd9de7	gha: ci: Use $VAR instead of ${{ env.VAR }} Otherwise we'll get the following error from the workflow: ``` The workflow is not valid. .github/workflows/ci-on-push.yaml (Line: 24, Col: 20): Unrecognized named-value: 'env'. Located at position 1 within expression: env.COMMIT_HASH .github/workflows/ci-on-push.yaml (Line: 25, Col: 18): Unrecognized named-value: 'env'. Located at position 1 within expression: env.PR_NUMBER ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-06 22:02:08 +02:00
Wainer Moschetta	1a4ae1ef47	Merge pull request #6953 from fidencio/topic/add-nightly-jobs gha: Add nightly jobs	2023-07-06 14:50:10 -03:00
Gabriela Cervantes	6acce83e12	metrics: Fix the call to check_metrics function This PR fixes the call to check_metrics function as KATA_HYPERVISOR is not needed to be passed. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-06 17:22:49 +00:00
David Esparza	0bd21c173a	Merge pull request #7240 from dborquez/storing_metrics_artifacts metrics: storing metrics workflow artifacts	2023-07-06 09:49:45 -06:00
Fabiano Fidêncio	152e2509ca	Merge pull request #7238 from fidencio/topic/gha-run-tests-on-specific-namespace gha: k8s: Ensure tests are running on a specific namespace	2023-07-06 17:25:00 +02:00
Fabiano Fidêncio	e067d18333	gha: Add a nightly CI job The idea is to mimic what's been done with Jenkins and the "Green CI" effort, but now using our GHA and the GHA infrastructure. Fixes: #7247 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-06 14:39:49 +02:00
Fabiano Fidêncio	7c0de8703c	gha: k8s: Ensure tests are running on a specific namespace Let's make sure we run our tests in a specific namespace, as in case of any kind of issue, we will just get rid of the namespace itself, which will take care of cleaning up any leftover from failing tests. One important thing to mention is why we can get rid of the `namespace: ${namespace}` on the tests that are already using it, and let's do it in parts: * namespace: default We can easily get rid of this as that's the default namespace where pods are created, so it was a no-op so far. * namespace: test-quota-ns My understanding is that we'd need this in order to get a clean namespace where we'd be setting a quota for. Doing this in the namespace that's only used for tests should not cause any side-effect on the tests, as we're running those in serial and there's no other pods running on the `kata-containers-k8s-tests` namespace Last but not least, we're not dynamically creating namespaces as the tests are not running in parallel, never, not in the case of having 2 tests being ran at same time, neither in the case of having 2 jobs being scheduled to the same machine. Fixes: #6864 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-06 14:14:50 +02:00
Fabiano Fidêncio	106e305717	gha: Create a re-usable `ci.yaml` file This is based on the `ci-on-push.yaml` file, and it's called from ther The reason to split on a new file is that we can easily introduce a `ci-nightly.yaml` file and re-use the `ci.yaml` file there as well. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-06 13:07:59 +02:00
Fabiano Fidêncio	cc3993d860	gha: Pass event specific info from the caller workflow Let's ensure we're not relying, on any of the called workflows, on event specific information. Right now, the two information we've been relying on are: * PR number, coming from github.event.pull_request.number * Commit hash, coming from github.event.pull_request.head.sha As we want to, in the future, add nightly jobs, which will be triggered by a different event (thus, having different fields populated), we should ensure that those are not used unless it's in the "top action" that's trigerred by the event. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-06 11:23:17 +02:00
David Esparza	4e396e7285	metrics: Add function keyword to to helper metrics functions Use the 'function' keyword to prevent bash aliases from colliding with other function's name. Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-07-05 20:59:21 -06:00
David Esparza	1ca17c2f70	metrics: storing metrics workflow artifacts This PR enables storing metrics workflow artifacts in two separated flavours: clh and qemu. Fixes: #7239 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-07-05 20:57:10 -06:00
David Esparza	a3fc673121	Merge pull request #7181 from dborquez/add_blogbench_and_webtooling metrics: Adds blogbench and webtool metrics tests	2023-07-05 20:37:33 -06:00
Gabriela Cervantes	5a61065ab7	checkmetrics: Add checkmetrics value for memory usage in qemu This PR adds the checkmetrics value for memory usage in qemu. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-05 19:22:12 +00:00
Gabriela Cervantes	78086ed1fe	checkmetrics: Add memory usage value for clh This PR adds the memory usage value for clh. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-05 19:19:04 +00:00
Gabriela Cervantes	1c3dbafbf0	metrics: Fix function of how to retrieve multiple values This PR fixes the function of how to add multiple values of pss memory. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-05 18:19:36 +00:00
Gabriela Cervantes	18968f428f	metrics: Add function to have uniformity This PR adds the function name before the function to have uniformity across all the test. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-05 18:15:31 +00:00
David Esparza	35d096b607	metrics: Adds blogbench and webtool metrics tests This PR adds blogbench and webtooling metrics checks to this repo. The function running the test intentionally returns zero, so the test will be enabled in another PR once the workflow is green. Fixes: #7069 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-07-04 14:38:52 -06:00
Gabriela Cervantes	d8f90e89d5	metrics: Rename function at memory usage script This PR renames the function name for the memory usage script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-04 19:58:09 +00:00
Gabriela Cervantes	b9d66e0d53	metrics: Fix double quotes variables in memory usage script This PR usses double quotes in all the variables as well as general fixes to the memory usage script in order to have uniformity. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-04 19:51:36 +00:00
Gabriela Cervantes	476a11194a	tests: Enable memory usage metrics tests This PR enables the memory usage metrics tests for kata CI. Fixes #7229 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-04 16:11:54 +00:00
Fabiano Fidêncio	a25d5b9807	Merge pull request #7222 from jepio/fix-dragonball-check gha: dragonball: Correctly propagate PATH update	2023-07-04 15:59:13 +02:00
Jeremi Piotrowski	b568c7f7d8	tests/integration: Provide default value for KATA_HOST_OS Non AKS k8s tests (SEV/SNP/TDX) don't currently set KATA_HOST_OS, so provide a default empty value for the variable so that those tests can run. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-04 14:28:29 +02:00
Fabiano Fidêncio	6d2e6ed7b6	Merge pull request #7217 from likebreath/0630/clh_v33.0 versions: Upgrade to Cloud Hypervisor v33.0	2023-07-04 12:52:26 +02:00
Jeremi Piotrowski	d6e96ea06d	tests/integration: Use AzureLinux instead of Mariner as OSSKU value, to get rid of this warning when creating the AKS cluster: WARNING: The osSKU "AzureLinux" should be used going forward instead of "CBLMariner" or "Mariner". The osSKUs "CBLMariner" and "Mariner" will eventually be deprecated. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-04 12:49:07 +02:00
Jeremi Piotrowski	40c46c75ed	tests/integration: Perform yq install in run_tests() We only need to install in run_tests() so that the yq install is picked up by kubernets/setup.sh as well. We also need to either use (sudo && INSTALL_IN_GOPATH=false) \|\| (INSTALL_IN_GOPATH=true). Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-04 12:49:07 +02:00
Bin Liu	f214058b07	Merge pull request #7202 from wedsonaf/macros Convert `is_allowed`, `ttrpc_error` and `sl` to functions	2023-07-04 14:23:08 +08:00
Peng Tao	f5658c7833	Merge pull request #7224 from fidencio/topic/gha-release-fix-hub-download gha: release: Use a specific release of hub	2023-07-04 10:21:17 +08:00
GabyCT	5950df7d95	Merge pull request #7199 from GabyCT/topic/installchem metrics: Add checkmetrics to gha-run.sh for metrics CI	2023-07-03 17:49:18 -06:00
Gabriela Cervantes	d8b8f7e94d	metrics: Enable launch tests time metrics This PR enables the launch tests metrics for kata CI. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-03 22:38:04 +00:00
Fabiano Fidêncio	72fd562bd6	gha: release: Use a specific release of hub ideally we should never ever use hub again, and switch to a supported / release tool instead. However, in order to get v3.1.3 released, let's just stick to the last released version of hub, as trying to get its release is leading to: ``` curl -s "https://api.github.com/repos/github/hub/releases/latest" { "message": "Moved Permanently", "url": "https://api.github.com/repositories/401025/releases/latest", "documentation_url": "https://docs.github.com/v3/#http-redirects" } ``` And that breaks the release process. :-/ Fixes: #7223 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-03 22:00:55 +02:00
Fabiano Fidêncio	a7340a63a4	Merge pull request #7209 from GabyCT/topic/fixbuildovmf packaging: Fix indentation of build.sh script at ovmf	2023-07-03 20:06:29 +02:00
Gabriela Cervantes	0502354b42	checkmetrics: Add checkmetrics json for qemu This PR adds checkmetrics json file for qemu metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-03 16:47:03 +00:00
Gabriela Cervantes	b481ef1883	makefile: Add -buildvcs=false flag to go build This PR adds the -buildvcs=false flag to the go build of checkmetrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-03 16:42:51 +00:00
Gabriela Cervantes	e94aaed3c7	ci_worker: Add checkmetrics ci worker for cloud hypervisor This PR adds the checkmetrics ci worker file for cloud hypervisor in order to check the boot times limit. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-03 16:42:51 +00:00
Gabriela Cervantes	917576e6fb	metrics: Add double quotes in all variables This PR adds double quotes in all variables to have uniformity across all the gha-run.sh script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-03 16:42:50 +00:00
Gabriela Cervantes	cc8f0a24e4	metrics: Add checkmetrics to gha-run.sh for metrics CI This PR adds checkmetrics installation for gha-run.sh in order to compare results limits as part of the metrics CI. Fixes #7198 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-07-03 16:41:31 +00:00
Jeremi Piotrowski	477856c1e3	gha: dragonball: Correctly propagate PATH update cargo/rust is installed in one step, we need to write the PATH update to GITHUBENV so that it becomes visible in the next steps. Fixes: #7221 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-03 17:05:12 +02:00
Fupan Li	b6307c2744	Merge pull request #5444 from zvonkok/vra doc: Add documentation for the virtualization reference architecture	2023-07-03 10:14:20 +08:00
Peng Tao	c85aff7ef4	Merge pull request #6949 from zvonkok/kernel-fixes gpu: Update kernel building to the latest changes	2023-07-03 09:53:08 +08:00
Peng Tao	581be92b25	Merge pull request #4492 from zvonkok/pcie-topology runtime: fix PCIe topology for GPUDirect use-case	2023-07-03 09:17:12 +08:00
David Esparza	d01762dc35	Merge pull request #7174 from dborquez/add_memory_footprint_test metrics: Add memory footprint tests	2023-06-30 16:32:10 -06:00
Fabiano Fidêncio	00b0755e3e	Merge pull request #7200 from fidencio/topic/add-virtiofs-none-option runtime: Add "none" as a shared_fs option	2023-06-30 22:45:39 +02:00
Aurélien Bombo	1c211cd730	gha: Swap asset/release in build matrix This simply displays the asset name first in GH's UI, so that the release name (always "test") is truncated rather than the asset name. Makes things slightly easier to read. e.g. build-asset (cloud-hypervisor-glibc, te... instead of build-asset (test, cloud-hypervisor-gli... Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-30 12:51:40 -07:00
Aurélien Bombo	0152c9aba5	tools: Introduce `USE_CACHE` environment variable This allows setting `USE_CACHE=no` to test building e2e during developmet without having to comment code blocks and so forth. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-30 12:51:40 -07:00
Aurélien Bombo	2b59756894	tests: Build CLH with glibc for Mariner This enables building CLH with glibc and the mshv feature as required for Mariner. At test time, it also configures Kata to use that CLH flavor when running Mariner. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-30 12:51:40 -07:00
Aurélien Bombo	80c78eadce	tests: Use baked-in kernel with Mariner Mariner ships a bleeding-edge kernel that might be ahead of upstream, so we use that to guarantee compatibility with the host. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-30 12:51:40 -07:00
Aurélien Bombo	532755ce31	tests: Build Mariner rootfs initrd * Adds a new `rootfs-initrd-mariner` build target. * Sets the custom initrd path via annotation in `setup.sh` at test time. * Adapts versions.yaml to specify a `cbl-mariner` initrd variant. * Introduces env variable `HOST_OS` at deploy time to enable using a custom initrd. * Refactors the image builder so that its caller specifies the desired guest OS. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-30 12:51:40 -07:00
Fabiano Fidêncio	6a21e20c63	runtime: Add "none" as a shared_fs option Currently, even when using devmapper, if the VMM supports virtio-fs / virtio-9p, that's used to share a few files between the host and the guest. This needed, as we need to share with the guest contents like secrets, certificates, and configurations, via Kubernetes objects like configMaps or secrets, and those are rotated and must be updated into the guest whenever the rotation happens. However, there are still use-cases users can live with just copying those files into the guest at the pod creation time, and for those there's absolutely no need to have a shared filesystem process running with no extra obvious benefit, consuming memory and even increasing the attack surface used by Kata Containers. For the case mentioned above, we should allow users, making it very clear which limitations it'll bring, to run Kata Containers with devmapper without actually having to use a shared file system, which is already the approach taken when using Firecracker as the VMM. Fixes: #7207 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-06-30 20:45:00 +02:00
Bo Chen	5681caad5c	versions: Upgrade to Cloud Hypervisor v33.0 Details of this release can be found in ourroadmap project as iteration v33.0: https://github.com/orgs/cloud-hypervisor/projects/6. Fixes: #7216 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-06-30 09:37:27 -07:00
David Esparza	b2ce8b4d61	metrics: Add memory footprint tests to the CI This PR adds memory foot print metrics to tests/metrics/density folder. Intentionally, each test exits w/ zero in all test cases to ensure that tests would be green when added, and will be enabled in a subsequent PR. A workflow matrix was added to define hypervisor variation on each job, in order to run them sequentially. The launch-times test was updated to make use of the matrix environment variables. Fixes: #7066 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-06-30 09:52:27 -06:00
David Esparza	5e3f617cb6	Merge pull request #7197 from GabyCT/topic/fixfunctionname metrics: Uniformity across function names in gha-run.sh	2023-06-30 09:37:15 -06:00
Zvonko Kaiser	d035955ef5	doc: Add documentation for the virtualization reference architecture Fixes: #4041 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-30 12:30:37 +00:00
Zvonko Kaiser	0f454d0c04	gpu: Fixing typos for PCIe topology changes Some comments and functions had typos and wrong capitalization. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-30 08:42:55 +00:00
Gabriela Cervantes	6bb2ea8195	packaging: Fix indentation of build.sh script at ovmf This PR fixes the indentation of build.sh script at ovmf. Fixes #7208 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-29 15:46:54 +00:00
Fupan Li	4288b935e1	Merge pull request #7104 from openanolis/physical/endpoint runtime-rs: support physical endpoint using device manager	2023-06-29 14:43:44 +08:00
GabyCT	19890133e9	Merge pull request #7189 from Apokleos/direct-vol-bugfix runtime-rs: bugfix for direct volume path's validation.	2023-06-28 12:26:22 -06:00
Wedson Almeida Filho	0504bd7254	agent: convert the `sl` macros to functions There is nothing in them that requires them to be macros. Converting them to functions allows for better error messages. Fixes: #7201 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-06-28 14:05:32 -03:00
Wedson Almeida Filho	0860fbd410	agent: convert the `ttrpc_error` macro to a function There is nothing in it that requires it to be a macro. Converting it to a function allows for better error messages. Fixes: #7201 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-06-28 14:05:32 -03:00
Wedson Almeida Filho	0e5d6ce6d7	agent: convert the `is_allowed` macro to a function Having a function allows for better error messages from the type checker and it makes it clearer to callers what can happen. For example: is_allowed!(req); Gives no indication that it may result in an early return, and no simple way for callers to modify the behaviour. It also makes it look like ownership of `req` is being transferred. On the other hand, is_allowed(&req)?; Indicates that `req` is being borrowed (immutably) and may fail. The question mark indicates that the caller wants an early return on failure. Fixes: #7201 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-06-28 14:05:32 -03:00
Wedson Almeida Filho	f680fc52be	agent: change `AGENT_CONFIG`'s lazy type to just `AgentConfig` Since it is never modified, it doesn't really need a lock of any kind. Removing the `RwLock` wrapper allows us to remove all `.read().await` calls when accessing it. Additionally, `AGENT_CONFIG` already has a static lifetime, so there is no need to wrap it in a ref-counted heap allocation. Fixes: #5409 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-06-28 14:05:27 -03:00
GabyCT	3f87d0fbfe	Merge pull request #7180 from dborquez/run_ret_hypervisor_version_w_sudo metrics: Fix retrieving hypervisor version on metrics	2023-06-28 10:54:23 -06:00
Gabriela Cervantes	beb7063683	metrics: Uniformity across function names This PR adds the word function before the function names in order to have uniformity across the script as some are using this and some are not. Fixes #7196 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-28 16:09:19 +00:00
Fabiano Fidêncio	c8d33da8a4	Merge pull request #7188 from jongwu/fix_vfio runtime-rs: fix build error on AArch64	2023-06-28 15:43:14 +02:00
Jianyong Wu	1f3e837e4b	runtime-rs: fix build error on AArch64 Vfio support introduce build error on AArch64. Remove arch related annotation can avoid this error. Fixes: #7187 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-06-28 07:10:43 +00:00
alex.lyn	6fd25968c6	runtime-rs: bugfix for direct volume path's validation. The failure mainly caused by the encoded volume path and the mount/src. As the src will be validated with stat,but it's not a full path and encoded, which causes the stat mount source failed. Fixes: #7186 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-28 10:07:07 +08:00
GabyCT	3885ba4910	Merge pull request #7173 from GabyCT/topic/addcheckm checkmetrics: Add checkmetrics makefile and documentation	2023-06-27 16:30:44 -06:00
Gabriela Cervantes	415578cf3b	docs: Add general README This PR adds link to the unreference docs in the cmd path to make them more discoverable. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-27 20:29:37 +00:00
Zhongtao Hu	c76583a08f	Merge pull request #7171 from GabyCT/topic/enabletimedoc docs: Add boot time metrics documentation	2023-06-27 10:28:56 +08:00
Zhongtao Hu	bff4672f7d	runtime-rs: support physical endpoint using device manager use device manager to attach physical endpoint Fixes: #7103 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-06-27 10:25:51 +08:00
David Esparza	32cba7e44a	metrics: Fix retrieving hypervisor version on metrics This PR makes use of sudo to retrieve the hypervisor version. Fixes: #7178 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-06-26 16:26:27 -06:00
Gabriela Cervantes	aa7946de47	checkmetrics: Add general checkmetrics documentation This PR adds the general checkmetrics documentation for kata metrics tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-26 17:07:57 +00:00
Gabriela Cervantes	2fac2b72fe	checkmetrics: Add checkmetrics makefile This PR adds checkmetrics makefile which is used to process the metrics json results files. Fixes #7172 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-26 16:31:55 +00:00
Gabriela Cervantes	e45899ae0e	docs: Add time tests documentation reference This PR adds time tests documentation reference in the general README for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-26 16:30:20 +00:00
Gabriela Cervantes	28130d3cef	docs: Add boot time metrics documentation This PR adds boot time metrics documentation for kata metrics tests. Fixes #7170 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-26 16:19:28 +00:00
Zhongtao Hu	ce8e3cc091	Merge pull request #7073 from Apokleos/spdk-vol runtime-rs: add support spdk/vhost-user based volume.	2023-06-26 11:34:44 +08:00
alex.lyn	0df2fc2702	runtime-rs: add support spdk/vhost-user based volume. Unlike the previous usage which requires creating /dev/xxx by mknod on the host, the new approach will fully utilize the DirectVolume-related usage method, and pass the spdk controller to vmm. And a user guide about using the spdk volume when run a kata-containers. it can be found in docs/how-to. Fixes: #6526 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-25 16:23:19 +08:00
GabyCT	4cf552c151	Merge pull request #7097 from stevenhorsman/remove-unecessary-kata-versions static-build: Remove kata-version parameter	2023-06-23 16:53:57 -06:00
GabyCT	388b55175e	Merge pull request #7056 from FuuuOverclocking/fuu/fix-console_manager dragonball: avoid obtaining lock twice in create_stdio_console	2023-06-23 16:47:00 -06:00
GabyCT	1a80fd66a2	Merge pull request #7161 from GabyCT/topic/enablemetricslimits metrics: Add checkmetrics for kata metrics CI	2023-06-23 16:45:16 -06:00
Gabriela Cervantes	17198089ee	vendor: Add vendor checkmetrics dependencies This PR adds the vendor for the checkmetrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-23 20:55:30 +00:00
David Esparza	cfd6da9467	Merge pull request #7159 from dborquez/enable_launchtimes_test metrics: enable launch-times test on gha-run metrics script	2023-06-23 12:59:46 -06:00
GabyCT	d6ff48f4e7	Merge pull request #7158 from GabyCT/topic/addmetricsreadme docs: Add general metrics documentation	2023-06-23 11:28:00 -06:00
Gabriela Cervantes	f1dfea6e87	docs: Add metrics documentation reference This PR adds the metrics documentation as a general reference in the main README for kata containers. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-23 16:26:34 +00:00
Zvonko Kaiser	8330fb8ee7	gpu: Update unit tests Some tests are now failing due to the changes how PCIe is handled. Update the test accordingly. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-23 11:16:25 +00:00
David Esparza	8593594247	metrics: enable launch-times test on gha-run metrics script This PR enables launch-times test on gha metrics workflow. Fixes: #7049 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-06-22 18:05:46 -06:00
Fupan Li	469c678425	Merge pull request #7058 from Apokleos/vfio-dev add support vfio device manager	2023-06-22 17:51:22 -06:00
Gabriela Cervantes	c4ee601bf4	metrics: Add checkmetrics for kata metrics CI This PR adds the checkmetrics scripts that will be used for the kata metrics CI. Fixes #7160 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-22 21:06:46 +00:00
Steve Horsman	267e97f9c0	Merge pull request #7162 from sprt/trusted-pr-authors gha: Don't automatically trigger CI	2023-06-22 20:55:10 +01:00
Aurélien Bombo	e0d6475b49	gha: Don't automatically trigger CI We have GH configured so that manual approval is required for CI runs triggered by outside contributors. However, because CI is triggered by the `pull_request_target` event, this setting isn't being honored (see [1]). This means that an attacker could trivially extracts secrets by submitting a PR. This change aims to mititgate this issue by preventing PRs from triggering CI unless the `ok-to-test` label is set. Note: For further context, we use the `pull_request_target` event and manually check out the PR branch because it is the only way to both access secrets and test incoming code changes. Fixes: #7163 [1]: https://docs.github.com/en/actions/managing-workflow-runs/approving-workflow-runs-from-public-forks Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-22 11:05:53 -07:00
Aurélien Bombo	b535c7cbd8	tests: Enable running k8s tests on Mariner This removes the gate and lets CI run tests on Mariner. Fixes: #6840 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-22 10:30:52 -07:00
Archana Shinde	2d329125fd	Merge pull request #6800 from amshinde/check-vm-capability kata-ctl: Check for vm capability	2023-06-21 23:52:46 -07:00
Zhongtao Hu	4b793222ab	Merge pull request #7154 from cheriL/7153/fix_spellings docs: fix spelling of "crate"	2023-06-22 10:48:58 +08:00
Gabriela Cervantes	71071bdb63	docs: Add general metrics documentation This PR adds a general metrics introduction documentation for the kata CI. Fixes #7157 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-21 17:19:36 +00:00
Archana Shinde	610f7986e4	check: Relax the unrestricted_guest check when running in a VM When running on a VM, the kernel parameter "unrestricted_guest" for kernel module "kvm_intel" is not required. So, return success when running on a VM without checking value of this kernel parameter. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-06-21 07:30:35 -07:00
Archana Shinde	1b406b9d0c	kata-ctl:Implement functionality to check host is capable of running VM Implement functionality to add to the env output if the host is capable of running a VM. Fixes: #6727 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-06-21 07:30:22 -07:00
David Esparza	90408d66c0	Merge pull request #7148 from GabyCT/topic/fixtabsinitscript packaging: Fix indentation in init.sh script	2023-06-21 07:24:25 -06:00
stevenhorsman	adf88eaa89	static-build: Remove kata-version parameter - Remove the unnecessary kata-version passed as a second parameter Fixes: #7096 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-06-21 10:15:42 +01:00
soup	09720babc3	docs: fix spelling of "crate" Fixes: #7153 Signed-off-by: soup <lqh348659137@outlook.com>	2023-06-21 16:10:54 +08:00
David Esparza	84b214d9d2	Merge pull request #7150 from GabyCT/topic/fixworkflows gha: Fix gha actions	2023-06-20 18:08:23 -06:00
Gabriela Cervantes	7185afc50e	gha: Fix gha actions This PR removes an unrecognized value located in one of the yamls for the gha in order to make it work the CI again. Fixes #7149 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-20 23:13:25 +00:00
Gabriela Cervantes	21294b868d	packaging: Fix indentation in init.sh script This PR replaces single spaces for tabs in order to fix the indentation in the init.sh script. Fixes #7147 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-20 22:06:52 +00:00
GabyCT	90e36f43ff	Merge pull request #7138 from dborquez/setup-kata-and-configure-launchtimes-test metrics: install kata and launch-times test	2023-06-20 16:00:38 -06:00
David Esparza	fad3ac9f58	metrics: install kata and launch-times test This PR installs kata static tarball on metrics runner and run launch-times tests. Fixes: #7049 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-06-20 13:58:09 -06:00
David Esparza	d071a87c7b	Merge pull request #7109 from dborquez/add_common_libs_for_metrics tests: Move tests helper script to this repo	2023-06-19 19:02:37 -06:00
David Esparza	4bbfcfaf15	tests: Move tests helper script to this repo The common.sh script includes helper functions used in our metrics tests, so we are gradually adding more metrics used in kata. Fixes: #7108 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-06-19 12:14:25 -06:00
David Esparza	f152f0e8c3	metrics: Add launch-times to metrics tests This test measures the duration of a workload that starts, and then immediately stops the contianer. Also measures the workload period, the time to quit period, and the time to kernel period. Fixes: #7049 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-06-19 10:40:16 -06:00
GabyCT	decbe77e28	Merge pull request #7129 from GabyCT/topic/metrlibjson tests: Add json script for metrics tests	2023-06-19 09:59:41 -06:00
Fabiano Fidêncio	ef8b360711	Merge pull request #7085 from stevenhorsman/cherry-pick-initramfs Cherry pick initramfs caching updates from CCv0	2023-06-19 11:59:00 +02:00
alex.lyn	59510cfee0	runtime-rs: add support vfio device based volume A new choice of using vfio devic based volume for kata-containers. With the help of kata-ctl direct-volume, users are able to add a specified device which is BDF or IOMMU group ID. To help users to use it smoothly, A doc about howto added in docs/how-to/how-to-run-kata-containers-with-kinds-of-Block-Volumes. Fixes: #6525 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-18 14:07:05 +08:00
alex.lyn	1e3b372bbb	runtime-rs: add support vfio device manager Limitations: As no ready rust vmm's vfio manager is ready, it only supports part of vfio in runtime-rs. And the left part is to call vmm interfaces related to vfio add/remove. So when vmm/vfio manager ready, a new PR will be pushed to narrow the gap. Fixes: #6525 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-18 14:05:59 +08:00
David Esparza	61e819ea8e	Merge pull request #7131 from GabyCT/topic/fixrunner gha: Fix format for run launchtimes metrics yaml	2023-06-16 18:30:57 -06:00
Gabriela Cervantes	6b08489301	gha: Fix format for run launchtimes metrics yaml This PR fixes the format for the run launchtimes metrics yaml which is causing to the workflow to fail. Fixes #7130 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-16 22:00:36 +00:00
Gabriela Cervantes	3cefa43e75	tests: Add json script for metrics tests This PR adds the json script which allow us to save the metrics results into a json file which will be used in the kata containers metrics. Fixes #7128 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-16 19:45:26 +00:00
GabyCT	7976a0ac72	Merge pull request #7114 from GabyCT/topic/libcommontests tests: Add tests lib common script	2023-06-16 11:48:19 -06:00
Greg Kurz	27045798bf	Merge pull request #7112 from gkurz/fix-virtiofsd-args Fix deprecated virtiofsd args (go shim only)	2023-06-16 18:13:24 +02:00
Fabiano Fidêncio	6a3710055b	initramfs: Build dependencies as part of the Dockerfile This will help to not have to build those on every CI run, and rather take advantage of the cached image. Fixes: #7084 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `c720869eef`)	2023-06-16 10:58:12 +01:00
Fabiano Fidêncio	aa2380fdd6	packaging: Add infra to push the initramfs builder image Let's add the needed infra for only building and pushing the initramfs builder image to the Kata Containers' quay.io registry. Fixes: #7084 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `111ad87828`)	2023-06-16 10:58:12 +01:00
Fabiano Fidêncio	1c7fcc6cbb	packaging: Use existing image to build the initramfs Let's first try to pull a pre-existing image, instead of building our own, to be used as a builder for the initramds. This will save us some CI time. Fixes: #7084 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `ebf6c83839`)	2023-06-16 10:58:12 +01:00
Greg Kurz	a43ea24dfc	virtiofsd: Convert legacy `-o` sub-options to their `--` replacement The `-o` option is the legacy way to configure virtiofsd, inherited from the C implementation. The rust implementation honours it for compatibility but it logs deprecation warnings. Let's use the replacement options in the go shim code. Also drop references to `-o` from the configuration TOML file. Fixes #7111 Signed-off-by: Greg Kurz <groug@kaod.org>	2023-06-16 11:42:54 +02:00
Greg Kurz	8e00dc6944	virtiofsd: Drop `-o no_posix_lock` The C implementation of virtiofsd had some kind of limited support for remote POSIX locks that was causing some workflows to fail with kata. Commit `432f9bea6e` hard coded `-o no_posix_lock` in order to enforce guest local POSIX locks and avoid the issues. We've switched to the rust implementation of virtiofsd since then, but it emits a warning about `-o` being deprecated. According to https://gitlab.com/virtio-fs/virtiofsd/-/issues/53 : The C implementation of the daemon has limited support for remote POSIX locks, restricted exclusively to non-blocking operations. We tried to implement the same level of functionality in #2, but we finally decided against it because, in practice most applications will fail if non-blocking operations aren't supported. Implementing support for non-blocking isn't trivial and will probably require extending the kernel interface before we can even start working on the daemon side. There is thus no justification to pass `-o no_posix_lock` anymore. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-06-16 11:42:39 +02:00
Greg Kurz	2a15ad9788	virtiofsd: Stop using deprecated `-f` option The rust implementation of virtiofsd always runs foreground and spits a deprecation warning when `-f` is passed. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-06-16 10:30:40 +02:00
David Esparza	b9d92f4577	Merge pull request #7117 from dborquez/add_checkout_metrics_workflow gha: Add base branch on SHA on pull requst	2023-06-15 17:06:16 -06:00
Gabriela Cervantes	c3043a6c60	tests: Add tests lib common script This PR adds the test lib common script that is going to be used for kata containers metrics. Fixes #7113 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-15 21:23:00 +00:00
David Esparza	b16e0de734	gha: Add base branch on SHA on pull requst The run-launchtimes-metrics workflow needs to get the commit ID for the last commit to the head branch of the PR. Fixes: #7116 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-06-15 13:11:33 -06:00
Zvonko Kaiser	72f2cb84e6	gpu: Reset cold or hot plug after overriding If we override the cold, hot plug with an annotation we need to reset the other plugging mechanism to NoPort otherwise both will be enabled. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-15 17:51:01 +00:00
Zvonko Kaiser	fbacc09646	gpu: PCIe topology, consider vhost-user-block in Virt In Virt the vhost-user-block is an PCIe device so we need to make sure to consider it as well. We're keeping track of vhost-user-block devices and deduce the correct amount of PCIe root ports. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-15 17:39:55 +00:00
GabyCT	0f24f427d7	Merge pull request #7101 from dborquez/add_initial_metrics_gh_workflow gha: ci-on-push: Run metrics tests	2023-06-15 10:08:56 -06:00
David Esparza	bc152b1141	gha: ci-on-push: Run metrics tests This gh-workflow prints a simple msg, but is the base for future PRs that will gradually add the jobs corresponding to the kata metrics test. Fixes: #7100 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-06-14 15:15:08 -06:00
GabyCT	a3180d0cb8	Merge pull request #7095 from GabyCT/topic/updatedebugconse docs: Update Developer Guide	2023-06-14 13:49:37 -06:00
Gabriela Cervantes	dad731d5c1	docs: Update Developer Guide This PR updates the developer guide at the connect to the debug console section. Fixes #7094 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-14 15:36:51 +00:00
Zhongtao Hu	11692a76e1	Merge pull request #7092 from Apokleos/virtiofs-enhancement runtime-rs: Enhance flexibility of virtio-fs config	2023-06-14 20:01:46 +08:00
Zvonko Kaiser	b11246c3aa	gpu: Various fixes for virt machine type The PCI qom path was not deduced correctly added regex for correct path walking. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:33:57 +00:00
Zvonko Kaiser	40101ea7db	vfio: Added annotation for hot(cold) plug Now it is possible to configure the PCIe topology via annotations and addded a simple test, checking for Invalid and RootPort Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	8f0d4e2612	vfio: Cleanup of Cold and Hot Plug Removed the configuration of PCIeRootPort and PCIeSwitchPort, those values can be deduced in createPCIeTopology Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	b5c4677e0e	vfio: Rearrange the bus assignemnt Refactor the bus assignment so that the call to GetAllVFIODevicesFromIOMMUGroup can be used by any module without affecting the topology. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	b1aa8c8a24	gpu: Moved the PCIe configs to drivers The hypervisor_state file was the wrong location for the PCIe Port settings, moved everything under device umbrella, where it can be consumed more easily and we do not get into circular deps. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	55a66eb7fb	gpu: Add config to TOML Update cold-plug and hot-plug setting to include bridge, root and switch-port Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	da42801c38	gpu: Add config settings tests for hot-plug Updated all references and config settings for hot-plug to match cold-plug Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	de39fb7d38	runtime: Add support for GPUDirect and GPUDirect RDMA PCIe topology Fixes: #4491 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	9318e022af	gpu: Add CC relates configs For the GPU CC use case we need to set several crypto algorithms. The driver relies on them in the CC case. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 07:56:53 +00:00
Zvonko Kaiser	b7932be4b6	gpu: Add Arm64 Kernel Settings For different archs we need diferent settings use ${ARCH} to choose the right fragment Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 07:56:53 +00:00
Zvonko Kaiser	211b0ab268	gpu: Update Kernel Config Newer drivers need more symbols so lets enable them Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 07:56:53 +00:00
Zvonko Kaiser	5f103003d6	gpu: Update kernel building to the latest changes Use now the sev.conf rather then the snp.conf. Devices can be prestend in two different way in the container (1) as vfio devices /dev/vfio/<num> (2) the device is managed by whataever driver in the VM kernel claims it. Fixes: #6844 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 07:56:53 +00:00
Fabiano Fidêncio	95bec479ca	Merge pull request #7090 from GabyCT/topic/ufcversion versions: Update firecracker version to 1.3.3	2023-06-14 01:24:02 +02:00
Fabiano Fidêncio	8aa4a87fae	Merge pull request #7099 from sprt/fix-new-targets tools: Fix no-op builds	2023-06-14 01:23:39 +02:00
Aurélien Bombo	35e4938e8c	tools: Fix no-op builds This fixes the builds of `cloud-hypervisor-glibc` and `rootfs-initrd-mariner` to properly create the `build/` directory. Fixes: #7098 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-13 10:56:49 -07:00
Zhongtao Hu	da8dde0c24	Merge pull request #7079 from HerlinCoder/herlincoder/vpa runtime-rs: update Cargo.lock	2023-06-13 21:44:45 +08:00
Fabiano Fidêncio	ff38937246	Merge pull request #7087 from sprt/fix-gha-stage gha: Fix `stage` definition in matrix	2023-06-13 12:17:25 +02:00
alex.lyn	347385b4ee	runtime-rs: Enhance flexibility of virtio-fs config support more and flexible options for inline virtiofs. Fixes: #7091 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-13 15:12:47 +08:00
Zhongtao Hu	355a24e0e1	Merge pull request #6289 from openanolis/runtime_vcpu_resize feat(runtime): vcpu resize capability	2023-06-13 10:54:11 +08:00
Chelsea Mafrica	1763b1f69f	Merge pull request #7082 from jodh-intel/remove-snap packaging: Remove snap package	2023-06-12 17:05:00 -07:00
Gabriela Cervantes	21d2278539	versions: Update firecracker version to 1.3.3 This PR updates the firecracker version to 1.3.3 which includes the following changes Fixed passing through cache information from host in CPUID leaf 0x80000006. A race condition that has been identified between the API thread and the VMM thread due to a misconfiguration of the api_event_fd. Fixes #7089 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-12 20:32:02 +00:00
Aurélien Bombo	0e2379909b	gha: Fix `stage` definition in matrix This defines `stage` as a list instead of a literal to fix the GHA CI. Fixes: #7086 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-12 11:24:45 -07:00
Fabiano Fidêncio	977309a281	Merge pull request #7027 from sprt/sprt/mariner-build-targets gha: Add new build targets for Mariner	2023-06-12 19:19:22 +02:00
Yushuo	ae2cfa8263	doc: add vcpu handlint doc for runtime-rs Kubernetes and Containerd will help calculate the Sandbox Size and pass it to Kata Containers through annotations. In order to accommodate this favorable change and be compatible with the past, we have implemented the handling of the number of vCPUs in runtime-rs. This is This is slightly different from the original runtime-go design. This doc introduce how we handle vCPU size in runtime-rs. Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 19:23:11 +08:00
Yushuo	7b1e67819c	fix(clippy): fix clippy error Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Yushuo	67972ec48a	feat(runtime-rs): calculate initial size In this commit, we refactored the logic of static resource management. We defined the sandbox size calculated from PodSandbox's annotation and SingleContainer's spec as initial size, which will always be the sandbox size when booting the VM. The configuration static_sandbox_resource_mgmt controls whether we will modify the sandbox size in the following container operation. Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Yushuo	aaa96c749b	feat(runtime-rs): modify onlineCpuMemRequest Some vmms, such as dragonball, will actively help us perform online cpu operations when doing cpu hotplug. Under the old onlineCpuMem interface, it is difficult to adapt to this situation. So we modify the semantics of nb_cpus in onlineCpuMemRequest. In the original semantics, nb_cpus represents the number of newly added CPUs that need to be online. The modified semantics become that the number of online CPUs in the guest needs to be guaranteed. Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Yushuo	d66f7572dd	feat(runtime-rs): clear cpuset in runtime side The declaration of the cpu number in the cpuset is greater than the actual number of vcpus, which will cause an error when updating the cgroup in the guest. This problem is difficult to solve, so we temporarily clean up the cpuset in the container spec before passing in the agent. Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Yushuo	a0385e1383	feat(runtime-rs): update linux resource when stop_process Update the resource when delete container, which is in stop_process in runtime-rs. Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Yushuo	a39e1e6cd1	feat(runtime-rs): merge the update_cgroups in update_linux_resources Updating vCPU resources and memory resources of the sandbox and updating cgroups on the host will always happening together, and they are all updated based on the linux resources declarations of all the containers. So we merge update_cgroups into the update_linux_resources, so we can better manage the resources allocated to one pod in the host. Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Ji-Xinyou	fa6dff9f70	feat(runtime-rs): support vcpu resizing on runtime side Support vcpu resizing on runtime side: 1. Calculate vcpu numbers in resource_manager using all the containers' linux_resources in the spec. 2. Call the hypervisor(vmm) to do the vcpu resize. 3. Call the agent to online vcpus. Fixes: #5030 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com> Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-06-12 17:53:16 +08:00
James O. D. Hunt	8cb4238b46	packaging: Remove snap package Nobody has volunteered to maintain the (currently broken) snap build, so remove it. Fixes: #6769. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-06-12 09:24:09 +01:00
Helin Guo	2137739987	runtime-rs: update Cargo.lock After we support memory resize in Dragonball, we need to update Cargo.lock in runtime-rs. Fixes: #6719 Signed-off-by: Helin Guo <helinguo@linux.alibaba.com>	2023-06-12 11:25:59 +08:00
Chao Wu	2988553305	Merge pull request #6998 from HerlinCoder/herlincoder/vpa Dragonball: support resize memory	2023-06-11 17:21:12 +08:00
Archana Shinde	56d2ea9b78	kata-ctl: Refactor kernel module check Adding vhost and vhost-net to the kernel modules. These do not require any kernel module parameters to be checked. Currently, kernel params is a required field. Make this as optional. Could make this as <Option>, but making this a slice instead, as a module could have multiple kernel params. Refactor the function that checks are for kernel modules into two with one specifically checking if the module is loaded and other checking for module parameters. Refactor some of the tests to take into account these changes. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-06-09 14:10:31 -07:00
Aurélien Bombo	9f7a45996c	gha: Add `rootfs-initrd-mariner` build target This adds the Mariner guest image build target to the list of assets as preparation for #6839. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-09 11:36:42 -07:00
Aurélien Bombo	f28a62164a	gha: Add `cloud-hypervisor-glibc` build target This adds the glibc flavor of CLH to the list of assets as preparation for #6839. Mariner Kata is only tested with glibc. Fixes: #7026 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-09 11:35:50 -07:00
Fabiano Fidêncio	b50f62ce48	Merge pull request #6756 from arronwy/measured_rootfs Port Measured rootfs feature from CCv0 branch to main	2023-06-09 12:35:05 +02:00
Helin Guo	8fb7ab7518	dragonball: introduce virtio-balloon device We introduce virtio-balloon device to support memory resize. virtio-balloon device could reclaim memory from guest to host. Fixes: #6719 Signed-off-by: Helin Guo <helinguo@linux.alibaba.com>	2023-06-09 17:47:27 +08:00
Helin Guo	7ed9494973	dragonball: introduce virtio-mem device We introduce virtio-mem device to support memory resize. virtio-mem device could hot-plug more memory blocks to guest and could also hot-unplug them from guest. Fixes: #6719 Signed-off-by: Helin Guo <helinguo@linux.alibaba.com>	2023-06-09 17:47:21 +08:00
Chao Wu	c7c45626c9	Merge pull request #6973 from Apokleos/direct-vol add support direct volume and refactor device manager	2023-06-09 11:29:00 +08:00
alex.lyn	776a15e092	runtime-rs: add support direct volume. As block/direct volume use similar steps of device adding, so making full use of block volume code is a better way to handle direct volume. the only different point is that direct volume will use DirectVolume and get_volume_mount_info to parse mountinfo.json from the direct volume path. That's to say, direct volume needs the help of `kata-ctl direct-volume ...`. Details seen at Advanced Topics: [How to run Kata Containers with kinds of Block Volumes] docs/how-to/how-to-run-kata-containers-with-kinds-of-Block-Volumes.md Fixes: #5656 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-09 08:16:26 +08:00
Helin Guo	a8e0f51c52	dragonball: extend DeviceOpContext In order to support virtio-mem and virtio-balloon devices, we need to extend DeviceOpContext with VmConfigInfo and InstanceInfo. Fixes: #6719 Signed-off-by: Helin Guo <helinguo@linux.alibaba.com>	2023-06-08 22:04:31 +08:00
alex.lyn	abae114046	runtime-rs: refactor device manager implementation The key aspects of the DM implementation refactoring as below: 1. reduce duplicated code Many scenarios have similar steps when adding devices. so to reduce duplicated code, we should create a common method abstracted and use it in various scenarios. do_handle_device: (1) new_device with DeviceConfig and return device_id; (2) try_add_device with device_id and do really add device; (3) return device info of device's info; 2. return full info of Device Trait get_device_info replace the original type DeviceConfig with full info DeviceType. 3. refactor find_device method. Fixes: #5656 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-08 08:47:08 +08:00
Fabiano Fidêncio	08d10d38be	Merge pull request #7048 from sprt/sprt/fix-gha gha: Fix gha-run.sh and unbreak CI	2023-06-07 23:40:02 +02:00
James O. D. Hunt	452f286552	Merge pull request #6764 from byron-marohn/fix_5401 kata-ctl: Switch to slog logging; add --log-level and --json-logging arguments	2023-06-07 16:08:53 +01:00
Fuu	210a15794c	dragonball: avoid obtaining lock twice in create_stdio_console Fixes #7055 Signed-off-by: Fuu <fuu-open@linux.alibaba.com>	2023-06-07 16:12:22 +08:00
Aurélien Bombo	69668ce87f	tests: gha-run: Use correct env variable for repo s/DOCKER_IMAGE/DOCKER_REPO Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-06 11:54:43 -07:00
Aurélien Bombo	f487199edf	gha: aks: Fix argument in call to gha-run.sh Fixes: #7047 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-06 11:51:18 -07:00
GabyCT	5ad8aaf9df	Merge pull request #7035 from GabyCT/topic/logparserdoc log-parser: Update log parser link at README	2023-06-06 12:02:25 -06:00
Fabiano Fidêncio	de2e507483	Merge pull request #6972 from sprt/sprt/gha-run-script gha: aks: Extract `run` commands to a script	2023-06-06 14:54:03 +02:00
Wang, Arron	f6afae9c73	packaging: Add rootfs-image-tdx-tarball target Add rootfs-image-tdx target: ./tools/packaging/kata-deploy/local-build/kata-deploy-binaries.sh --build=rootfs-image-tdx ./opt/kata/share/kata-containers/kata-containers-tdx.img ./opt/kata/share/kata-containers/kata-ubuntu-latest-tdx.image Fixes: #6674 Signed-off-by: Wang, Arron <arron.wang@intel.com>	2023-06-06 12:34:20 +02:00
Wang, Arron	f62b2670c0	config: Add root hash value and measure config to kernel params After we have a guest kernel with builtin initramfs which provide the rootfs measurement capability and Kata rootfs image with hash device, we need set related root hash value and measure config to the kernel params in kata configuration file. Fixes: #6674 Signed-off-by: Wang, Arron <arron.wang@intel.com>	2023-06-06 12:34:13 +02:00
Wang, Arron	0080588075	kernel: Integrate initramfs into Guest kernel Integrate initramfs into guest kernel as one binary, which will be measured by the firmware together. Fixes: #6674 Signed-off-by: Wang, Arron <arron.wang@intel.com>	2023-06-06 12:33:41 +02:00
Wang, Arron	28b2645624	initramfs: Add build script to generate initramfs The init.sh in initramfs will parse the verity scheme, roothash, root device and setup the root device accordingly. Fixes: #6674 Signed-off-by: Wang, Arron <arron.wang@intel.com>	2023-06-06 12:33:28 +02:00
Wang, Arron	5cb02a8067	image-build: generate root hash as an separate partition for rootfs Generate rootfs hash data during creating the kata rootfs, current kata image only have one partition, we add another partition as hash device to save hash data of rootfs data blocks. Fixes: #6674 Signed-off-by: Wang, Arron <arron.wang@intel.com>	2023-06-06 12:31:14 +02:00
Arron Wang	31c0ad2076	packaging: Add cryptsetup support in Guest kernel and rootfs Add required kernel config for dm-crypt/dm-integrity/dm-verity and related crypto config. Add userspace command line tools for disk encryption support and ext4 file system utilities. Fixes: #6674 Signed-off-by: Arron Wang <arron.wang@intel.com>	2023-06-06 12:30:07 +02:00
Fabiano Fidêncio	eb1bfa922b	Merge pull request #6980 from nubificus/feat_sharefs_files runtime-rs: handle copy files when share_fs is not available	2023-06-06 12:26:55 +02:00
Chao Wu	b0c6cd05a2	Merge pull request #7033 from openanolis/fix-agent-ctl agent-ctl: fix the compile error	2023-06-06 11:55:15 +08:00
Gabriela Cervantes	980d084f47	log-parser: Update log parser link at README This PR updates the link to the correspondent Developer Guide at the enabling full containerd debug that we have for kata 2.0 documentation. Fixes #7034 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-06-05 15:59:52 +00:00
Yushuo	410bc18143	agent-ctl: fix the compile error When the version of libc is upgraded to 0.2.145, older getrandom could not adapt to new API, and this will make agent-ctl fail to compile. We upgrade the version of `rand`, so the low version of getrandom will no longer need. Fixes: #7032 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-06-05 21:48:36 +08:00
Jayant Singh	77519fd120	kata-ctl: Switch to slog logging; add --log-level, --json-logging args Fixes: #5401, #6654 - Switch kata-ctl from eprintln!()/println!() to structured logging via the logging library which uses slog. - Adds a new create_term_logger() library call which enables printing log messages to the terminal via a less verbose / more human readable terminal format with colors. - Adds --log-level argument to select the minimum log level of printed messages. - Adds --json-logging argument to switch to logging in JSON format. Co-authored-by: Byron Marohn <byron.marohn@intel.com> Co-authored-by: Luke Phillips <lucas.phillips@intel.com> Signed-off-by: Jayant Singh <jayant.singh@intel.com> Signed-off-by: Byron Marohn <byron.marohn@intel.com> Signed-off-by: Luke Phillips <lucas.phillips@intel.com> Signed-off-by: Kelby Madal-Hellmuth <kelby.madal-hellmuth@intel.com> Signed-off-by: Liz Lawrens <liz.lawrens@intel.com>	2023-06-02 20:13:22 +00:00
Aurélien Bombo	aab6030962	gha: aks: Extract `run` commands to a script Github Actions reads and runs workflow files from the main branch, rather than from the PR branch. This means that PRs that modify workflow files aren't being tested with the updated workflows coming from the PR, but rather with the old workflows from the main branch. AFAIK, this behavior isn't avoidable for workflow files (but is for other scripts). This makes it very hard to reliably test workflow changes before they're actually merged into main and leads to issues that we have to hotifx (see #6983, #6995). This PR aims to mitigate that by extracting the commands used in workflows to a separate script file. The way our CI is set up, those script files are read from the PR branch and thus changes would be reflected in the CI checks. Fixes: #6971 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-02 10:22:35 -07:00
Fupan Li	465f5a5ced	Merge pull request #4748 from lifupan/main_fix agent: fix the issue of exec hang with a backgroud process	2023-06-02 10:46:43 +08:00
Chao Wu	2128fa2b4e	Merge pull request #7013 from xuejun-xj/xuejun/bugfix runtime-rs: bugfix: update Cargo.lock	2023-06-02 10:08:27 +08:00
Anastassios Nanos	e4eb664d27	runtime-rs: update rust to 1.69.0 We are probably hitting this: https://github.com/rust-lang/rust/issues/63033 Seems like it is worth a try to upgrade to 1.69.0 Signed-off-by: Anastassios Nanos <ananos@nubificus.co.uk>	2023-06-01 21:40:56 +00:00
Anastassios Nanos	ed37715e05	runtime-rs: handle copy files when share_fs is not available In hypervisors that do not support virtiofs we have to copy files in the VM sandbox to properly setup the network (resolv.conf, hosts, and hostname). To do that, we construct the volume as before, with the addition of an extra variable that designates the path where the file will reside in the sandbox. In this case, we issue a `copy_file` agent request and we patch the spec to account for this change. Fixes: #6978 Signed-off-by: Anastassios Nanos <ananos@nubificus.co.uk> Signed-off-by: George Pyrros <gpyrros@nubificus.co.uk>	2023-06-01 21:40:56 +00:00
Fabiano Fidêncio	18b1a019d4	Merge pull request #7011 from jepio/fix-aks-cluster-name gha: aks: Use short SHA in cluster name	2023-06-01 15:56:20 +02:00
Fabiano Fidêncio	5ab42d87fb	Merge pull request #7009 from fidencio/topic/display-badge-for-the-publish-artefacts-job README: Display badge for the "Publish Artefacts" job and update the Kata Containers logo	2023-06-01 15:13:41 +02:00
Fabiano Fidêncio	eb1f44f111	Merge pull request #7007 from fidencio/topic/try-to-fix-ubuntu-k8s-key-not-available kata-deploy: Change how we get the Ubuntu k8s key	2023-06-01 15:13:22 +02:00
xuejun-xj	5f6fc3ed76	runtime-rs: bugfix: update Cargo.lock When dragonball update dbs-boot crate in commit `64c764c147`, the Cargo.lock in runtime-rs should also be updated. Fixes: #6969 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-06-01 20:25:35 +08:00
Jeremi Piotrowski	1c6d22c803	gha: aks: Use short SHA in cluster name Full SHA is 40 characters, while AKS cluster name has a limit of 63. Trim the SHA to 12 characters, which is widely considered to be unique enough and is short enough to be used in the cluster name Fixes: #7010 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-06-01 14:03:53 +02:00
Fabiano Fidêncio	3c1f6d36dc	readme: Update Kata Containers logo Let's use the horizontal logo, as it occupies better the space the we have. The logo comes from: https://openinfra.dev/brand/logos Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-06-01 12:25:13 +02:00
Fabiano Fidêncio	3886841131	readme: Add status badge for the "Publish Artefacts" job Let's start adding the status of our jobs as part of our main page, so folks monitoring those can easily check whether they're okay, or if someone has to be pinged about those. Fixes: #7008 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-06-01 12:25:01 +02:00
Fabiano Fidêncio	26f7520387	kata-deploy: Change how we get the Ubuntu k8s key The current method has been failing every now and then, and was reported on https://github.com/kubernetes/release/issues/2862. Ding poked me and suggested to do this change here, so here we go. :-) Fixes: #7006 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-06-01 12:10:30 +02:00
Fabiano Fidêncio	9ec2bca101	Merge pull request #7002 from fidencio/topic/follow-up-on-7000 gha: aks: Ensure host_os is used everywhere needed	2023-06-01 08:51:27 +02:00
Fabiano Fidêncio	8cbb80da66	Merge pull request #6929 from LindaYu17/dev kubernetes: add agnhost command in pod yaml	2023-06-01 08:39:58 +02:00
Fabiano Fidêncio	aebd3b47d9	gha: aks: Ensure host_os is used everywhere needed We added that to create the cluster name, but I forgot to add that to the part we get the k8s config file, or to the part where we delete the AKS cluster. Fixes: #6999 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-31 20:50:55 +02:00
Fabiano Fidêncio	e01f75723a	Merge pull request #6997 from singhwang/main main \| release: Standardize kata static file name	2023-05-31 15:22:30 +02:00
Fabiano Fidêncio	1ed917a079	Merge pull request #6989 from BbolroC/configurable-build-registry packaging: make BUILDER_REGISTRY configurable	2023-05-31 15:18:51 +02:00
Fabiano Fidêncio	de22783124	Merge pull request #7000 from fidencio/topic/use-a-different-name-for-the-ubuntu-and-mariner-aks-clusters gha: aks: Add the host_os as part of the aks cluster's name	2023-05-31 15:18:17 +02:00
Archana Shinde	141c26f307	Merge pull request #6985 from amshinde/kernel-tdx-build kernel: Modify build-kernel.sh to accomodate for changes in version.yaml	2023-05-31 01:57:20 -07:00
Fabiano Fidêncio	0c8282c224	gha: aks: Add the host_os as part of the aks cluster's name We need to do so, otherwise we'll create two clusters for testing Cloud Hypervisor with exactly the same name, one using Ubuntu, and one using Mariner. Fixes: #6999 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-31 05:20:04 +02:00
SinghWang	4b89a6bdac	release: Standardize kata static file name The string representing the architecture aarch64 and x86_64 need to be changed to arm64 and amd64 for the release. Fixes: #6986 Signed-off-by: SinghWang <wangxin_0611@126.com>	2023-05-31 10:24:45 +08:00
Fabiano Fidêncio	51e42a9972	Merge pull request #6995 from sprt/sprt/fix-mariner-ci gha: Fix Mariner cluster creation	2023-05-31 00:23:36 +02:00
Archana Shinde	9228815ad2	kernel: Modify build-kernel.sh to accomodate for changes in version.yaml There were recent changes for the tdx kernel in the version.yaml that are not currently accounted for in the build-kernel.sh script. Attempts to setup a tdx kernel to build local changes seemed to not download the tdx kernel. Instead the mainline kernel is downloaded which has no tdx-related changes. The version.yaml has a new entry for tdx kernel. Use that instead for setting up and downloading the tdx kernel. Fixes: #6984 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-05-30 13:44:58 -07:00
Aurélien Bombo	03027a7399	gha: Fix Mariner cluster creation While the Mariner Kata host is in preview, we need the `aks-preview` extension to enable the `--workload-runtime KataMshvVmIsolation` flag. Fixes: #6994 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-05-30 13:26:49 -07:00
Hyounggyu Choi	43e73bdef7	packaging: make BUILDER_REGISTRY configurable This PR is to make an environment variable `BUILDER_REGISTRY` configurable so that those who want to use their own registry for build can set up the registry. Fixes: #6988 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-05-30 14:40:02 +02:00
Fabiano Fidêncio	2e2d7243d2	Merge pull request #6983 from sprt/sprt/fix-gha-ci gha: Unbreak CI and fix cluster creation step	2023-05-30 12:58:10 +02:00
Zhongtao Hu	8b6cb2cd75	Merge pull request #6806 from xuejun-xj/xuejun/vcpuhotplug Dragonball: support vcpu hotplug on aarch64	2023-05-30 18:47:50 +08:00
xuejun-xj	ffe3157a46	dragonball: add arm64 patches for upcall The vcpu hotplug/hotunplug feature is implemented with upcall. This commit add three patches to support the feature on aarch64. Patches: > 0005: add support of upcall on aarch64 > 0006: skip activate offline cpus' MSI interrupt > 0007: set the correct boot cpu number Fixes: #6010 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-05-30 15:51:08 +08:00
xuejun-xj	560442e6ed	dragonball: add vcpu_boot_onlined vector This commit implements the vcpu_boot_onlined vector in get_fdt_vm_info. "boot_enabled" means whether this vcpu should be onlined at first boot. It will be used by fdt, which write an attribute called boot_enabled, and will be handled by guest kernel to pass the correct cpu number to function "bringup_nonboot_cpus". Fixes: #6010 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-05-30 15:51:08 +08:00
xuejun-xj	e31772cfea	dragonball: add support resize_vcpu on aarch64 This commit add support of resize_vcpu on aarch64. As kvm will check whether vgic is initialized when calling KVM_CREATE_VCPU ioctl, all the vcpu fds should be created before vm is booted. To support resizing vcpu scenario, we use max_vcpu_count for create_vcpus and setup_interrupt_controller interfaces. The SetVmConfiguration API will ensure max_vcpu_count >= boot_vcpu_count. Fixes: #6010 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-05-30 15:51:08 +08:00
xuejun-xj	64c764c147	dragonball: update dbs-boot to v0.4.0 dbs-boot-v0.4.0 refectors the create_fdt interface. It simplifies the parameters needed to be passed and abstracts them into three structs. By the way, it also reserves some interfaces for future feature: numa passthrough and cache passthrough. Fixes: #6969 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-05-30 15:51:08 +08:00
xuejun-xj	fd9b414646	dragonball: update comment for init_microvm Rewrite the comment of Vm::init_microvm method for aarch64. Fixes cargo test warnings on aarch64. Fixes: #6969 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-05-30 15:51:08 +08:00
Aurélien Bombo	af16d3fca4	gha: Unbreak CI and fix cluster creation step This fixes the regression introduced by #6686 by properly injecting the `--os-sku mariner --workload-runtime KataMshvVmIsolation` flags. Error reference: https://github.com/kata-containers/kata-containers/actions/runs/5111460297/jobs/9188819103 Fixes: #6982 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-05-29 13:32:47 -07:00
Zhongtao Hu	099b4b0d0e	Merge pull request #6598 from Apokleos/sandbox_bind_mounts runtime-rs/sandbox_bindmounts: add support for sandbox bindmounts	2023-05-28 12:00:39 +08:00
Zhongtao Hu	cb962b0dc9	Merge pull request #6702 from Apokleos/directvol-common runtime-rs/kata-ctl: Enhancement of DirectVolumeMount.	2023-05-28 12:00:12 +08:00
Fabiano Fidêncio	44546a4a57	Merge pull request #6686 from sprt/sprt/mariner-ci gha: Create Mariner host as part of k8s tests	2023-05-27 05:34:28 +02:00
alex.lyn	5ddc4f94c5	runtime-rs/kata-ctl: Enhancement of DirectVolumeMount. Move the get_volume_mount_info to kata-types/src/mount.rs. If so, it becomes a common method of DirectVolumeMountInfo and reduces duplicated code. Fixes: #6701 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-05-26 11:18:29 +08:00
Fupan Li	25d2fb0fde	agent: fix the issue of exec hang with a backgroud process When run a exec process in backgroud without tty, the exec will hang and didn't terminated. For example: crictl -i <container id> sh -c 'nohup tail -f /dev/null &' Fixes: #4747 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2023-05-26 10:56:46 +08:00
Tim Zhang	5231aff90f	Merge pull request #6860 from lifupan/main netlink: Fix the issue of update_interface	2023-05-26 10:54:07 +08:00
Aurélien Bombo	4af4ced1aa	gha: Create Mariner host as part of k8s tests The current testing setup only supports running Kata on top of an Ubuntu host. This adds Mariner to the matrix of testable hosts for k8s tests, with Cloud Hypervisor as a VMM. As preparation for the upcoming PR that will change only the actual test code (rather than workflow YAMLs), this also introduces a new file `setup.sh` that will be used to set host-specific parameters at test run-time. Fixes: #6961 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-05-25 14:29:46 -07:00
Fabiano Fidêncio	59cefa719c	Merge pull request #6965 from fidencio/topic/gha-increase-aks-creation-waiting-time gha: Increase timeout for AKS jobs and give more time to start running the tests	2023-05-25 17:23:17 +02:00
Greg Kurz	837f7a2fe6	Merge pull request #6959 from beraldoleal/issues/6757 runtime: sending SIGKILL to qemu	2023-05-25 16:24:37 +02:00
alex.lyn	eee7aae71d	runtime-rs/sandbox_bindmounts: add support for sandbox bindmounts sandbox_bind_mounts supports kinds of mount patterns, for example: (1) "/path/to", default readonly mode. (2) "/path/to:ro", same as (1). (3) "/path/to:rw", readwrite mode. Both support configuration and annotation: (1)[runtime] sandbox_bind_mounts=["/path/to", "/path/to:rw", "/mnt/to:ro"] (2) annotation will alse be supported, restricted as below: io.katacontainers.config.runtime.sandbox_bind_mounts = "/path/to /path/to:rw /mnt/to:ro" Fixes: #6597 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-05-25 20:00:25 +08:00
Fupan Li	62b2838962	Merge pull request #6846 from ZhangShuaiyi/DeviceMgrMethod dragonball: convert BlockDeviceMgr and VirtioNetDeviceMgr functions to methods	2023-05-25 18:11:44 +08:00
QuanweiZhou	377b7735f5	Merge pull request #6872 from justxuewei/rm-virtio-devices dragonball: Remove virtio-net and vsock devices gracefully	2023-05-25 17:08:36 +08:00
Fabiano Fidêncio	3d5d6eb361	Merge pull request #6958 from fidencio/topic/kata-deploy-improve-backup-restore kata-deploy: Improve shim backup / restore	2023-05-25 10:54:06 +02:00
Fabiano Fidêncio	3f0735a7e8	Merge pull request #6952 from stevenhorsman/git-clone-doc-fix doc: Update git commands	2023-05-25 10:36:08 +02:00
Fabiano Fidêncio	557b840814	gha: aks: Wait longer to start running the tests We're still facing issues related to the time taken to deploy the kata-deplot daemonset and starting to run the tests. Ideally, we should solve this with a readiness probe, and that's the approach we want to take in the future. However, for now, let's just make sure those tests are not on the way of the community. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-25 10:13:19 +02:00
Fabiano Fidêncio	c04c872c42	gha: aks: Increase the timeout time We've seen tests being aborted close to the end of the run due to the timeout. Let's increase it, avoiding to hit such cases again.. Fixes: #6964 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-25 10:13:08 +02:00
GabyCT	8d98484230	Merge pull request #6926 from GabyCT/topic/fixtabsmerge kata-deploy: Fix indentation on kata deploy merge script	2023-05-24 14:55:51 -06:00
Fabiano Fidêncio	428041624a	kata-deploy: Improve shim backup / restore We're currently backing up and restoring all the possible shim files, but the default one ("containerd-shim-kata-v2"). Let's ensure this is also backed up and restored. Fixes: #6957 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-24 18:39:27 +02:00
Gabriela Cervantes	14c3f1e9f5	kata-deploy: Fix indentation on kata deploy merge script This PR fixes the indentation on the kata deploy merge script that instead of single spaces uses a tap. Fixes #6925 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-05-24 16:01:10 +00:00
Beraldo Leal	0e47cfc4c7	runtime: sending SIGKILL to qemu There is a race condition when virtiofsd is killed without finishing all the clients. Because of that, when a pod is stopped, QEMU detects virtiofsd is gone, which is legitimate. Sending a SIGTERM first before killing could introduce some latency during the shutdown. Fixes #6757. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-05-24 11:31:28 -04:00
stevenhorsman	6a0035e419	doc: Update git commands Fix bad migrations from `go get` to `git clone` and update the cloned directory path Fixes: #6951 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-05-24 13:16:48 +01:00
Fabiano Fidêncio	7c9faab523	Merge pull request #6947 from fidencio/topic/gha-release-fix-payload-tagging gha: release: Simplify the process for tagging the payload	2023-05-24 11:22:09 +02:00
Fabiano Fidêncio	f636c1f8a4	gha: release: Simplify the process for tagging the payload We previously were doing: * Create a new image on kata-deploy-ci using the commit hash of the latest tag * This was used to test on AKS, which is no longer needed as we test on AKS on every PR * Create a new image on kata-deploy using the release tag and "latest" or "stable", by tagging the kata-deploy-ci image accordingly As part of `cfe63527c5`, we broke the workflow described above, as in the first step we would save the PKG_SHA to be used in the second step, but that part ended up being removed. Anyways, this back and forth is not needed anymore and we can simplify the process by doing: * Create a new image on kata-deploy, using: - The tag received as ref from the event that triggered this worklow - "latest" or "stable" tag, depending on whether it's a stable release or not Fixes: #6946 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-24 08:54:43 +02:00
Fabiano Fidêncio	01827911f4	Merge pull request #6943 from fidencio/topic/gha-login-dont-specify-the-registry-if-using-docker-io gha: release: login-action: Don't specify docker.io registry	2023-05-24 07:33:12 +02:00
Fabiano Fidêncio	1c9ad4435a	Merge pull request #6939 from GabyCT/topic/updatenydus versions: Update nydus version to 2.2.1	2023-05-24 00:12:57 +02:00
Fabiano Fidêncio	d10c9be603	gha: release: login-action: Don't specify docker.io registry For some bizarre reason, the login-action will simply fail to authenticate to docker.io in it's specified as a registry. The way to proceed, instead, is to not specify any registry as it'd be used by default. Fixes: #6943 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-23 22:38:12 +02:00
Fabiano Fidêncio	9aae333343	Merge pull request #6871 from kmjohansen/bugfix/ptmx runtime: make debug console work with sandbox_cgroup_only	2023-05-23 22:24:51 +02:00
Fabiano Fidêncio	df77fefce8	Merge pull request #6941 from fidencio/3.2.0-alpha3-branch-bump # Kata Containers 3.2.0-alpha3	2023-05-23 22:21:03 +02:00
Fabiano Fidêncio	c54363114d	release: Kata Containers 3.2.0-alpha3 - release: Fix `docker/login-action` version `f3702268d` release: Fix `docker/login-action` version Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-23 18:39:16 +02:00
Fabiano Fidêncio	c7a77f980b	Merge pull request #6935 from fidencio/topic/release-fix-docker-login-action-version release: Fix `docker/login-action` version	2023-05-23 18:35:03 +02:00
Gabriela Cervantes	0b1c5ea5bb	versions: Update nydus version to 2.2.1 This PR updates the nydus version to 2.2.1. This change includes: nydus-image: fix a underflow issue in get_compressed_size() backport fix/feature to stable 2.2 [backport] contrib: upgrade runc to v1.1.5 service: add README for nydus-service nydus: fix a possible panic caused by SubCmdArgs::is_present Backports two bugfixes from master into stable/v2.2 [backport stable/v2.2] action: upgrade golangci-lint to v1.51.2 [backport] action: fix smoke test for branch pattern Fixes #6938 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-05-23 15:39:04 +00:00
Fabiano Fidêncio	f3702268d1	release: Fix `docker/login-action` version `docker/login-action@v3` does not exist and `docker/login-action@v2` should be used instead. Fixes: #6934 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-23 14:11:03 +02:00
Fabiano Fidêncio	c82ac57e30	Merge pull request #6930 from fidencio/3.2.0-alpha2-branch-bump # Kata Containers 3.2.0-alpha2	2023-05-23 13:50:58 +02:00
Linda Yu	433b5add4a	kubernetes: add agnhost command in pod yaml Fixes: #6928 Signed-off-by: Linda Yu <linda.yu@intel.com>	2023-05-23 18:11:45 +08:00
Fupan Li	170336517f	Merge pull request #5441 from openanolis/device_manager_dev runtime-rs: device manager for runtime-rs	2023-05-23 16:50:07 +08:00
Fabiano Fidêncio	fc09d0f5dd	release: Kata Containers 3.2.0-alpha2 - Fix cache for OVMF and rootfs-initrd (both x86_64) - Upgrade to Cloud Hypervisor v32.0 - osbuilder: Bump fedora image version - local-build: Standardise what's set for the local build scripts - gha: aks: Wait a little bit more before run the tests - docs: Update container network model url - gha: release: Fix s390x worklow - cache: Fix OVMF caching - gha: payload-after-push: Pass secrets down - tools: Fix arch bug `22154e0a3` cache: Fix OVMF tarball name for different flavours `b7341cd96` cache: Use "initrd" as `initrd_type` to build rootfs-initrd `b8ffcd1b9` osbuilder: Bump fedora image version `636539bf0` kata-deploy: Use apt-key.gpg from k8s.io `ae24dc73c` local-build: Standardise what's set for the local build scripts `35c3d7b4b` runtime: clh: Re-generate the client code `cfee99c57` versions: Upgrade to Cloud Hypervisor v32.0 `ad324adf1` gha: aks: Wait a little bit more before run the tests `191b6dd9d` gha: release: Fix s390x worklow `cfd8f4ff7` gha: payload-after-push: Pass secrets down `75330ab3f` cache: Fix OVMF caching `a89b44aab` tools: Fix arch bug `11a34a72e` docs: Update container network model url Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-23 09:06:44 +02:00
Fabiano Fidêncio	160d9aae4d	Merge pull request #6918 from fidencio/topic/fix-cache-x86_64-ovmf-rootfs-initrd Fix cache for OVMF and rootfs-initrd (both x86_64)	2023-05-22 21:34:56 +02:00
Zhongtao Hu	4719802c8d	runtime-rs: add virtio-blk-mmio add virtio-blk-mmio option for dragonball Fixes:#5375 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-05-23 00:58:10 +08:00
Zhongtao Hu	f9bded4484	runtime-rs: add devicetype enum use device type to store the config information for different kind of devices Fixes:#5375 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-05-23 00:55:35 +08:00
Zhongtao Hu	6800d30fdb	runtime-rs: remove device Support remove device after container stop Fixes:#5375 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-05-23 00:54:22 +08:00
Zhongtao Hu	f16012a1eb	runtime-rs: support linux device support linux device in runtime-rs Fixes:#5375 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-05-23 00:54:13 +08:00
Zhongtao Hu	fe9ec67644	runtime-rs: block volume support block volume in runtime-rs Fixes: #5375 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-05-23 00:54:04 +08:00
Zhongtao Hu	a8bfac90b1	runtime-rs: support block rootfs support devmapper for block rootfs Fixes: #5375 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-05-23 00:53:30 +08:00
Zhongtao Hu	b076d46db3	agent: handle hotplug virtio-mmio device As dragonball support hotplug virtio-mmio device, we should handle it in agent Fixes:#5375 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-05-23 00:53:22 +08:00
Zhongtao Hu	6e273d6ccc	runtime-rs: implement trait for vhost-user device add the trait implementation for vhost-user device Fixes:#5375 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-05-23 00:53:16 +08:00
Zhongtao Hu	cc9c915384	runtime-rs: implement trait for vfio device add the trait implementation for vfio device, Fixes:#5375 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-05-23 00:53:10 +08:00
Archana Shinde	2c9efbe04c	Merge pull request #6907 from likebreath/0519/clh_v32.0 Upgrade to Cloud Hypervisor v32.0	2023-05-22 09:53:05 -07:00
Zhongtao Hu	e4c5c74a75	runtime-rs: device manager Support device manager for runtime-rs, add block device handler for device manager Fixes:#5375 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-05-23 00:53:04 +08:00
Fabiano Fidêncio	22154e0a3b	cache: Fix OVMF tarball name for different flavours `75330ab3f9` tried to fix OVMF caching, but didn't consider that the "vanilla" OVMF tarball name is not "kata-static-ovmf-x86_64.tar.xz", but rather "kata-static-ovmf.tar.xz". The fact we missed that, led to the cache builds of OVMF failing, and the need to build the component on every single PR. Fixes: #6917 (hopefully for good this time). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-22 18:12:30 +02:00
Fabiano Fidêncio	b7341cd968	cache: Use "initrd" as `initrd_type` to build rootfs-initrd We've been defaulting to "", which would lead to a mismatch with the latest version from the cache, causing a miss, and finally having to build the rootfs-initrd as part of the tests, every single time. Fixes: #6917 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-22 18:12:30 +02:00
Fabiano Fidêncio	a28cefd538	Merge pull request #6924 from stevenhorsman/fedora-bump osbuilder: Bump fedora image version	2023-05-22 18:10:57 +02:00
Fabiano Fidêncio	7f350d3ec6	Merge pull request #6913 from fidencio/topic/gha-build-and-upload-payload-can-silently-fail local-build: Standardise what's set for the local build scripts	2023-05-22 18:04:51 +02:00
stevenhorsman	b8ffcd1b9b	osbuilder: Bump fedora image version - Swap out an EoL fedora image for the latest Fixes: #6923 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-05-22 13:48:00 +01:00
Fabiano Fidêncio	636539bf0c	kata-deploy: Use apt-key.gpg from k8s.io We're facing some issues to download / use the public key provided by google for installing kubernetes as part of the kata-deploy image. ``` The following signatures couldn't be verified because the public key is not available: NO_PUBKEY B53DC80D13EDEF05 Reading package lists... Done W: GPG error: https://packages.cloud.google.com/apt kubernetes-xenial InRelease: The following signatures couldn't be verified because the public key is not available: NO_PUBKEY B53DC80D13EDEF05 E: The repository 'https://apt.kubernetes.io kubernetes-xenial InRelease' is not signed. N: Updating from such a repository can't be done securely, and is therefore disabled by default. N: See apt-secure(8) manpage for repository creation and user configuration details. ``` Let's work this around following the suggestion made by @dims, at: https://github.com/kubernetes/k8s.io/pull/4837#issuecomment-1446426585 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-22 11:06:01 +02:00
Fabiano Fidêncio	ae24dc73c1	local-build: Standardise what's set for the local build scripts We've a discrepancy on what's set along the scripts used to build the Kata Cotainers artefacts locally. Some of those were missing a way to easily debug them in case of a failure happens, but one specific one (build-and-upload-payload.sh) could actually silently fail. All of those have been changed as part of this commut. Fixes: #6908 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-22 08:36:01 +02:00
Steve Horsman	a2e69c5b66	Merge pull request #6906 from fidencio/topic/gh-aks-wait-a-little-more-before-start-the-tests gha: aks: Wait a little bit more before run the tests	2023-05-20 08:01:20 +01:00
GabyCT	6796af511b	Merge pull request #6890 from GabyCT/topic/fixurlvirt docs: Update container network model url	2023-05-19 15:10:26 -06:00
Bo Chen	35c3d7b4bc	runtime: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v32.0. Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by openapi-generator. Fixes: #6632 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-05-19 12:49:45 -07:00
Bo Chen	cfee99c577	versions: Upgrade to Cloud Hypervisor v32.0 Details of this release can be found in ourroadmap project as iteration v32.0: https://github.com/orgs/cloud-hypervisor/projects/6. Fixes: #6682 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-05-19 12:11:13 -07:00
Steve Horsman	98fa436627	Merge pull request #6904 from fidencio/topic/gha-fix-s390x-release-workflow gha: release: Fix s390x worklow	2023-05-19 19:00:57 +01:00
Steve Horsman	d5355dee20	Merge pull request #6898 from fidencio/topic/fix-ovmf-caching cache: Fix OVMF caching	2023-05-19 18:24:51 +01:00
Fabiano Fidêncio	dfa9301eac	Merge pull request #6900 from fidencio/topic/gha-fix-payload-after-push gha: payload-after-push: Pass secrets down	2023-05-19 17:23:00 +02:00
Fabiano Fidêncio	ad324adf1d	gha: aks: Wait a little bit more before run the tests `fa832f4709` increased the timeout, which helped a lot, mainly in the TEE machines. However, we're still seeing some failures here and there with the AKS tests. Let's bump it yet again and, hopefully, those errors to start the tests will go away. Fixes: #6905 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-19 16:40:35 +02:00
Fabiano Fidêncio	191b6dd9dd	gha: release: Fix s390x worklow GitHub is warning us that: """ The workflow is not valid. In .github/workflows/release.yaml (Line: 21, Col: 11): Error from called workflow kata-containers/kata-containers/.github/workflows/release-s390x.yaml@d2e92c9ec993f56537044950a4673e50707369b5 (Line: 14, Col: 12): Job 'kata-deploy' depends on unknown job 'create-kata-tarball'. """ This is happening as we need to reference "build-kata-static-tarball-s390x" instead of "create-kata-tarball". Fixes: #6903 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-19 16:21:49 +02:00
Fabiano Fidêncio	cfd8f4ff76	gha: payload-after-push: Pass secrets down The "build-assets-${arch}" jobs need to have access to the secrets in order to log into the container registry in the cases where "push-to-registry", which is used to push the builder containers to quay.io, is set to "yes". Now that "build-assets-${arch}" pass the secrets down, we need to log into the container registry in the "build-kata-static-tarball-${arch}" files, in case "push-to-registry" is set to "yes". Fixes: #6899 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-19 15:00:06 +02:00
Fabiano Fidêncio	7abae8ee9c	Merge pull request #6896 from stevenhorsman/firecracker-arch-case tools: Fix arch bug	2023-05-19 14:26:14 +02:00
Fabiano Fidêncio	75330ab3f9	cache: Fix OVMF caching OVMF has been cached, but it's not been used from cache as the `version` set in the cached builds has always been empty. The reason for that is because we've been trying to look for `externals.ovmf.ovmf.version`, while we should be actually looking for `externals.ovmf.x86_64.version`. Setting `x86_64` as the OVMF_FLAVOUR would cause another bug, as the expected tarball name would then be `kata-static-x86_64.tar.xz`, instead of `kata-static-ovmf-x86_64.tar.xz`. With everything said, let's simplify the OVMF_FLAVOUR usage, by using it as it's passed, and only adapting the tarball name for the TDVF case, which is the abnormal one. Fixes: #6897 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-19 14:00:39 +02:00
Fabiano Fidêncio	d2e92c9ec9	Merge pull request #6892 from fidencio/3.2.0-alpha1-branch-bump # Kata Containers 3.2.0-alpha1	2023-05-19 12:31:33 +02:00
stevenhorsman	a89b44aabf	tools: Fix arch bug Fix mismatched case of `arch` Fixes: #6895 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-05-19 09:28:22 +01:00
Fabiano Fidêncio	f527f614c1	release: Kata Containers 3.2.0-alpha1 - runtime: Use static_sandbox_resource_mgmt=true for TEEs - update tokio dependency - resource-control: fix setting CPU affinities on Linux - runtime: use enable_vcpus_pinning from toml - gha: k8s: Make the tests more reliable - gha: Enable SEV-SNP tests on main - gha: tdx: Use the k3s overlay for kata-cleanup - runtime: Port sev package to main - gpu: Rename the last bits from `gpu` to `nvidia-gpu` - deploy: fix shell script error - ppc64le: switch virtiofsd from C to rust version - osbuilder: Fix indentation in rootfs.sh - virtcontainers/qemu_test.go: Improve coverage - agent: Add context to errors that may occur when AgentConfig file is … - virtcontainers/pkg/compatoci/: Improved coverage for for Kata 2.0 - kata-manager: Fix '-o' syntax and logic error - kata-ctl: Add the option to install kata-ctl to a user specified directory - runtime-rs: fix building instructions to use correct required Rust ve… - Dragonball: use LinuxBootConfigurator::write_bootparams - kata-deploy: Add http_proxy as part of the docker build - kata-deploy: Do not ship the kata tarball - kata-deploy: Build improvements - deploy: Fix arch in image tag - Revert "kata-deploy: Use readinessProbe to ensure everything is ready" - virtcontainers: Improved test coverage for fc.go from 4.6% to 18.5% - main \| release: Fix multi-arch publishing is not supported - cache: More fixes to nvidia-gpu kernels caching - runtime: remove overriding ARCH value by default for ppc64le - gha: Fix Body Line Length action flagging empty body commit messages - gha: Fix snap creation workflow - cache: Fix nvidia-gpu version - cache: Update the KERNEL_FLAVOUR list to include nvidia-gpu - packaging: Add SEV-SNP artifacts to main - docs: Mark snap installation method as unmaintained - packaging: Add sev artifacts to main - kata-ctl: add generic kvm check & unit test - Log-parser-rs - warning_fix: fix warnings when build with cargo-1.68.0 - cross-compile: Include documentation and configuration for cross-compile - runtime: Fix virtiofs fd leak - gpu: cold plug VFIO devices - pkg/signals: Improved test coverage 60% to 100% - virtcontainers/persist: Improved test coverage 65% to 87.5% - virtcontainers/clh_test.go: improve unit test coverage - virtcontainers/factory: Improved test coverage - gha: Also run k8s tests on qemu-snp - gha: sev: fix for kata-deploy error - gha: Also run k8s tests on qemu-sev - Implement the "kata-ctl env" command - runtime-rs: support keep_abnormal in toml config - gpu: Build and Ship an GPU enabled Kernel - kata-ctl: checks for kvm, kvm_intel modules loaded - osbuilder: Fix D-Bus enabling in the dracut case - snap: fix docker start fail issue - kata-manager: Fix containerd download - agent: Fix ut issue caused by fd double closed - Bump ttrpc to 0.7.2 and protobuf to 3.2.0 - gpu: Add GPU enabled confguration and runtime - gpu: Do not pass-through PCI (Host) Bridges - cache-components: Fix caching of TDVF and QEMU for TDX - gha: tdx: Ensure kata-deploy is removed after the tests run - versions: Upgrade to Cloud Hypervisor v31.0 - osbuilder: Enable dbus in the dracut case - runtime: Don't create socket file in /run/kata - nydus_rootfs/prefetch_files: add prefetch_files for RAFS - runtime-rs/virtio-fs: add support extra handler for cache mode. - runtime-rs: enable nerdctl to setup cni plugin - tdx: Add artefacts from the latest TDX tools release into main - runtime: support non-root for clh - gha: ci-on-push: Run k8s tests with dragonball - rustjail: Use CPUWeight with systemd and CgroupsV2 - gha: k8s-on-aks: {create,delete} AKS must be a coded-in step - docs: update the rust version from version.yaml - gha: k8s-on-aks: Set {create,delete}_aks as steps - gha: k8s-on-aks: Fix cluster name - gha: Also run k8s tests on AKS with dragonball - gha: Only push images to registry after merging a PR - gha: aks: Use D4s_v5 instance - tools: Avoid building the kernel twice - rustjail: Fix panic when cgroup manager fails - runtime: add filter metrics with specific names - gha: Use ghcr.io for the k8s CI - GHA \|Switch "kubernetes tests" from jenkins to GitHub actions - docs: Update CNM url in networking document - kata-ctl: add function to get platform protection. `f6e1b1152` agent: update tokio dependency `4cb83dc21` kata-ctl: update tokio dependency `df615ff25` runk: update tokio dependency `ca6892ddb` runtime-rs: update tokio dependency `ca1531fe9` runtime: Use static_sandbox_resource_mgmt=true for TEEs `fa832f470` gha: k8s: Make the tests more reliable `cbb9fe8b8` config: Use standard OVMF with SEV `724437efb` kata-deploy: add kata-qemu-sev runtimeclass `521dad2a4` Tests: skip CPU constraints test on SEV and SNP `72308ddb0` gha: ci-on-push: Don't skip tests for SEV `da0f92cef` gha: ci-on-push: Don't skip tests for SEV-SNP `12f43bea0` gha: tdx: Use the k3s overlay for kata-cleanup `1a3f8fc1a` deploy: fix shell script error `87cb98c01` osbuilder: Fix indentation in rootfs.sh `c5a59caca` ppc64le: switch virtiofsd from C to rust version `bfdf0144a` versions: Bump virtiofsd to 1.6.1 `dd7562522` runtime: pkg/sev: Add kbs utility package for SEV pre-attestation `05de7b260` runtime: Add sev package `3a9d3c72a` gpu: Rename the last bits from `gpu` to `nvidia-gpu` `4cde844f7` local-build: Fix kernel-nvidia-gpu target name `593840e07` kata-ctl: Allow INSTALL_PATH= to be specified `bdb75fb21` runtime: use enable_vcpus_pinning from toml `20cb87508` virtcontainers/qemu_test.go: Improve test coverage `b9a1db260` kata-deploy: Add http_proxy as part of the docker build `3e85bf5b1` resource-control: fix setting CPU affinities on Linux `5f3f844a1` runtime-rs: fix building instructions with respect to required Rust version `777c3dc8d` kata-deploy: Do not ship the kata tarball `50cc9c582` tests: Improve coverage for virtcontainers/pkg/compatoci/ for Kata 2.0 `136e2415d` static-build: Download firecracker instead of building it `3bf767cfc` static-build: Adjust ARCH for nydus `ac88d34e0` static-build: Use relased binary for CLH (aarch64) `73913c8eb` kata-manager: Fix '-o' syntax and logic error `2856d3f23` deploy: Fix arch in image tag `e8f81ee93` Revert "kata-deploy: Use readinessProbe to ensure everything is ready" `cfe63527c` release: Fix multi-arch publishing is not supported `197c33651` Dragonball: use LinuxBootConfigurator::write_bootparams to writes the boot parameters into guest memory. `4d17ea4a0` cache: Fix nvidia-snp caching version `a133fadbf` cache: Fix nvidia-gpu-tdx-experimental cache URL `b9990c201` cache: Fix nvidia-gpu version `c9bf7808b` cache: Update the KERNEL_FLAVOUR list to include nvidia-gpu `3665b4204` gpu: Rename `gpu` targets to `nvidia-gpu` `2c90cac75` local-build: fixup alphabetization `4da6eb588` kata-deploy: Add qemu-snp shim `14dd05375` kata-deploy: add kata-qemu-snp runtimeclass `0bb37bff7` config: Add SNP configuration `af7f2519b` versions: update SEV kernel description `dbcc3b5cc` local-build: fix default values for OVMF build `b8bbe6325` gha: build OVMF for tests and release `cf0ca265f` local-build: Add x86_64 OVMF target `db095ddeb` cache: add SNP flavor to comments `f4ee00576` gha: Build and ship QEMU for SNP `7a58a91fa` docs: update SNP guide `879333bfc` versions: update SNP QEMU version `38ce4a32a` local-build: add support to build QEMU for SEV-SNP `5f8008b69` kata-ctl: add unit test for kvm check `a085a6d7b` kata-ctl: add generic kvm check `772d4db26` gha: Build and ship SEV initrd `45fa36692` gha: Build and ship SEV OVMF `4770d3064` gha: Build and ship SEV kernel. `fb9c1fc36` runtime: Add qemu-sev config `813e4c576` runtimeClasses: add sev runtime class `af18806a8` static-build: Add caching support to sev ovmf `76ae7a3ab` packaging: adding caching capability for kernel `12c5ef902` packaging: add support to build OVMF for SEV `b87820ee8` packaging: add support to build initrd for sev `e1f3b871c` docs: Mark snap installation method as unmaintained `022a33de9` agent: Add context to errors when AgentConfig file is missing `b0e6a094b` packaging: Add sev kernel build capability `a4c0303d8` virtcontainers: Fixed static checks for improved test coverage for fc.go `8495f830b` cross-compile: Include documentation and configuration for cross-compile `13d7f39c7` gpu: Check for VFIO port assignments `6594a9329` tools: made log-parser-rs `03a8cd69c` virtcontainers: Improved test coverage for fc.go from 4.6% to 18.5% `9e2b7ff17` gha: sev: fix for kata-deploy error `5c9246db1` gha: Also run k8s tests on qemu-snp `c57a44436` gha: Add the ability to test qemu-snp `406419289` env: Utilize arch specific functionality to get cpu details `fb40c71a2` env: Check for root privileges `1016bc17b` config: Add api to fetch config from default config path `b908a780a` kata-env: Pass cmd option for file path `b1920198b` config: Workaround the way agent and hypervisor configs are fetched `f2b2621de` kata-env: Implement the kata-env command. `c849bdb0a` gha: Also run k8s tests on qemu-sev `6bf1fc605` virtcontainers/factory: Improved test coverage `0d49ceee0` gha: Fix snap creation workflow warnings `138ada049` gpu: Cold Plug VFIO toml setting `defb64334` runtime: remove overriding ARCH value by default for ppc64le `f7ad75cb1` gpu: Cold-plug extend the api.md `0fec2e698` gpu: Add cold-plug test `f2ebdd81c` utils: Get rid of spurious print statement left behind. `9a94f1f14` make: Export VERSION and COMMIT `2f81f48da` config: Add file under /opt as another location to look for the config `07f7d17db` config: Make the pipe_size field optional `68f635773` config: Make function to get the default conf file public `7565b3356` kata-ctl: Implement Display trait for GuestProtection enum `94a00f934` utils: Make certain constants in utils.rs public `572b338b3` gitignore: Ignore .swp and .swo editor backup files `376884b8a` cargo: Update version of clap to 4.1.13 `17daeb9dd` warning_fix: fix warnings when build with cargo-1.68.0 `521519d74` gha: Add the ability to test qemu-sev `205909fbe` runtime: Fix virtiofs fd leak `5226f15c8` gha: Fix Body Line Length action flagging empty body commit messages `0f45b0faa` virtcontainers/clh_test.go: improve unit test coverage `dded731db` gpu: Add OVMF setting for MMIO aperture `2a830177c` gpu: Add fwcfg helper function `131f056a1` gpu: Extract VFIO Functions to drivers `c8cf7ed3b` gpu: Add ColdPlug of VFIO devices with devManager `e2b5e7f73` gpu: Add Rawdevices to hypervisor `6107c32d7` gpu: Assign default value to cold-plug `377ebc2ad` gpu: Add configuration option for cold-plug VFIO `c18ceae10` gpu: Add new struct PCIePort `9c38204f1` virtcontainers/persist: Improved test coverage 65% to 87.5% `1c1ee8057` pkg/signals: Improved test coverage 60% to 100% `cc8ea3232` runtime-rs: support keep_abnormal in toml config `96e8470db` kata-manager: Fix containerd download `432d40744` kata-ctl: checks for kvm, kvm_intel modules loaded `b1730e4a6` gpu: Add new kernel build option to usage() `3e7b90226` osbuilder: Fix D-Bus enabling in the dracut case `53c749a9d` agent: Fix ut issue caused by fd double closed `2e3f19af9` agent: fix clippy warnings caused by protobuf3 `4849c56fa` agent: Fix unit test issue cuased by protobuf upgrade `0a582f781` trace-forwarder: remove unused crate protobuf `73253850e` kata-ctl: remove unused crate ttrpc `76d2e3054` agent-ctl: Bump ttrpc from 0.6.0 to 0.7.1 `eb3d20dcc` protocols: Add ut for Serde `59568c79d` protocols: add support for Serde `a6b4d92c8` runtime-rs: Bump ttrpc from 0.6.0 to 0.7.1 `ac7c63bc6` gpu: Add containerd shim for qemu-gpu `a0cc8a75f` gpu: Add a kube runtime class `a81fff706` gpu: Adding a GPU enabled configuration `8af6fc77c` agent: Bump ttrpc from 0.6.0 to 0.7.1 `009b42dbf` protocols: Fix unit test `392732e21` protocols: Bump ttrpc from 0.6.0 to 0.7.1 `f4f958d53` gpu: Do not pass-through PCI (Host) Bridges `825e76948` gpu: Add GPU support to default kernel without any TEE `e4ee07f7d` gpu: Add GPU TDX experimental kernel `a1272bcf1` gha: tdx: Fix typo overlay -> overlays `3fa0890e5` cache-components: Fix TDVF caching `80e3a2d40` cache-components: Fix TDX QEMU caching `87ea43cd4` gpu: Add configuration fragment `aca6ff728` gpu: Build and Ship an GPU enabled Kernel `dc662333d` runtime: Increase the dial_timeout `eb1762e81` osbuilder: Enable dbus in the dracut case `f478b9115` clh: tdx: Update timeouts for confidential guest `3b76abb36` kata-deploy: Ensure node is ready after CRI Engine restart `5ec9ae0f0` kata-deploy: Use readinessProbe to ensure everything is ready `ea386700f` kata-deploy: Update podOverhead for TDX `e31efc861` gha: tdx: Use the k3s overlay `542bb0f3f` gha: tdx: Set KUBECONFIG env at the job level `d7fdf19e9` gha: tdx: Delete kata-deploy after the tests finish `da35241a9` tests: k8s: Skip k8s-cpu-ns when testing TDX `db2cac34d` runtime: Don't create socket file in /run/kata `6d315719f` snap: fix docker start fail issue `e4b3b0887` gpu: Add proper CONFIG_LOCALVERSION depending on TEE `69ba2098f` runtime-rs: remove network entities and netns `b31f103d1` runtime-rs: enable nerdctl cni plugin `69d7a959c` gha: ci-on-push: Run tests on TDX `5a0727ecb` kata-deploy: Ship kata-qemu-tdx runtimeClass `98682805b` config: Add configuration for QEMU TDX `3e1580019` govmm: Directly pass the firmware using -bios with TDX `3c5ffb0c8` govmm: Set "sept-ve-disable=on" `ed145365e` runtime/qemu: Drop "kvm-type=tdx" `25b3cdd38` virtcontainers: Drop check for the `tdx` CPU flag `01bdacb4e` virtcontainers: Also check /sys/firmwares/tdx for TDX `9feec533c` cache: Add ability to cache OVMF `ce8d98251` gha: Build and ship the OVMF for TDX `39c3fab7b` local-build: Add support to build OVMF for TDX `054174d3e` versions: Bump OVMF for TDX `800fb49da` packaging: Add get_ovmf_image_name() helper `fbf03d7ac` cache: Document kernel-tdx-experimental `5d79e9696` cache: Add a space to ease the reading of the kernel flavours `6e4726e45` cache: Fix typos `fc22ed0a8` gha: Build and ship the Kernel for TDX `502844ced` local-build: Add support to build Kernel for TDX `b2585eecf` local-build: Avoid code duplication building the kernel `f33345c31` versions: Update Kernel TDX version `20ab2c242` versions: Move Kernel TDX to its own experimental entry `3d9ce3982` cache: Allow specifying the QEMU_FLAVOUR `33dc6c65a` gha: Build and ship QEMU for TDX `eceaae30a` local-build: Add support to build QEMU for TDX `f7b7c187e` static-build: Improve qemu-experimental build script `3018c9ad5` versions: Update QEMU TDX version `800ee5cd8` versions: Move QEMU TDX to its own experimental entry `1315bb45f` local-build: Add dragonball kernel to the `all` target `73e108136` local-build: Rename non vanilla kernel build functions `1d851b4be` local-build: Cosmetic changes in build targets `49ce685eb` gha: k8s-on-aks: Always delete the AKS cluster `e2a770df5` gha: ci-on-push: Run k8s tests with dragonball `d1f550bd1` docs: update the rust version from versions.yaml `f3595e48b` nydus_rootfs/prefetch_files: add prefetch_files for RAFS `3bfaafbf4` fix: oci hook `c1fbaae8d` rustjail: Use CPUWeight with systemd and CgroupsV2 `375187e04` versions: Upgrade to Cloud Hypervisor v31.0 `79f3047f0` gha: k8s-on-aks: {create,delete} AKS must be a coded-in step `2f35b4d4e` gha: ci-on-push: Only run on `main` branch `e7bd2545e` Revert "gha: ci-on-push: Depend on Commit Message Check" `0d96d4963` Revert "gha: ci-on-push: Adjust to using workflow_run" `c7ee45f7e` Revert "gha: ci-on-push: Adapt chained jobs to workflow_run" `5d4d72064` Revert "gha: k8s-on-aks: Fix cluster name" `13d857a56` gha: k8s-on-aks: Set {create,delete}_aks as steps `dc6569dbb` runtime-rs/virtio-fs: add support extra handler for cache mode. `85cc5bb53` gha: k8s-on-aks: Fix cluster name `1688e4f3f` gha: aks: Use D4s_v5 instance `108d80a86` gha: Add the ability to also test Dragonball `2550d4462` gha: build-kata-static-tarball: Only push to registry after merge `e81b8b8ee` local-build: build-and-upload-payload is not quay.io specific `13929fc61` gha: publish-kata-deploy-payload: Improve registry login `41026f003` gha: payload-after-push: Pass registry / repo as inputs `7855b4306` gha: ci-on-push: Adapt chained jobs to workflow_run `3a760a157` gha: ci-on-push: Adjust to using workflow_run `a159ffdba` gha: ci-on-push: Depend on Commit Message Check `8086c75f6` gha: Also run k8s tests on AKS with dragonball `fe86c08a6` tools: Avoid building the kernel twice `3215860a4` gha: Set ci-on-push to run on `pull_request_target` `d17dfe4cd` gha: Use ghcr.io for the k8s CI `b661e0cf3` rustjail: Add anyhow context for D-Bus connections `60c62c3b6` gha: Remove kata-deploy-test.yaml `43894e945` gha: Remove kata-deploy-push.yaml `cab9ca043` gha: Add a CI pipeline for Kata Containers `53b526b6b` gha: k8s: Add snippet to run k8s tests on aks clusters `c444c24bc` gha: aks: Add snippets to create / delete aks clusters `11e0099fb` tests: Move k8s tests to this repo `73be4bd3f` gha: Update actions for release.yaml `d38d7fbf1` gha: Remove code duplication from release.yaml `56331bd7b` gha: Split payload-after-push-*.yaml `a552a1953` docs: Update CNM url in networking document `7796e6ccc` rustjail: Fix minor grammatical error in function name `41fdda1d8` rustjail: Do not unwrap potential error with cgroup manager `a914283ce` kata-ctl: add function to get platform protection. `0f7351556` runtime: add filter metrics with specific names `cbe6ad903` runtime: support non-root for clh `d3bb25418` utils: Add function to check vhost-vsock Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-19 09:26:36 +02:00
Fabiano Fidêncio	0364620844	Merge pull request #6819 from fidencio/topic/use-static-sandbox-resource-mgmt-for-TEEs runtime: Use static_sandbox_resource_mgmt=true for TEEs	2023-05-18 22:38:31 +02:00
Fabiano Fidêncio	2ea8acaaa5	Merge pull request #6882 from bergwolf/github/tokio update tokio dependency	2023-05-18 20:35:16 +02:00
Krister Johansen	eff6ed2d5f	runtime: make debug console work with sandbox_cgroup_only If a hypervisor debug console is enabled and sandbox_cgroup_only is set, the hypervisor can fail to open /dev/ptmx, which prevents the sandbox from launching. This is caused by the absence of a device cgroup entry to allow access to /dev/ptmx. When sandbox_cgroup_only is not set, the hypervisor inherits the default unrestrcited device cgroup, but with it enabled it runs into allow / deny list restrictions. Fix by adding an allowlist entry for /dev/ptmx when debug is enabled, sandbox_cgroup_only is true, and no /dev/ptmx is already in the list of devices. Fixes: #6870 Signed-off-by: Krister Johansen <kjlx@templeofstupid.com>	2023-05-18 10:36:24 -07:00
Gabriela Cervantes	11a34a72e2	docs: Update container network model url This PR updates the container network model url that is part of the virtcontainers documentation. Fixes #6889 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-05-18 15:08:08 +00:00
Peng Tao	f6e1b1152c	agent: update tokio dependency To 1.28.1 to bring in the latest fixes. Fixes: #6881 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-05-18 09:36:06 +00:00
Shuaiyi Zhang	c477ac551f	dragonball: Convert VirtioNetDeviceMgr function to method Convert VirtioNetDeviceMgr::insert_device and VirtioNetDeviceMgr::update_device_ratelimiters to method. Fixes: #6880 Signed-off-by: Shuaiyi Zhang <zhang_syi@qq.com>	2023-05-18 16:57:01 +08:00
Shuaiyi Zhang	4659facb74	dragonball: Convert BlockDeviceMgr function to method Convert BlockDeviceMgr::insert_device, BlockDeviceMgr::remove_device and BlockDeviceMgr::update_device_ratelimiters to method. Fixes: #6880 Signed-off-by: Shuaiyi Zhang <zhang_syi@qq.com>	2023-05-18 16:56:49 +08:00
Peng Tao	4cb83dc219	kata-ctl: update tokio dependency Update to 1.28.1 To pick up the latest fixes. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-05-18 08:25:13 +00:00
Peng Tao	df615ff252	runk: update tokio dependency Update to 1.28.1 to pick up latest fixes. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-05-18 08:24:41 +00:00
Peng Tao	ca6892ddb1	runtime-rs: update tokio dependency Unify it to the latest 1.28.1 version. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2023-05-18 08:18:22 +00:00
Fabiano Fidêncio	3a4b924226	Merge pull request #6833 from rye-stripe/bugfix/vcpu-pinning resource-control: fix setting CPU affinities on Linux	2023-05-18 08:12:39 +02:00
Xuewei Niu	ee6deef09d	dragonball: Remove virtio-net and vsock devices gracefully This MR implements removing virtio-net and virtio-vsock devices gracefully when shutting down VMM. Fixes: #6684 Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com> Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-05-18 12:11:20 +08:00
Fabiano Fidêncio	e762f70920	Merge pull request #6838 from rye-stripe/bugfix/use-enable-vcpus-pinning-from-toml runtime: use enable_vcpus_pinning from toml	2023-05-17 21:30:44 +02:00
Fabiano Fidêncio	ca1531fe9d	runtime: Use static_sandbox_resource_mgmt=true for TEEs When this option is enabled the runtime will attempt to determine the appropriate sandbox size (memory, CPU) before booting the virtual machine. As TEEs do not support memory and CPU hotplug, this approach must be used. Fixes: #6818 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-17 19:21:52 +02:00
Fabiano Fidêncio	851b97fa51	Merge pull request #6866 from fidencio/topic/gha-improve-actions gha: k8s: Make the tests more reliable	2023-05-17 19:19:18 +02:00
Fabiano Fidêncio	8ce14e709a	Merge pull request #6810 from fitzthum/snp-enable gha: Enable SEV-SNP tests on main	2023-05-17 15:29:54 +02:00
Greg Kurz	206df04b99	Merge pull request #6858 from fidencio/topic/gha-tdx-fix-cleanup gha: tdx: Use the k3s overlay for kata-cleanup	2023-05-17 15:04:56 +02:00
Wainer Moschetta	259158f1c3	Merge pull request #6789 from dubek/add-sev-package runtime: Port sev package to main	2023-05-17 10:02:19 -03:00
Fabiano Fidêncio	fa832f4709	gha: k8s: Make the tests more reliable We like it or not, every now and then we'll have to deal with flaky tests, and our tests using GHA are not exempt from that fact. With this simple commit, we're trying to improve the reliability of the tests in a few different fronts: * Giving enough time for the script used by kata-deploy to be executed * We've hit issues as the kata-deploy pod is considered "Ready" at the moment it starts running, not when it finishes the needed setup. We should also be looking on how to solve this on the kata-deploy side but, for now, let's ensure our tests do not break with the current kata-deploy behavior. * Merging the "Deploy kata-deploy" and "Run tests" steps * We've hit issues re-running tests and seeing even more failures than the ones we're trying to debug, as a step will simply be taken as succeeded as part of the re-run, in case it was successful executed as part of the first run. This causes issues with the kata-deploy deployment, as the tests would start running before even having the node set up for running Kata Containers. Fixes: #6865 #6649 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-17 13:38:08 +02:00
Tobin Feldman-Fitzthum	cbb9fe8b81	config: Use standard OVMF with SEV The AmdSev firmware package should be used with measured direct boot. If the expected hashes are not injected into the firmware binary by the VMM, the guest will not boot. This is required for security. Currently the main branch does not have the extended shim support for SEV, which tells the VMM to inject the expected hashes. We ship the standard OVMF package to use with SNP, so let's switch SEV to that for now. This will need to be changed back when shim support for SEV(-ES) is added to main. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-17 11:36:04 +02:00
Tobin Feldman-Fitzthum	724437efb3	kata-deploy: add kata-qemu-sev runtimeclass In order to populate containerd config file with support for SEV, we need to add the qemu-sev shim to the kata-deploy script. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-17 11:36:02 +02:00
Tobin Feldman-Fitzthum	521dad2a47	Tests: skip CPU constraints test on SEV and SNP Currently Kata does not support memory / CPU hotplug for SEV or SEV-SNP so we need to skip tests that rely on it. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-17 11:35:13 +02:00
Tobin Feldman-Fitzthum	72308ddb07	gha: ci-on-push: Don't skip tests for SEV Now that SEV artifacts are built by GHA, remove conditional that skips tests when using qemu-sev. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-17 11:35:13 +02:00
Tobin Feldman-Fitzthum	da0f92cef8	gha: ci-on-push: Don't skip tests for SEV-SNP Now that we have SNP artifacts in place and they are built via gha, remove the condition that skips the tests for SNP. Fixes: #6809 Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-17 11:35:13 +02:00
fupan	2bda92face	netlink: Fix the issue of update_interface When updating an interface, there's maybe an existed interface whose name would be the same with the updated required name, thus it would update failed with interface name existed error. Thus we should rename the existed interface with an temporary name and swap it with the previouse interface name last. Fixes: #6842 Signed-off-by: fupan <fupan.lfp@antgroup.com>	2023-05-17 16:45:49 +08:00
Fabiano Fidêncio	12f43bea0f	gha: tdx: Use the k3s overlay for kata-cleanup As the TDX CI runs on k3s, we must ensure the cleanup, as already done for the deploy, used the k3s overlay. Fixes: #6857 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-17 09:50:29 +02:00
Fabiano Fidêncio	9630c13ac0	Merge pull request #6845 from fidencio/topic/yet-more-nvidia-gpu-naming-fixes gpu: Rename the last bits from `gpu` to `nvidia-gpu`	2023-05-17 09:05:12 +02:00
Steve Horsman	e4a458035c	Merge pull request #6852 from stevenhorsman/container-image-arch-consistency deploy: fix shell script error	2023-05-17 08:01:39 +01:00
Amulya Meka	3ccc29030d	Merge pull request #6780 from Amulyam24/rust-virtfs ppc64le: switch virtiofsd from C to rust version	2023-05-17 09:36:28 +05:30
GabyCT	e0e46de12d	Merge pull request #6849 from GabyCT/topic/fixtabs osbuilder: Fix indentation in rootfs.sh	2023-05-16 16:47:09 -06:00
stevenhorsman	1a3f8fc1a2	deploy: fix shell script error - Remove local introduced by bad copy-paste Fixes: #6814 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-05-16 19:30:32 +01:00
Salvador Fuentes	b76058c979	Merge pull request #6721 from nedsouza/virtcontainers-qemu-go-coverage virtcontainers/qemu_test.go: Improve coverage	2023-05-16 11:11:43 -06:00
Feng Wang	ebc8e8e2fd	Merge pull request #6773 from jepio/agent-config-error-context agent: Add context to errors that may occur when AgentConfig file is …	2023-05-16 09:21:34 -07:00
Gabriela Cervantes	87cb98c01d	osbuilder: Fix indentation in rootfs.sh This PR replaces single spaces to tabs in order to fix the indentation of the rootfs script. Fixes #6848 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-05-16 15:30:50 +00:00
James O. D. Hunt	a96fcfd5be	Merge pull request #6735 from nedsouza/258/tests-coverage-compatoci virtcontainers/pkg/compatoci/: Improved coverage for for Kata 2.0	2023-05-16 15:36:35 +01:00
Amulyam24	c5a59caca1	ppc64le: switch virtiofsd from C to rust version We have been using the C version of virtiofsd on ppc64le. Now that the issue with rust virtiofsd have been fixed, let's switch to it. Fixes: #4259 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2023-05-16 14:46:19 +02:00
Amulyam24	bfdf0144aa	versions: Bump virtiofsd to 1.6.1 virtiofsd v1.6.1 has been released with the fixes required for running successfully on ppc64le. Fixes: #4259 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2023-05-16 14:46:16 +02:00
Dov Murik	dd7562522a	runtime: pkg/sev: Add kbs utility package for SEV pre-attestation Supports both online and offline modes of interaction with simple-kbs for SEV/SEV-ES confidential guests. Fixes: #6795 Signed-off-by: Dov Murik <dovmurik@linux.ibm.com>	2023-05-16 15:27:32 +03:00
Dov Murik	05de7b2607	runtime: Add sev package The sev package provides utilities for launching AMD SEV and SEV-ES confidential guests. Fixes: #6795 Signed-off-by: Dov Murik <dovmurik@linux.ibm.com>	2023-05-16 15:27:32 +03:00
Fabiano Fidêncio	3a9d3c72aa	gpu: Rename the last bits from `gpu` to `nvidia-gpu` Let's specifically name the `gpu` runtime class as `nvidia-gpu`. By doing this we keep the door open and ease the life of the next vendor adding GPU support for Kata Containers. Fixes: #6553 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-16 13:47:52 +02:00
Fabiano Fidêncio	4cde844f70	local-build: Fix kernel-nvidia-gpu target name It must have `-tarball` as part of its name. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-16 13:34:52 +02:00
Archana Shinde	8d10d157b3	Merge pull request #6823 from jodh-intel/utils-kata-manager-containerd-fix kata-manager: Fix '-o' syntax and logic error	2023-05-15 21:44:35 -07:00
Bin Liu	47a02dcc7f	Merge pull request #6767 from ngpatel6/Issue-5403 kata-ctl: Add the option to install kata-ctl to a user specified directory	2023-05-16 10:43:40 +08:00
Chao Wu	911d8a5a7f	Merge pull request #6804 from pmores/fix-rust-version-in-docs runtime-rs: fix building instructions to use correct required Rust ve…	2023-05-16 10:14:05 +08:00
Bin Liu	2cd2d02d1f	Merge pull request #6812 from ZhangShuaiyi/dev/write_bootparams Dragonball: use LinuxBootConfigurator::write_bootparams	2023-05-16 09:54:41 +08:00
GabyCT	3d8185863d	Merge pull request #6835 from GabyCT/topic/buildkataproxy kata-deploy: Add http_proxy as part of the docker build	2023-05-15 16:15:27 -06:00
Narendra Patel	593840e075	kata-ctl: Allow INSTALL_PATH= to be specified Update the kata-ctl install rule to allow it to be installed to a given directory The Makefile was updated to use an INSTALL_PATH variable to track where the kata-ctl binary should be installed. If the user doesn't specify anything, then it uses the default path that cargo uses. Otherwise, it will install it in the directory that the user specified. The README.md file was also updated to show how to use the new option. Fixes #5403 Co-authored-by: Cesar Tamayo <cesar.tamayo@intel.com> Co-authored-by: Kevin Mora Jimenez <kevin.mora.jimenez@intel.com> Co-authored-by: Narendra Patel <narendra.g.patel@intel.com> Co-authored-by: Ray Karrenbauer <ray.karrenbauer@intel.com> Co-authored-by: Srinath Duraisamy <srinath.duraisamy@intel.com> Signed-off-by: Narendra Patel <narendra.g.patel@intel.com>	2023-05-15 17:21:49 -04:00
Peteris Rudzusiks	bdb75fb21e	runtime: use enable_vcpus_pinning from toml Set the default value of runtime's EnableVCPUsPinning to value read from .toml. Fixes: #6836 Signed-off-by: Peteris Rudzusiks <rye@stripe.com>	2023-05-15 21:41:20 +02:00
Tamas K Lengyel	20cb875087	virtcontainers/qemu_test.go: Improve test coverage Rework TestQemuCreateVM routine to be a table driven test with various config variations passed to it. After CreateVM a handful of additional functions are exercised to improve code-coverage. Also add partial coverage for StartVM routine. Currently improving from 19.7% to 35.7% Credit PR to Hackathon Team3 Fixes: #267 Signed-off-by: Tamas K Lengyel <tamas.lengyel@intel.com>	2023-05-15 15:26:35 -04:00
Fabiano Fidêncio	da877a603d	Merge pull request #6829 from fidencio/topic/kata-deploy-remove-tarball-from-payload-image kata-deploy: Do not ship the kata tarball	2023-05-15 19:01:14 +02:00
Gabriela Cervantes	b9a1db2601	kata-deploy: Add http_proxy as part of the docker build Add http_proxy and https_proxy as part of the docker build arguments in order to build properly when we are behind a proxy. Fixes #6834 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-05-15 15:57:29 +00:00
Peteris Rudzusiks	3e85bf5b17	resource-control: fix setting CPU affinities on Linux With this fix the vCPU pinning feature chooses the correct physical cores to pin the vCPU threads on rather than always using core 0. Fixes #6831 Signed-off-by: Peteris Rudzusiks <rye@stripe.com>	2023-05-15 16:46:36 +02:00
Pavel Mores	5f3f844a1e	runtime-rs: fix building instructions with respect to required Rust version Fixes: #6803 Signed-off-by: Pavel Mores <pmores@redhat.com>	2023-05-15 16:30:41 +02:00
Fabiano Fidêncio	9e83795fca	Merge pull request #6825 from fidencio/topic/kata-deploy-build-improvements kata-deploy: Build improvements	2023-05-15 13:49:15 +02:00
Fabiano Fidêncio	802cd2f673	Merge pull request #6821 from stevenhorsman/container-image-arch-consistency deploy: Fix arch in image tag	2023-05-15 11:16:01 +02:00
Fabiano Fidêncio	815b4e8dac	Merge pull request #6816 from fidencio/topic/kata-deploy-fixes Revert "kata-deploy: Use readinessProbe to ensure everything is ready"	2023-05-15 10:24:58 +02:00
Fabiano Fidêncio	777c3dc8d2	kata-deploy: Do not ship the kata tarball There's absolutely no reason to ship the kata-static tarball as part of the payload image, as: * The tarball is already part of the release process * The payload image already has uncompressed content of the tarball * The tarball itself is not used anywhere by the kata-deploy scripts Fixes: #6828 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-15 09:22:39 +02:00
LiuWeijie	50cc9c582f	tests: Improve coverage for virtcontainers/pkg/compatoci/ for Kata 2.0 Add test cases for ParseConfigJson function and GetContainerSpec function Fixes: #258 Signed-off-by: LiuWeijie <weijie.liu@intel.com>	2023-05-15 11:58:17 +08:00
Fabiano Fidêncio	136e2415da	static-build: Download firecracker instead of building it There's no reason for us to build firecracker instead of simply downloading the official released tarball, as tarballs are provided for the architectures we want to use them. Fixes: #6770 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-12 22:05:33 +02:00
Fabiano Fidêncio	3bf767cfcd	static-build: Adjust ARCH for nydus When building from aarch64, just use "arm64" as that's what's used in the name of the released nydus tarballs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-12 22:05:33 +02:00
Fabiano Fidêncio	ac88d34e0c	static-build: Use relased binary for CLH (aarch64) There's no need to build Cloud Hypervisor aarch64 as, for a few releases already, Cloud Hypervisor provides an official release binary for the architecture. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-12 22:05:01 +02:00
Archana Shinde	32b39ee347	Merge pull request #6763 from nedsouza/266/tests_coverage_virtcontainers_fc virtcontainers: Improved test coverage for fc.go from 4.6% to 18.5%	2023-05-12 11:53:27 -07:00
James O. D. Hunt	73913c8eb7	kata-manager: Fix '-o' syntax and logic error Fix the syntax and logic error that is only displayed if the user runs the script with `-o`. This option requests that "only" Kata Containers is installed and stops containerd from being installed. Fixes: #6822. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-05-12 16:44:24 +01:00
stevenhorsman	2856d3f23d	deploy: Fix arch in image tag `uname -m` produces `x86_64`, but container image convention is to use `amd64`, so update this in the tag Fixes: #6820 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-05-12 16:14:19 +01:00
Fabiano Fidêncio	42dce15b1f	Merge pull request #6450 from singhwang/main main \| release: Fix multi-arch publishing is not supported	2023-05-12 15:25:59 +02:00
Fabiano Fidêncio	e8f81ee93d	Revert "kata-deploy: Use readinessProbe to ensure everything is ready" This reverts commit `5ec9ae0f04`, for two main reasons: * The readinessProbe was misintepreted by myself when working on the original PR * It's actually causing issues, as the pod ends up marked as not healthy.	2023-05-12 14:28:23 +02:00
SinghWang	cfe63527c5	release: Fix multi-arch publishing is not supported When release is published, kata-deploy payload and kata-static package can support multi-arch publishing. Fixes: #6449 Signed-off-by: SinghWang <wangxin_0611@126.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-12 13:36:44 +02:00
Shuaiyi Zhang	197c336516	Dragonball: use LinuxBootConfigurator::write_bootparams to writes the boot parameters into guest memory. Fixes: #6813 Signed-off-by: Shuaiyi Zhang <zhang_syi@qq.com>	2023-05-12 16:07:44 +08:00
Fabiano Fidêncio	181017d1d8	Merge pull request #6811 from fidencio/topic/yet-more-fixes-for-nvidia-gpu-kernels cache: More fixes to nvidia-gpu kernels caching	2023-05-12 10:02:08 +02:00
Amulya Meka	76f975e5e6	Merge pull request #6742 from Amulyam24/agent-build runtime: remove overriding ARCH value by default for ppc64le	2023-05-12 12:34:50 +05:30
Archana Shinde	20ac3917ad	Merge pull request #6739 from byron-marohn/fix_5561 gha: Fix Body Line Length action flagging empty body commit messages	2023-05-11 15:17:07 -07:00
Archana Shinde	1ad442e656	Merge pull request #6748 from nedsouza/fix-snap gha: Fix snap creation workflow	2023-05-11 15:09:22 -07:00
Fabiano Fidêncio	4d17ea4a01	cache: Fix nvidia-snp caching version All the kernel-foo instances, such as "kernel-sev" or "kernel-snp", should be transformed into "kernel.foo" when looking at the versions.yaml file. This was already done for SEV, but missed on the SNP case. Fixes: #6777 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-11 21:26:58 +02:00
Fabiano Fidêncio	a133fadbfa	cache: Fix nvidia-gpu-tdx-experimental cache URL We were passing "kernel-nvidia-gpu-tdx", missing the "-experimental" part, leading to a non-valid URL. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-11 21:20:06 +02:00
Fabiano Fidêncio	a7dd6cbadd	Merge pull request #6807 from fidencio/topic/fix-nvidia-gpu-cache cache: Fix nvidia-gpu version	2023-05-11 17:40:41 +02:00
Fabiano Fidêncio	b9990c2017	cache: Fix nvidia-gpu version `c9bf7808b6` introduced the logic to properly get the version of nvidia-gpu kernels, but one important part was dropped during the rebase into main, which is actually getting the correct version of the kernel. Fixing this now, and using the old issue as reference. Fixes: #6777 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-11 13:55:14 +02:00
Fabiano Fidêncio	14939d00ad	Merge pull request #6778 from fidencio/topic/cache-gpu-related-kernels cache: Update the KERNEL_FLAVOUR list to include nvidia-gpu	2023-05-11 13:14:45 +02:00
Fabiano Fidêncio	c9bf7808b6	cache: Update the KERNEL_FLAVOUR list to include nvidia-gpu We need to make sure that, when caching a `-nvidia-gpu` kernel, we still look at the version of the base kernel used to build the nvidia-gpu drivers, as the ${vendor}-gpu kernels are based on already existing entries in the versions.yaml file and do not require a new entry to be added. Fixes: #6777 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-11 10:56:13 +02:00
Fabiano Fidêncio	3665b42045	gpu: Rename `gpu` targets to `nvidia-gpu` This will make it easier for other GPU vendors to add the needed bits in the future. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-11 10:55:55 +02:00
Fabiano Fidêncio	edfaae85cb	Merge pull request #6700 from fitzthum/snp-artifacts packaging: Add SEV-SNP artifacts to main	2023-05-11 10:47:10 +02:00
James O. D. Hunt	fe33015075	Merge pull request #6794 from jodh-intel/docs-mark-snap-as-unmaintained docs: Mark snap installation method as unmaintained	2023-05-11 09:14:25 +01:00
Fabiano Fidêncio	c937d0a5d4	Merge pull request #6591 from UnmeshDeodhar/add-sev-artifacts-to-main packaging: Add sev artifacts to main	2023-05-11 09:09:36 +02:00
Tobin Feldman-Fitzthum	2c90cac751	local-build: fixup alphabetization A few pieces of the local-build tooling are supposed to be alphabetized. Fixup a couple minor issues that have accumulated. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-10 21:23:38 +00:00
Tobin Feldman-Fitzthum	4da6eb588d	kata-deploy: Add qemu-snp shim Now that we have the SNP components in place, make sure that kata-deploy knows about the qemu-snp shim so that it will be added to containerd config. Fixes: #6575 Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-10 20:55:36 +00:00
Tobin Feldman-Fitzthum	14dd053758	kata-deploy: add kata-qemu-snp runtimeclass Since SEV-SNP has limited hotplug support, increase the pod overhead to account for fixed resource usage. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-10 20:55:36 +00:00
Tobin Feldman-Fitzthum	0bb37bff78	config: Add SNP configuration SNP requires many specific configurations, so let's make a new SNP configuration file that we can use with the kata-qemu-snp runtime class. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com> Signed-off-by: Alex Carter <Alex.Carter@ibm.com>	2023-05-10 20:55:36 +00:00
Chelsea Mafrica	13f9ba2298	Merge pull request #6379 from cmaf/kata-ctl-check-kvm-1 kata-ctl: add generic kvm check & unit test	2023-05-10 13:33:57 -07:00
Tobin Feldman-Fitzthum	af7f2519bf	versions: update SEV kernel description SNP and SEV will share a (guest) kernel. Update the description in versions.yaml to mention this. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-10 20:27:12 +00:00
Tobin Feldman-Fitzthum	dbcc3b5cc8	local-build: fix default values for OVMF build Existing value has wrong name and compression type leading to installation failure. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-10 20:27:12 +00:00
Tobin Feldman-Fitzthum	b8bbe6325f	gha: build OVMF for tests and release The x86_64 package of OVMF is required for deployments that don't use kernel hashes, which includes SEV-SNP in the short term. We should keep this in the bundle in the long term in case someone wants to disable kernel hashes. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-10 20:27:12 +00:00
Tobin Feldman-Fitzthum	cf0ca265f9	local-build: Add x86_64 OVMF target Add targets to build the "plain" x86_64 OVMF. This will be used by anyone who is using SEV or SNP without kernel hashes. The SNP QEMU does not yet support kernel hashes so the OvmfPkg will be used by default. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com> Signed-off-by: Alex Carter <Alex.Carter@ibm.com>	2023-05-10 20:24:51 +00:00
Tobin Feldman-Fitzthum	db095ddeb4	cache: add SNP flavor to comments Update comments to include new SNP QEMU option Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-10 20:19:56 +00:00
Tobin Feldman-Fitzthum	f4ee00576a	gha: Build and ship QEMU for SNP Now that we can build SNP QEMU, let's do that for tests and release. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-10 20:19:56 +00:00
Tobin Feldman-Fitzthum	7a58a91fa6	docs: update SNP guide Since we reshuffled versions.yaml, update the guide so that we can find the SNP QEMU info. Once runtime support is merged we should overhaul or remove this guide, but let's keep it for now. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-10 20:19:56 +00:00
Tobin Feldman-Fitzthum	879333bfc7	versions: update SNP QEMU version Refactor SNP QEMU entry in versions.yaml to match qemu-experimental and qemu-tdx-experimental. Also, update the version of QEMU to what we are using in CCv0. This is the non-UPM QEMU and it does not have kernel hashes support. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com> Signed-off-by: Alex Carter <Alex.Carter@ibm.com>	2023-05-10 20:19:56 +00:00
Tobin Feldman-Fitzthum	38ce4a32af	local-build: add support to build QEMU for SEV-SNP Add Make targets and helper functions to build the QEMU needed for SEV-SNP. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com> Signed-off-by: Alex Carter <Alex.Carter@ibm.com>	2023-05-10 20:19:56 +00:00
Chelsea Mafrica	5f8008b69c	kata-ctl: add unit test for kvm check Check that kvm test fails when run as non-root and when device specified is not /dev/kvm. Fixes #5338 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-05-10 10:29:20 -07:00
Chelsea Mafrica	a085a6d7b4	kata-ctl: add generic kvm check Add kvm check using ioctl macro to create a syscall that checks the kvm api version and if creation of a vm is successful. Fixes #5338 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2023-05-10 10:29:20 -07:00
Unmesh Deodhar	772d4db262	gha: Build and ship SEV initrd We have code that builds initrd for SEV. thus, adding that to the test and release process. Fixes: #6572 Signed-off-by: Unmesh Deodhar <udeodhar@amd.com>	2023-05-10 12:19:56 -05:00
Unmesh Deodhar	45fa366926	gha: Build and ship SEV OVMF SEV requires special OVMF to work. Thus, building that for test and release. Fixes: #6572 Signed-Off-By: Unmesh Deodhar <udeodhar@amd.com>	2023-05-10 12:19:56 -05:00
Unmesh Deodhar	4770d3064a	gha: Build and ship SEV kernel. SEV requires custom kernel arguments when building. Thus, adding it to the test and release process. Fixes: #6572 Signed-off-by: Unmesh Deodhar <udeodhar@amd.com>	2023-05-10 12:19:56 -05:00
Unmesh Deodhar	fb9c1fc36e	runtime: Add qemu-sev config Adding config file that can be used with qemu-sev runtime class. Since SEV has limited hotplug support, increase the pod overhead to account for fixed resource usage. Fixes: #6572 Signed-off-by: Unmesh Deodhar <udeodhar@amd.com>	2023-05-10 12:19:56 -05:00
Unmesh Deodhar	813e4c576f	runtimeClasses: add sev runtime class Adding kata-qemu-sev runtime class. Fixes: #6572 Signed-off-by: Unmesh Deodhar <udeodhar@amd.com>	2023-05-10 12:19:56 -05:00
Unmesh Deodhar	af18806a8d	static-build: Add caching support to sev ovmf SEV requires special OVMF. Now that we have ability to build this custom OVMF, let's optimize it by caching so that we don't have to build it for every run. Fixes: sev: #6572 Signed-Off-By: Unmesh Deodhar <udeodhar@amd.com>	2023-05-10 12:19:55 -05:00
Unmesh Deodhar	76ae7a3abe	packaging: adding caching capability for kernel The SEV initrd build requires kernel modules. So, for SEV case, we need to cache kernel modules tarball in addition to kernel tarball. Fixes: #6572 Signed-Off-By: Unmesh Deodhar <udeodhar@amd.com>	2023-05-10 12:19:55 -05:00
Unmesh Deodhar	12c5ef9020	packaging: add support to build OVMF for SEV SEV requires special OVMF to work with kernel hashes. Thus, adding changes that builds this custom OVMF for SEV. Fixes: #6572 Signed-Off-By: Unmesh Deodhar <udeodhar@amd.com>	2023-05-10 12:19:55 -05:00
Unmesh Deodhar	b87820ee8c	packaging: add support to build initrd for sev We need special initrd for SEV. The work on SEV initrd is based on Ubuntu. Thus, adding another entry in versions.yaml This binary will have '-sev' suffix to distinguish it from the generic binary. Fixes: #6572 Signed-Off-By: Unmesh Deodhar <udeodhar@amd.com>	2023-05-10 12:19:55 -05:00
James O. D. Hunt	e1f3b871cd	docs: Mark snap installation method as unmaintained The snap package is no longer being maintained so update the docs to warn readers. We'll remove the snap installation docs in a few weeks. See: #6769. Fixes: #6793. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-05-10 18:02:46 +01:00
Jeremi Piotrowski	022a33de92	agent: Add context to errors when AgentConfig file is missing When the agent config file is missing, the panic message says "no such file or directory" but doesn't inform the user about which file was missing. Add context to the parsing (with filename) and to the from_config_file() calls (with information where the path is coming from). Fixes: #6771 Depends-on: github.com/kata-containers/tests#5627 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-05-10 08:43:16 +02:00
Fabiano Fidêncio	6881b9558b	Merge pull request #6512 from gabevenberg/log-parser-rs Log-parser-rs	2023-05-10 08:22:59 +02:00
Chao Wu	7218229af0	Merge pull request #6594 from Apokleos/warning_fix_1.68.0 warning_fix: fix warnings when build with cargo-1.68.0	2023-05-10 09:51:45 +08:00
Unmesh Deodhar	b0e6a094be	packaging: Add sev kernel build capability Adding code that builds sev kernel. Fixes: #6572 Signed-off-by: Unmesh Deodhar <udeodhar@amd.com>	2023-05-09 13:47:22 -05:00
Tim Zhang	b0b5d7082e	Merge pull request #6753 from amshinde/add-cross-building-with-cross cross-compile: Include documentation and configuration for cross-compile	2023-05-09 16:31:40 +08:00
Feng Wang	4e0dce6802	Merge pull request #6738 from fengwang666/oss-fix-fd-leak runtime: Fix virtiofs fd leak	2023-05-08 10:52:36 -07:00
Eduardo Berrocal	a4c0303d89	virtcontainers: Fixed static checks for improved test coverage for fc.go Expanded tests on fc_test.go to cover more lines of code. Coverage went from 4.6% to 18.5%. Fixed very simple static check fail on line 202. Fixes: #266 Signed-off-by: Eduardo Berrocal <eduardo.berrocal@intel.com>	2023-05-07 00:17:36 -07:00
Peng Tao	65670e6b0a	Merge pull request #6699 from zvonkok/cold-plug-vfio gpu: cold plug VFIO devices	2023-05-05 10:04:29 +08:00
Archana Shinde	b86d32aba9	Merge pull request #6728 from nedsouza/256/tests_coverage_pkg_signals pkg/signals: Improved test coverage 60% to 100%	2023-05-04 16:19:12 -07:00
Archana Shinde	9443c4aea7	Merge pull request #6729 from nedsouza/259/tests_coverage_virtcontainers_persist virtcontainers/persist: Improved test coverage 65% to 87.5%	2023-05-04 16:18:55 -07:00
Archana Shinde	09134c30de	Merge pull request #6737 from nedsouza/265/virtcontainers-clh-go-coverage virtcontainers/clh_test.go: improve unit test coverage	2023-05-04 16:15:43 -07:00
Archana Shinde	8495f830b7	cross-compile: Include documentation and configuration for cross-compile `cross` is an open source tool that provides zero-setup cross compile for rust binaries. Add documentation on this tool for compiling kata-ctl tool and Cross.toml file that provides required configuration for installing dependencies for various targets. This is pretty useful for a developer to make sure code compiles and passes checks for various architectures. Fixes: #6765 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-05-04 14:13:00 -07:00
Bin Liu	e57ac2ae18	Merge pull request #6749 from nedsouza/260/tests_coverage_virtcontainers_factory virtcontainers/factory: Improved test coverage	2023-05-04 10:54:40 +08:00
Zvonko Kaiser	13d7f39c71	gpu: Check for VFIO port assignments Bailing out early if the port is wrong, allowed port settings are no-port, root-port, switch-port Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-05-03 12:32:33 +00:00
Gabe Venberg	6594a9329d	tools: made log-parser-rs Eventual replacement of kata-log-parser, but for now replicates its functionaility for the new runtime-rs syntax. Takes in log files, parses, sorts by timestamp, spits them out in json, csv, xml, toml, and a few others. Fixes #5350 Signed-off-by: Gabe Venberg <gabevenberg@gmail.com>	2023-05-02 13:16:54 -05:00
Wainer Moschetta	f5ff975560	Merge pull request #6723 from ryansavino/gha-k8s-also-test-snp gha: Also run k8s tests on qemu-snp	2023-05-01 10:37:12 -03:00
Fabiano Fidêncio	b6e54676eb	Merge pull request #6759 from ryansavino/gha-sev-kata-deploy-fix gha: sev: fix for kata-deploy error	2023-05-01 11:42:16 +02:00
Eduardo Berrocal	03a8cd69c2	virtcontainers: Improved test coverage for fc.go from 4.6% to 18.5% Expanded tests on fc_test.go to cover more lines of code. Coverage went from 4.6% to 18.5%. Fixes: #266 Signed-off-by: Eduardo Berrocal <eduardo.berrocal@intel.com>	2023-04-28 15:40:45 -07:00
Ryan Savino	9e2b7ff177	gha: sev: fix for kata-deploy error kubectl commands need a '-f' instead of a '-k' Fixes: #6758 Signed-Off-By: Ryan Savino <ryan.savino@amd.com>	2023-04-28 14:54:36 -05:00
Ryan Savino	5c9246db19	gha: Also run k8s tests on qemu-snp Added the k8s tests for qemu-snp Fixes: #6722 Signed-Off-By: Ryan Savino <ryan.savino@amd.com>	2023-04-28 14:43:53 -05:00
Ryan Savino	c57a44436c	gha: Add the ability to test qemu-snp With the changes proposed as part of this PR, a qemu-snp cluster will be created but no tests will be performed. GitHub Actions will only run the tests using the workflows that are part of the target branch, instead of the using the ones coming from the PR. No way to work around this for now. After this commit is merged, the tests (not the yaml files for the actions) will be altered in order for the checkout action to help in this case. Fixes: #6722 Signed-off-by: Ryan Savino <ryan.savino@amd.com>	2023-04-28 13:07:13 -05:00
Wainer Moschetta	29785a43d7	Merge pull request #6712 from ryansavino/gha-k8s-also-test-sev gha: Also run k8s tests on qemu-sev	2023-04-28 14:22:03 -03:00
Archana Shinde	65c61785fc	Merge pull request #6660 from amshinde/kata-ctl-cmd Implement the "kata-ctl env" command	2023-04-28 01:33:28 -07:00
Archana Shinde	4064192896	env: Utilize arch specific functionality to get cpu details Have kata-env call architecture specific function to get cpu details instead of generic function to get cpu details that works only for certain architectures. The functionality for cpu details has been fully implemented for x86_64 and arm architectures, but needs to be implemented for s390 and powerpc. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-27 16:45:41 -07:00
Archana Shinde	fb40c71a21	env: Check for root privileges Check for root privileges early on. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-27 16:45:41 -07:00
Archana Shinde	1016bc17b7	config: Add api to fetch config from default config path Add api to fetch config from default config path and use that in kata-ctl tool. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-27 16:45:41 -07:00
Archana Shinde	b908a780a0	kata-env: Pass cmd option for file path Add ability to write the environment information to a file or stdout if file path is absent. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-27 16:45:41 -07:00
Archana Shinde	b1920198be	config: Workaround the way agent and hypervisor configs are fetched This is essentially a workaround for the issue: https://github.com/kata-containers/kata-containers/issues/5954 runtime-rs chnages the Kata config format adding agent_name and hypervisor_name which are then used as keys to fetch the agent and hypervisor configs. This will not work for older configs. So use the first entry in the hashmaps to fetch the configs as a workaround while the config change issue is resolved. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-27 16:45:41 -07:00
Archana Shinde	f2b2621dec	kata-env: Implement the kata-env command. Command implements functionality to get user environment settings. Fixes: #5339 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-27 16:45:41 -07:00
Ryan Savino	c849bdb0a5	gha: Also run k8s tests on qemu-sev Added the k8s tests for qemu-sev Fixes: #6711 Signed-Off-By: Ryan Savino <ryan.savino@amd.com>	2023-04-27 15:24:08 -05:00
Eduardo Berrocal	6bf1fc6051	virtcontainers/factory: Improved test coverage Expanded tests on factory_test.go to cover more lines of code. Coverage went from 34% to 41.5% in the case of user-mode run tests, and from 77.7% to 84% in the case of priviledge-mode run tests. Fixes: #260 Signed-off-by: Eduardo Berrocal <eduardo.berrocal@intel.com>	2023-04-27 13:08:35 -07:00
Tamas K Lengyel	0d49ceee0b	gha: Fix snap creation workflow warnings Fix recurring issues of failing to install dependencies due to stale apt cache. Uprev actions/checkout to v3 to resolve issue "Node.js 12 actions are deprecated." Fixes: #5659 Signed-off-by: Tamas K Lengyel <tamas.lengyel@intel.com>	2023-04-27 18:40:02 +00:00
Zvonko Kaiser	138ada049c	gpu: Cold Plug VFIO toml setting Added the cold_plug_vfio setting to the qemu-toml.in with some epxlanation Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-27 11:04:45 +00:00
Amulyam24	defb643346	runtime: remove overriding ARCH value by default for ppc64le Currently, ARCH value is being set to powerpc64le by default. powerpc64le is only right in context of rust and any operation which might use this variable for a different purpose would fail on ppc64le. Fixes: #6741 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2023-04-27 16:17:48 +05:30
Zvonko Kaiser	f7ad75cb12	gpu: Cold-plug extend the api.md Make the hypervisorconfig consistent in code and api.md Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-27 09:35:05 +00:00
Zvonko Kaiser	0fec2e6986	gpu: Add cold-plug test Cold plug setting is now correctly decoded in toml Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-27 09:30:24 +00:00
Archana Shinde	f2ebdd81c2	utils: Get rid of spurious print statement left behind. The print was used for debugging, get ris of it. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-26 22:12:30 -07:00
Archana Shinde	9a94f1f149	make: Export VERSION and COMMIT These will be consumed by kata-ctl, so export these so that they can be used to replace variables available to the rust binary. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-26 22:12:30 -07:00
Archana Shinde	2f81f48dae	config: Add file under /opt as another location to look for the config Most of kata installation tools use this path for installation, so add this to the paths to look for the configuration.toml file. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-26 22:12:30 -07:00
Archana Shinde	07f7d17db5	config: Make the pipe_size field optional Add the serde default attribute to the field so that parsing can continue if this field is not present. The agent assumes a default value for this, so it is not required by the user to provide a value here. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-26 22:12:30 -07:00
Archana Shinde	68f6357731	config: Make function to get the default conf file public This will be used by the kata-env command. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-26 22:12:30 -07:00
Archana Shinde	7565b33568	kata-ctl: Implement Display trait for GuestProtection enum Implement Display for enum to display in env output. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-26 22:12:30 -07:00
Archana Shinde	94a00f9346	utils: Make certain constants in utils.rs public These would be used outside of utils. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-26 22:12:30 -07:00
Archana Shinde	572b338b3b	gitignore: Ignore .swp and .swo editor backup files Ignore temporary files created by vim editor. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-26 22:12:30 -07:00
Archana Shinde	376884b8a4	cargo: Update version of clap to 4.1.13 This version includes macros related to using command options. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-26 22:12:30 -07:00
alex.lyn	17daeb9dd7	warning_fix: fix warnings when build with cargo-1.68.0 Fixes: #6593 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-04-27 10:29:50 +08:00
Ryan Savino	521519d745	gha: Add the ability to test qemu-sev With the changes proposed as part of this PR, a qemu-sev cluster will be created but no tests will be performed. GitHub Actions will only run the tests using the workflows that are part of the target branch, instead of the using the ones coming from the PR. No way to work around this for now. After this commit is merged, the tests (not the yaml files for the actions) will be altered in order for the checkout action to help in this case. Fixes: #6711 Signed-off-by: Ryan Savino <ryan.savino@amd.com>	2023-04-26 17:56:28 -05:00
Feng Wang	205909fbed	runtime: Fix virtiofs fd leak The kata runtime invokes removeStaleVirtiofsShareMounts after a container is stopped to clean up the stale virtiofs file caches. Fixes: #6455 Signed-off-by: Feng Wang <fwang@confluent.io>	2023-04-26 15:53:39 -07:00
Byron Marohn	5226f15c84	gha: Fix Body Line Length action flagging empty body commit messages Change the Body Line Length workflow to not trigger when the commit message contains only a message without a body. Other workflows will flag the missing body sections, and it was confusing to have an error message that said 'Body line too long (max 150)' when this was not actually the case. Fixes: #5561 Co-authored-by: Jayant Singh <jayant.singh@intel.com> Co-authored-by: Luke Phillips <lucas.phillips@intel.com> Signed-off-by: Byron Marohn <byron.marohn@intel.com> Signed-off-by: Jayant Singh <jayant.singh@intel.com> Signed-off-by: Luke Phillips <lucas.phillips@intel.com> Signed-off-by: Kelby Madal-Hellmuth <kelby.madal-hellmuth@intel.com> Signed-off-by: Liz Lawrens <liz.lawrens@intel.com>	2023-04-26 17:29:16 -04:00
Tamas K Lengyel	0f45b0faa9	virtcontainers/clh_test.go: improve unit test coverage Credit PR to Hackathon Team3 Fixes: #265 Signed-off-by: Tamas K Lengyel <tamas.lengyel@intel.com>	2023-04-26 19:12:51 +00:00
Zvonko Kaiser	dded731db3	gpu: Add OVMF setting for MMIO aperture The default size of OVMFs aperture is too low to initialized PCIe devices with huge BARs Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-26 09:47:37 +00:00
Zvonko Kaiser	2a830177ca	gpu: Add fwcfg helper function Added driver util function for easier handling of VFIO devices outside of the VFIO module. At the sandbox level we may need to set options depending if we have a VFIO/PCIe device, like the fwCfg for confiential guests. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-26 09:47:37 +00:00
Zvonko Kaiser	131f056a12	gpu: Extract VFIO Functions to drivers Some functions may be used in other modules then only in the VFIO module, extract them and make them available to other layers like sandbox. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-26 09:47:37 +00:00
Zvonko Kaiser	c8cf7ed3bc	gpu: Add ColdPlug of VFIO devices with devManager If we have a VFIO device and cold-plug is enabled we mark each device as ColdPlug=true and let the VFIO module do the attaching. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-26 09:47:37 +00:00
Zvonko Kaiser	e2b5e7f73b	gpu: Add Rawdevices to hypervisor RawDevics are used to get PCIe device info early before the sandbox is started to make better PCIe topology decisions Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-26 09:47:37 +00:00
Zvonko Kaiser	6107c32d70	gpu: Assign default value to cold-plug Make sure the configuration is propagated to the right structs and the default value is assigned. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-26 09:47:37 +00:00
Zvonko Kaiser	377ebc2ad1	gpu: Add configuration option for cold-plug VFIO Users can set cold-plug="root-port" to cold plug a VFIO device in QEMU Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-26 09:47:37 +00:00
Zvonko Kaiser	c18ceae109	gpu: Add new struct PCIePort For the hypervisor to distinguish between PCIe components, adding a new enum that can be used for hot-plug and cold-plug of PCIe devices Fixes: #6687 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-26 09:47:37 +00:00
Bin Liu	509bc8b6c8	Merge pull request #6718 from openanolis/mengze/keep_abnormal runtime-rs: support keep_abnormal in toml config	2023-04-26 12:36:52 +08:00
Bin Liu	b6d880510a	Merge pull request #6595 from zvonkok/gpu-snp-tdx-kernel gpu: Build and Ship an GPU enabled Kernel	2023-04-26 12:33:51 +08:00
Eduardo Berrocal	9c38204f13	virtcontainers/persist: Improved test coverage 65% to 87.5% Expanded tests on manager_test.go to cover more lines of code. Fixes: #259 Signed-off-by: Eduardo Berrocal <eduardo.berrocal@intel.com>	2023-04-25 23:53:46 +00:00
Eduardo Berrocal	1c1ee8057c	pkg/signals: Improved test coverage 60% to 100% Expanded tests on signals_test.go to cover more lines of code. 'go test' won't show 100% coverage (only 66.7%), because one test need to spawn a new process (since it is testing a function that calls os.Exit(1)). Fixes: #256 Signed-off-by: Eduardo Berrocal <eduardo.berrocal@intel.com>	2023-04-25 23:34:13 +00:00
mengze	cc8ea3232e	runtime-rs: support keep_abnormal in toml config This patch adds keep_abnormal in runtime config. If keep_abnormal = true, it means that 1) if the runtime exits abnormally, the cleanup process will be skipped, and 2) the runtime will not exit even if the health check fails. This option is typically used to retain abnormal information for debugging and should NOT be enabled by default. Fixes: #6717 Signed-off-by: mengze <mengze@linux.alibaba.com> Signed-off-by: quanweiZhou <quanweiZhou@linux.alibaba.com>	2023-04-25 13:47:44 +08:00
David Esparza	7fdaab49bc	Merge pull request #6295 from dborquez/add_kernel_module_checks_kvm kata-ctl: checks for kvm, kvm_intel modules loaded	2023-04-24 13:33:18 -06:00
Greg Kurz	0ca6d3b726	Merge pull request #6681 from Vlad1mir-D/6677-fix-kata-agent-dbus-connection osbuilder: Fix D-Bus enabling in the dracut case	2023-04-24 17:31:13 +02:00
Bin Liu	3d8688f92e	Merge pull request #6620 from jongwu/docker_fail_start_snap snap: fix docker start fail issue	2023-04-24 10:53:16 +08:00
Archana Shinde	97291d88e9	Merge pull request #6696 from amshinde/kata-manager-containerd-fix kata-manager: Fix containerd download	2023-04-21 09:54:30 -07:00
Archana Shinde	96e8470dbe	kata-manager: Fix containerd download Newer containerd releases have an additional static package published. Because of this, download_url contains two urls causing curl to fail. To resolve this, pick the first url from the containerd releases to download containerd. Fixes: #6695 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-04-20 23:08:51 -07:00
David Esparza	432d407440	kata-ctl: checks for kvm, kvm_intel modules loaded Ensure that kvm and kvm_intel modules are loaded. Renames the get_cpu_info() function to read_file_contents() Fixes #5332 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-04-20 11:29:36 -06:00
Zvonko Kaiser	b1730e4a67	gpu: Add new kernel build option to usage() With each release make sure we ship a GPU enabled kernel Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-20 07:48:30 +00:00
Fupan Li	ceefd50bd0	Merge pull request #6680 from Tim-Zhang/fix-ut-bad-fd agent: Fix ut issue caused by fd double closed	2023-04-20 11:18:27 +08:00
Fupan Li	a7b4b69230	Merge pull request #6673 from Tim-Zhang/upgrade-ttrpc-protobuf Bump ttrpc to 0.7.2 and protobuf to 3.2.0	2023-04-20 10:13:43 +08:00
Fupan Li	a1568cd2f5	Merge pull request #6676 from zvonkok/gpu-runtime gpu: Add GPU enabled confguration and runtime	2023-04-19 13:01:49 +08:00
Vladimir	3e7b902265	osbuilder: Fix D-Bus enabling in the dracut case - D-Bus enabling now occurs only in setup_rootfs (instead of prepare_overlay and setup_rootfs) - Adjust permissions of / so dbus-broker will be able to traverse FS These changes enables kata-agent to successfully communicate with D-Bus. Fixes #6677 Signed-off-by: Vladimir <amigo.elite@gmail.com>	2023-04-18 23:17:34 +03:00
Tim Zhang	53c749a9de	agent: Fix ut issue caused by fd double closed Never ever try to close the same fd double times, even in a unit test. A file descriptor is a number which will be reused, so when you close the same number twice you may close another file descriptor in the second time and then there will be an error 'Bad file descriptor (os error 9)' while the wrongly closed fd is being used. Fixes: #6679 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-18 23:19:10 +08:00
Hyounggyu Choi	5c032c64ac	Merge pull request #6664 from zvonkok/vfio-fix gpu: Do not pass-through PCI (Host) Bridges	2023-04-18 19:50:15 +09:00
Tim Zhang	2e3f19af92	agent: fix clippy warnings caused by protobuf3 Fix warnings introduced by protobuf upgrade. Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 20:15:49 +08:00
Tim Zhang	4849c56faa	agent: Fix unit test issue cuased by protobuf upgrade Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	0a582f7815	trace-forwarder: remove unused crate protobuf Remove unused crate protobuf. Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	73253850e6	kata-ctl: remove unused crate ttrpc Remove unused crate ttrpc. Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	76d2e30547	agent-ctl: Bump ttrpc from 0.6.0 to 0.7.1 Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	eb3d20dccb	protocols: Add ut for Serde Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	59568c79dd	protocols: add support for Serde rust-protobuf@3 does not support Serde natively anymore. So we need to do it by ourselves. Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	a6b4d92c84	runtime-rs: Bump ttrpc from 0.6.0 to 0.7.1 Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:20 +08:00
Zvonko Kaiser	ac7c63bc66	gpu: Add containerd shim for qemu-gpu Last but not least add the continerd shim configuration pointing to the correct configuration-<shim>.toml Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-17 10:45:04 +00:00
Zvonko Kaiser	a0cc8a75f2	gpu: Add a kube runtime class With the added configuration add the corresponding kube runtime class. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-17 10:42:04 +00:00
Zvonko Kaiser	a81fff706f	gpu: Adding a GPU enabled configuration We need to set hotplug on pci root port and enable at least one root port. Also set the guest-hooks-dir to the correct path Fixes: #6675 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-17 10:40:09 +00:00
Tim Zhang	8af6fc77cd	agent: Bump ttrpc from 0.6.0 to 0.7.1 Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 18:31:41 +08:00
Tim Zhang	009b42dbff	protocols: Fix unit test Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 18:31:41 +08:00
Tim Zhang	392732e213	protocols: Bump ttrpc from 0.6.0 to 0.7.1 Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 18:31:35 +08:00
Zvonko Kaiser	f4f958d53c	gpu: Do not pass-through PCI (Host) Bridges On some systems a GPU is in a IOMMU group with a PCI Bridge and PCI Host Bridge. Per default no PCI Bridge needs to be passed-through. When scanning the IOMMU group, ignore devices with a 0x60 class ID prefix. Fixes: #6663 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-17 10:08:23 +00:00
Zvonko Kaiser	825e769483	gpu: Add GPU support to default kernel without any TEE With each release make sure we ship a GPU enabled kernel Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-17 09:58:58 +00:00
Zvonko Kaiser	e4ee07f7d4	gpu: Add GPU TDX experimental kernel With each release make sure we ship a GPU and TEE enabled kernel This adds tdx-experimental kernel support Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-17 09:58:52 +00:00
Fabiano Fidêncio	243cb2e3af	Merge pull request #6670 from fidencio/topic/fix-caching-of-tdvf-and-tdx-qemu cache-components: Fix caching of TDVF and QEMU for TDX	2023-04-16 09:04:04 +02:00
Fabiano Fidêncio	a1272bcf1d	gha: tdx: Fix typo overlay -> overlays The beauty of GHA not allowing us to easily test changes in the yaml files as part of the PR has hit us again. :-/ The correct path for the k3s deployment is tools/packaging/kata-deploy/kata-deploy/overlays/k3s instead of tools/packaging/kata-deploy/kata-deploy/overlay/k3s. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-15 15:00:06 +02:00
Fabiano Fidêncio	3fa0890e5e	cache-components: Fix TDVF caching TDVF caching is not working as the tarball name is incorrect. The result expected is kata-static-tdvf.tar.xz, but it's looking for kata-static-tdx.tar.xz. This happens as a logic to convert tdx -> tdvf has been added as part of the building scripts, but I missed doing this as part of the caching scripts. Fixes: #6669 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-15 14:12:29 +02:00
Fabiano Fidêncio	80e3a2d408	cache-components: Fix TDX QEMU caching TDX QEMU caching is not working as expected, as we're checking for its version looking at "assets.hypervisor.${QEMU_FLAVOUR}.version", which is correct for standard QEMU. However, for TDX QEMU we should be checking for "assets.hypervisor.${QEMU_FLAVOUR}.tag" Fixes: #6668 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-15 14:12:26 +02:00
Fabiano Fidêncio	fffe2c6082	Merge pull request #6648 from fidencio/topic/gha-tdx-improvements-and-fixes gha: tdx: Ensure kata-deploy is removed after the tests run	2023-04-15 00:21:31 +02:00
Bo Chen	a819ce145f	Merge pull request #6633 from likebreath/0406/clh_v31.0 versions: Upgrade to Cloud Hypervisor v31.0	2023-04-14 13:52:19 -07:00
Zvonko Kaiser	87ea43cd4e	gpu: Add configuration fragment Adding configuration fragment for the kernel, depending on the TEE kernel update the LOCALVERSION Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-14 07:52:51 +00:00
Zvonko Kaiser	aca6ff7289	gpu: Build and Ship an GPU enabled Kernel With each release make sure we ship a GPU and TEE enabled kernel Fixes: #6553 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-14 07:52:42 +00:00
Fabiano Fidêncio	dc662333df	runtime: Increase the dial_timeout When testing on AKS, we've been hitting the dial_timeout every now and then. Let's increase it to 45 seconds (instead of 30) for all the VMMs, and to 60 seconfs in case of TEEs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 22:42:52 +02:00
Greg Kurz	897c0bc67e	Merge pull request #6658 from gkurz/osbuilder-dracut-dbus osbuilder: Enable dbus in the dracut case	2023-04-13 19:03:15 +02:00
Greg Kurz	eb1762e813	osbuilder: Enable dbus in the dracut case The agent now offloads cgroup configuration to systemd when possible. This requires to enable D-Bus in order to communicate with systemd. Fixes #6657 Signed-off-by: Greg Kurz <groug@kaod.org>	2023-04-13 14:16:50 +02:00
Greg Kurz	f9a94f8fc5	Merge pull request #6623 from UiPath/fix-no-space-device runtime: Don't create socket file in /run/kata	2023-04-13 10:36:20 +02:00
Fabiano Fidêncio	f478b9115e	clh: tdx: Update timeouts for confidential guest Booting up TDX takes more time than booting up a normal VM. Those values are being already used as part of the CCv0 branch, and we're just bringing them to the `main` branch as well. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 10:18:07 +02:00
Fabiano Fidêncio	3b76abb366	kata-deploy: Ensure node is ready after CRI Engine restart Let's ensure the node is ready after the CRI Engine restart, otherwise we may proceed and scripts may simply fail if they try to deploy a pod while the CRI Engine is not yet restarted (and, consequently, the node is not Ready). Related: #6649 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 10:18:07 +02:00
Fabiano Fidêncio	5ec9ae0f04	kata-deploy: Use readinessProbe to ensure everything is ready readinessProbe will help us to only have the kata-deploy pod marked as Ready when it finishes all the needed configurations in the node. Related: #6649 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 10:18:07 +02:00
Fabiano Fidêncio	ea386700fe	kata-deploy: Update podOverhead for TDX As TEEs cannot hotplug memory / CPU, we must consider the default values for those as part of the podOverhead. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 10:18:07 +02:00
Fabiano Fidêncio	e31efc861c	gha: tdx: Use the k3s overlay As the TDX machine is using k3s, let's make sure we're deploying kat-deploy using the k3s overlay. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 10:18:07 +02:00
Fabiano Fidêncio	542bb0f3f3	gha: tdx: Set KUBECONFIG env at the job level By doing this we avoid having to set it up on every step. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 10:18:07 +02:00
Fabiano Fidêncio	d7fdf19e9b	gha: tdx: Delete kata-deploy after the tests finish We must ensure that no kata-deploy is left behind after the tests finish, otherwise it may interfere with the next run. Fixes: #6647 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 10:18:07 +02:00
Fabiano Fidêncio	da35241a91	tests: k8s: Skip k8s-cpu-ns when testing TDX TEEs do not support CPU / memory hotplug, thus this test must be skipped. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 10:18:07 +02:00
Alexandru Matei	db2cac34d8	runtime: Don't create socket file in /run/kata The socket file for shim management is created in /run/kata and it isn't deleted after the container is stopped. After running and stopping thousands of containers /run folder will run out of space. Fixes #6622 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com> Co-authored-by: Greg Kurz <groug@kaod.org>	2023-04-13 10:21:29 +03:00
Jianyong Wu	6d315719f0	snap: fix docker start fail issue In Arm baseline CI, docker starts fail with error: "no sockets found via socket activation: make sure the service was started by systemd". I find a solusion in [1] to fix it. [1] https://forums.docker.com/t/failed-to-load-listeners-no-sockets-found-via-socket-activation-make-sure-the-service-was-started-by-systemd/62505 Fixes: #6619 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-04-13 09:35:40 +08:00
Zhongtao Hu	328793bb27	Merge pull request #6585 from Apokleos/nydus_prefetch_files nydus_rootfs/prefetch_files: add prefetch_files for RAFS	2023-04-12 19:58:36 +08:00
Zvonko Kaiser	e4b3b08871	gpu: Add proper CONFIG_LOCALVERSION depending on TEE If conf_guest is set we need to update the CONFIG_LOCALVERSION to match the suffix created in install_kata -nvidia-gpu-{snp\|tdx}, the linux headers will be named the very same if build with make deb-pkg for TDX or SNP. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-12 11:30:59 +00:00
Zhongtao Hu	fef531f565	Merge pull request #6618 from Apokleos/virtiofs_extra_cache_mode runtime-rs/virtio-fs: add support extra handler for cache mode.	2023-04-12 14:40:05 +08:00
Bin Liu	9327bb0912	Merge pull request #6639 from openanolis/nerdctl runtime-rs: enable nerdctl to setup cni plugin	2023-04-12 12:04:37 +08:00
Zhongtao Hu	69ba2098f8	runtime-rs: remove network entities and netns remove network entities and netns Fixes:#4693 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-04-12 10:21:06 +08:00
Zhongtao Hu	b31f103d12	runtime-rs: enable nerdctl cni plugin 1. when we use nerdctl to setup network for kata, no netns is created by nerdctl, kata need to create netns by its own 2. after start VM, nerdctl will call cni plugin via oci hook, we need to rescan the netns after the interfaces have been created, and hotplug the network device into the VM Fixes:#4693 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-04-12 10:21:04 +08:00
Fabiano Fidêncio	3b3656d96d	Merge pull request #6522 from fidencio/topic/add-tdx-artefacts-from-2023ww01-to-main tdx: Add artefacts from the latest TDX tools release into main	2023-04-11 20:43:02 +02:00
Fabiano Fidêncio	50ce33b02d	Merge pull request #6205 from fengwang666/non-root-clh runtime: support non-root for clh	2023-04-11 19:34:00 +02:00
Fabiano Fidêncio	4751adbea1	Merge pull request #6610 from fidencio/topic/gha-run-dragonball-k8s-tests gha: ci-on-push: Run k8s tests with dragonball	2023-04-11 18:16:14 +02:00
Fabiano Fidêncio	69d7a959c8	gha: ci-on-push: Run tests on TDX Now that we've added a TDX capable external runner, let's make sure we also run the basic tests using TDX. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 16:10:35 +02:00
Fabiano Fidêncio	5a0727ecb4	kata-deploy: Ship kata-qemu-tdx runtimeClass Let's make sure we configure containerd for the kata-qemu-tdx handler and ship the kata-qemu-tdx runtime class for kubernetes. Fixes: #6537 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 16:10:35 +02:00
Fabiano Fidêncio	98682805be	config: Add configuration for QEMU TDX As the QEMU configuration for TDX differs quite a lot from the normal QEMU configuration, let's add a new configuration file for the QEMU TDX. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 16:10:35 +02:00
Fabiano Fidêncio	3e15800199	govmm: Directly pass the firmware using -bios with TDX Since TDX doesn't support readonly memslot, TDVF cannot be mapped as pflash device and it actually works as RAM. "-bios" option is chosen to load TDVF. OVMF is the opensource firmware that implements the TDVF support. Thus the command line to specify and load TDVF is ``-bios OVMF.fd`` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	3c5ffb0c85	govmm: Set "sept-ve-disable=on" This is needed since 22ww49. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	ed145365ec	runtime/qemu: Drop "kvm-type=tdx" This is not supported since 22ww49. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	25b3cdd38c	virtcontainers: Drop check for the `tdx` CPU flag In the recent kernels provided by Intel the `tdx` CPU flag is not present anymore. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	01bdacb4e4	virtcontainers: Also check /sys/firmwares/tdx for TDX Let's make sure we also check /sys/firmwares/tdx for TDX guest protection, as the location may depend on whether TDX Seam is being used or not. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	9feec533ce	cache: Add ability to cache OVMF Let's add the ability to cache OVMF, which right now we're only building and shipping it for TDX. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	ce8d982512	gha: Build and ship the OVMF for TDX Let's build the OVMF with TDX support as part of our tests, and let's ship it as part of our releases. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	39c3fab7b1	local-build: Add support to build OVMF for TDX Let's add the needed targets and modifications to be able to build OVMF for TDX as part of the local-build scripts. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	054174d3e6	versions: Bump OVMF for TDX Let's update the OVMF for TDX version to what's the latest tested release of the Intel TDX tools with Kata Containers. This change requires a newer version of `nasm` than the one provided by the container used to build the project. This change will also be needed for SEV-SNP and was originally done by Alex Carter (thanks!). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Alex Carter <Alex.Carter@ibm.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	800fb49da1	packaging: Add get_ovmf_image_name() helper As we'll be using this from different places in the near future, let's create a helper function as part of the libs.sh. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	fbf03d7aca	cache: Document kernel-tdx-experimental Let's make users aware of the cache_components_main.sh that they can also cache the kernel-tdx-experimental builds. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	5d79e96966	cache: Add a space to ease the reading of the kernel flavours Right now it's quite hard to read those, let's improve it a little bit. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	6e4726e454	cache: Fix typos Let's just fix a few simple typos: * kernek -> kernel * experimetnal -> experimental Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	fc22ed0a8a	gha: Build and ship the Kernel for TDX Let's build the kernel with TDX support as part of our tests, and let's ship it as part of our releases. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	502844ced9	local-build: Add support to build Kernel for TDX Let's add the needed targets and modifications to be able to build kernel-tdx-experimental as part of the local-build scripts. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	b2585eecff	local-build: Avoid code duplication building the kernel Let's create a `install_kernel_helper()` function, as it was already done for QEMU, and rely on that when calling `install_kernel` and `install_kernel_dragonball_experimental`. This helps us to reduce the code duplication by a fair amount. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	f33345c311	versions: Update Kernel TDX version Let's update the Kernel TDX version to what's the latest tested release of the Intel TDX tools with Kata Containers. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	20ab2c2420	versions: Move Kernel TDX to its own experimental entry Although we've been providing users a way to build kernel with TDX support, this must be moved to its own experimental entry instead of how it currently is. The reason for that is because the patches are not yet merged into kernel, and this is still an experimental build of the project. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	3d9ce3982b	cache: Allow specifying the QEMU_FLAVOUR Let's do what we already did when caching the kernel, and allow passing a FLAVOUR of the project to build. By doing this we can re-use the same function used to cache QEMU to also cache any kind of experimental QEMU that we may happen to have. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	33dc6c65aa	gha: Build and ship QEMU for TDX Let's build QEMU TDX as part of our tests, and let's ship it as part of our releases. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	eceaae30a5	local-build: Add support to build QEMU for TDX Let's add the needed targets and modifications to be able to build qemu-tdx-experimental as part of the local-build scripts. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	f7b7c187ec	static-build: Improve qemu-experimental build script Let's make sure the `qemu_suffix` and `qemu_tarball_name` can be specified. With this we make it really easy to reuse this script for any addition flavour of an experimental QEMU that ends up having to be built (specifically looking at the ones for Confidential Containers here). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:17:04 +02:00
Fabiano Fidêncio	3018c9ad51	versions: Update QEMU TDX version Let's update the QEMU TDX version to what's the latest tested release of the Intel TDX tools with Kata Containers. In order to do such update, we had to relax the checks on the QEMU version for some of the configuration options, as those were removed right after the window was open for the 7.1.0 development (thus the 7.0.50 check). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:17:04 +02:00
Fabiano Fidêncio	800ee5cd88	versions: Move QEMU TDX to its own experimental entry Although we've been providing users a way to build QEMU with TDX support, this must be moved to its own experimental entry instead of how it currently is. The reason for that is because the patches are not yet merged into QEMU, and this is still an experimental build of the project. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:17:04 +02:00
Fabiano Fidêncio	1315bb45f9	local-build: Add dragonball kernel to the `all` target As the dragonball kernel is shipped as part of our releases, it must be added to the `all` target. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:17:04 +02:00
Fabiano Fidêncio	73e108136a	local-build: Rename non vanilla kernel build functions In order to make it easier to read, let's just rename the install_dragonball_experimental_kernel and install_experimental_kernel to install_kernel_dragonball_experimental and install_kernel_experimental, respectively. This allows us to quickly get to those functions when looking for `install_kernel`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:17:04 +02:00
Fabiano Fidêncio	1d851b4be3	local-build: Cosmetic changes in build targets This is a simple cosmetic change, adding a space between the function call and the `;;`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:17:04 +02:00
Fabiano Fidêncio	49ce685ebf	gha: k8s-on-aks: Always delete the AKS cluster Regardless of the tests succeeding or failing, the AKS cluster must be deleted. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 13:40:40 +02:00
Fabiano Fidêncio	e2a770df55	gha: ci-on-push: Run k8s tests with dragonball Now that the infra for running dragonball tests has been enabled, let's actually make sure to have them running on each PR. The tests skipped are: * `k8s-cpu-ns.bats`, as CPU resize doesn't seem to be yet properly supported on runtime-rs * https://github.com/kata-containers/kata-containers/issues/6621 Fixes: #6605 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 11:47:47 +02:00
Fabiano Fidêncio	aee6174a53	Merge pull request #6637 from gkurz/cpu-shares-to-weight rustjail: Use CPUWeight with systemd and CgroupsV2	2023-04-11 10:55:48 +02:00
GabyCT	dc74133e74	Merge pull request #6631 from fidencio/topic/gha-create-delete-aks-cannot-be-workflows gha: k8s-on-aks: {create,delete} AKS must be a coded-in step	2023-04-10 14:05:24 -06:00
Zhongtao Hu	8cdec5707e	Merge pull request #6540 from houstar/main docs: update the rust version from version.yaml	2023-04-10 16:53:21 +08:00
Qingyuan Hou	d1f550bd1e	docs: update the rust version from versions.yaml Fixes: #6539 Signed-off-by: Qingyuan Hou <lenohou@gmail.com>	2023-04-10 03:34:15 +00:00
alex.lyn	f3595e48b0	nydus_rootfs/prefetch_files: add prefetch_files for RAFS A sandbox annotation used to specify prefetch_files.list path the container image being used, and runtime will pass it to Hypervisor to search for corresponding prefetch file: format looks like: "io.katacontainers.config.hypervisor.prefetch_files.list" = /path/to/<uid>/xyz.com/fedora:36/prefetch_file.list Fixes: #6582 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-04-10 10:05:52 +08:00
Zhongtao Hu	3bfaafbf44	fix: oci hook 1. when do the deserialization for the oci hook, we should use camel case for createRuntime 2. we should pass the dir of bundle path instead of the path of config.json Fixes:#4693 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-04-10 09:53:43 +08:00
Greg Kurz	c1fbaae8d6	rustjail: Use CPUWeight with systemd and CgroupsV2 The CPU shares property belongs to CgroupsV1. CgroupsV2 uses CPU weight instead. The correct value is computed in the latter case but it is passed to systemd using the legacy property. Systemd rejects the request and the agent exists with the following error : Value specified in CPUShares is out of range: unknown Replace the "shares" wording with "weight" in the CgroupsV2 code to avoid confusions. Use the "CPUWeight" property since this is what systemd expects in this case. Fixes #6636 References: https://www.freedesktop.org/software/systemd/man/systemd.resource-control.html#CPUWeight=weight https://www.freedesktop.org/software/systemd/man/systemd.resource-control.html#systemd%20252 https://github.com/containers/crun/blob/main/crun.1.md#cpu-controller Signed-off-by: Greg Kurz <groug@kaod.org>	2023-04-07 17:57:26 +02:00
Bo Chen	375187e045	versions: Upgrade to Cloud Hypervisor v31.0 Details of this release can be found in our new roadmap project as iteration v31.0: https://github.com/orgs/cloud-hypervisor/projects/6. Fixes: #6632 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-04-06 14:35:26 -07:00
Fabiano Fidêncio	79f3047f06	gha: k8s-on-aks: {create,delete} AKS must be a coded-in step I should have seen this coming, but currently the "create" and "delete" AKS workflows cannot be imported and uses as a job's step, resulting on an error trying to find the correspondent action.yaml file for those. Fixes: #6630 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 22:56:08 +02:00
Fabiano Fidêncio	ee5dda012b	Merge pull request #6629 from fidencio/topic/gha-refactor-run-k8s-tests-on-aks gha: k8s-on-aks: Set {create,delete}_aks as steps	2023-04-06 22:02:34 +02:00
Fabiano Fidêncio	2f35b4d4e5	gha: ci-on-push: Only run on `main` branch Let's ensure we're only running this workflow when PRs are opened against the main branch. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 19:11:24 +02:00
Fabiano Fidêncio	e7bd2545ef	Revert "gha: ci-on-push: Depend on Commit Message Check" This reverts commit `a159ffdba7`. Unfortunately we have to revert the PRs related to the switch done to using `workflow_run` instead of `pull_request_target`. The reason for that being that we can only mark jobs as required if they are targetting PRs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 19:11:14 +02:00
Fabiano Fidêncio	0d96d49633	Revert "gha: ci-on-push: Adjust to using workflow_run" This reverts commit `3a760a157a`. Unfortunately we have to revert the PRs related to the switch done to using `workflow_run` instead of `pull_request_target`. The reason for that being that we can only mark jobs as required if they are targetting PRs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 19:11:06 +02:00
Fabiano Fidêncio	c7ee45f7e5	Revert "gha: ci-on-push: Adapt chained jobs to workflow_run" This reverts commit `7855b43062`. Unfortunately we have to revert the PRs related to the switch done to using `workflow_run` instead of `pull_request_target`. The reason for that being that we can only mark jobs as required if they are targetting PRs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 19:09:54 +02:00
Fabiano Fidêncio	5d4d720647	Revert "gha: k8s-on-aks: Fix cluster name" This reverts commit `85cc5bb534`. Unfortunately we have to revert the PRs related to the switch done to using `workflow_run` instead of `pull_request_target`. The reason for that being that we can only mark jobs as required if they are targetting PRs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 19:07:04 +02:00
Fabiano Fidêncio	13d857a56d	gha: k8s-on-aks: Set {create,delete}_aks as steps We've been currently using {create,delete}_aks as jobs. However, it means that if the tests fail we'll end up deleting the AKS cluster (as expected), but not having a way to recreate the cluster without re-running all jobs, which is a waste of resources. Fixes: #6628 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 16:54:15 +02:00
Fabiano Fidêncio	abaf881f4a	Merge pull request #6612 from fidencio/topic/gha-k8s-on-aks-fix-cluster-name gha: k8s-on-aks: Fix cluster name	2023-04-06 10:48:38 +02:00
alex.lyn	dc6569dbbc	runtime-rs/virtio-fs: add support extra handler for cache mode. Add support for virtiofsd when virtio_fs_extra_args with "-o cache auto, ..." users specified. Fixes: #6615 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-04-06 16:31:02 +08:00
Fabiano Fidêncio	85cc5bb534	gha: k8s-on-aks: Fix cluster name This was missed from the last series, as GHA will use the "target branch" yaml file to start the workflow. Basically we changed the name of the cluster created to stop relying on the PR number, as that's not easily accessible on `workflow_run`. Fixes: #6611 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 08:50:07 +02:00
Fabiano Fidêncio	68cb5689f5	Merge pull request #6584 from fidencio/topic/gha-k8s-also-test-dragonball gha: Also run k8s tests on AKS with dragonball	2023-04-05 22:50:14 +02:00
Fabiano Fidêncio	ae488cc09f	Merge pull request #6596 from fidencio/topic/gha-only-push-to-registry-when-merging-content gha: Only push images to registry after merging a PR	2023-04-05 22:07:13 +02:00
Fabiano Fidêncio	2c38e17ef0	Merge pull request #6607 from fidencio/topic/gha-switch-to-using-a-D4_v5-instance gha: aks: Use D4s_v5 instance	2023-04-05 22:06:40 +02:00
Archana Shinde	6af52cef3a	Merge pull request #6590 from zvonkok/build-kernel-fix tools: Avoid building the kernel twice	2023-04-05 11:45:59 -07:00
Greg Kurz	a3e3b0591f	Merge pull request #6562 from c3d/issue/6561-unwrap-panic rustjail: Fix panic when cgroup manager fails	2023-04-05 16:58:13 +02:00
James O. D. Hunt	cbe6f04194	Merge pull request #6501 from shippomx/dev_metrics runtime: add filter metrics with specific names	2023-04-05 15:15:09 +01:00
Fabiano Fidêncio	1688e4f3f0	gha: aks: Use D4s_v5 instance It's been pointed out that D4s_v5 instances are more powerful than the D4s_v3 ones, and have the very same price. With this in mind, let's switch to the newer machines. Fixes: #6606 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-05 16:02:17 +02:00
Fabiano Fidêncio	108d80a86d	gha: Add the ability to also test Dragonball With the changes proposed as part of this PR, an AKS cluster will be created but no tests will be performed. The reason we have to do this is because GitHub Actions will only run the tests using the workflows that are part of the target branch, instead of the using the ones coming from the PR, and we didn't find yet a way to work this around. Once this commit is in, we'll actually change the tests themselves (not the yaml files for the actions), as those will be the ones we want as the checkout action helps us on this case. Fixes: #6583 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-05 15:53:03 +02:00
Fabiano Fidêncio	2550d4462d	gha: build-kata-static-tarball: Only push to registry after merge `56331bd7bc` oversaw the fact that we mistakenly tried to push the build containers to the registry for a PR, rather than doing so only when the code is merged. As the workflow is now shared between different actions, let's introduce an input variable to specify which are the cases we actually need to perform a push to the registry. Fixes: #6592 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-05 13:57:26 +02:00
Fabiano Fidêncio	e81b8b8ee5	local-build: build-and-upload-payload is not quay.io specific Let's just print "to the registry" instead of printing "to quay.io", as the registry used is not tied to quay.io. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-05 12:54:44 +02:00
Fabiano Fidêncio	13929fc610	gha: publish-kata-deploy-payload: Improve registry login Let's only try to login to the registry that's being passed as an input argument. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-05 12:54:44 +02:00
Fabiano Fidêncio	41026f003e	gha: payload-after-push: Pass registry / repo as inputs We made registry / repo mandatory, but we only adapted that to the amd64 job. Let's fix it now and make sure this is also passed to the arm64 and s390x jobs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-05 12:54:44 +02:00
Fabiano Fidêncio	7855b43062	gha: ci-on-push: Adapt chained jobs to workflow_run As we're using the `workflow_run` event, the checkout action would pull the current target branch instead of the PR one. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-05 12:54:44 +02:00
Fabiano Fidêncio	3a760a157a	gha: ci-on-push: Adjust to using workflow_run The way previously used to get the PR's commit sha can only be used with `pull_request*` kind of events. Let's adapt it to the `workflow_run` now that we're using it. With this change we ended up dropping the PR number from the tarball suffix, as that's not straightforward to get and, to be honest, not a unique differentiator that would justify the effort. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-05 12:54:44 +02:00
Fabiano Fidêncio	a159ffdba7	gha: ci-on-push: Depend on Commit Message Check Let's make this workflow dependent of the commit message check, and only start it if the commit message check one passes. As a side effect, this allows us to run this specific workflow using secrets, without having to rely on `pull_request_target`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-05 12:54:40 +02:00
Fabiano Fidêncio	8086c75f61	gha: Also run k8s tests on AKS with dragonball As already done for Cloud Hypervisor and QEMU, let's make sure we can run the AKS tests using dragonball. Fixes: #6583 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-04 10:58:47 +02:00
Fabiano Fidêncio	1c6d7cb0f7	Merge pull request #6589 from fidencio/topic/gha-k8s-use-ghcr-instead-of-quay gha: Use ghcr.io for the k8s CI	2023-04-04 10:48:16 +02:00
Zvonko Kaiser	fe86c08a63	tools: Avoid building the kernel twice Two different kernel build targets (build,install) have both instructions to build the kernel, hence it was executed twice. Install should only do install and build should only do build. Fixes: #6588 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-04 05:44:44 +00:00
Fabiano Fidêncio	3215860a47	gha: Set ci-on-push to run on `pull_request_target` This is less secure than running the PR on `pull_request`, and will require using an additional `ok-to-test` label to make sure someone deliverately ran the actions coming from a forked repo. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-03 20:50:36 +02:00
Fabiano Fidêncio	d17dfe4cdd	gha: Use ghcr.io for the k8s CI Let's switch to using the `ghcr.io` registry for the k8s CI, as this will save us some troubles on running the CI with PRs coming from forked repos. Fixes: #6587 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-03 15:52:33 +02:00
Fabiano Fidêncio	e1f972fb1d	Merge pull request #6568 from kata-containers/topic/add-k8s-tests-as-part-of-gha GHA \|Switch "kubernetes tests" from jenkins to GitHub actions	2023-04-03 14:25:35 +02:00
Christophe de Dinechin	b661e0cf3f	rustjail: Add anyhow context for D-Bus connections In cases where the D-Bus connection fails, add a little additional context about the origin of the error. Fixes: 6561 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com> Suggested-by: Archana Shinde <archana.m.shinde@intel.com> Spell-checked-by: Greg Kurz <gkurz@redhat.com>	2023-04-03 14:09:34 +02:00
Fabiano Fidêncio	60c62c3b69	gha: Remove kata-deploy-test.yaml This workflow becomes redundant as we're already testing kubernetes using kata-deploy, and also testing it on AKS. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-31 21:55:41 +02:00
Fabiano Fidêncio	43894e9459	gha: Remove kata-deploy-push.yaml This becomes redundant now that its steps are covered as part of the `ci-on-push.yaml`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-31 21:55:41 +02:00
Fabiano Fidêncio	cab9ca0436	gha: Add a CI pipeline for Kata Containers This is the very first step to replacing the Jenkins CI, and I've decided to start with an x86_64 approach only (although easily expansible for other arches as soon as they're ready to switch), and to start running our kubernetes tests (now running on AKS). Fixes: #6541 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-31 21:55:41 +02:00
Fabiano Fidêncio	53b526b6bd	gha: k8s: Add snippet to run k8s tests on aks clusters This will be shortly used as part of a newly created GitHub action which will replace our Jenkins CI. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-31 21:55:41 +02:00
Fabiano Fidêncio	c444c24bc5	gha: aks: Add snippets to create / delete aks clusters Those will be shortly used as part of a newly added GitHub action for testing k8s tests on Azure. They've been created using the secrets we already have exposed as part of our GitHub, and they follow a similar way to authenticate to Azure / create an AKS cluster as done in the `/test-kata-deploy` action. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-31 21:55:41 +02:00
Fabiano Fidêncio	11e0099fb5	tests: Move k8s tests to this repo The first part of simplifying things to have all our tests using GitHub actions is moving the k8s tests to this repo, as those will be the first vict^W targets to be migrated to GitHub actions. Those tests have been slightly adapted, mainly related to what they load / import, so they are more self-contained and do not require us bringing a lot of scripts from the tests repo here. A few scripts were also dropped along the way, as we no longer plan to deploy kubernetes as part of every single run, but rather assume there will always be k8s running whenever we land to run those tests. It's important to mention that a few tests were not added here: * k8s-block-volume: * k8s-file-volume: * k8s-volume: * k8s-ro-volume: These tests depend on some sort of volume being created on the kubernetes node where the test will run, and this won't fly as the tests will run from a GitHub runner, targetting a different machine where kubernetes will be running. * https://github.com/kata-containers/kata-containers/issues/6566 * k8s-hugepages: This test depends a whole lot on the host where it lands and right now we cannot assume anything about that anymore, as the tests will run from a GitHub runner, targetting a different machine where kubernetes will be running. * https://github.com/kata-containers/kata-containers/issues/6567 * k8s-expose-ip: This is simply hanging when running on AKS and has to be debugged in order to figure out the root cause of that, and then adapted to also work on AKS. * https://github.com/kata-containers/kata-containers/issues/6578 Till those issues are solved, we'll keep running a jenkins job with hose tests to avoid any possible regression. Last but not least, I've decided to not keep the history when bringing those tests here, otherwise we'd end up polluting a lot the history of this repo, without any clear benefit on doing so. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-31 21:55:41 +02:00
David Esparza	5d89d08fc4	Merge pull request #6564 from GabyCT/topic/updateneturl docs: Update CNM url in networking document	2023-03-31 09:58:55 -06:00
Fabiano Fidêncio	73be4bd3f9	gha: Update actions for release.yaml checkout@v2 should not be used anymore, please, see: https://github.blog/changelog/2022-09-22-github-actions-all-actions-will-begin-running-on-node16-instead-of-node12/ Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-31 13:24:26 +02:00
Fabiano Fidêncio	d38d7fbf1a	gha: Remove code duplication from release.yaml We can easily re-use the newly added build-kata-static-tarball-*.yaml as part of the release.yaml file. By doing this we consolidate on how we build the components accross our actions. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-31 13:24:26 +02:00
Fabiano Fidêncio	56331bd7bc	gha: Split payload-after-push-.yaml Let's split those actions into two different ones: Build the kata-static tarball * Publish the kata-deploy payload We're doing this as, later in this series we'll start taking advantage of both pieces. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-31 13:24:26 +02:00
Gabriela Cervantes	a552a1953a	docs: Update CNM url in networking document This PR updates the url for the Container Network Model in the network document. Fixes #6563 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-30 16:20:33 +00:00
Christophe de Dinechin	7796e6ccc6	rustjail: Fix minor grammatical error in function name Rename `unit_exist` function to `unit_exists` to match English grammar rule. Fixes: #6561 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2023-03-30 16:13:37 +02:00
Christophe de Dinechin	41fdda1d84	rustjail: Do not unwrap potential error with cgroup manager There can be an error while connecting to the cgroups managager, for example a `ENOENT` if a file is not found. Make sure that this is reported through the proper channels instead of causing a `panic()` that does not provide much information. Fixes: #6561 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com> Reported-by: Greg Kurz <gkurz@redhat.com>	2023-03-30 16:09:13 +02:00
Archana Shinde	07e49c63e1	Merge pull request #6257 from amshinde/kata-ctl-env kata-ctl: add function to get platform protection.	2023-03-29 11:55:07 -07:00
Archana Shinde	a914283ce0	kata-ctl: add function to get platform protection. This function checks for tdx, sev or snp protection on x86 platform. Fixes: #1000 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-03-28 15:40:25 -07:00
Fabiano Fidêncio	245ed2cecf	Merge pull request #6536 from gkurz/3.2.0-alpha0-branch-bump # Kata Containers 3.2.0-alpha0	2023-03-28 16:05:10 +02:00
Wainer Moschetta	d0f79e66b9	Merge pull request #6513 from fidencio/topic/use-kata-deploy-local-build-as-part-of-the-snap-stuff snap: Build the artefacts using kata-deploy	2023-03-28 09:59:31 -03:00
Miao Xia	0f73515561	runtime: add filter metrics with specific names The kata monitor metrics API returns a huge size response, if containers or sandboxs are a large number, focus on what we need will be harder. Fixes: #6500 Signed-off-by: Miao Xia <xia.miao1@zte.com.cn>	2023-03-28 14:56:13 +08:00
Greg Kurz	4a246309ee	release: Kata Containers 3.2.0-alpha0 - nydus: upgrad to v2.2.0 - osbuilder: Add support for CBL-Mariner - kata-deploy: Fix bash semantics error - make only_kata work without -f - runtime-rs: ch: Implement confidential guest handling - qemu/arm64: disable image nvdimm once no firmware offered - static checks workflow improvements - A couple of kata-deploy fixes - agent: Bring in VFIO-AP device handling again - bugfix: set hostname in CreateSandboxRequest - packaging / kata-deploy builds: Add the ability to cache and consume cached components - versions: Update firecracker version - dependency: update cgroups-rs - Built-in Sandbox: add more unit tests for dragonball. Part 6 - runtime: add support for Hyper-V - runtime-rs: update load_config comment - Add support for ephemeral mounts to occupy entire sandbox's memory - runtime-rs: fix default kernel location and add more default config paths - Implement direct-volume commands handler for shim-mgmt - bugfix: modify tty_win info in runtime when handling ResizePtyRequest - bugfix: add get_ns_path API for Hypervisor - runtime-rs: add the missing default trait - packaging: Simplify get_last_modification() - utils: Make kata-manager.sh runs checks - dragonball: support pmu on aarch64 - docs: fix typo in key filename in AWS installation guide - backport rustjail systemd cgroup fix #6331 to 3.1 - main \| kata-deploy: Fix kata deploy arm64 image build error - workflows: Yet more fixes for publishing the kata-deploy payload after every PR merged - rustjail: fix cgroup handling in agent-init mode - runtime/Makefile: Fix install-containerd-shim-v2 dependency - fix wrong notes for func GetSandboxesStoragePathRust() - fix(runtime-rs): add exited state to ensure cleanup - runtime-rs: add oci hook support - utils: Remove kata-manager.sh cgroups v2 check - workflows: Fixes for the `payload-after-push` action - Dragonball: update dependencies - workflows: Do not install docker - workflows: Publish kata-deploy payload after a merge - src: Fixed typo mod.rs - actions: Use `git-diff` to get changes in kernel dir - agent: don't set permission of existing directory in copy_file - runtime: use filepath.Clean() to clean the mount path - Upgrade to Cloud Hypervisor v30.0 - feat(runtime): make static resource management consistent with 2.0 - osbuilder: Include minimal set of device nodes in ubuntu initrd - kata-ctl/exec: add new command exec to enter guest VM. - kernel: Add CONFIG_SEV_GUEST to SEV kernel config - runtime-rs: Improve Cloud Hypervisor config handling - virtiofsd: update to a valid path on ppc64le - runtime-rs: cleanup kata host share path - osbuilder: fix default build target in makefile - devguide: Add link to the contribution guidelines - kata-deploy: Ensure go binaries can run on Ubuntu 20.04 - dragonball: config_manager: preserve device when update - Revert "workflows: Push the builder image to quay.io" - Remove all remaining unsafe impl - kata-deploy: Fix building the kata static firecracker arm64 package occurred an error - shim-v2: Bump Ubuntu container image to 22.04 - packaging: Cache the container used to build the kata-deploy artefacts - utils: always check some dependencies. - versions: Use ubuntu as the default distro for the rootfs-image - github-action: Replace deprecated command with environment file - docs: Change the order of release step - runtime-rs: remove unnecessary Send/Sync trait implement - runtime-rs: Don't build on Power, don't break on Power. - runtime-rs: handle sys_dir bind volume - sandbox: set the dns for the sandbox - packaging/shim-v2: Only change the config if the file exists - runtime-rs: Add basic CH implementation - release: Revert kata-deploy changes after 3.1.0-rc0 release `8b008fc743` kata-deploy: fix bash semantics error `74ec38cf02` osbuilder: Add support for CBL-Mariner `ac58588682` runtime-rs: ch: Generate Cloud Hypervisor config for confidential guests `96555186b3` runtime-rs: ch: Honour debug setting `e3c2d727ba` runtime-rs: ch: clippy fix `ece5edc641` qemu/arm64: disable image nvdimm if no firmware offered `dd23f452ab` utils: renamed only_kata to skip_containerd `59c81ed2bb` utils: informed pre-check about only_kata `4f0887ce42` kata-deploy: fix install failing to chmod runtime-rs/bin/* `09c4828ac3` workflows: add missing artifacts on payload-after-push `fbf891fdff` packaging: Adapt `get_last_modification()` `82a04dbce1` local-build: Use cached VirtioFS when possible `3b99004897` local-build: Use cached shim v2 when possible `1b8c5474da` local-build: Use cached RootFS when possible `09ce4ab893` local-build: Use cached QEMU when possible `1e1c843b8b` local-build: Use cached Nydus when possible `64832ab65b` local-build: Use cached Kernel when possible `04fb52f6c9` local-build: Use cached Firecracker when possible `8a40f6f234` local-build: Use cached Cloud Hypervisor when possible `194d5dc8a6` tools: Add support for caching VirtioFS artefacts `a34272cf20` tools: Add support for caching shim v2 artefacts `7898db5f79` tools: Add support for caching RootFS artefacts `e90891059b` tools: Add support for caching QEMU artefacts `7aed8f8c80` tools: Add support for caching Nydus artefacts `cb4cbe2958` tools: Add support for caching Kernel artefacts `762f9f4c3e` tools: Add support for caching Firecracker artefacts `6b1b424fc7` tools: Add support for caching Cloud Hypervisor artefacts `08fe49f708` versions: Adjust kernel names to match kata-deploy build targets `99505c0f4f` versions: Update firecracker version `f4938c0d90` bugfix: set hostname `96baa83895` agent: Bring in VFIO-AP device handling again `f666f8e2df` agent: Add VFIO-AP device handling `b546eca26f` runtime: Generalize VFIO devices `4c527d00c7` agent: Rename VFIO handling to VFIO PCI handling `db89c88f4f` agent: Use cfg-if for s390x CCW `68a586e52c` agent: Use a constant for CCW root bus path `a8b55bf874` dependency: update cgroups-rs `97cdba97ea` runtime-rs: update load_config comment `974a5c22f0` runtime: add support for Hyper-V `40f4eef535` build: Use the correct kernel name `a6c67a161e` runtime: add support for ephemeral mounts to occupy entire sandbox memory `844bf053b2` runtime-rs: add the missing default trait `e7bca62c32` bugfix: modify tty_win info in runtime when handling ResizePtyRequest `30e235f0a1` runtime-rs: impl volume-resize trait for sandbox `e029988bc2` bugfix: add get_ns_path API for Hypervisor `42b8867148` runtime-rs: impl volume-stats trait for sandbox `462d4a1af2` workflows: static-checks: Free disk space before running checks `e68186d9af` workflows: static-checks: Set GOPATH only once `439ff9d4c4` tools/osbuilder/tests: Remove TRAVIS variable `43ce3f7588` packaging: Simplify get_last_modification() `33c5c49719` packaging: Move repo_root_dir to lib.sh `16e2c3cc55` agent: implement update_ephemeral_mounts api `3896c7a22b` protocol: add updateEphemeralMounts proto `23488312f5` agent: always use cgroupfs when running as init `8546387348` agent: determine value of use_systemd_cgroup before LinuxContainer::new() `736aae47a4` rustjail: print type of cgroup manager `dbae281924` workflows: Properly set the kata-tarball architecture `76b4591e2b` tools: Adjust the build-and-upload-payload.sh script `cd2aaeda2a` kata-deploy: Switch to using an ubuntu image `2d43e13102` docs: fix typo in AWS installation guide `760f78137d` dragonball: support pmu on aarch64 `9bc7bef3d6` kata-deploy: Fix path to the Dockerfile `78ba363f8e` kata-deploy: Use different images for s390x and aarch64 `6267909501` kata-deploy: Allow passing BASE_IMAGE_{NAME,TAG} `3443f558a6` nydus: upgrad nydus to v2.2.0 `395645e1ce` runtime: hybrid-mode cause error in the latest nydusd `f8e44172f6` utils: Make kata-manager.sh runs checks `f31c79d210` workflows: static-checks: Remove TRAVIS_XXX variables `8030e469b2` fix(runtime-rs): add exited state to ensure cleanup `7d292d7fc3` workflows: Fix the path of imported workflows `e07162e79d` workflows: Fix action name `dd2713521e` Dragonball: update dependencies `bd1ed26c8d` workflows: Publish kata-deploy payload after a merge `fea7e8816f` runtime-rs: Fixed typo mod.rs `a9e2fc8678` runtime/Makefile: Fix install-containerd-shim-v2 dependency `b6880c60d3` logging: Correct the code notes `12cfad4858` runtime-rs: modify the transfer to oci::Hooks `828d467222` workflows: Do not install docker `4b8a5a1a3d` utils: Remove kata-manager.sh cgroups v2 check `2c4428ee02` runtime-rs: move pre-start hooks to sandbox_start `e80c9f7b74` runtime-rs: add StartContainer hook `977f281c5c` runtime-rs: add CreateContainer hook support `875f2db528` runtime-rs: add oci hook support `ecac3a9e10` docs: add design doc for Hooks `3ac6f29e95` runtime: clh: Re-generate the client code `262daaa2ef` versions: Upgrade to Cloud Hypervisor v30.0 `192df84588` agent: always use cgroupfs when running as init `b0691806f1` agent: determine value of use_systemd_cgroup before LinuxContainer::new() `dc86d6dac3` runtime: use filepath.Clean() to clean the mount path `c4ef5fd325` agent: don't set permission of existing directory `3483272bbd` runtime-rs: ch: Enable initrd usage `fbee6c820e` runtime-rs: Improve Cloud Hypervisor config handling `1bff1ca30a` kernel: Add CONFIG_SEV_GUEST to SEV kernel config Adding kernel config to sev case since it is needed for SNP and SNP will use the SEV kernel. Incrementing kernel config version to reflect changes `ad8968c8d9` rustjail: print type of cgroup manager `b4a1527aa6` kata-deploy: Fix static shim-v2 build on arm64 `2c4f8077fd` Revert "shim-v2: Bump Ubuntu container image to 22.04" `afaccf924d` Revert "workflows: Push the builder image to quay.io" `4c39c4ef9f` devguide: Add link to the contribution guidelines `76e926453a` osbuilder: Include minimal set of device nodes in ubuntu initrd `697ec8e578` kata-deploy: Fix kata static firecracker arm64 package build error `ced3c99895` dragonball: config_manager: preserve device when update `da8a6417aa` runtime-rs: remove all remaining unsafe impl `0301194851` dragonball: use crossbeam_channel in VmmService instead of mpsc::channel `9d78bf9086` shim-v2: Bump Ubuntu container image to 22.04 `3cfce5a709` utils: improved unsupported distro message. `919d19f415` feat(runtime): make static resource management consistent with 2.0 `b835c40bbd` workflows: Push the builder image to quay.io `781ed2986a` packaging: Allow passing a container builder to the scripts `45668fae15` packaging: Use existing image to build td-shim `e8c6bfbdeb` packaging: Use existing image to build td-shim `3fa24f7acc` packaging: Add infra to push the OVMF builder image `f076fa4c77` packaging: Use existing image to build OVMF `c7f515172d` packaging: Add infra to push the QEMU builder image `fb7b86b8e0` packaging: Use existing image to build QEMU `d0181bb262` packaging: Add infra to push the virtiofsd builder image `7c93428a18` packaging: Use existing image to build virtiofsd `8c227e2471` virtiofsd: Pass the expected toolchain to the build container `7ee00d8e57` packaging: Add infra to push the shim-v2 builder image `24767d82aa` packaging: Use existing image to build the shim-v2 `e84af6a620` virtiofsd: update to a valid path on ppc64le `6c3c771a52` packaging: Add infra to push the kernel builder image `b9b23112bf` packaging: Use existing image to build the kernel `869827d77f` packaging: Add push_to_registry() `e69a6f5749` packaging: Add get_last_modification() `6c05e5c67a` packaging: Add and export BUILDER_REGISTRY `1047840cf8` utils: always check some dependencies. `95e3364493` runtime-rs: remove unnecessary Send/Sync trait implement `a96ba99239` actions: Use `git-diff` to get changes in kernel dir `619ef54452` docs: Change the order of release step `a161d11920` versions: Use ubuntu as the default distro for the rootfs-image `be40683bc5` runtime-rs: Add a generic powerpc64le-options.mk `47c058599a` packaging/shim-v2: Install the target depending on the arch/libc `b582c0db86` kata-ctl/exec: add new command exec to enter guest VM. `07802a19dc` runtime-rs: handle sys_dir bind volume `04e930073c` sandbox: set the dns for the sandbox `32ebe1895b` agent: fix the issue of creating the dns file `44aaec9020` github-action: Replace deprecated command with environment file `a68c5004f8` packaging/shim-v2: Only change the config if the file exists `ee76b398b3` release: Revert kata-deploy changes after 3.1.0-rc0 release `bbc733d6c8` docs: runtime-rs: Add CH status details `37b594c0d2` runtime-rs: Add basic CH implementation `545151829d` kata-types: Add Cloud Hypervisor (CH) definitions `2dd2421ad0` runtime-rs: cleanup kata host share path `0a21ad78b1` osbuilder: fix default build target in makefile `9a01d4e446` dragonball: add more unit test for virtio-blk device. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-03-28 08:40:06 +02:00
Bin Liu	75987aae72	Merge pull request #6408 from jongwu/nydus_rm_hybrid nydus: upgrad to v2.2.0	2023-03-28 11:07:56 +08:00
Fabiano Fidêncio	4a95375dc8	Merge pull request #6465 from dallasd1/mariner-rootfs osbuilder: Add support for CBL-Mariner	2023-03-27 22:18:31 +02:00
Fabiano Fidêncio	43dd4440f4	snap: Build the artefacts using kata-deploy Our CI and release process are currently taking advantage of the kata-deploy local build scripts to build the artefacts. Having snap doing the same is the next logical step, and it will also help to reduce, by a lot, the CI time as we only build the components that a PR is touching (otherwise we just pull the cached component). Fixes: #6514 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-27 17:34:43 +02:00
Fabiano Fidêncio	293119df78	Merge pull request #6515 from xyz-li/main kata-deploy: Fix bash semantics error	2023-03-24 13:18:10 +01:00
Chelsea Mafrica	bbc699ddd8	Merge pull request #6419 from gabevenberg/containerd-pre-check make only_kata work without -f	2023-03-23 10:02:32 -07:00
xyz-li	8b008fc743	kata-deploy: fix bash semantics error The argument of return must be numeric. Fixes: #6521 Signed-off-by: xyz-li <hui0787411@163.com>	2023-03-23 22:47:54 +08:00
James O. D. Hunt	da676872b1	Merge pull request #6439 from jodh-intel/runtime-rs-ch-confidential-guest runtime-rs: ch: Implement confidential guest handling	2023-03-23 13:01:47 +00:00
Dallas Delaney	74ec38cf02	osbuilder: Add support for CBL-Mariner Add osbuilder support to build a rootfs and image based on the CBL-Mariner Linux distro Fixes: #6462 Signed-off-by: Dallas Delaney <dadelan@microsoft.com>	2023-03-22 11:45:32 -07:00
James O. D. Hunt	ac58588682	runtime-rs: ch: Generate Cloud Hypervisor config for confidential guests This change provides a preliminary implementation for the Cloud Hypervisor (CH) feature ([currently disabled](https://github.com/kata-containers/kata-containers/pull/6201)) to allow it to generate the CH configuration for handling confidential guests. This change also introduces concrete errors using the `thiserror` crate (see `src/runtime-rs/crates/hypervisor/ch-config/src/errors.rs`) and a lot of unit tests for the conversion code that generates the CH configuration from the generic Hypervisor configuration. Fixes: #6430. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-03-22 14:38:38 +00:00
James O. D. Hunt	96555186b3	runtime-rs: ch: Honour debug setting Enable Cloud Hypervisor debug based on the specified configuration rather than hard-coding debug to be disabled. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-03-22 14:38:38 +00:00
James O. D. Hunt	e3c2d727ba	runtime-rs: ch: clippy fix Simplify the code to keep rust's `clippy` happy. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-03-22 14:38:38 +00:00
James O. D. Hunt	f06f72b5e9	Merge pull request #6467 from jongwu/qemu-uefi-path qemu/arm64: disable image nvdimm once no firmware offered	2023-03-22 08:43:01 +00:00
Steve Horsman	adaabd141a	Merge pull request #6406 from jepio/jepio/static-checks-workflow-improvements static checks workflow improvements	2023-03-20 17:12:54 +00:00
Wainer Moschetta	20da7f3ec8	Merge pull request #6495 from wainersm/fix-kata-deploy-ci A couple of kata-deploy fixes	2023-03-20 13:48:02 -03:00
Fabiano Fidêncio	2fe0733dcb	Merge pull request #4582 from BbolroC/vfio-ap agent: Bring in VFIO-AP device handling again	2023-03-20 11:43:13 +01:00
Jianyong Wu	ece5edc641	qemu/arm64: disable image nvdimm if no firmware offered For now, image nvdimm on qemu/arm64 depends on UEFI/ACPI, so if there is no firmware offered, it should be disabled. Fixes: #6468 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-03-20 18:03:05 +08:00
Zhongtao Hu	1e8005ff88	Merge pull request #6477 from openanolis/runtime-rs-hostname bugfix: set hostname in CreateSandboxRequest	2023-03-20 12:43:29 +08:00
Gabe Venberg	dd23f452ab	utils: renamed only_kata to skip_containerd Renamed for greater clarity as to what that flag does. Signed-off-by: Gabe Venberg <gabevenberg@gmail.com>	2023-03-17 16:09:45 -05:00
Gabe Venberg	59c81ed2bb	utils: informed pre-check about only_kata passed the only_kata variable through to pre_check, only_kata does not abort the install when containerd is already installed. fixes #6385 Signed-off-by: Gabe Venberg <gabevenberg@gmail.com>	2023-03-17 15:58:57 -05:00
Fabiano Fidêncio	96252db787	Merge pull request #6481 from fidencio/topic/cache-artefacts packaging / kata-deploy builds: Add the ability to cache and consume cached components	2023-03-17 20:54:42 +01:00
Wainer dos Santos Moschetta	4f0887ce42	kata-deploy: fix install failing to chmod runtime-rs/bin/* The kata-deploy install method tried to `chmod +x /opt/kata/runtime-rs/bin/*` but it isn't always true that /opt/kata/runtime-rs/bin/ exists. For example, the s390x payload does not build the kernel-dragonball-experimental artifacts. So let's ensure the dir exist before issuing the command. Fixes #6494 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-03-17 16:09:21 -03:00
Wainer dos Santos Moschetta	09c4828ac3	workflows: add missing artifacts on payload-after-push The kata-deploy-ci payloads for amd64 and arm64 were missing the shim-v2 and kernel-dragonball-experimental artifacts. Fixes #6493 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-03-17 15:31:21 -03:00
Fabiano Fidêncio	fbf891fdff	packaging: Adapt `get_last_modification()` The function is returning "" when called from the script used to cache the artefacts and one difference noted between this version and the already working one from the CCv0 is that we make sure to `pushd ${repo_root_dir}` in the CCv0 version. Let's give it a try here and see if it solves the issue. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-17 16:27:34 +01:00
Fabiano Fidêncio	82a04dbce1	local-build: Use cached VirtioFS when possible As we've added the support for caching components, let's use them whenever those are available. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 16:27:34 +01:00
Fabiano Fidêncio	3b99004897	local-build: Use cached shim v2 when possible As we've added the support for caching components, let's use them whenever those are available. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 16:27:34 +01:00
Fabiano Fidêncio	1b8c5474da	local-build: Use cached RootFS when possible As we've added the support for caching components, let's use them whenever those are available. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 16:27:34 +01:00
Fabiano Fidêncio	09ce4ab893	local-build: Use cached QEMU when possible As we've added the support for caching components, let's use them whenever those are available. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 16:27:34 +01:00
Fabiano Fidêncio	1e1c843b8b	local-build: Use cached Nydus when possible As we've added the support for caching components, let's use them whenever those are available. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 16:27:34 +01:00
Fabiano Fidêncio	64832ab65b	local-build: Use cached Kernel when possible As we've added the support for caching components, let's use them whenever those are available. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 16:27:34 +01:00
Fabiano Fidêncio	04fb52f6c9	local-build: Use cached Firecracker when possible As we've added the support for caching components, let's use them whenever those are available. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 16:27:34 +01:00
Fabiano Fidêncio	8a40f6f234	local-build: Use cached Cloud Hypervisor when possible As we've added the support for caching components, let's use them whenever those are available. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 16:27:34 +01:00
Fabiano Fidêncio	194d5dc8a6	tools: Add support for caching VirtioFS artefacts Let's add support for caching VirtioFS artefacts that are generated using the kata-deploy local-build scripts. Right now those are not used, but we'll switch to using them very soon as part of upcoming changes of how we build the components we test in our CI. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 11:43:01 +01:00
Fabiano Fidêncio	a34272cf20	tools: Add support for caching shim v2 artefacts Let's add support for caching shim v2 artefacts that are generated using the kata-deploy local-build scripts. Right now those are not used, but we'll switch to using them very soon as part of upcoming changes of how we build the components we test in our CI. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 11:43:01 +01:00
Fabiano Fidêncio	7898db5f79	tools: Add support for caching RootFS artefacts Let's add support for caching RootFS artefacts that are generated using the kata-deploy local-build scripts. Right now those are not used, but we'll switch to using them very soon as part of upcoming changes of how we build the components we test in our CI. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 11:43:01 +01:00
Fabiano Fidêncio	e90891059b	tools: Add support for caching QEMU artefacts Let's add support for caching QEMU artefacts that are generated using the kata-deploy local-build scripts. Right now those are not used, but we'll switch to using them very soon as part of upcoming changes of how we build the components we test in our CI. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 11:43:01 +01:00
Fabiano Fidêncio	7aed8f8c80	tools: Add support for caching Nydus artefacts Let's add support for caching Nydus artefacts that are generated using the kata-deploy local-build scripts. Right now those are not used, but we'll switch to using them very soon as part of upcoming changes of how we build the components we test in our CI. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 11:43:01 +01:00
Fabiano Fidêncio	cb4cbe2958	tools: Add support for caching Kernel artefacts Let's add support for caching Kernel artefacts that are generated using the kata-deploy local-build scripts. Right now those are not used, but we'll switch to using them very soon as part of upcoming changes of how we build the components we test in our CI. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 11:43:01 +01:00
Fabiano Fidêncio	762f9f4c3e	tools: Add support for caching Firecracker artefacts Let's add support for caching Firecracker artefacts that are generated using the kata-deploy local-build scripts. Right now those are not used, but we'll switch to using them very soon as part of upcoming changes of how we build the components we test in our CI. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 11:28:56 +01:00
Fabiano Fidêncio	6b1b424fc7	tools: Add support for caching Cloud Hypervisor artefacts Let's add support for caching Cloud Hypervisor artefacts that are generated using the kata-deploy local-build scripts. Right now those are not used, but we'll switch to using them very soon as part of upcoming changes of how we build the components we test in our CI. Fixes: #6480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-17 11:28:56 +01:00
Fabiano Fidêncio	08fe49f708	versions: Adjust kernel names to match kata-deploy build targets Let's adjust the kernel names in versions.yaml so those can match the names used as part of the kata-deploy local build scripts. Right now this doesn't bring any benefit nor drawback, but it'll make our life easier later on in this same series. Depends-on: github.com/kata-containers/tests#5534 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-17 11:28:56 +01:00
Fabiano Fidêncio	d281d1b90a	Merge pull request #6483 from GabyCT/topic/updatefcv versions: Update firecracker version	2023-03-17 10:37:22 +01:00
Gabriela Cervantes	99505c0f4f	versions: Update firecracker version This PR updates the firecracker version being used in kata containers versions.yaml The changes in version 1.3.1 are Added Introduced T2CL (Intel) and T2A (AMD) CPU templates to provide instruction set feature parity between Intel and AMD CPUs when using these templates. Added Graviton3 support (c7g instance type). Changed Improved error message when invalid network backend provided. Improved TCP throughput by between 5% and 15% (depending on CPU) by using scatter-gather I/O in the net device's TX path. Upgraded Rust toolchain from 1.64.0 to 1.66.0. Made seccompiler output bit-reproducible. Fixed Fixed feature flags in T2 CPU template on Intel Ice Lake. Fixes #6482 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-03-16 17:34:33 +00:00
Yushuo	f4938c0d90	bugfix: set hostname Setting hostname according to the spec. Fixes: #6247 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-03-16 17:16:06 +08:00
Hyounggyu Choi	96baa83895	agent: Bring in VFIO-AP device handling again This PR is a continuing work for (kata-containers#3679). This generalizes the previous VFIO device handling which only focuses on PCI to include AP (IBM Z specific). Fixes: kata-containers#3678 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-03-16 18:14:12 +09:00
Greg Kurz	e6e719699f	Merge pull request #6471 from etrunko/main dependency: update cgroups-rs	2023-03-16 08:01:07 +01:00
QuanweiZhou	56c63a9b1c	Merge pull request #6186 from wllenyj/dragonball-ut-6 Built-in Sandbox: add more unit tests for dragonball. Part 6	2023-03-16 11:02:05 +08:00
Jakob Naucke	f666f8e2df	agent: Add VFIO-AP device handling Initial VFIO-AP support (#578) was simple, but somewhat hacky; a different code path would be chosen for performing the hotplug, and agent-side device handling was bound to knowing the assigned queue numbers (APQNs) through some other means; plus the code for awaiting them was written for the Go agent and never released. This code also artificially increased the hotplug timeout to wait for the (relatively expensive, thus limited to 5 seconds at the quickest) AP rescan, which is impractical for e.g. common k8s timeouts. Since then, the general handling logic was improved (#1190), but it assumed PCI in several places. In the runtime, introduce and parse AP devices. Annotate them as such when passing to the agent, and include information about the associated APQNs. The agent awaits the passed APQNs through uevents and triggers a rescan directly. Fixes: #3678 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2023-03-16 10:07:48 +09:00
Jakob Naucke	b546eca26f	runtime: Generalize VFIO devices Generalize VFIO devices to allow for adding AP in the next patch. The logic for VFIOPciDeviceMediatedType() has been changed and IsAPVFIOMediatedDevice() has been removed. The rationale for the revomal is: - VFIODeviceMediatedType is divided into 2 subtypes for AP and PCI - Logic of checking a subtype of mediated device is included in GetVFIODeviceType() - VFIOPciDeviceMediatedType() can simply fulfill the device addition based on a type categorized by GetVFIODeviceType() Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2023-03-16 10:06:37 +09:00
Jakob Naucke	4c527d00c7	agent: Rename VFIO handling to VFIO PCI handling e.g., split_vfio_option is PCI-specific and should instead be named split_vfio_pci_option. This mutually affects the runtime, most notably how the labels are named for the agent. Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2023-03-16 07:43:39 +09:00
Jakob Naucke	db89c88f4f	agent: Use cfg-if for s390x CCW Uses fewer lines in upcoming VFIO-AP support. Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2023-03-16 07:43:39 +09:00
Jakob Naucke	68a586e52c	agent: Use a constant for CCW root bus path used a function like PCI does, but this is not necessary Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2023-03-16 07:43:39 +09:00
Fabiano Fidêncio	814d07af58	Merge pull request #6463 from sprt/sprt/mshv-compat runtime: add support for Hyper-V	2023-03-15 18:03:25 +01:00
Eduardo Lima (Etrunko)	a8b55bf874	dependency: update cgroups-rs Huge pages failure with cgroups v2. https://github.com/kata-containers/cgroups-rs/issues/112 Fixes: #6470 Signed-off-by: Eduardo Lima (Etrunko) <etrunko@redhat.com>	2023-03-15 12:21:12 -03:00
Chao Wu	530b2a7685	Merge pull request #6458 from openanolis/chao/update_comments runtime-rs: update load_config comment	2023-03-15 19:32:07 +08:00
Chao Wu	97cdba97ea	runtime-rs: update load_config comment Since shimv2 create task option is already implemented, we need to update the corresponding comments. Also, the ordering is also updated to fit with the code. fixes: #3961 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-03-15 14:44:47 +08:00
Eric Ernst	dc42f0a33b	Merge pull request #6411 from wlan0/empty-dir Add support for ephemeral mounts to occupy entire sandbox's memory	2023-03-13 20:07:27 -07:00
Henry Beberman	974a5c22f0	runtime: add support for Hyper-V This adds /dev/mshv to the list of sandbox devices so that VMMs can create Hyper-V VMs. In our testing, this also doesn't error out in case /dev/mshv isn't present. Fixes #6454. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-03-13 17:13:51 -07:00
Fabiano Fidêncio	ab0bd7a1ee	Merge pull request #6292 from fidencio/topic/runtime-rs-small-fixes runtime-rs: fix default kernel location and add more default config paths	2023-03-13 16:53:30 +01:00
Fabiano Fidêncio	40f4eef535	build: Use the correct kernel name When calling `MAKE_KERNEL_NAME` we're considering the default kernel name will be `vmlinux.container` or `vmlinuz.container`, which is not the case as the runtime-rs, when used with dragonball, relies on the `vmlinu[zx]-dragonball-experimental.container` kernel. Other hypervisors will have to introduce a similar `MAKE_KERNEL_NAME_${HYPERVISOR}` to adapt this to the kernel they want to use, similarly to what's already done for the go runtime. By doing this we also ensure that no changes in the configuration file will be required to run runtime-rs, with dragonball, as part of our CI or as part of kata-deploy. Fixes: #6290 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-13 13:47:20 +01:00
James O. D. Hunt	ae9be1d94b	Merge pull request #5840 from tzY15368/feat-runtimers-direct-vol Implement direct-volume commands handler for shim-mgmt	2023-03-13 07:58:40 +00:00
Chelsea Mafrica	4b877b0a3e	Merge pull request #6426 from openanolis/runtime-rs-resize-pty bugfix: modify tty_win info in runtime when handling ResizePtyRequest	2023-03-10 14:08:41 -08:00
Sidhartha Mani	a6c67a161e	runtime: add support for ephemeral mounts to occupy entire sandbox memory On hotplug of memory as containers are started, remount all ephemeral mounts with size option set to the total sandbox memory Fixes: #6417 Signed-off-by: Sidhartha Mani <sidhartha_mani@apple.com>	2023-03-10 13:36:02 -08:00
James O. D. Hunt	99a4eaa898	Merge pull request #6443 from openanolis/runtime-rs-get-netns bugfix: add get_ns_path API for Hypervisor	2023-03-10 20:16:22 +00:00
Fabiano Fidêncio	44bc222ca4	Merge pull request #5578 from Richardhongyu/main runtime-rs: add the missing default trait	2023-03-10 18:01:43 +01:00
Li Hongyu	844bf053b2	runtime-rs: add the missing default trait Some structs in the runtime-rs don't implement Default trait. This commit adds the missing Default. Fixes: #5463 Signed-off-by: Li Hongyu <lihongyu1999@bupt.edu.cn>	2023-03-10 08:19:56 +00:00
Yushuo	e7bca62c32	bugfix: modify tty_win info in runtime when handling ResizePtyRequest Currently, we only create the new exec process in runtime, this will cause error when the following requests needing to be handled: - Task: exec process - Task: resize process pty - ... The agent do not do_exec_process when we handle ExecProcess, thus we can not find any process information in the guest when we handle ResizeProcessPty. This will report an error. In this commit, the handling process is modified to the: * Modify process tty_win information in runtime * If the exec process is not running, we just return. And the truly pty_resize will happen when start_process Fixes: #6248 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-03-10 14:33:51 +08:00
Tingzhou Yuan	30e235f0a1	runtime-rs: impl volume-resize trait for sandbox Implements resize-volume handlers in shim-mgmt, trait for sandbox and add RPC calls to agent. Note the actual rpc handler for the resize request is currently not implemented, refer to issue #3694. Fixes #5369 Signed-off-by: Tingzhou Yuan <tzyuan15@bu.edu>	2023-03-10 01:27:06 -05:00
Yushuo	e029988bc2	bugfix: add get_ns_path API for Hypervisor For external hypervisors(qemu, cloud-hypervisor, ...), the ns they launch vm in is different from internal hypervisor(dragonball). And when we doing CreateContainer hook, we will rely on the netns path. So we add a get_ns_path API. Fixes: #6442 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-03-10 13:57:00 +08:00
Tingzhou Yuan	42b8867148	runtime-rs: impl volume-stats trait for sandbox Implements get-volume-stats trait for sandbox, handler for shim-mgmt and add RPC calls to agent. Also added type conversions in trans.rs Fixes #5369 Signed-off-by: Tingzhou Yuan <tzyuan15@bu.edu>	2023-03-10 00:48:02 -05:00
Jeremi Piotrowski	462d4a1af2	workflows: static-checks: Free disk space before running checks We've been seeing the 'sudo make test' job occasionally run out of space in /tmp, which is part of the root filesystem. Removing dotnet and `AGENT_TOOLSDIRECTORY` frees around 10GB of space and in my tests the job still has 13GB of space left after running. Fixes: #6401 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-03-09 13:30:09 +01:00
Jeremi Piotrowski	e68186d9af	workflows: static-checks: Set GOPATH only once {{ runner.workspace }}/kata-containers and {{ github.workspace }} resolve to the same value, but they're being used multiple times in the workflow. Remove multiple definitions and define the GOPATH var at job level once. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-03-09 13:30:09 +01:00
Jeremi Piotrowski	439ff9d4c4	tools/osbuilder/tests: Remove TRAVIS variable The last remaining user of the TRAVIS variable in this repo is tools/osbuilder/tests and it is only used to skip spinning up VMs. Travis didn't support virtualization and the same is true for github actions hosted runners. Replace the variable with KVM_MISSING and determine availability of /dev/kvm at runtime. TRAVIS is also used by '.ci/setup.sh' in kata-containers/tests to reduce the set of dependencies that gets installed, but this is also in the process of being removed. Fixes: #3544 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-03-09 13:29:49 +01:00
Christophe de Dinechin	7566a7eae4	Merge pull request #6432 from fidencio/topic/simplify-get-last-modification packaging: Simplify get_last_modification()	2023-03-09 10:57:58 +01:00
Fabiano Fidêncio	43ce3f7588	packaging: Simplify get_last_modification() There's no need to pass repo_root_dir to get_last_modification() as the variable used everywhere is exported from that very same file. Fixes: #6431 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-08 21:22:03 +01:00
Fabiano Fidêncio	33c5c49719	packaging: Move repo_root_dir to lib.sh This is used in several parts of the code, and can have a single declaration as part of the `lib.sh` file, which is already imported by all the places where it's used. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-08 21:10:53 +01:00
James O. D. Hunt	614d1817ce	Merge pull request #6410 from tg5788re/kata-manager-use-runtime-checks utils: Make kata-manager.sh runs checks	2023-03-08 09:55:03 +00:00
Chao Wu	fef268a7de	Merge pull request #6413 from xuejun-xj/xuejun/pmu dragonball: support pmu on aarch64	2023-03-08 14:24:31 +08:00
Steve Horsman	cc1821fb8b	Merge pull request #6409 from Sig00rd/patch-1 docs: fix typo in key filename in AWS installation guide	2023-03-07 15:19:46 +00:00
Fabiano Fidêncio	861552c305	Merge pull request #6414 from jepio/jepio/backport-3.1-rustjail-systemd-cgroup-fix-6331 backport rustjail systemd cgroup fix #6331 to 3.1	2023-03-07 12:51:08 +01:00
Sidhartha Mani	16e2c3cc55	agent: implement update_ephemeral_mounts api - implement update_ephemeral_mounts rpc - for each mountpoint passed in, remount it with new options Signed-off-by: Sidhartha Mani <sidhartha_mani@apple.com>	2023-03-06 13:44:14 -08:00
Sidhartha Mani	3896c7a22b	protocol: add updateEphemeralMounts proto - adds a new rpc call to the agent service named `updateEphemeralMounts` - this call takes a list of grpc.Storage objects Signed-off-by: Sidhartha Mani <sidhartha_mani@apple.com>	2023-03-06 13:43:47 -08:00
Jeremi Piotrowski	23488312f5	agent: always use cgroupfs when running as init The logic to decide which cgroup driver is used is currently based on the cgroup path that the host provides. This requires host and guest to use the same cgroup driver. If the guest uses kata-agent as init, then systemd can't be used as the cgroup driver. If the host requests a systemd cgroup, this currently results in a rustjail panic: thread 'tokio-runtime-worker' panicked at 'called `Result::unwrap()` on an `Err` value: I/O error: No such file or directory (os error 2) Caused by: No such file or directory (os error 2)', rustjail/src/cgroups/systemd/manager.rs:44:51 stack backtrace: 0: 0x7ff0fe77a793 - std::backtrace_rs::backtrace::libunwind::trace::h8c197fa9a679d134 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/../../backtrace/src/backtrace/libunwind.rs:93:5 1: 0x7ff0fe77a793 - std::backtrace_rs::backtrace::trace_unsynchronized::h9ee19d58b6d5934a at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/../../backtrace/src/backtrace/mod.rs:66:5 2: 0x7ff0fe77a793 - std::sys_common::backtrace::_print_fmt::h4badc450600fc417 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys_common/backtrace.rs:65:5 3: 0x7ff0fe77a793 - <std::sys_common::backtrace::_print::DisplayBacktrace as core::fmt::Display>::fmt::had334ddb529a2169 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys_common/backtrace.rs:44:22 4: 0x7ff0fdce815e - core::fmt::write::h1aa7694f03e44db2 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/core/src/fmt/mod.rs:1209:17 5: 0x7ff0fe74e0c4 - std::io::Write::write_fmt::h61b2bdc565be41b5 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/io/mod.rs:1682:15 6: 0x7ff0fe77cd3f - std::sys_common::backtrace::_print::h4ec69798b72ff254 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys_common/backtrace.rs:47:5 7: 0x7ff0fe77cd3f - std::sys_common::backtrace::print::h0e6c02048dec3c77 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys_common/backtrace.rs:34:9 8: 0x7ff0fe77c93f - std::panicking::default_hook::{{closure}}::hcdb7e705dc37ea6e at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/panicking.rs:267:22 9: 0x7ff0fe77d9b8 - std::panicking::default_hook::he03a933a0f01790f at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/panicking.rs:286:9 10: 0x7ff0fe77d9b8 - std::panicking::rust_panic_with_hook::he26b680bfd953008 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/panicking.rs:688:13 11: 0x7ff0fe77d482 - std::panicking::begin_panic_handler::{{closure}}::h559120d2dd1c6180 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/panicking.rs:579:13 12: 0x7ff0fe77d3ec - std::sys_common::backtrace::__rust_end_short_backtrace::h36db621fc93b005a at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys_common/backtrace.rs:137:18 13: 0x7ff0fe77d3c1 - rust_begin_unwind at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/panicking.rs:575:5 14: 0x7ff0fda52ee2 - core::panicking::panic_fmt::he7679b415d25c5f4 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/core/src/panicking.rs:65:14 15: 0x7ff0fda53182 - core::result::unwrap_failed::hb71caff146724b6b at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/core/src/result.rs:1791:5 16: 0x7ff0fe5bd738 - <rustjail::cgroups::systemd::manager::Manager as rustjail::cgroups::Manager>::apply::hd46958d9d807d2ca 17: 0x7ff0fe606d80 - <rustjail::container::LinuxContainer as rustjail::container::BaseContainer>::start::{{closure}}::h1de806d91fcb878f 18: 0x7ff0fe604a76 - <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll::h1749c148adcc235f 19: 0x7ff0fdc0c992 - kata_agent::rpc::AgentService::do_create_container::{{closure}}::{{closure}}::hc1b87a15dfdf2f64 20: 0x7ff0fdb80ae4 - <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll::h846a8c9e4fb67707 21: 0x7ff0fe3bb816 - <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll::h53de16ff66ed3972 22: 0x7ff0fdb519cb - <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll::h1cbece980286c0f4 23: 0x7ff0fdf4019c - <tokio::future::poll_fn::PollFn<F> as core::future::future::Future>::poll::hc8e72d155feb8d1f 24: 0x7ff0fdfa5fd8 - tokio::loom::std::unsafe_cell::UnsafeCell<T>::with_mut::h0a407ffe2559449a 25: 0x7ff0fdf033a1 - tokio::runtime::task::raw::poll::h1045d9f1db9742de 26: 0x7ff0fe7a8ce2 - tokio::runtime::scheduler::multi_thread::worker::Context::run_task::h4924ae3464af7fbd 27: 0x7ff0fe7afb85 - tokio::runtime::task::raw::poll::h5c843be39646b833 28: 0x7ff0fe7a05ee - std::sys_common::backtrace::__rust_begin_short_backtrace::ha7777c55b98a9bd1 29: 0x7ff0fe7a9bdb - core::ops::function::FnOnce::call_once{{vtable.shim}}::h27ec83c953360cdd 30: 0x7ff0fe7801d5 - <alloc::boxed::Box<F,A> as core::ops::function::FnOnce<Args>>::call_once::hed812350c5aef7a8 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/alloc/src/boxed.rs:1987:9 31: 0x7ff0fe7801d5 - <alloc::boxed::Box<F,A> as core::ops::function::FnOnce<Args>>::call_once::hc7df8e435a658960 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/alloc/src/boxed.rs:1987:9 32: 0x7ff0fe7801d5 - std::sys::unix::thread::Thread::new::thread_start::h575491a8a17dbb33 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys/unix/thread.rs:108:17 Forward the value of "init_mode" to AgentService, so that we can force cgroupfs when systemd is unavailable. Fixes: #5779 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-03-06 20:34:21 +01:00
Jeremi Piotrowski	8546387348	agent: determine value of use_systemd_cgroup before LinuxContainer::new() Right now LinuxContainer::new() gets passed a CreateOpts struct, but then modifies the use_systemd_cgroup field inside that struct. Pull the cgroups path parsing logic into do_create_container, so that CreateOpts can be immutable in LinuxContainer::new. This is just moving things around, there should be no functional changes. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-03-06 20:34:21 +01:00
Jeremi Piotrowski	736aae47a4	rustjail: print type of cgroup manager Since the cgroup manager is wrapped in a dyn now, the print in LinuxContainer::new has been useless and just says "CgroupManager". Extend the Debug trait for 'dyn Manager' to print the type of the cgroup manager so that it's easier to debug issues. Fixes: #5779 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-03-06 20:34:21 +01:00
Fabiano Fidêncio	0749657c73	Merge pull request #6359 from singhwang/main main \| kata-deploy: Fix kata deploy arm64 image build error	2023-03-06 16:48:03 +01:00
Fabiano Fidêncio	dbae281924	workflows: Properly set the kata-tarball architecture Let's make sure the kata-tarball architecture upload / downloaded / used is exactly the same one that we need as part of the architecture we're using to generate the image. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-06 13:18:51 +01:00
Fabiano Fidêncio	76b4591e2b	tools: Adjust the build-and-upload-payload.sh script Now that we've switched the base container image to using Ubuntu instead of CentOS, we don't need any kind of extra logic to correctly build the image for different architectures, as Ubuntu is a multi-arch image that supports all the architectures we're targetting. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-06 13:18:51 +01:00
SinghWang	cd2aaeda2a	kata-deploy: Switch to using an ubuntu image Let's make sure we use a multi-arch image for building kata-deploy. A few changes were also added in order to get systemd working inside the kata-deploy image, due to the switch from CentOS to Ubuntu. Fixes: #6358 Signed-off-by: SinghWang <wangxin_0611@126.com>	2023-03-06 13:18:51 +01:00
Szymon Fugas	2d43e13102	docs: fix typo in AWS installation guide Fixes referring to previously created key file with .pen extension instead of .pem. Fixes: #6412 Signed-off-by: Sig00rd <sfugas@virtuslab.com>	2023-03-06 13:18:08 +01:00
xuejun-xj	760f78137d	dragonball: support pmu on aarch64 This commit adds support for pmu virtualization on aarch64. The initialization of pmu is in the following order: 1. Receive pmu parameter(vpmu_feature) from runtime-rs to determine the VpmuFeatureLevel. 2. Judge whether to initialize pmu devices and add pmu device node into fdt on aarch64, according to VpmuFeatureLevel. Fixes: #6168 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com>	2023-03-06 18:55:13 +08:00
Fabiano Fidêncio	93a40cb35e	Merge pull request #6402 from fidencio/topic/yet-more-fixes-for-the-publish-kata-deploy-payload-work workflows: Yet more fixes for publishing the kata-deploy payload after every PR merged	2023-03-06 10:43:32 +01:00
Fabiano Fidêncio	df35f8f885	Merge pull request #6331 from jepio/jepio/fix-agent-init-cgroups rustjail: fix cgroup handling in agent-init mode	2023-03-05 20:29:40 +01:00
Fabiano Fidêncio	98d611623f	Merge pull request #6361 from etrunko/main runtime/Makefile: Fix install-containerd-shim-v2 dependency	2023-03-04 13:47:11 +01:00
Fabiano Fidêncio	9bc7bef3d6	kata-deploy: Fix path to the Dockerfile As part of `bd1ed26c8d`, we've pointed to the Dockerfile that's used in the CC branch, which is wrong. For what we're doing on main, we should be pointing to the one under the `kata-deploy` folder, and not the one under the non-existent `kata-deploy-cc` one. Fixes: #6343 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-04 12:18:38 +01:00
Fabiano Fidêncio	78ba363f8e	kata-deploy: Use different images for s390x and aarch64 As the image provided as part of registry.centos.org is not a multi-arch one, at least not for CentOS 7, we need to expand the script used to build the image to pass images that are known to work for s390x (ClefOS) and aarch64 (CentOS, but coming from dockerhub). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-04 12:18:32 +01:00
Fabiano Fidêncio	6267909501	kata-deploy: Allow passing BASE_IMAGE_{NAME,TAG} Let's break the IMAGE build parameter into BASE_IMAGE_NAME and BASE_IMAGE_TAG, as it makes it easier to replace the default CentOS image by something else. Spoiler alert, the default CentOS image is not multi-arch, and we do want to support at least aarch64 and s390x in the near term future. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-04 12:16:41 +01:00
Jianyong Wu	3443f558a6	nydus: upgrad nydus to v2.2.0 Use the latest nydus, we may let nydus work on arm64. Fixes: #6407 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-03-04 12:58:48 +08:00
Jianyong Wu	395645e1ce	runtime: hybrid-mode cause error in the latest nydusd When update the nydusd to 2.2, the argument "--hybrid-mode" cause the following error: thread 'main' panicked at 'ArgAction::SetTrue / ArgAction::SetFalse is defaulted' Maybe we should remove it to upgrad nydusd Fixes: #6407 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-03-04 12:58:48 +08:00
tg5788re	f8e44172f6	utils: Make kata-manager.sh runs checks Updated the `kata-manager.sh` script to make it run all the checks on the host system before attempting to create a container. If any checks fail, they will indicate to the user what the problem is in a clearer manner than those reported by the container manager. Fixes: #6281. Signed-off-by: tg5788re <jfokugas@gmail.com>	2023-03-03 09:56:12 -06:00
Chelsea Mafrica	ebe916b372	Merge pull request #6355 from yanggangtony/fix-wrong-notes fix wrong notes for func GetSandboxesStoragePathRust()	2023-03-03 07:55:54 -08:00
Jeremi Piotrowski	f31c79d210	workflows: static-checks: Remove TRAVIS_XXX variables These variables are unused since we don't use travis CI. This also allows to remove two steps: - 'Setup GOPATH' only printed variables - 'Setup travis reference' modified some shell local variables that don't have any influence on the rest of the steps The TRAVIS var is still used by tools/osbuilder/tests to determine if virtualization is available. Fixes: #3544 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-03-03 11:38:34 +01:00
Zhongtao Hu	60bb9d114a	Merge pull request #6399 from yipengyin/fix-cleanup fix(runtime-rs): add exited state to ensure cleanup	2023-03-03 17:41:16 +08:00
Chao Wu	6fc4c8b099	Merge pull request #5788 from openanolis/runtime-rs-ocihook runtime-rs: add oci hook support	2023-03-03 01:06:21 +08:00
James O. D. Hunt	4a7a859592	Merge pull request #6377 from pembek01/remove-cgroupsv2-check utils: Remove kata-manager.sh cgroups v2 check	2023-03-02 17:00:46 +00:00
Fabiano Fidêncio	b20d5289cb	Merge pull request #6400 from fidencio/topic/fixes-for-generating-the-kata-deploy-payload workflows: Fixes for the `payload-after-push` action	2023-03-02 14:20:24 +01:00
Yipeng Yin	8030e469b2	fix(runtime-rs): add exited state to ensure cleanup Set process status to exited at end of io wait, which indicate process exited only, but stop process has not been finished. Otherwise, the cleanup_container will be skipped. Fixes: #6393 Signed-off-by: Yipeng Yin <yinyipeng@bytedance.com>	2023-03-02 18:14:20 +08:00
Fabiano Fidêncio	7d292d7fc3	workflows: Fix the path of imported workflows In `payload-after-push.yaml` we ended up mentioning cc-*.yaml workflows, which are non existent in the main branch. Let's adapt the name to the correct ones. Fixes: #6343 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-02 10:18:10 +01:00
Fabiano Fidêncio	e07162e79d	workflows: Fix action name We have a few actions in the `payload-after-push.*.yaml` that are referring to Confidential Containers, but they should be referring to Kata Containers instead. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-02 10:17:18 +01:00
Chao Wu	572c385774	Merge pull request #6269 from openanolis/chao/update_dragonball_version Dragonball: update dependencies	2023-03-02 17:15:39 +08:00
Fabiano Fidêncio	7286f8f706	Merge pull request #6391 from fidencio/topic/do-not-install-docker-as-part-of-the-actions workflows: Do not install docker	2023-03-02 10:12:15 +01:00
Fabiano Fidêncio	7201279647	Merge pull request #6344 from fidencio/topic/generate-a-kata-deploy-payload-on-each-PR-merged workflows: Publish kata-deploy payload after a merge	2023-03-02 09:02:34 +01:00
Chao Wu	dd2713521e	Dragonball: update dependencies Since rust-vmm and dragonball-sandbox has introduced several updates such as vPMU support for aarch64, we also need to update Dragonball dependencies to include those changes. Update: virtio-queue to v0.6.0 kvm-ioctls to v0.12.0 dbs-upcall to v0.2.0 dbs-virtio-devices to v0.2.0 kvm-bindings to v0.6.0 Also, several aarch64 features are updated because of dependencies changes: 1. update vcpu hotplug API. 2. update vpmu related API. 3. adjust unit test cases for aarch64 Dragonball. fixes: #6268 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-03-02 14:53:04 +08:00
Chao Wu	2934ab4a3c	Merge pull request #6380 from Christopher-C-Robinson/#6256-typo-fix src: Fixed typo mod.rs	2023-03-02 14:31:33 +08:00
Fabiano Fidêncio	bd1ed26c8d	workflows: Publish kata-deploy payload after a merge For the architectures we know that `make kata-tarball` works as expected, let's start publishing the kata-deploy payload after each merge. This will help to: * Easily test the content of current `main` or `stable-` branch Easily bisect issues * Start providing some sort of CI/CD content pipeline for those who need that This is a forward-port work from the `CCv0` and groups together patches that I've worked on, with the work that Choi did in order to support different architectures. Fixes: #6343 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-02 02:19:10 +01:00
Domesticcadiz	fea7e8816f	runtime-rs: Fixed typo mod.rs Fixed the typo in comment in the delete method located in mod.rs file. Fixes: #6256. Signed-off-by: Domesticcadiz <christopher.cadiz.robinson@gmail.com>	2023-03-01 18:03:41 -06:00
Archana Shinde	65fa19fe92	Merge pull request #6305 from amshinde/update-action-kernel-check actions: Use `git-diff` to get changes in kernel dir	2023-03-01 13:46:50 -08:00
Eduardo Lima (Etrunko)	a9e2fc8678	runtime/Makefile: Fix install-containerd-shim-v2 dependency $ make install make: *** No rule to make target 'containerd-shim-kata-v2', needed by 'install-containerd-shim-v2'. Stop. Spotted when building kata-runtime with a different name for SHIMV2_OUTPUT. For instance, trying to keep different runtime binaries installed at the same time, one from master and another from lets say, the CCv0 branch, with the following small change applied. diff --git a/src/runtime/Makefile b/src/runtime/Makefile index 95efaff78..2bab9eb75 100644 --- a/src/runtime/Makefile +++ b/src/runtime/Makefile @@ -231,7 +231,7 @@ SED = sed CLI_DIR = cmd SHIMV2 = containerd-shim-kata-v2 -SHIMV2_OUTPUT = $(bCURDIR)/$(SHIMV2) +SHIMV2_OUTPUT = $(CURDIR)/$(SHIMV2)-ccv0 SHIMV2_DIR = $(CLI_DIR)/$(SHIMV2) MONITOR = kata-monitor Fixes: #6398 Signed-off-by: Eduardo Lima (Etrunko) <etrunko@redhat.com>	2023-03-01 15:57:30 -03:00
yanggang	b6880c60d3	logging: Correct the code notes Fix wrong notes for func GetSandboxesStoragePathRust() Fixes: #6394 Signed-off-by: yanggang <gang.yang@daocloud.io>	2023-03-01 19:20:25 +08:00
Yushuo	12cfad4858	runtime-rs: modify the transfer to oci::Hooks In this commit, we have done: * modify the tranfer process from grpc::Hooks to oci::Hooks, so the code can be more clean * add more tests for create_runtime, create_container, start_container hooks Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-03-01 10:35:10 +08:00
Fabiano Fidêncio	828d467222	workflows: Do not install docker The latest ubuntu runners already have docker installed and trying to install it manually will cause the following issue: ``` Run curl -fsSL https://test.docker.com/ -o test-docker.sh Warning: the "docker" command appears to already exist on this system. If you already have Docker installed, this script can cause trouble, which is why we're displaying this warning and provide the opportunity to cancel the installation. If you installed the current Docker package using this script and are using it again to update Docker, you can safely ignore this message. You may press Ctrl+C now to abort this script. + sleep 20 + sudo -E sh -c apt-get update -qq >/dev/null E: The repository 'https://packages.microsoft.com/ubuntu/22.04/prod jammy Release' is no longer signed. ``` Fixes: #6390 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-28 23:53:28 +01:00
Alec Pemberton	4b8a5a1a3d	utils: Remove kata-manager.sh cgroups v2 check Removed the part in the `kata-manager.sh` script that checks if the host system only runs cgroups v2. Fixes: #6259. Signed-off-by: Alec Pemberton <pembek1901@gmail.com>	2023-02-28 11:23:51 -06:00
Steve Horsman	785310fe18	Merge pull request #6368 from yoheiueda/dir-perm agent: don't set permission of existing directory in copy_file	2023-02-28 14:48:10 +00:00
Chelsea Mafrica	703589c279	Merge pull request #6369 from XDTG/6082/Fix-path-check-bypassed runtime: use filepath.Clean() to clean the mount path	2023-02-27 17:24:50 -08:00
Bo Chen	ba9227184e	Merge pull request #6376 from likebreath/0224/clh_v30.0 Upgrade to Cloud Hypervisor v30.0	2023-02-27 11:48:52 -08:00
Yushuo	2c4428ee02	runtime-rs: move pre-start hooks to sandbox_start In some cases, network endpoints will be configured through Prestart Hook. So network endpoints may need to be added(hotpluged) after vm is started and also Prestart Hook is executed. We move pre-start hook functions' execution to sandbox_start to allow hooks running between vm_start and netns_scan easily, so that the lifecycle API can be cleaner. Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-02-27 21:56:43 +08:00
Yushuo	e80c9f7b74	runtime-rs: add StartContainer hook StartContainer will be execute in guest container namespace in Kata. The Hook Path of this kind of hook is also in guest container namespace. StartContainer is executed after start operation is called, and it should be executed before user-specific command is executed. Fixes: #5787 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-02-27 21:56:43 +08:00
Yushuo	977f281c5c	runtime-rs: add CreateContainer hook support CreateContainer hook is one kind of OCI hook. In kata, it will be executed after VM is started, before container is created, and after CreateRuntime is executed. The hook path of CreateContainer hook is in host runtime namespace, but it will be executed in host vmm namespace. Fixes: #5787 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-02-27 21:56:43 +08:00
Yushuo	875f2db528	runtime-rs: add oci hook support According to the runtime OCI Spec, there can be some hook operations in the lifecycle of the container. In these hook operations, the runtime can execute some commands. There are different points in time in the container lifecycle and different hook types can be executed. In this commit, we are now supporting 4 types of hooks(same in runtime-go): Prestart hook, CreateRuntime hook, Poststart hook and Poststop hook. Fixes: #5787 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-02-27 21:56:43 +08:00
Yushuo	ecac3a9e10	docs: add design doc for Hooks Fixes: #5787 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-02-27 21:56:43 +08:00
Bin Liu	e90989b16b	Merge pull request #6314 from openanolis/static_doc feat(runtime): make static resource management consistent with 2.0	2023-02-27 16:43:27 +08:00
Bo Chen	3ac6f29e95	runtime: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v30.0. Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by openapi-generator. Fixes: #6375 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-02-24 10:20:29 -08:00
Bo Chen	262daaa2ef	versions: Upgrade to Cloud Hypervisor v30.0 Details of this release can be found in our new roadmap project as iteration v30.0: https://github.com/orgs/cloud-hypervisor/projects/6. Fixes: #6375 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-02-24 10:19:46 -08:00
Jeremi Piotrowski	192df84588	agent: always use cgroupfs when running as init The logic to decide which cgroup driver is used is currently based on the cgroup path that the host provides. This requires host and guest to use the same cgroup driver. If the guest uses kata-agent as init, then systemd can't be used as the cgroup driver. If the host requests a systemd cgroup, this currently results in a rustjail panic: thread 'tokio-runtime-worker' panicked at 'called `Result::unwrap()` on an `Err` value: I/O error: No such file or directory (os error 2) Caused by: No such file or directory (os error 2)', rustjail/src/cgroups/systemd/manager.rs:44:51 stack backtrace: 0: 0x7ff0fe77a793 - std::backtrace_rs::backtrace::libunwind::trace::h8c197fa9a679d134 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/../../backtrace/src/backtrace/libunwind.rs:93:5 1: 0x7ff0fe77a793 - std::backtrace_rs::backtrace::trace_unsynchronized::h9ee19d58b6d5934a at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/../../backtrace/src/backtrace/mod.rs:66:5 2: 0x7ff0fe77a793 - std::sys_common::backtrace::_print_fmt::h4badc450600fc417 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys_common/backtrace.rs:65:5 3: 0x7ff0fe77a793 - <std::sys_common::backtrace::_print::DisplayBacktrace as core::fmt::Display>::fmt::had334ddb529a2169 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys_common/backtrace.rs:44:22 4: 0x7ff0fdce815e - core::fmt::write::h1aa7694f03e44db2 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/core/src/fmt/mod.rs:1209:17 5: 0x7ff0fe74e0c4 - std::io::Write::write_fmt::h61b2bdc565be41b5 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/io/mod.rs:1682:15 6: 0x7ff0fe77cd3f - std::sys_common::backtrace::_print::h4ec69798b72ff254 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys_common/backtrace.rs:47:5 7: 0x7ff0fe77cd3f - std::sys_common::backtrace::print::h0e6c02048dec3c77 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys_common/backtrace.rs:34:9 8: 0x7ff0fe77c93f - std::panicking::default_hook::{{closure}}::hcdb7e705dc37ea6e at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/panicking.rs:267:22 9: 0x7ff0fe77d9b8 - std::panicking::default_hook::he03a933a0f01790f at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/panicking.rs:286:9 10: 0x7ff0fe77d9b8 - std::panicking::rust_panic_with_hook::he26b680bfd953008 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/panicking.rs:688:13 11: 0x7ff0fe77d482 - std::panicking::begin_panic_handler::{{closure}}::h559120d2dd1c6180 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/panicking.rs:579:13 12: 0x7ff0fe77d3ec - std::sys_common::backtrace::__rust_end_short_backtrace::h36db621fc93b005a at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys_common/backtrace.rs:137:18 13: 0x7ff0fe77d3c1 - rust_begin_unwind at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/panicking.rs:575:5 14: 0x7ff0fda52ee2 - core::panicking::panic_fmt::he7679b415d25c5f4 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/core/src/panicking.rs:65:14 15: 0x7ff0fda53182 - core::result::unwrap_failed::hb71caff146724b6b at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/core/src/result.rs:1791:5 16: 0x7ff0fe5bd738 - <rustjail::cgroups::systemd::manager::Manager as rustjail::cgroups::Manager>::apply::hd46958d9d807d2ca 17: 0x7ff0fe606d80 - <rustjail::container::LinuxContainer as rustjail::container::BaseContainer>::start::{{closure}}::h1de806d91fcb878f 18: 0x7ff0fe604a76 - <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll::h1749c148adcc235f 19: 0x7ff0fdc0c992 - kata_agent::rpc::AgentService::do_create_container::{{closure}}::{{closure}}::hc1b87a15dfdf2f64 20: 0x7ff0fdb80ae4 - <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll::h846a8c9e4fb67707 21: 0x7ff0fe3bb816 - <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll::h53de16ff66ed3972 22: 0x7ff0fdb519cb - <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll::h1cbece980286c0f4 23: 0x7ff0fdf4019c - <tokio::future::poll_fn::PollFn<F> as core::future::future::Future>::poll::hc8e72d155feb8d1f 24: 0x7ff0fdfa5fd8 - tokio::loom::std::unsafe_cell::UnsafeCell<T>::with_mut::h0a407ffe2559449a 25: 0x7ff0fdf033a1 - tokio::runtime::task::raw::poll::h1045d9f1db9742de 26: 0x7ff0fe7a8ce2 - tokio::runtime::scheduler::multi_thread::worker::Context::run_task::h4924ae3464af7fbd 27: 0x7ff0fe7afb85 - tokio::runtime::task::raw::poll::h5c843be39646b833 28: 0x7ff0fe7a05ee - std::sys_common::backtrace::__rust_begin_short_backtrace::ha7777c55b98a9bd1 29: 0x7ff0fe7a9bdb - core::ops::function::FnOnce::call_once{{vtable.shim}}::h27ec83c953360cdd 30: 0x7ff0fe7801d5 - <alloc::boxed::Box<F,A> as core::ops::function::FnOnce<Args>>::call_once::hed812350c5aef7a8 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/alloc/src/boxed.rs:1987:9 31: 0x7ff0fe7801d5 - <alloc::boxed::Box<F,A> as core::ops::function::FnOnce<Args>>::call_once::hc7df8e435a658960 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/alloc/src/boxed.rs:1987:9 32: 0x7ff0fe7801d5 - std::sys::unix::thread::Thread::new::thread_start::h575491a8a17dbb33 at /rustc/69f9c33d71c871fc16ac445211281c6e7a340943/library/std/src/sys/unix/thread.rs:108:17 Forward the value of "init_mode" to AgentService, so that we can force cgroupfs when systemd is unavailable. Fixes: #5779 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-02-24 14:02:11 +01:00
Jeremi Piotrowski	b0691806f1	agent: determine value of use_systemd_cgroup before LinuxContainer::new() Right now LinuxContainer::new() gets passed a CreateOpts struct, but then modifies the use_systemd_cgroup field inside that struct. Pull the cgroups path parsing logic into do_create_container, so that CreateOpts can be immutable in LinuxContainer::new. This is just moving things around, there should be no functional changes. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-02-24 13:46:37 +01:00
XDTG	dc86d6dac3	runtime: use filepath.Clean() to clean the mount path Fix path check bypassed issuse introduced by #6082, use filepath.Clean() to clean path before check Fixes: #6082 Signed-off-by: XDTG <click1799@163.com>	2023-02-24 15:48:09 +08:00
Yohei Ueda	c4ef5fd325	agent: don't set permission of existing directory This patch fixes the issue that do_copy_file changes the directory permission of the parent directory of a target file, even when the parent directory already exists. Fixes #6367 Signed-off-by: Yohei Ueda <yohei@jp.ibm.com>	2023-02-24 16:43:59 +09:00
Feng Wang	cbe6ad9034	runtime: support non-root for clh This change enables to run cloud-hypervisor VMM using a non-root user when rootless flag is set true in the configuration Fixes: #2567 Signed-off-by: Feng Wang <fwang@confluent.io>	2023-02-22 13:57:09 -08:00
Fabiano Fidêncio	44a780f262	Merge pull request #6262 from jepio/jepio/initrd-dev-nodes osbuilder: Include minimal set of device nodes in ubuntu initrd	2023-02-22 20:34:13 +01:00
GabyCT	a0b1f81867	Merge pull request #5958 from Apokleos/kata-ctl-exec kata-ctl/exec: add new command exec to enter guest VM.	2023-02-22 12:07:44 -06:00
Fabiano Fidêncio	109071855d	Merge pull request #6124 from Alex-Carter01/snp-kernel-config kernel: Add CONFIG_SEV_GUEST to SEV kernel config	2023-02-22 18:42:35 +01:00
David Esparza	5e2fe5f932	Merge pull request #6332 from jodh-intel/runtime-rs-ch-config-convert runtime-rs: Improve Cloud Hypervisor config handling	2023-02-22 10:15:50 -06:00
GabyCT	5c6e56931f	Merge pull request #6312 from Amulyam24/virtiofsd-fix virtiofsd: update to a valid path on ppc64le	2023-02-22 08:57:51 -06:00
James O. D. Hunt	3483272bbd	runtime-rs: ch: Enable initrd usage Allow an initrd/initramfs image to be used with Cloud Hypervisor, which is handled differently to the default rootfs image type. Fixes: #6335. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-02-22 10:55:01 +00:00
James O. D. Hunt	fbee6c820e	runtime-rs: Improve Cloud Hypervisor config handling Replace `cloud_hypervisor_vm_create_cfg()` with a set of `TryFrom` trait implementations in the new CH specific `convert.rs` to allow the generic `Hypervisor` configuration to be converted into the CH specific `VmConfig` type. Note that device configuration is not currently handled in `convert.rs` (it's handled in `inner_device.rs`). This change removes the old hard-coded CH specific configuration. Fixes: #6203. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-02-22 10:48:05 +00:00
Chao Wu	578f2e7c2e	Merge pull request #6080 from openanolis/rem runtime-rs: cleanup kata host share path	2023-02-22 17:45:24 +08:00
GabyCT	7aff118c82	Merge pull request #6236 from jepio/jepio/osbuilder-fix-default-make-target osbuilder: fix default build target in makefile	2023-02-21 17:00:21 -06:00
Alex Carter	1bff1ca30a	kernel: Add CONFIG_SEV_GUEST to SEV kernel config Adding kernel config to sev case since it is needed for SNP and SNP will use the SEV kernel. Incrementing kernel config version to reflect changes Fixes: #6123 Signed-off-by: Alex Carter <Alex.Carter@ibm.com>	2023-02-21 16:48:45 +00:00
GabyCT	fc5c62a5a1	Merge pull request #6330 from c3d/issue/6329-contribution-link-in-devguide devguide: Add link to the contribution guidelines	2023-02-21 09:17:20 -06:00
Fabiano Fidêncio	ab5b45f615	Merge pull request #6340 from fidencio/topic/ensure-go-binaries-can-still-run-on-ubuntu-2004 kata-deploy: Ensure go binaries can run on Ubuntu 20.04	2023-02-21 13:52:18 +01:00
Zhongtao Hu	4f20cb7ced	Merge pull request #6325 from HerlinCoder/herlincoder/config-manager dragonball: config_manager: preserve device when update	2023-02-21 17:51:41 +08:00
Jeremi Piotrowski	ad8968c8d9	rustjail: print type of cgroup manager Since the cgroup manager is wrapped in a dyn now, the print in LinuxContainer::new has been useless and just says "CgroupManager". Extend the Debug trait for 'dyn Manager' to print the type of the cgroup manager so that it's easier to debug issues. Fixes: #5779 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-02-21 10:07:03 +01:00
SinghWang	b4a1527aa6	kata-deploy: Fix static shim-v2 build on arm64 Following Jong Wu suggestion, let's link /usr/bin/musl-gcc to /usr/bin/aarch64-linux-musl-gcc. Fixes: #6320 Signed-off-by: SinghWang <wangxin_0611@126.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-21 10:00:28 +01:00
Fabiano Fidêncio	2c4f8077fd	Revert "shim-v2: Bump Ubuntu container image to 22.04" This reverts commit `9d78bf9086`. Golang binaries are built statically by default, unless linking against CGO, which we do. In this case we dynamically link against glibc, causing us troubles when running a binary built with Ubuntu 22.04 on Ubuntu 20.04 (which will still be supported for the next few years ...) Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-21 10:00:28 +01:00
Fabiano Fidêncio	73d0ca0bd5	Merge pull request #6334 from fidencio/topic/fix-push-to-registry-behaviour Revert "workflows: Push the builder image to quay.io"	2023-02-21 10:00:13 +01:00
Bin Liu	5c16e98d4f	Merge pull request #6322 from Tim-Zhang/remove-remain-unsafe-impl Remove all remaining unsafe impl	2023-02-21 14:08:05 +08:00
Fabiano Fidêncio	afaccf924d	Revert "workflows: Push the builder image to quay.io" This reverts commit `b835c40bbd`. Right now I'm reverting this one as this should only run after commits get pushed to our repo, not on very PR.	2023-02-20 18:37:28 +01:00
Fabiano Fidêncio	b1fd4b093b	Merge pull request #6319 from singhwang/main kata-deploy: Fix building the kata static firecracker arm64 package occurred an error	2023-02-20 18:04:31 +01:00
Christophe de Dinechin	4c39c4ef9f	devguide: Add link to the contribution guidelines New developers are often confused by some of our requirements, notably porting labels. While our CONTRIBUTING.md file points to the solution, the developer's guide does not. Add a link there. Fixes: #6329 Signed-off-by: Christophe de Dinechin <christophe@dinechin.org>	2023-02-20 15:27:19 +01:00
Fabiano Fidêncio	a3b615919e	Merge pull request #6323 from fidencio/topic/fix-make-shim-v2-tarball-on-aarch64 shim-v2: Bump Ubuntu container image to 22.04	2023-02-20 14:57:34 +01:00
Jeremi Piotrowski	76e926453a	osbuilder: Include minimal set of device nodes in ubuntu initrd When starting an initrd the kernel expects to find /dev/console in the initrd, so that it can connect it as stdin/stdout/stderr to the /init process. If the device node is missing the kernel will complain that it was unable to open an initial console. If kata-agent is the initrd init process, it will also result in log messages not being logged to console and thus not forwarded to host syslog. Add a set of standard device nodes for completeness, so that console logging works. To do that we install the makedev packge which provides a MAKEDEV helper that knows the major/minor numbers. Unfortunately the debian package tries to create devnodes from postinst, which can be suppressed if systemd-detect-virt is present. That's why we create a small dummy script that matches what systemd-detect-virt would output (anything is enough to suppress mknod). Fixes: #6261 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-02-20 11:15:56 +01:00
Fabiano Fidêncio	6a0ac2b3a5	Merge pull request #6310 from kata-containers/topic/cache-artefacts-container-builder packaging: Cache the container used to build the kata-deploy artefacts	2023-02-20 11:02:53 +01:00
James O. D. Hunt	0dea57c452	Merge pull request #6309 from gabevenberg/always-check-deps utils: always check some dependencies.	2023-02-20 08:31:56 +00:00
SinghWang	697ec8e578	kata-deploy: Fix kata static firecracker arm64 package build error When building the kata static arm64 package, the stages of firecracker report errors. Fixes: #6318 Signed-off-by: SinghWang <wangxin_0611@126.com>	2023-02-20 16:10:18 +08:00
Helin Guo	ced3c99895	dragonball: config_manager: preserve device when update DeviceConfigInfo contains config and device, so when we want to do update we could simply update config part of the info, and device would not be changed during update. Fixes: #6324 Signed-off-by: Helin Guo <helinguo@linux.alibaba.com>	2023-02-20 14:34:09 +08:00
Tim Zhang	da8a6417aa	runtime-rs: remove all remaining unsafe impl Fixes: #6307 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-02-20 14:29:59 +08:00
Tim Zhang	0301194851	dragonball: use crossbeam_channel in VmmService instead of mpsc::channel Because crossbeam_channel has more features and better performance than mpsc::channel and finally rust replace its channel implementation with crossbeam_channel on version 1.67 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-02-20 14:29:57 +08:00
Fabiano Fidêncio	9d78bf9086	shim-v2: Bump Ubuntu container image to 22.04 Let's bump the base container image to use the 22.04 version of Ubuntu, as it does bring up-to-date package dependencies that we need to statically build the runtime-rs on aarch64. Fixes: #6320 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-20 07:14:09 +01:00
Fabiano Fidêncio	299fc35c37	Merge pull request #6304 from fidencio/topic/switch-the-default-x86_64-rootfs-image-to-ubuntu versions: Use ubuntu as the default distro for the rootfs-image	2023-02-17 19:29:10 +01:00
Gabe Venberg	3cfce5a709	utils: improved unsupported distro message. previously, if installing on unkown distro, script would tell user that their distro was unsupported. Changed error message prompting user to install dependecies manually, then retry. Signed-off-by: Gabe Venberg <gabevenberg@gmail.com>	2023-02-17 09:06:26 -06:00
Bin Liu	f44dae75c9	Merge pull request #6267 from jongwooo/github-action/replace-deprecated-command-with-environment-file github-action: Replace deprecated command with environment file	2023-02-17 22:54:12 +08:00
Fabiano Fidêncio	6a29088b81	Merge pull request #6298 from amshinde/update-release-doc docs: Change the order of release step	2023-02-17 15:46:12 +01:00
Ji-Xinyou	919d19f415	feat(runtime): make static resource management consistent with 2.0 * add doc in the configuration * make entry consistent with 2.0 Fixes: #6313 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-02-17 21:36:56 +08:00
Bin Liu	b7fe29f033	Merge pull request #6308 from Tim-Zhang/remove-unnecessary-send-and-sync runtime-rs: remove unnecessary Send/Sync trait implement	2023-02-17 19:53:54 +08:00
Fabiano Fidêncio	b835c40bbd	workflows: Push the builder image to quay.io Let's push the builder images to a registry, so we can take advantage of those on each step of our building process. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:48 +01:00
Fabiano Fidêncio	781ed2986a	packaging: Allow passing a container builder to the scripts This, combined with the effort of caching builder images and only performing the build itself inside the builder images, is the very first step for reproducible builds for the project. Reproducible builds are quite important when we talk about Confidential Containers, as users may want to verify the content used / provided by the CSPs, and this is the first step towards that direction. Fixes: #5517 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:48 +01:00
Fabiano Fidêncio	45668fae15	packaging: Use existing image to build td-shim Let's first try to pull a pre-existing image, instead of building our own, to be used as a builder image for the td-shim. This will save us some CI time. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:48 +01:00
Fabiano Fidêncio	e8c6bfbdeb	packaging: Use existing image to build td-shim Let's first try to pull a pre-existing image, instead of building our own, to be used as a builder image for the td-shim. This will save us some CI time. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:48 +01:00
Fabiano Fidêncio	3fa24f7acc	packaging: Add infra to push the OVMF builder image Let's add the needed infra for building and pushing the OVMF builder image to the Kata Containers' quay.io registry. Fixes: #5477 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:48 +01:00
Fabiano Fidêncio	f076fa4c77	packaging: Use existing image to build OVMF Let's first try to pull a pre-existing image, instead of buildinf our own, to be used as a builder image for OVMF. This will save us some CI time. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:48 +01:00
Fabiano Fidêncio	c7f515172d	packaging: Add infra to push the QEMU builder image Let's add the needed infra for only building and pushing the QEMU builder image to the Kata Containers' quay.io registry. Fixes: #5481 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:48 +01:00
Fabiano Fidêncio	fb7b86b8e0	packaging: Use existing image to build QEMU Let's first try to pull a pre-existsing image, instead of building our own, to be used as a builder image for QEMU. This will save us some CI time. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:48 +01:00
Fabiano Fidêncio	d0181bb262	packaging: Add infra to push the virtiofsd builder image Let's add the needed infra for only building and pushing the virtiofsd builder image to the Kata Containers' quay.io registry. Fixes: #5480 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:48 +01:00
Fabiano Fidêncio	7c93428a18	packaging: Use existing image to build virtiofsd Let's first try to pull a pre-existing image, instead of building our own, to be used as a builder image for the virtiofsd. This will save us some CI time. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:48 +01:00
Fabiano Fidêncio	8c227e2471	virtiofsd: Pass the expected toolchain to the build container Let's ensure we're building virtiofsd with a specific toolchain that's known to not cause any issues, instead of always using the latest one. On each bump of the virtiofsd, we'll make sure to adjust this according to what's been used by the virtiofsd community. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:48 +01:00
Fabiano Fidêncio	7ee00d8e57	packaging: Add infra to push the shim-v2 builder image Let's add the needed infra for only building and pushing the shim-v2 builder image to the Kata Containers' quay.io registry. Fixes: #5478 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:47 +01:00
Fabiano Fidêncio	24767d82aa	packaging: Use existing image to build the shim-v2 Let's try to pull a pre-existing image, instead of building our own, to be used as a builder for the shim-v2. This will save us some CI time. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 12:06:24 +01:00
Amulyam24	e84af6a620	virtiofsd: update to a valid path on ppc64le Currently the symbolic link for virtiofsd which is used as a valid path is not updated on every CI run. Fix it by using the actual path of installation. Fixes: #6311 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2023-02-17 16:22:39 +05:30
Fabiano Fidêncio	6c3c771a52	packaging: Add infra to push the kernel builder image Let's add the needed infra for only building and pushing the kernel builder image to the Kata Containers' quay.io registry. Fixes: #5476 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 11:30:28 +01:00
Fabiano Fidêncio	b9b23112bf	packaging: Use existing image to build the kernel Let's first try to pull a pre-existing image, instead of building our own, to be used as a builder image for the kernel. This will save us some CI time. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 11:30:28 +01:00
Fabiano Fidêncio	869827d77f	packaging: Add push_to_registry() This function will push a specific tag to a registry, whenever the PUSH_TO_REGISTRY environment variable is set, otherwise it's a no-op. This will be used in the future to avoid replicating that logic in every builder used by the kata-deploy scripts. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 11:30:21 +01:00
Fabiano Fidêncio	e69a6f5749	packaging: Add get_last_modification() Let's add a function to get the hash of the last commit modifying a specific file. This will help to avoid writing `git rev-list ...` into every single build script used by the kata-deploy. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 10:39:33 +01:00
Fabiano Fidêncio	6c05e5c67a	packaging: Add and export BUILDER_REGISTRY BUILD_REGISTRY, which points to quay.io/kata-containers/builder, will be used for storing the builder images used to build the artefacts via the kata-deploy scripts. The plan is to tag, whenever it's possible and makes sense, images like: * ${BUILDER_REGISTRY}:${component}-${unique_identifier} Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-17 10:39:33 +01:00
Fabiano Fidêncio	bd9af5569f	Merge pull request #6296 from fidencio/topic/dont-build-runtime-rs-for-ppc64le-2nd-try runtime-rs: Don't build on Power, don't break on Power.	2023-02-17 10:08:39 +01:00
Gabe Venberg	1047840cf8	utils: always check some dependencies. Every dependency in check_deps is used inside the script (apart from git, which may be a historical artifact), and therefore should be checked even when the -f option is passed to the script. Simply changed at what point check_deps is called in order to always run it. Fixes #6302. Signed-off-by: Gabe Venberg <gabevenberg@gmail.com>	2023-02-16 23:00:19 -06:00
Tim Zhang	95e3364493	runtime-rs: remove unnecessary Send/Sync trait implement Send and Sync are automatically derived traits, if a type is composed entirely of Send or Sync types, then it is Send or Sync. Almost all primitives are Send and Sync, so we don't need to implement them manually most of the time. Fixes: #6307 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-02-17 11:51:13 +08:00
Archana Shinde	a96ba99239	actions: Use `git-diff` to get changes in kernel dir Use `git-diff` instead of legacy `git-whatchanged` to get differences in the packaging/kernel directory. This also fixes a bug by grepping for the kernel directory in the output of the git command. Fixes: #6210 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-02-16 17:33:41 -08:00
Archana Shinde	619ef54452	docs: Change the order of release step When a new stable branch is created, it is necessary to change the references in the tests repo from main to the new stable branch. However this step needs to be performed after the repos have been tagged as the `tags_repos.sh` script is the one that creates the new branch. Clarify this in the documentation and move the step to change branch references in test repo after repos have been tagged. Fixes: #1824 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-02-16 12:12:21 -08:00
Fabiano Fidêncio	a161d11920	versions: Use ubuntu as the default distro for the rootfs-image Currently ubuntu is already the default distro for all the architectures but x86_64, which uses clearlinux. However, our CI does not test the clearlinux image we ship. Taking a look at our CI code [0], we've been using ubuntu as base for the tests for a few years already, if not forever. The minimum we can do is to switch to distributing ubuntu, as the tested rootfs-image, and then decide later on whether we should switch back to clearlinux (once we switch our CI to using that, and make sure all tests will be green), or if we move to slimmer distro, such as alpine. [0]: `0a39dd1a01/.ci/install_kata_image.sh (L44)` Fixes: #6303 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-16 20:30:40 +01:00
Fabiano Fidêncio	be40683bc5	runtime-rs: Add a generic powerpc64le-options.mk There's a check in the runtime-rs Makefile that basically checks whether the `arch/$arch-options.mk` exists or not and, if it doesn't, the build is just aborted. With this in mind, let's create a generic powerpc64le-options.mk file and not bail when building for this architecture. Fixes: #6142 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-16 16:29:24 +01:00
Fabiano Fidêncio	47c058599a	packaging/shim-v2: Install the target depending on the arch/libc In the `install_go_rust.sh` file we're adding a x86_64-unknown-linux-musl target unconditionally. That should be, instead, based in the ARCH of the host and the appropriate LIBC to be used with that host. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-16 16:29:24 +01:00
Fabiano Fidêncio	c1602c848a	Merge pull request #6300 from openanolis/footloose runtime-rs: handle sys_dir bind volume	2023-02-16 12:53:15 +01:00
alex.lyn	b582c0db86	kata-ctl/exec: add new command exec to enter guest VM. The patchset will help users to easily enter guest VM by debug console sock. In order to enter guest VM smoothly, users needs to do some configuration, options as below: (1) Set debug_console_enabled = true with default vport 1026. (2) Or add agent.debug_console agent.debug_console_vport=<PORT> into kernel_params, and the vport is <PORT> you set. The detail of usage: $ kata-ctl exec -h kata-ctl-exec Enter into guest VM by debug console USAGE: kata-ctl exec [OPTIONS] <SANDBOX_ID> ARGS: <SANDBOX_ID> pod sandbox ID Fixes: #5340 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-02-16 17:05:53 +08:00
Yushuo	07802a19dc	runtime-rs: handle sys_dir bind volume For some cases, users will mount system directories as bind volume. We should not bind mount these kind of directories in the host as it does not make sense. Fixes: #6299 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-02-16 15:45:33 +08:00
Bin Liu	629a31ec6e	Merge pull request #6287 from lifupan/main sandbox: set the dns for the sandbox	2023-02-16 15:00:01 +08:00
Fabiano Fidêncio	f5b28736ce	Merge pull request #6294 from fidencio/topic/only-change-configs-if-the-config-files-exist packaging/shim-v2: Only change the config if the file exists	2023-02-16 07:13:28 +01:00
Fupan Li	04e930073c	sandbox: set the dns for the sandbox The rust agent had supported to set the guest dns server in start sandbox request, thus add the dns in the runtime side. Fixes:#6286 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2023-02-16 11:25:02 +08:00
Fupan Li	32ebe1895b	agent: fix the issue of creating the dns file We should make sure the dns's source file's parent directory exist, otherwise, it would failed to create the file directly. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2023-02-16 11:24:54 +08:00
Peng Tao	139ad8e95f	Merge pull request #6201 from jodh-intel/runtime-rs-add-cloud-hypervisor runtime-rs: Add basic CH implementation	2023-02-16 11:23:04 +08:00
Archana Shinde	eba2bb275d	Merge pull request #6284 from amshinde/revert-kata-deploy-changes-after-3.1.0-rc0-release release: Revert kata-deploy changes after 3.1.0-rc0 release	2023-02-15 14:50:12 -08:00
jongwooo	44aaec9020	github-action: Replace deprecated command with environment file In workflow, `set-output` command is deprecated and will be disabled soon. This commit replaces the deprecated `set-output` command with putting a value in the environment file `$GITHUB_OUTPUT`. Fixes #6266 Signed-off-by: jongwooo <jongwooo.han@gmail.com>	2023-02-16 01:41:03 +09:00
Hyounggyu Choi	a68c5004f8	packaging/shim-v2: Only change the config if the file exists Let's not try to sed a file that doesn't exist, which may be the case depending on the architecture we're building the shim-v2 for. This is a partial-forward port of `f24c47ea47`. Fixes: #6293 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-02-15 17:00:53 +01:00
Archana Shinde	ee76b398b3	release: Revert kata-deploy changes after 3.1.0-rc0 release As 3.1.0-rc0 has been released, let's switch the kata-deploy / kata-cleanup tags back to "latest", and re-add the kata-deploy-stable and the kata-cleanup-stable files. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-02-14 15:47:51 -08:00
James O. D. Hunt	bbc733d6c8	docs: runtime-rs: Add CH status details Add a few details about the current state of the Cloud Hypervisor (CH) runtime-rs external hypervisor implementation with pointers to the appropriate issues. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-02-14 15:38:46 +00:00
James O. D. Hunt	37b594c0d2	runtime-rs: Add basic CH implementation Add a basic runtime-rs `Hypervisor` trait implementation for Cloud Hypervisor (CH). > Notes: > > - This only supports a default Kata configuration for CH currently. > > - Since this feature is still under development, `cargo` features have > been added to enable the feature optionally. The default is to not enable > currently since the code is not ready for general use. > > To enable the feature for testing and development, enable the > `cloud-hypervisor` feature in the `virt_container` crate and enable the > `cloud-hypervisor` feature for its `hypervisor` dependency. Fixes: #5242. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-02-14 15:38:39 +00:00
James O. D. Hunt	545151829d	kata-types: Add Cloud Hypervisor (CH) definitions Implement `ConfigPlugin` trait for Cloud Hypervisor (CH). Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-02-13 10:25:29 +00:00
Zhongtao Hu	2dd2421ad0	runtime-rs: cleanup kata host share path cleanup the /run/kata-containers/shared/sandboxes/pid path Fixes:#5975 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-02-13 13:07:07 +08:00
Jeremi Piotrowski	0a21ad78b1	osbuilder: fix default build target in makefile The .dracut_rootfs.done file is accidentally being picked up as the default target, regardless of BUILD_METHOD. Move the 'all' target definition up, so that it's the default (=first) target in the makefile. Additionally make the .dracut_rootfs.done target conditional on the right BUILD_METHOD being selected, as building it doesn't make sense with BUILD_METHOD=distro. Fixes: #6235 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-02-07 18:36:03 +01:00
wllenyj	9a01d4e446	dragonball: add more unit test for virtio-blk device. Added more unit tests for virtio-blk device. Fixes: #4899 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2023-02-07 17:16:11 +08:00
Archana Shinde	d3bb254188	utils: Add function to check vhost-vsock Add function to check if the host-system has the vhost-vsock kernel module. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-02-03 15:41:59 -08:00

4511 changed files with 788245 additions and 246285 deletions

									
										4

.github/workflows/PR-wip-checks.yaml
									
										vendored
									
												View File
												
				@@ -9,6 +9,10 @@ on:

				      - labeled

				      - unlabeled

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				jobs:

				  pr_wip_check:

				    runs-on: ubuntu-latest

									
										100

.github/workflows/add-backport-label.yaml
									
										vendored
									
												View File
											
				@@ -1,100 +0,0 @@

				name: Add backport label

				on:

				  pull_request:

				    types:

				      - opened

				      - synchronize

				      - reopened

				      - edited

				      - labeled

				      - unlabeled

				jobs:

				  check-issues:

				    if: ${{ github.event.label.name != 'auto-backport' }}

				    runs-on: ubuntu-latest

				    steps:

				      - name: Checkout code to allow hub to communicate with the project

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        uses: actions/checkout@v3

				      - name: Install hub extension script

				        run: |

				          pushd $(mktemp -d) &>/dev/null

				          git clone --single-branch --depth 1 "https://github.com/kata-containers/.github" && cd .github/scripts

				          sudo install hub-util.sh /usr/local/bin

				          popd &>/dev/null

				      - name: Determine whether to add label 

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        env:

				          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}

				          CONTAINS_AUTO_BACKPORT: ${{ contains(github.event.pull_request.labels.*.name, 'auto-backport') }}

				        id: add_label

				        run: |

				          pr=${{ github.event.pull_request.number }}

				          linked_issue_urls=$(hub-util.sh \

				            list-issues-for-pr "$pr" |\

				            grep -v "^\#"  |\

				            cut -d';' -f3 || true)

				          [ -z "$linked_issue_urls" ] && {

				            echo "::error::No linked issues for PR $pr"

				            exit 1

				          }

				          has_bug=false

				          for issue_url in $(echo "$linked_issue_urls")

				          do

				            issue=$(echo "$issue_url"| awk -F\/ '{print $NF}' || true)

				            [ -z "$issue" ] && {

				              echo "::error::Cannot determine issue number from $issue_url for PR $pr"

				              exit 1

				            }

				            labels=$(hub-util.sh list-labels-for-issue "$issue")

				            label_names=$(echo $labels | jq -r '.[].name' || true)

				            if [[ "$label_names" =~ "bug" ]]; then

				              has_bug=true

				              break

				            fi

				          done

				          has_backport_needed_label=${{ contains(github.event.pull_request.labels.*.name, 'needs-backport') }}

				          has_no_backport_needed_label=${{ contains(github.event.pull_request.labels.*.name, 'no-backport-needed') }}

				          echo "::set-output name=add_backport_label::false"

				          if [ $has_backport_needed_label  = true ] || [ $has_bug  = true ]; then

				            if [[ $has_no_backport_needed_label = false ]]; then

				              echo "::set-output name=add_backport_label::true"

				            fi

				          fi

				          # Do not spam comment, only if auto-backport label is going to be newly added.

				          echo "::set-output name=auto_backport_added::$CONTAINS_AUTO_BACKPORT"

				      - name: Add comment

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') && steps.add_label.outputs.add_backport_label == 'true' && steps.add_label.outputs.auto_backport_added == 'false' }}

				        uses: actions/github-script@v6

				        with:

				          script: |

				            github.rest.issues.createComment({

				              issue_number: context.issue.number,

				              owner: context.repo.owner,

				              repo: context.repo.repo,

				              body: 'This issue has been marked for auto-backporting. Add label(s) backport-to-BRANCHNAME to backport to them'

				            })

				      # Allow label to be removed by adding no-backport-needed label

				      - name: Remove auto-backport label

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') && steps.add_label.outputs.add_backport_label == 'false' }}

				        uses: andymckay/labeler@e6c4322d0397f3240f0e7e30a33b5c5df2d39e90

				        with:

				          remove-labels: "auto-backport"

				          repo-token: ${{ secrets.GITHUB_TOKEN }}

				      - name: Add auto-backport label

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') && steps.add_label.outputs.add_backport_label == 'true' }}

				        uses: andymckay/labeler@e6c4322d0397f3240f0e7e30a33b5c5df2d39e90

				        with:

				          add-labels: "auto-backport"

				          repo-token: ${{ secrets.GITHUB_TOKEN }}

									
										6

.github/workflows/add-issues-to-project.yaml
									
										vendored
									
												View File
												
				@@ -11,6 +11,10 @@ on:

				      - opened

				      - reopened

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				jobs:

				  add-new-issues-to-backlog:

				    runs-on: ubuntu-latest

				@@ -35,7 +39,7 @@ jobs:

				          popd &>/dev/null

				      - name: Checkout code to allow hub to communicate with the project

				        uses: actions/checkout@v2

				        uses: actions/checkout@v4

				      - name: Add issue to issue backlog

				        env:

									
										15

.github/workflows/add-pr-sizing-label.yaml
									
										vendored
									
												View File
												
				@@ -12,12 +12,25 @@ on:

				      - reopened

				      - synchronize

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				jobs:

				  add-pr-size-label:

				    runs-on: ubuntu-latest

				    steps:

				      - name: Checkout code

				        uses: actions/checkout@v1

				        uses: actions/checkout@v4

				        with:

				          ref: ${{ github.event.pull_request.head.sha }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ github.event.pull_request.base.ref }}

				      - name: Install PR sizing label script

				        run: |

									
										29

.github/workflows/auto-backport.yaml
									
										vendored
									
												View File
											
				@@ -1,29 +0,0 @@

				on:

				  pull_request_target:

				    types: ["labeled", "closed"]

				jobs:

				  backport:

				    name: Backport PR

				    runs-on: ubuntu-latest

				    if: |

				      github.event.pull_request.merged == true

				      && contains(github.event.pull_request.labels.*.name, 'auto-backport')

				      && (

				        (github.event.action == 'labeled' && github.event.label.name == 'auto-backport')

				        || (github.event.action == 'closed')

				      )

				    steps:

				      - name: Backport Action

				        uses: sqren/backport-github-action@v8.9.2

				        with:

				          github_token: ${{ secrets.GITHUB_TOKEN }}

				          auto_backport_label_prefix: backport-to-

				      - name: Info log

				        if: ${{ success() }}

				        run: cat /home/runner/.backport/backport.info.log

				      - name: Debug log

				        if: ${{ failure() }}

				        run: cat /home/runner/.backport/backport.debug.log

									
										336

.github/workflows/basic-ci-amd64.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,336 @@

				name: CI | Basic amd64 tests

				on:

				  workflow_call:

				    inputs:

				      tarball-suffix:

				        required: false

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-cri-containerd:

				    strategy:

				      # We can set this to true whenever we're 100% sure that

				      # the all the tests are not flaky, otherwise we'll fail

				      # all the tests due to a single flaky instance.

				      fail-fast: false

				      matrix:

				        containerd_version: ['lts', 'active']

				        vmm: ['clh', 'dragonball', 'qemu', 'stratovirt', 'cloud-hypervisor', 'qemu-runtime-rs']

				    runs-on: garm-ubuntu-2204-smaller

				    env:

				      CONTAINERD_VERSION: ${{ matrix.containerd_version }}

				      GOPATH: ${{ github.workspace }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/integration/cri-containerd/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/integration/cri-containerd/gha-run.sh install-kata kata-artifacts

				      - name: Run cri-containerd tests

				        timeout-minutes: 10

				        run: bash tests/integration/cri-containerd/gha-run.sh run

				  run-containerd-stability:

				    strategy:

				      fail-fast: false

				      matrix:

				        containerd_version: ['lts', 'active']

				        vmm: ['clh', 'cloud-hypervisor', 'dragonball', 'qemu', 'stratovirt']

				    runs-on: garm-ubuntu-2204-smaller

				    env:

				      CONTAINERD_VERSION: ${{ matrix.containerd_version }}

				      GOPATH: ${{ github.workspace }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/stability/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/stability/gha-run.sh install-kata kata-artifacts

				      - name: Run containerd-stability tests

				        timeout-minutes: 15

				        run: bash tests/stability/gha-run.sh run

				  run-nydus:

				    strategy:

				      # We can set this to true whenever we're 100% sure that

				      # the all the tests are not flaky, otherwise we'll fail

				      # all the tests due to a single flaky instance.

				      fail-fast: false

				      matrix:

				        containerd_version: ['lts', 'active']

				        vmm: ['clh', 'qemu', 'dragonball', 'stratovirt']

				    runs-on: garm-ubuntu-2204-smaller

				    env:

				      CONTAINERD_VERSION: ${{ matrix.containerd_version }}

				      GOPATH: ${{ github.workspace }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/integration/nydus/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/integration/nydus/gha-run.sh install-kata kata-artifacts

				      - name: Run nydus tests

				        timeout-minutes: 10

				        run: bash tests/integration/nydus/gha-run.sh run

				  run-runk:

				    runs-on: garm-ubuntu-2204-smaller

				    env:

				      CONTAINERD_VERSION: lts

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/integration/runk/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/integration/runk/gha-run.sh install-kata kata-artifacts

				      - name: Run runk tests

				        timeout-minutes: 10

				        run: bash tests/integration/runk/gha-run.sh run

				  run-tracing:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm:

				          - clh # cloud-hypervisor

				          - qemu

				    runs-on: garm-ubuntu-2204-smaller

				    env:

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/functional/tracing/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/functional/tracing/gha-run.sh install-kata kata-artifacts

				      - name: Run tracing tests

				        timeout-minutes: 15

				        run: bash tests/functional/tracing/gha-run.sh run

				  run-vfio:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm: ['clh', 'qemu']

				    runs-on: garm-ubuntu-2304

				    env:

				      GOPATH: ${{ github.workspace }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/functional/vfio/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Run vfio tests

				        timeout-minutes: 15

				        run: bash tests/functional/vfio/gha-run.sh run

				  run-docker-tests:

				    strategy:

				      # We can set this to true whenever we're 100% sure that

				      # all the tests are not flaky, otherwise we'll fail them

				      # all due to a single flaky instance.

				      fail-fast: false

				      matrix:

				        vmm:

				          - clh

				          - qemu

				    runs-on: garm-ubuntu-2304-smaller

				    env:

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/integration/docker/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/integration/docker/gha-run.sh install-kata kata-artifacts

				      - name: Run docker smoke test

				        timeout-minutes: 5

				        run: bash tests/integration/docker/gha-run.sh run

				  run-nerdctl-tests:

				    strategy:

				      # We can set this to true whenever we're 100% sure that

				      # all the tests are not flaky, otherwise we'll fail them

				      # all due to a single flaky instance.

				      fail-fast: false

				      matrix:

				        vmm:

				          - clh

				          - dragonball

				          - qemu

				          - cloud-hypervisor

				    runs-on: garm-ubuntu-2304-smaller

				    env:

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/integration/nerdctl/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/integration/nerdctl/gha-run.sh install-kata kata-artifacts

				      - name: Run nerdctl smoke test

				        timeout-minutes: 5

				        run: bash tests/integration/nerdctl/gha-run.sh run

				      - name: Collect artifacts ${{ matrix.vmm }}

				        run: bash tests/integration/nerdctl/gha-run.sh collect-artifacts

				      - name: Archive artifacts ${{ matrix.vmm }}

				        uses: actions/upload-artifact@v4

				        with:

				          name: nerdctl-tests-garm-${{ matrix.vmm }}

				          path: /tmp/artifacts

				          retention-days: 1

									
										113

.github/workflows/build-checks.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,113 @@

				on:

				  workflow_call:

				    inputs:

				      instance:

				        required: true

				        type: string

				name: Build checks

				jobs:

				  check:

				    runs-on: ${{ inputs.instance }}

				    strategy:

				      fail-fast: false

				      matrix:

				        component:

				          - agent

				          - dragonball

				          - runtime

				          - runtime-rs

				          - agent-ctl

				          - kata-ctl

				          - runk

				          - trace-forwarder

				          - genpolicy

				        command:

				          - "make vendor"

				          - "make check"

				          - "make test"

				          - "sudo -E PATH=\"$PATH\" make test"

				        include:

				          - component: agent

				            component-path: src/agent

				          - component: dragonball

				            component-path: src/dragonball

				          - component: runtime

				            component-path: src/runtime

				          - component: runtime-rs

				            component-path: src/runtime-rs

				          - component: agent-ctl

				            component-path: src/tools/agent-ctl

				          - component: kata-ctl

				            component-path: src/tools/kata-ctl

				          - component: runk

				            component-path: src/tools/runk

				          - component: trace-forwarder

				            component-path: src/tools/trace-forwarder

				          - install-libseccomp: no

				          - component: agent

				            install-libseccomp: yes

				          - component: runk

				            install-libseccomp: yes

				          - component: genpolicy

				            component-path: src/tools/genpolicy

				    steps:

				      - name: Adjust a permission for repo

				        run: |

				          sudo chown -R $USER:$USER $GITHUB_WORKSPACE $HOME

				          sudo rm -rf $GITHUB_WORKSPACE/* && echo "GITHUB_WORKSPACE removed" || { sleep 10 && sudo rm -rf $GITHUB_WORKSPACE/*; }

				          sudo rm -f /tmp/kata_hybrid*  # Sometime we got leftover from test_setup_hvsock_failed()

				        if: ${{ inputs.instance != 'ubuntu-20.04' }}

				      - name: Checkout the code

				        uses: actions/checkout@v4

				        with:

				          fetch-depth: 0

				      - name: Install yq

				        run: |

				          ./ci/install_yq.sh

				        env:

				          INSTALL_IN_GOPATH: false

				      - name: Install golang

				        if: ${{ matrix.component == 'runtime' }}

				        run: |

				          ./tests/install_go.sh -f -p

				          echo "/usr/local/go/bin" >> $GITHUB_PATH

				      - name: Install rust

				        if: ${{ matrix.component != 'runtime' }}

				        run: |

				          ./tests/install_rust.sh

				          echo "${HOME}/.cargo/bin" >> $GITHUB_PATH

				      - name: Install musl-tools

				        if: ${{ matrix.component != 'runtime' }}

				        run: sudo apt-get -y install musl-tools

				      - name: Install devicemapper

				        if: ${{ matrix.command == 'make check' && matrix.component == 'agent' }}

				        run: sudo apt-get -y install libdevmapper-dev

				      - name: Install libseccomp

				        if: ${{ matrix.command != 'make vendor'  &&  matrix.command != 'make check' &&  matrix.install-libseccomp == 'yes' }}

				        run: |

				          libseccomp_install_dir=$(mktemp -d -t libseccomp.XXXXXXXXXX)

				          gperf_install_dir=$(mktemp -d -t gperf.XXXXXXXXXX)

				          ./ci/install_libseccomp.sh "${libseccomp_install_dir}" "${gperf_install_dir}"

				          echo "Set environment variables for the libseccomp crate to link the libseccomp library statically"

				          echo "LIBSECCOMP_LINK_TYPE=static" >> $GITHUB_ENV

				          echo "LIBSECCOMP_LIB_PATH=${libseccomp_install_dir}/lib" >> $GITHUB_ENV

				      - name: Install protobuf-compiler

				        if: ${{ matrix.command != 'make vendor' && (matrix.component == 'agent' || matrix.component == 'runk' || matrix.component == 'genpolicy') }}

				        run: sudo apt-get -y install protobuf-compiler

				      - name: Install clang

				        if: ${{ matrix.command == 'make check' && matrix.component == 'agent' }}

				        run: sudo apt-get -y install clang

				      - name: Setup XDG_RUNTIME_DIR for the `runtime` tests

				        if: ${{ matrix.command != 'make vendor' && matrix.command != 'make check' && matrix.component == 'runtime' }}

				        run: |

				          XDG_RUNTIME_DIR=$(mktemp -d /tmp/kata-tests-$USER.XXX | tee >(xargs chmod 0700))

				          echo "XDG_RUNTIME_DIR=${XDG_RUNTIME_DIR}" >> $GITHUB_ENV

				      - name: Running `${{ matrix.command }}` for ${{ matrix.component }}

				        run: |

				          cd ${{ matrix.component-path }}

				          ${{ matrix.command }}

				        env:

				          RUST_BACKTRACE: "1"

									
										142

.github/workflows/build-kata-static-tarball-amd64.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,142 @@

				name: CI | Build kata-static tarball for amd64

				on:

				  workflow_call:

				    inputs:

				      stage:

				        required: false

				        type: string

				        default: test

				      tarball-suffix:

				        required: false

				        type: string

				      push-to-registry:

				        required: false

				        type: string

				        default: no

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  build-asset:

				    runs-on: ubuntu-latest

				    strategy:

				      matrix:

				        asset:

				          - agent

				          - agent-ctl

				          - cloud-hypervisor

				          - cloud-hypervisor-glibc

				          - coco-guest-components

				          - firecracker

				          - genpolicy

				          - kata-ctl

				          - kata-manager

				          - kernel

				          - kernel-confidential

				          - kernel-dragonball-experimental

				          - kernel-nvidia-gpu

				          - kernel-nvidia-gpu-confidential

				          - nydus

				          - ovmf

				          - ovmf-sev

				          - pause-image

				          - qemu

				          - qemu-snp-experimental

				          - stratovirt

				          - rootfs-image

				          - rootfs-image-confidential

				          - rootfs-initrd

				          - rootfs-initrd-confidential

				          - rootfs-initrd-mariner

				          - runk

				          - shim-v2

				          - trace-forwarder

				          - virtiofsd

				        stage:

				          - ${{ inputs.stage }}

				        exclude:

				          - asset: agent

				            stage: release

				          - asset: cloud-hypervisor-glibc

				            stage: release

				          - asset: pause-image

				            stage: release

				          - asset: coco-guest-components

				            stage: release

				    steps:

				      - name: Login to Kata Containers quay.io

				        if: ${{ inputs.push-to-registry == 'yes' }}

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0 # This is needed in order to keep the commit ids history

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Build ${{ matrix.asset }}

				        run: |

				          make "${KATA_ASSET}-tarball"

				          build_dir=$(readlink -f build)

				          # store-artifact does not work with symlink

				          sudo cp -r "${build_dir}" "kata-build"

				        env:

				          KATA_ASSET: ${{ matrix.asset }}

				          TAR_OUTPUT: ${{ matrix.asset }}.tar.gz

				          PUSH_TO_REGISTRY: ${{ inputs.push-to-registry }}

				          ARTEFACT_REGISTRY: ghcr.io

				          ARTEFACT_REGISTRY_USERNAME: ${{ github.actor }}

				          ARTEFACT_REGISTRY_PASSWORD: ${{ secrets.GITHUB_TOKEN }}

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: store-artifact ${{ matrix.asset }}

				        uses: actions/upload-artifact@v4

				        with:

				          name: kata-artifacts-amd64-${{ matrix.asset }}${{ inputs.tarball-suffix }}

				          path: kata-build/kata-static-${{ matrix.asset }}.tar.xz

				          retention-days: 15

				          if-no-files-found: error

				  create-kata-tarball:

				    runs-on: ubuntu-latest

				    needs: build-asset

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: get-artifacts

				        uses: actions/download-artifact@v4

				        with:

				          pattern: kata-artifacts-amd64-*${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				          merge-multiple: true

				      - name: merge-artifacts

				        run: |

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-merge-builds.sh kata-artifacts versions.yaml

				      - name: store-artifacts

				        uses: actions/upload-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-static.tar.xz

				          retention-days: 15

				          if-no-files-found: error

									
										123

.github/workflows/build-kata-static-tarball-arm64.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,123 @@

				name: CI | Build kata-static tarball for arm64

				on:

				  workflow_call:

				    inputs:

				      stage:

				        required: false

				        type: string

				        default: test

				      tarball-suffix:

				        required: false

				        type: string

				      push-to-registry:

				        required: false

				        type: string

				        default: no

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  build-asset:

				    runs-on: arm64-builder

				    strategy:

				      matrix:

				        asset:

				          - agent

				          - cloud-hypervisor

				          - firecracker

				          - kernel

				          - kernel-dragonball-experimental

				          - nydus

				          - qemu

				          - stratovirt

				          - rootfs-image

				          - rootfs-initrd

				          - shim-v2

				          - virtiofsd

				        stage:

				          - ${{ inputs.stage }}

				    steps:

				      - name: Adjust a permission for repo

				        run: |

				          sudo chown -R $USER:$USER $GITHUB_WORKSPACE

				      - name: Login to Kata Containers quay.io

				        if: ${{ inputs.push-to-registry == 'yes' }}

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0 # This is needed in order to keep the commit ids history

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Build ${{ matrix.asset }}

				        run: |

				          make "${KATA_ASSET}-tarball"

				          build_dir=$(readlink -f build)

				          # store-artifact does not work with symlink

				          sudo cp -r "${build_dir}" "kata-build"

				        env:

				          KATA_ASSET: ${{ matrix.asset }}

				          TAR_OUTPUT: ${{ matrix.asset }}.tar.gz

				          PUSH_TO_REGISTRY: ${{ inputs.push-to-registry }}

				          ARTEFACT_REGISTRY: ghcr.io

				          ARTEFACT_REGISTRY_USERNAME: ${{ github.actor }}

				          ARTEFACT_REGISTRY_PASSWORD: ${{ secrets.GITHUB_TOKEN }}

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: store-artifact ${{ matrix.asset }}

				        uses: actions/upload-artifact@v4

				        with:

				          name: kata-artifacts-arm64-${{ matrix.asset }}${{ inputs.tarball-suffix }}

				          path: kata-build/kata-static-${{ matrix.asset }}.tar.xz

				          retention-days: 15

				          if-no-files-found: error

				  create-kata-tarball:

				    runs-on: arm64-builder

				    needs: build-asset

				    steps:

				      - name: Adjust a permission for repo

				        run: |

				          sudo chown -R $USER:$USER $GITHUB_WORKSPACE

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: get-artifacts

				        uses: actions/download-artifact@v4

				        with:

				          pattern: kata-artifacts-arm64-*${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				          merge-multiple: true

				      - name: merge-artifacts

				        run: |

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-merge-builds.sh kata-artifacts versions.yaml

				      - name: store-artifacts

				        uses: actions/upload-artifact@v4

				        with:

				          name: kata-static-tarball-arm64${{ inputs.tarball-suffix }}

				          path: kata-static.tar.xz

				          retention-days: 15

				          if-no-files-found: error

									
										124

.github/workflows/build-kata-static-tarball-ppc64le.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,124 @@

				name: CI | Build kata-static tarball for ppc64le

				on:

				  workflow_call:

				    inputs:

				      stage:

				        required: false

				        type: string

				        default: test

				      tarball-suffix:

				        required: false

				        type: string

				      push-to-registry:

				        required: false

				        type: string

				        default: no

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  build-asset:

				    runs-on: ppc64le

				    strategy:

				      matrix:

				        asset:

				          - agent

				          - kernel

				          - qemu

				          - rootfs-initrd

				          - shim-v2

				          - virtiofsd

				        stage:

				          - ${{ inputs.stage }}

				    steps:

				      - name: Adjust a permission for repo

				        run: |

				          sudo chown -R $USER:$USER $GITHUB_WORKSPACE

				      - name: Prepare the self-hosted runner

				        run: |

				            ${HOME}/scripts/prepare_runner.sh

				            sudo rm -rf $GITHUB_WORKSPACE/*

				      - name: Login to Kata Containers quay.io

				        if: ${{ inputs.push-to-registry == 'yes' }}

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0 # This is needed in order to keep the commit ids history

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Build ${{ matrix.asset }}

				        run: |

				          make "${KATA_ASSET}-tarball"

				          build_dir=$(readlink -f build)

				          # store-artifact does not work with symlink

				          sudo cp -r "${build_dir}" "kata-build"

				          sudo chown -R $(id -u):$(id -g) "kata-build"

				        env:

				          KATA_ASSET: ${{ matrix.asset }}

				          TAR_OUTPUT: ${{ matrix.asset }}.tar.gz

				          PUSH_TO_REGISTRY: ${{ inputs.push-to-registry }}

				          ARTEFACT_REGISTRY: ghcr.io

				          ARTEFACT_REGISTRY_USERNAME: ${{ github.actor }}

				          ARTEFACT_REGISTRY_PASSWORD: ${{ secrets.GITHUB_TOKEN }}

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: store-artifact ${{ matrix.asset }}

				        uses: actions/upload-artifact@v4

				        with:

				          name: kata-artifacts-ppc64le-${{ matrix.asset }}${{ inputs.tarball-suffix }}

				          path: kata-build/kata-static-${{ matrix.asset }}.tar.xz

				          retention-days: 1

				          if-no-files-found: error

				  create-kata-tarball:

				    runs-on: ppc64le

				    needs: build-asset

				    steps:

				      - name: Adjust a permission for repo

				        run: |

				          sudo chown -R $USER:$USER $GITHUB_WORKSPACE

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: get-artifacts

				        uses: actions/download-artifact@v4

				        with:

				          pattern: kata-artifacts-ppc64le-*${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				          merge-multiple: true

				      - name: merge-artifacts

				        run: |

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-merge-builds.sh kata-artifacts versions.yaml

				      - name: store-artifacts

				        uses: actions/upload-artifact@v4

				        with:

				          name: kata-static-tarball-ppc64le${{ inputs.tarball-suffix }}

				          path: kata-static.tar.xz

				          retention-days: 1

				          if-no-files-found: error

									
										172

.github/workflows/build-kata-static-tarball-s390x.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,172 @@

				name: CI | Build kata-static tarball for s390x

				on:

				  workflow_call:

				    inputs:

				      stage:

				        required: false

				        type: string

				        default: test

				      tarball-suffix:

				        required: false

				        type: string

				      push-to-registry:

				        required: false

				        type: string

				        default: no

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  build-asset:

				    runs-on: s390x

				    strategy:

				      matrix:

				        asset:

				          - agent

				          - coco-guest-components

				          - kernel

				          - kernel-confidential

				          - pause-image

				          - qemu

				          - rootfs-image

				          - rootfs-image-confidential

				          - rootfs-initrd

				          - rootfs-initrd-confidential

				          - shim-v2

				          - virtiofsd

				        stage:

				          - ${{ inputs.stage }}

				        exclude:

				          - asset: pause-image

				            stage: release

				          - asset: coco-guest-components

				            stage: release

				    steps:

				      - name: Take a pre-action for self-hosted runner

				        run: ${HOME}/script/pre_action.sh ubuntu-2204

				      - name: Login to Kata Containers quay.io

				        if: ${{ inputs.push-to-registry == 'yes' }}

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0 # This is needed in order to keep the commit ids history

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Build ${{ matrix.asset }}

				        run: |

				          make "${KATA_ASSET}-tarball"

				          build_dir=$(readlink -f build)

				          # store-artifact does not work with symlink

				          sudo cp -r "${build_dir}" "kata-build"

				          sudo chown -R $(id -u):$(id -g) "kata-build"

				        env:

				          KATA_ASSET: ${{ matrix.asset }}

				          TAR_OUTPUT: ${{ matrix.asset }}.tar.gz

				          PUSH_TO_REGISTRY: ${{ inputs.push-to-registry }}

				          ARTEFACT_REGISTRY: ghcr.io

				          ARTEFACT_REGISTRY_USERNAME: ${{ github.actor }}

				          ARTEFACT_REGISTRY_PASSWORD: ${{ secrets.GITHUB_TOKEN }}

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: store-artifact ${{ matrix.asset }}

				        uses: actions/upload-artifact@v4

				        with:

				          name: kata-artifacts-s390x-${{ matrix.asset }}${{ inputs.tarball-suffix }}

				          path: kata-build/kata-static-${{ matrix.asset }}.tar.xz

				          retention-days: 15

				          if-no-files-found: error

				  build-asset-boot-image-se:

				    runs-on: s390x

				    needs: build-asset

				    steps:

				      - name: Take a pre-action for self-hosted runner

				        run: ${HOME}/script/pre_action.sh ubuntu-2204

				      - uses: actions/checkout@v4

				      - name: get-artifacts

				        uses: actions/download-artifact@v4

				        with:

				          pattern: kata-artifacts-s390x-*${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				          merge-multiple: true

				      - name: Place a host key document

				        run: |

				          mkdir -p "host-key-document"

				          cp "${CI_HKD_PATH}" "host-key-document"

				        env:

				          CI_HKD_PATH: ${{ secrets.CI_HKD_PATH }}

				      - name: Build boot-image-se

				        run: |

				          base_dir=tools/packaging/kata-deploy/local-build/

				          cp -r kata-artifacts ${base_dir}/build

				          # Skip building dependant artifacts of boot-image-se-tarball

				          # because we already have them from the previous build

				          sed -i 's/\(^boot-image-se-tarball:\).*/\1/g' ${base_dir}/Makefile

				          make boot-image-se-tarball

				          build_dir=$(readlink -f build)

				          sudo cp -r "${build_dir}" "kata-build"

				          sudo chown -R $(id -u):$(id -g) "kata-build"

				        env:

				          HKD_PATH: "host-key-document"

				      - name: store-artifact boot-image-se

				        uses: actions/upload-artifact@v4

				        with:

				          name: kata-artifacts-s390x${{ inputs.tarball-suffix }}

				          path: kata-build/kata-static-boot-image-se.tar.xz

				          retention-days: 1

				          if-no-files-found: error

				  create-kata-tarball:

				    runs-on: s390x

				    needs: [build-asset, build-asset-boot-image-se]

				    steps:

				      - name: Take a pre-action for self-hosted runner

				        run: ${HOME}/script/pre_action.sh ubuntu-2204

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: get-artifacts

				        uses: actions/download-artifact@v4

				        with:

				          pattern: kata-artifacts-s390x-*${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				          merge-multiple: true

				      - name: merge-artifacts

				        run: |

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-merge-builds.sh kata-artifacts versions.yaml

				      - name: store-artifacts

				        uses: actions/upload-artifact@v4

				        with:

				          name: kata-static-tarball-s390x${{ inputs.tarball-suffix }}

				          path: kata-static.tar.xz

				          retention-days: 15

				          if-no-files-found: error

									
										8

.github/workflows/cargo-deny-runner.yaml
									
										vendored
									
												View File
												
				@@ -6,7 +6,11 @@ on:

				      - edited

				      - reopened

				      - synchronize

				    paths-ignore: [ '**.md', '**.png', '**.jpg', '**.jpeg', '**.svg', '/docs/**' ]

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				jobs:

				  cargo-deny-runner:

				    runs-on: ubuntu-latest

				@@ -14,7 +18,7 @@ jobs:

				    steps:

				      - name: Checkout Code

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        uses: actions/checkout@v3

				        uses: actions/checkout@v4

				      - name: Generate Action

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: bash cargo-deny-generator.sh

									
										21

.github/workflows/ci-nightly-s390x.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,21 @@

				on:

				  schedule:

				    - cron: '0 5 * * *'

				name: Nightly CI for s390x

				jobs:

				  check-internal-test-result:

				    runs-on: s390x

				    strategy:

				      fail-fast: false

				      matrix:

				        test_title:

				          - kata-vfio-ap-e2e-tests

				          - cc-se-e2e-tests

				    steps:

				    - name: Fetch a test result for {{ matrix.test_title }}

				      run: |

				        file_name="${TEST_TITLE}-$(date +%Y-%m-%d).log"

				        /home/${USER}/script/handle_test_log.sh download $file_name

				      env:

				        TEST_TITLE: ${{ matrix.test_title }}

									
										19

.github/workflows/ci-nightly.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,19 @@

				name: Kata Containers Nightly CI

				on:

				  schedule:

				    - cron: '0 0 * * *'

				  workflow_dispatch:

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				jobs:

				  kata-containers-ci-on-push:

				    uses: ./.github/workflows/ci.yaml

				    with:

				      commit-hash: ${{ github.sha }}

				      pr-number: "nightly"

				      tag: ${{ github.sha }}-nightly

				      target-branch: ${{ github.ref_name }}

				    secrets: inherit

									
										30

.github/workflows/ci-on-push.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,30 @@

				name: Kata Containers CI

				on:

				  pull_request_target:

				    branches:

				      - 'main'

				      - 'stable-*'

				    types:

				      # Adding 'labeled' to the list of activity types that trigger this event

				      # (default: opened, synchronize, reopened) so that we can run this

				      # workflow when the 'ok-to-test' label is added.

				      # Reference: https://docs.github.com/en/actions/using-workflows/events-that-trigger-workflows#pull_request_target

				      - opened

				      - synchronize

				      - reopened

				      - labeled

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				jobs:

				  kata-containers-ci-on-push:

				    if: ${{ contains(github.event.pull_request.labels.*.name, 'ok-to-test') }}

				    uses: ./.github/workflows/ci.yaml

				    with:

				      commit-hash: ${{ github.event.pull_request.head.sha }}

				      pr-number: ${{ github.event.pull_request.number }}

				      tag: ${{ github.event.pull_request.number }}-${{ github.event.pull_request.head.sha }}

				      target-branch: ${{ github.event.pull_request.base.ref }}

				    secrets: inherit

									
										248

.github/workflows/ci.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,248 @@

				name: Run the Kata Containers CI

				on:

				  workflow_call:

				    inputs:

				      commit-hash:

				        required: true

				        type: string

				      pr-number:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  build-kata-static-tarball-amd64:

				    uses: ./.github/workflows/build-kata-static-tarball-amd64.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      commit-hash: ${{ inputs.commit-hash }}

				      target-branch: ${{ inputs.target-branch }}

				  publish-kata-deploy-payload-amd64:

				    needs: build-kata-static-tarball-amd64

				    uses: ./.github/workflows/publish-kata-deploy-payload-amd64.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      registry: ghcr.io

				      repo: ${{ github.repository_owner }}/kata-deploy-ci

				      tag: ${{ inputs.tag }}-amd64

				      commit-hash: ${{ inputs.commit-hash }}

				      target-branch: ${{ inputs.target-branch }}

				    secrets: inherit

				  build-kata-static-tarball-s390x:

				    uses: ./.github/workflows/build-kata-static-tarball-s390x.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      commit-hash: ${{ inputs.commit-hash }}

				      target-branch: ${{ inputs.target-branch }}

				    secrets: inherit

				  build-kata-static-tarball-ppc64le:

				    uses: ./.github/workflows/build-kata-static-tarball-ppc64le.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      commit-hash: ${{ inputs.commit-hash }}

				      target-branch: ${{ inputs.target-branch }}

				  publish-kata-deploy-payload-s390x:

				    needs: build-kata-static-tarball-s390x

				    uses: ./.github/workflows/publish-kata-deploy-payload-s390x.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      registry: ghcr.io

				      repo: ${{ github.repository_owner }}/kata-deploy-ci

				      tag: ${{ inputs.tag }}-s390x

				      commit-hash: ${{ inputs.commit-hash }}

				      target-branch: ${{ inputs.target-branch }}

				    secrets: inherit

				  publish-kata-deploy-payload-ppc64le:

				    needs: build-kata-static-tarball-ppc64le

				    uses: ./.github/workflows/publish-kata-deploy-payload-ppc64le.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      registry: ghcr.io

				      repo: ${{ github.repository_owner }}/kata-deploy-ci

				      tag: ${{ inputs.tag }}-ppc64le

				      commit-hash: ${{ inputs.commit-hash }}

				      target-branch: ${{ inputs.target-branch }}

				    secrets: inherit

				  build-and-publish-tee-confidential-unencrypted-image:

				    runs-on: ubuntu-latest

				    steps:

				      - name: Checkout code

				        uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Set up QEMU

				        uses: docker/setup-qemu-action@v3

				      - name: Set up Docker Buildx

				        uses: docker/setup-buildx-action@v3

				      - name: Login to Kata Containers ghcr.io

				        uses: docker/login-action@v3

				        with:

				          registry: ghcr.io

				          username: ${{ github.actor }}

				          password: ${{ secrets.GITHUB_TOKEN }}

				      - name: Docker build and push

				        uses: docker/build-push-action@v5

				        with:

				          tags: ghcr.io/kata-containers/test-images:unencrypted-${{ inputs.pr-number }}

				          push: true

				          context: tests/integration/kubernetes/runtimeclass_workloads/confidential/unencrypted/

				          platforms: linux/amd64, linux/s390x

				          file: tests/integration/kubernetes/runtimeclass_workloads/confidential/unencrypted/Dockerfile

				  run-kata-deploy-tests-on-aks:

				    needs: publish-kata-deploy-payload-amd64

				    uses: ./.github/workflows/run-kata-deploy-tests-on-aks.yaml

				    with:

				      registry: ghcr.io

				      repo: ${{ github.repository_owner }}/kata-deploy-ci

				      tag: ${{ inputs.tag }}-amd64

				      commit-hash: ${{ inputs.commit-hash }}

				      pr-number: ${{ inputs.pr-number }}

				      target-branch: ${{ inputs.target-branch }}

				    secrets: inherit

				  run-kata-deploy-tests-on-garm:

				    needs: publish-kata-deploy-payload-amd64

				    uses: ./.github/workflows/run-kata-deploy-tests-on-garm.yaml

				    with:

				      registry: ghcr.io

				      repo: ${{ github.repository_owner }}/kata-deploy-ci

				      tag: ${{ inputs.tag }}-amd64

				      commit-hash: ${{ inputs.commit-hash }}

				      pr-number: ${{ inputs.pr-number }}

				      target-branch: ${{ inputs.target-branch }}

				    secrets: inherit

				  run-kata-monitor-tests:

				    needs: build-kata-static-tarball-amd64

				    uses: ./.github/workflows/run-kata-monitor-tests.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      commit-hash: ${{ inputs.commit-hash }}

				      target-branch: ${{ inputs.target-branch }}

				  run-k8s-tests-on-aks:

				    needs: publish-kata-deploy-payload-amd64

				    uses: ./.github/workflows/run-k8s-tests-on-aks.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      registry: ghcr.io

				      repo: ${{ github.repository_owner }}/kata-deploy-ci

				      tag: ${{ inputs.tag }}-amd64

				      commit-hash: ${{ inputs.commit-hash }}

				      pr-number: ${{ inputs.pr-number }}

				      target-branch: ${{ inputs.target-branch }}

				    secrets: inherit

				  run-k8s-tests-on-garm:

				    needs: publish-kata-deploy-payload-amd64

				    uses: ./.github/workflows/run-k8s-tests-on-garm.yaml

				    with:

				      registry: ghcr.io

				      repo: ${{ github.repository_owner }}/kata-deploy-ci

				      tag: ${{ inputs.tag }}-amd64

				      commit-hash: ${{ inputs.commit-hash }}

				      pr-number: ${{ inputs.pr-number }}

				      target-branch: ${{ inputs.target-branch }}

				    secrets: inherit

				  run-k8s-tests-with-crio-on-garm:

				    needs: publish-kata-deploy-payload-amd64

				    uses: ./.github/workflows/run-k8s-tests-with-crio-on-garm.yaml

				    with:

				      registry: ghcr.io

				      repo: ${{ github.repository_owner }}/kata-deploy-ci

				      tag: ${{ inputs.tag }}-amd64

				      commit-hash: ${{ inputs.commit-hash }}

				      pr-number: ${{ inputs.pr-number }}

				      target-branch: ${{ inputs.target-branch }}

				    secrets: inherit

				  run-kata-coco-tests:

				    needs: [publish-kata-deploy-payload-amd64, build-and-publish-tee-confidential-unencrypted-image]

				    uses: ./.github/workflows/run-kata-coco-tests.yaml

				    with:

				      registry: ghcr.io

				      repo: ${{ github.repository_owner }}/kata-deploy-ci

				      tag: ${{ inputs.tag }}-amd64

				      commit-hash: ${{ inputs.commit-hash }}

				      pr-number: ${{ inputs.pr-number }}

				      target-branch: ${{ inputs.target-branch }}

				    secrets: inherit

				  run-k8s-tests-on-zvsi:

				    needs: [publish-kata-deploy-payload-s390x, build-and-publish-tee-confidential-unencrypted-image]

				    uses: ./.github/workflows/run-k8s-tests-on-zvsi.yaml

				    with:

				      registry: ghcr.io

				      repo: ${{ github.repository_owner }}/kata-deploy-ci

				      tag: ${{ inputs.tag }}-s390x

				      commit-hash: ${{ inputs.commit-hash }}

				      pr-number: ${{ inputs.pr-number }}

				      target-branch: ${{ inputs.target-branch }}

				  run-k8s-tests-on-ppc64le:

				    needs: publish-kata-deploy-payload-ppc64le

				    uses: ./.github/workflows/run-k8s-tests-on-ppc64le.yaml

				    with:

				      registry: ghcr.io

				      repo: ${{ github.repository_owner }}/kata-deploy-ci

				      tag: ${{ inputs.tag }}-ppc64le

				      commit-hash: ${{ inputs.commit-hash }}

				      pr-number: ${{ inputs.pr-number }}

				      target-branch: ${{ inputs.target-branch }}

				  run-metrics-tests:

				    needs: build-kata-static-tarball-amd64

				    uses: ./.github/workflows/run-metrics.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      commit-hash: ${{ inputs.commit-hash }}

				      target-branch: ${{ inputs.target-branch }}

				  run-basic-amd64-tests:

				    needs: build-kata-static-tarball-amd64

				    uses: ./.github/workflows/basic-ci-amd64.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      commit-hash: ${{ inputs.commit-hash }}

				      target-branch: ${{ inputs.target-branch }}

				  run-cri-containerd-tests-s390x:

				    needs: build-kata-static-tarball-s390x

				    uses: ./.github/workflows/run-cri-containerd-tests-s390x.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      commit-hash: ${{ inputs.commit-hash }}

				      target-branch: ${{ inputs.target-branch }}

				  run-cri-containerd-tests-ppc64le:

				    needs: build-kata-static-tarball-ppc64le

				    uses: ./.github/workflows/run-cri-containerd-tests-ppc64le.yaml

				    with:

				      tarball-suffix: -${{ inputs.tag }}

				      commit-hash: ${{ inputs.commit-hash }}

				      target-branch: ${{ inputs.target-branch }}

									
										28

.github/workflows/commit-message-check.yaml
									
										vendored
									
												View File
												
				@@ -6,6 +6,10 @@ on:

				      - reopened

				      - synchronize

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				env:

				  error_msg: |+

				    See the document below for help on formatting commits for the project.

				@@ -15,6 +19,8 @@ env:

				jobs:

				  commit-message-check:

				    runs-on: ubuntu-latest

				    env:

				      PR_AUTHOR: ${{ github.event.pull_request.user.login }}

				    name: Commit Message Check

				    steps:

				    - name: Get PR Commits

				@@ -43,7 +49,7 @@ jobs:

				        commits: ${{ steps.get-pr-commits.outputs.commits }}

				    - name: Check Subject Line Length

				      if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') && ( success() || failure() ) }}

				      if: ${{ (env.PR_AUTHOR != 'dependabot[bot]') && !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') && ( success() || failure() ) }}

				      uses: tim-actions/commit-message-checker-with-regex@v0.3.1

				      with:

				        commits: ${{ steps.get-pr-commits.outputs.commits }}

				@@ -52,7 +58,7 @@ jobs:

				        post_error: ${{ env.error_msg }}

				    - name: Check Body Line Length

				      if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') && ( success() || failure() ) }}

				      if: ${{ (env.PR_AUTHOR != 'dependabot[bot]') && !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') && ( success() || failure() ) }}

				      uses: tim-actions/commit-message-checker-with-regex@v0.3.1

				      with:

				        commits: ${{ steps.get-pr-commits.outputs.commits }}

				@@ -62,6 +68,9 @@ jobs:

				        #   to be specified at the start of the regex as the action is passed

				        #   the entire commit message.

				        #

				        # - This check will pass if the commit message only contains a subject

				        #   line, as other body message properties are enforced elsewhere.

				        #

				        # - Body lines *can* be longer than the maximum if they start

				        #   with a non-alphabetic character or if there is no whitespace in

				        #   the line.

				@@ -75,23 +84,12 @@ jobs:

				        #

				        # - A SoB comment can be any length (as it is unreasonable to penalise

				        #   people with long names/email addresses :)

				        pattern: '^.+(\n([a-zA-Z].{0,150}|[^a-zA-Z\n].*|[^\s\n]*|Signed-off-by:.*|))+$'

				        pattern: '(^[^\n]+$|^.+(\n([a-zA-Z].{0,150}|[^a-zA-Z\n].*|[^\s\n]*|Signed-off-by:.*|))+$)'

				        error: 'Body line too long (max 150)'

				        post_error: ${{ env.error_msg }}

				    - name: Check Fixes

				      if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') && ( success() || failure() ) }}

				      uses: tim-actions/commit-message-checker-with-regex@v0.3.1

				      with:

				        commits: ${{ steps.get-pr-commits.outputs.commits }}

				        pattern: '\s*Fixes\s*:?\s*(#\d+|github\.com\/kata-containers\/[a-z-.]*#\d+)|^\s*release\s*:'

				        flags: 'i'

				        error: 'No "Fixes" found'

				        post_error: ${{ env.error_msg }}

				        one_pass_all_pass: 'true'

				    - name: Check Subsystem

				      if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') && ( success() || failure() ) }}

				      if: ${{ (env.PR_AUTHOR != 'dependabot[bot]') && !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') && ( success() || failure() ) }}

				      uses: tim-actions/commit-message-checker-with-regex@v0.3.1

				      with:

				        commits: ${{ steps.get-pr-commits.outputs.commits }}

									
										10

.github/workflows/darwin-tests.yaml
									
										vendored
									
												View File
												
				@@ -5,7 +5,11 @@ on:

				      - edited

				      - reopened

				      - synchronize

				    paths-ignore: [ '**.md', '**.png', '**.jpg', '**.jpeg', '**.svg', '/docs/**' ]

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				name: Darwin tests

				jobs:

				  test:

				@@ -14,8 +18,8 @@ jobs:

				    - name: Install Go

				      uses: actions/setup-go@v2

				      with:

				        go-version: 1.19.3

				        go-version: 1.22.2

				    - name: Checkout code

				      uses: actions/checkout@v2

				      uses: actions/checkout@v4

				    - name: Build utils

				      run: ./ci/darwin-test.sh

									
										4

.github/workflows/docs-url-alive-check.yaml
									
										vendored
									
												View File
												
				@@ -14,7 +14,7 @@ jobs:

				    - name: Install Go

				      uses: actions/setup-go@v2

				      with:

				        go-version: 1.19.3

				        go-version: 1.22.2

				      env:

				        GOPATH: ${{ runner.workspace }}/kata-containers

				    - name: Set env

				@@ -22,7 +22,7 @@ jobs:

				        echo "GOPATH=${{ github.workspace }}" >> $GITHUB_ENV

				        echo "${{ github.workspace }}/bin" >> $GITHUB_PATH

				    - name: Checkout code

				      uses: actions/checkout@v2

				      uses: actions/checkout@v4

				      with:

				        fetch-depth: 0

				        path: ./src/github.com/${{ github.repository }}

									
										86

.github/workflows/kata-deploy-push.yaml
									
										vendored
									
												View File
											
				@@ -1,86 +0,0 @@

				name: kata deploy build

				on: 

				  pull_request:

				    types:

				      - opened

				      - edited

				      - reopened

				      - synchronize

				    paths:

				      - tools/**

				      - versions.yaml

				jobs:

				  build-asset:

				    runs-on: ubuntu-latest

				    strategy:

				      matrix:

				        asset:

				          - kernel

				          - kernel-dragonball-experimental

				          - shim-v2

				          - qemu

				          - cloud-hypervisor

				          - firecracker

				          - rootfs-image

				          - rootfs-initrd

				          - virtiofsd

				          - nydus

				    steps:

				      - uses: actions/checkout@v2

				      - name: Install docker

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: |

				          curl -fsSL https://test.docker.com -o test-docker.sh

				          sh test-docker.sh

				      - name: Build ${{ matrix.asset }}

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: |

				          make "${KATA_ASSET}-tarball"

				          build_dir=$(readlink -f build)

				          # store-artifact does not work with symlink

				          sudo cp -r --preserve=all "${build_dir}" "kata-build"

				        env:

				          KATA_ASSET: ${{ matrix.asset }}

				      - name: store-artifact ${{ matrix.asset }}

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        uses: actions/upload-artifact@v2

				        with:

				          name: kata-artifacts

				          path: kata-build/kata-static-${{ matrix.asset }}.tar.xz

				          if-no-files-found: error

				  create-kata-tarball:

				    runs-on: ubuntu-latest

				    needs: build-asset

				    steps:

				      - uses: actions/checkout@v2

				      - name: get-artifacts

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        uses: actions/download-artifact@v2

				        with:

				          name: kata-artifacts

				          path: build

				      - name: merge-artifacts

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: |

				          make merge-builds

				      - name: store-artifacts

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        uses: actions/upload-artifact@v2

				        with:

				          name: kata-static-tarball

				          path: kata-static.tar.xz

				  make-kata-tarball:

				    runs-on: ubuntu-latest

				    steps:

				      - uses: actions/checkout@v2

				      - name: make kata-tarball

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: |

				          make kata-tarball

				          sudo make install-tarball

									
										169

.github/workflows/kata-deploy-test.yaml
									
										vendored
									
												View File
											
				@@ -1,169 +0,0 @@

				on:

				  workflow_dispatch: # this is used to trigger the workflow on non-main branches

				    inputs:

				      pr:

				        description: 'PR number from the selected branch to test'

				        type: string

				        required: true

				  issue_comment:

				    types: [created, edited]

				name: test-kata-deploy

				jobs:

				  check-comment-and-membership:

				    runs-on: ubuntu-latest

				    if: |

				      github.event.issue.pull_request

				      && github.event_name == 'issue_comment'

				      && github.event.action == 'created'

				      && startsWith(github.event.comment.body, '/test_kata_deploy')

				      || github.event_name == 'workflow_dispatch'

				    steps:

				      - name: Check membership on comment or dispatch

				        uses: kata-containers/is-organization-member@1.0.1

				        id: is_organization_member

				        with:

				          organization: kata-containers

				          username: ${{ github.event.comment.user.login || github.event.sender.login }}

				          token: ${{ secrets.GITHUB_TOKEN }}

				      - name: Fail if not member

				        run: |

				          result=${{ steps.is_organization_member.outputs.result }}

				          if [ $result == false ]; then

				              user=${{ github.event.comment.user.login || github.event.sender.login }}

				              echo Either ${user} is not part of the kata-containers organization

				              echo or ${user} has its Organization Visibility set to Private at

				              echo https://github.com/orgs/kata-containers/people?query=${user}

				              echo 

				              echo Ensure you change your Organization Visibility to Public and

				              echo trigger the test again.

				              exit 1

				          fi

				  build-asset:

				    runs-on: ubuntu-latest

				    needs: check-comment-and-membership

				    strategy:

				      matrix:

				        asset:

				          - cloud-hypervisor

				          - firecracker

				          - kernel

				          - kernel-dragonball-experimental

				          - nydus

				          - qemu

				          - rootfs-image

				          - rootfs-initrd

				          - shim-v2

				          - virtiofsd

				    steps:

				      - name: get-PR-ref

				        id: get-PR-ref

				        run: |

				            if [ ${{ github.event_name }} == 'issue_comment' ]; then

				                ref=$(cat $GITHUB_EVENT_PATH | jq -r '.issue.pull_request.url' | sed  's#^.*\/pulls#refs\/pull#' | sed 's#$#\/merge#')

				            else # workflow_dispatch

				                ref="refs/pull/${{ github.event.inputs.pr }}/merge"

				            fi

				            echo "reference for PR: " ${ref} "event:" ${{ github.event_name }}

				            echo "##[set-output name=pr-ref;]${ref}"

				      - uses: actions/checkout@v2

				        with:

				          ref: ${{ steps.get-PR-ref.outputs.pr-ref }}

				      - name: Install docker

				        run: |

				          curl -fsSL https://test.docker.com -o test-docker.sh

				          sh test-docker.sh

				      - name: Build ${{ matrix.asset }}

				        run: |

				          make "${KATA_ASSET}-tarball"

				          build_dir=$(readlink -f build)

				          # store-artifact does not work with symlink

				          sudo cp -r "${build_dir}" "kata-build"

				        env:

				          KATA_ASSET: ${{ matrix.asset }}

				          TAR_OUTPUT: ${{ matrix.asset }}.tar.gz

				      - name: store-artifact ${{ matrix.asset }}

				        uses: actions/upload-artifact@v2

				        with:

				          name: kata-artifacts

				          path: kata-build/kata-static-${{ matrix.asset }}.tar.xz

				          if-no-files-found: error

				  create-kata-tarball:

				    runs-on: ubuntu-latest

				    needs: build-asset

				    steps:

				      - name: get-PR-ref

				        id: get-PR-ref

				        run: |

				            if [ ${{ github.event_name }} == 'issue_comment' ]; then

				                ref=$(cat $GITHUB_EVENT_PATH | jq -r '.issue.pull_request.url' | sed  's#^.*\/pulls#refs\/pull#' | sed 's#$#\/merge#')

				            else # workflow_dispatch

				                ref="refs/pull/${{ github.event.inputs.pr }}/merge"

				            fi

				            echo "reference for PR: " ${ref} "event:" ${{ github.event_name }}

				            echo "##[set-output name=pr-ref;]${ref}"

				      - uses: actions/checkout@v2

				        with:

				          ref: ${{ steps.get-PR-ref.outputs.pr-ref }}

				      - name: get-artifacts

				        uses: actions/download-artifact@v2

				        with:

				          name: kata-artifacts

				          path: kata-artifacts

				      - name: merge-artifacts

				        run: |

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-merge-builds.sh kata-artifacts

				      - name: store-artifacts

				        uses: actions/upload-artifact@v2

				        with:

				          name: kata-static-tarball

				          path: kata-static.tar.xz

				  kata-deploy:

				    needs: create-kata-tarball

				    runs-on: ubuntu-latest

				    steps:

				      - name: get-PR-ref

				        id: get-PR-ref

				        run: |

				            if [ ${{ github.event_name }} == 'issue_comment' ]; then

				                ref=$(cat $GITHUB_EVENT_PATH | jq -r '.issue.pull_request.url' | sed  's#^.*\/pulls#refs\/pull#' | sed 's#$#\/merge#')

				            else # workflow_dispatch

				                ref="refs/pull/${{ github.event.inputs.pr }}/merge"

				            fi

				            echo "reference for PR: " ${ref} "event:" ${{ github.event_name }}

				            echo "##[set-output name=pr-ref;]${ref}"

				      - uses: actions/checkout@v2

				        with:

				          ref: ${{ steps.get-PR-ref.outputs.pr-ref }}

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v2

				        with:

				          name: kata-static-tarball

				      - name: build-and-push-kata-deploy-ci

				        id: build-and-push-kata-deploy-ci

				        run: |

				          PR_SHA=$(git log --format=format:%H -n1)

				          mv kata-static.tar.xz $GITHUB_WORKSPACE/tools/packaging/kata-deploy/kata-static.tar.xz

				          docker build --build-arg KATA_ARTIFACTS=kata-static.tar.xz -t quay.io/kata-containers/kata-deploy-ci:$PR_SHA $GITHUB_WORKSPACE/tools/packaging/kata-deploy

				          docker login -u ${{ secrets.QUAY_DEPLOYER_USERNAME }} -p ${{ secrets.QUAY_DEPLOYER_PASSWORD }} quay.io

				          docker push quay.io/kata-containers/kata-deploy-ci:$PR_SHA

				          mkdir -p packaging/kata-deploy

				          ln -s $GITHUB_WORKSPACE/tools/packaging/kata-deploy/action packaging/kata-deploy/action

				          echo "::set-output name=PKG_SHA::${PR_SHA}"

				      - name: test-kata-deploy-ci-in-aks

				        uses: ./packaging/kata-deploy/action

				        with:

				          packaging-sha: ${{steps.build-and-push-kata-deploy-ci.outputs.PKG_SHA}}

				        env:

				          PKG_SHA: ${{steps.build-and-push-kata-deploy-ci.outputs.PKG_SHA}}

				          AZ_APPID: ${{ secrets.AZ_APPID }}

				          AZ_PASSWORD: ${{ secrets.AZ_PASSWORD }}

				          AZ_SUBSCRIPTION_ID: ${{ secrets.AZ_SUBSCRIPTION_ID }}

				          AZ_TENANT_ID: ${{ secrets.AZ_TENANT_ID }}

									
										36

.github/workflows/kata-runtime-classes-sync.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,36 @@

				on:

				  pull_request:

				    types:

				      - opened

				      - edited

				      - reopened

				      - synchronize

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				jobs:

				  kata-deploy-runtime-classes-check:

				    runs-on: ubuntu-latest

				    steps:

				    - name: Checkout code

				      uses: actions/checkout@v4

				    - name: Ensure the split out runtime classes match the all-in-one file

				      run: |

				        pushd tools/packaging/kata-deploy/runtimeclasses/

				        echo "::group::Combine runtime classes"

				        for runtimeClass in `find . -type f \( -name "*.yaml" -and -not -name "kata-runtimeClasses.yaml" \) | sort`; do

				            echo "Adding ${runtimeClass} to the resultingRuntimeClasses.yaml"

				            cat ${runtimeClass} >> resultingRuntimeClasses.yaml;

				        done

				        echo "::endgroup::"

				        echo "::group::Displaying the content of resultingRuntimeClasses.yaml"

				        cat resultingRuntimeClasses.yaml

				        echo "::endgroup::"

				        echo ""

				        echo "::group::Displaying the content of kata-runtimeClasses.yaml"

				        cat kata-runtimeClasses.yaml

				        echo "::endgroup::"

				        echo ""

				        diff resultingRuntimeClasses.yaml kata-runtimeClasses.yaml

									
										19

.github/workflows/move-issues-to-in-progress.yaml
									
										vendored
									
												View File
												
				@@ -38,7 +38,17 @@ jobs:

				      - name: Checkout code to allow hub to communicate with the project

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        uses: actions/checkout@v2

				        uses: actions/checkout@v4

				        with:

				          ref: ${{ github.event.pull_request.head.sha }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ github.event.pull_request.base.ref }}

				      - name: Move issue to "In progress"

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				@@ -52,11 +62,10 @@ jobs:

				            grep -v "^\#"  |\

				            cut -d';' -f3 || true)

				          # PR doesn't have any linked issues

				          # (it should, but maybe a new user forgot to add a "Fixes: #XXX" commit).

				          # PR doesn't have any linked issues, handle it only if it exists

				          [ -z "$linked_issue_urls" ] && {

				            echo "::error::No linked issues for PR $pr"

				            exit 1

				            echo "::warning::No linked issues for PR $pr"

				            exit 0

				          }

				          project_name="Issue backlog"

									
										107

.github/workflows/payload-after-push.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,107 @@

				name: CI | Publish Kata Containers payload

				on:

				  push:

				    branches:

				      - main

				  workflow_dispatch:

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				jobs:

				  build-assets-amd64:

				    uses: ./.github/workflows/build-kata-static-tarball-amd64.yaml

				    with:

				      commit-hash: ${{ github.sha }}

				      push-to-registry: yes

				      target-branch: ${{ github.ref_name }}

				    secrets: inherit

				  build-assets-arm64:

				    uses: ./.github/workflows/build-kata-static-tarball-arm64.yaml

				    with:

				      commit-hash: ${{ github.sha }}

				      push-to-registry: yes

				      target-branch: ${{ github.ref_name }}

				    secrets: inherit

				  build-assets-s390x:

				    uses: ./.github/workflows/build-kata-static-tarball-s390x.yaml

				    with:

				      commit-hash: ${{ github.sha }}

				      push-to-registry: yes

				      target-branch: ${{ github.ref_name }}

				    secrets: inherit

				  build-assets-ppc64le:

				    uses: ./.github/workflows/build-kata-static-tarball-ppc64le.yaml

				    with:

				      commit-hash: ${{ github.sha }}

				      push-to-registry: yes

				      target-branch: ${{ github.ref_name }}

				    secrets: inherit

				  publish-kata-deploy-payload-amd64:

				    needs: build-assets-amd64

				    uses: ./.github/workflows/publish-kata-deploy-payload-amd64.yaml

				    with:

				      commit-hash: ${{ github.sha }}

				      registry: quay.io

				      repo: kata-containers/kata-deploy-ci

				      tag: kata-containers-latest-amd64

				      target-branch: ${{ github.ref_name }}

				    secrets: inherit

				  publish-kata-deploy-payload-arm64:

				    needs: build-assets-arm64

				    uses: ./.github/workflows/publish-kata-deploy-payload-arm64.yaml

				    with:

				      commit-hash: ${{ github.sha }}

				      registry: quay.io

				      repo: kata-containers/kata-deploy-ci

				      tag: kata-containers-latest-arm64

				      target-branch: ${{ github.ref_name }}

				    secrets: inherit

				  publish-kata-deploy-payload-s390x:

				    needs: build-assets-s390x

				    uses: ./.github/workflows/publish-kata-deploy-payload-s390x.yaml

				    with:

				      commit-hash: ${{ github.sha }}

				      registry: quay.io

				      repo: kata-containers/kata-deploy-ci

				      tag: kata-containers-latest-s390x

				      target-branch: ${{ github.ref_name }}

				    secrets: inherit

				  publish-kata-deploy-payload-ppc64le:

				    needs: build-assets-ppc64le

				    uses: ./.github/workflows/publish-kata-deploy-payload-ppc64le.yaml

				    with:

				      commit-hash: ${{ github.sha }}

				      registry: quay.io

				      repo: kata-containers/kata-deploy-ci

				      tag: kata-containers-latest-ppc64le

				      target-branch: ${{ github.ref_name }}

				    secrets: inherit

				  publish-manifest:

				    runs-on: ubuntu-latest

				    needs: [publish-kata-deploy-payload-amd64, publish-kata-deploy-payload-arm64, publish-kata-deploy-payload-s390x, publish-kata-deploy-payload-ppc64le]

				    steps:

				      - name: Checkout repository

				        uses: actions/checkout@v4

				      - name: Login to Kata Containers quay.io

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - name: Push multi-arch manifest

				        run: |

				          ./tools/packaging/release/release.sh publish-multiarch-manifest

				        env:

				          KATA_DEPLOY_IMAGE_TAGS: "kata-containers-latest"

				          KATA_DEPLOY_REGISTRIES: "quay.io/kata-containers/kata-deploy-ci"

									
										66

.github/workflows/publish-kata-deploy-payload-amd64.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,66 @@

				name: CI | Publish kata-deploy payload for amd64

				on:

				  workflow_call:

				    inputs:

				      tarball-suffix:

				        required: false

				        type: string

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  kata-payload:

				    runs-on: ubuntu-latest

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				      - name: Login to Kata Containers quay.io

				        if: ${{ inputs.registry == 'quay.io' }}

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - name: Login to Kata Containers ghcr.io

				        if: ${{ inputs.registry == 'ghcr.io' }}

				        uses: docker/login-action@v3

				        with:

				          registry: ghcr.io

				          username: ${{ github.actor }}

				          password: ${{ secrets.GITHUB_TOKEN }}

				      - name: build-and-push-kata-payload

				        id: build-and-push-kata-payload

				        run: |

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				          $(pwd)/kata-static.tar.xz \

				          ${{ inputs.registry }}/${{ inputs.repo }} ${{ inputs.tag }}

									
										71

.github/workflows/publish-kata-deploy-payload-arm64.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,71 @@

				name: CI | Publish kata-deploy payload for arm64

				on:

				  workflow_call:

				    inputs:

				      tarball-suffix:

				        required: false

				        type: string

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  kata-payload:

				    runs-on: arm64-builder

				    steps:

				      - name: Adjust a permission for repo

				        run: |

				          sudo chown -R $USER:$USER $GITHUB_WORKSPACE

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-arm64${{ inputs.tarball-suffix }}

				      - name: Login to Kata Containers quay.io

				        if: ${{ inputs.registry == 'quay.io' }}

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - name: Login to Kata Containers ghcr.io

				        if: ${{ inputs.registry == 'ghcr.io' }}

				        uses: docker/login-action@v3

				        with:

				          registry: ghcr.io

				          username: ${{ github.actor }}

				          password: ${{ secrets.GITHUB_TOKEN }}

				      - name: build-and-push-kata-payload

				        id: build-and-push-kata-payload

				        run: |

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				          $(pwd)/kata-static.tar.xz \

				          ${{ inputs.registry }}/${{ inputs.repo }} ${{ inputs.tag }}

									
										75

.github/workflows/publish-kata-deploy-payload-ppc64le.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,75 @@

				name: CI | Publish kata-deploy payload for ppc64le

				on:

				  workflow_call:

				    inputs:

				      tarball-suffix:

				        required: false

				        type: string

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  kata-payload:

				    runs-on: ppc64le

				    steps:

				      - name: Prepare the self-hosted runner

				        run: |

				          ${HOME}/scripts/prepare_runner.sh

				          sudo rm -rf $GITHUB_WORKSPACE/*

				      - name: Adjust a permission for repo

				        run: |

				          sudo chown -R $USER:$USER $GITHUB_WORKSPACE

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-ppc64le${{ inputs.tarball-suffix }}

				      - name: Login to Kata Containers quay.io

				        if: ${{ inputs.registry == 'quay.io' }}

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - name: Login to Kata Containers ghcr.io

				        if: ${{ inputs.registry == 'ghcr.io' }}

				        uses: docker/login-action@v3

				        with:

				          registry: ghcr.io

				          username: ${{ github.actor }}

				          password: ${{ secrets.GITHUB_TOKEN }}

				      - name: build-and-push-kata-payload

				        id: build-and-push-kata-payload

				        run: |

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				          $(pwd)/kata-static.tar.xz \

				          ${{ inputs.registry }}/${{ inputs.repo }} ${{ inputs.tag }}

									
										69

.github/workflows/publish-kata-deploy-payload-s390x.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,69 @@

				name: CI | Publish kata-deploy payload for s390x

				on:

				  workflow_call:

				    inputs:

				      tarball-suffix:

				        required: false

				        type: string

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  kata-payload:

				    runs-on: s390x

				    steps:

				      - name: Take a pre-action for self-hosted runner

				        run: ${HOME}/script/pre_action.sh ubuntu-2204

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-s390x${{ inputs.tarball-suffix }}

				      - name: Login to Kata Containers quay.io

				        if: ${{ inputs.registry == 'quay.io' }}

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - name: Login to Kata Containers ghcr.io

				        if: ${{ inputs.registry == 'ghcr.io' }}

				        uses: docker/login-action@v3

				        with:

				          registry: ghcr.io

				          username: ${{ github.actor }}

				          password: ${{ secrets.GITHUB_TOKEN }}

				      - name: build-and-push-kata-payload

				        id: build-and-push-kata-payload

				        run: |

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				          $(pwd)/kata-static.tar.xz \

				          ${{ inputs.registry }}/${{ inputs.repo }} ${{ inputs.tag }}

									
										57

.github/workflows/release-amd64.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,57 @@

				name: Publish Kata release artifacts for amd64

				on:

				  workflow_call:

				    inputs:

				      target-arch:

				        required: true

				        type: string

				jobs:

				  build-kata-static-tarball-amd64:

				    uses: ./.github/workflows/build-kata-static-tarball-amd64.yaml

				    with:

				      stage: release

				  kata-deploy:

				    needs: build-kata-static-tarball-amd64

				    runs-on: ubuntu-latest

				    steps:

				      - name: Login to Kata Containers docker.io

				        uses: docker/login-action@v3

				        with:

				          username: ${{ secrets.DOCKER_USERNAME }}

				          password: ${{ secrets.DOCKER_PASSWORD }}

				      - name: Login to Kata Containers quay.io

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - uses: actions/checkout@v4

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64

				      - name: build-and-push-kata-deploy-ci-amd64

				        id: build-and-push-kata-deploy-ci-amd64

				        run: |

				          # We need to do such trick here as the format of the $GITHUB_REF

				          # is "refs/tags/<tag>"

				          tag=$(echo $GITHUB_REF | cut -d/ -f3-)

				          if [ "${tag}" = "main" ]; then

				              tag=$(./tools/packaging/release/release.sh release-version)

				              tags=(${tag} "latest")

				          else

				              tags=(${tag})

				          fi

				          for tag in ${tags[@]}; do

				              ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				                  $(pwd)/kata-static.tar.xz "docker.io/katadocker/kata-deploy" \

				                  "${tag}-${{ inputs.target-arch }}"

				              ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				                  $(pwd)/kata-static.tar.xz "quay.io/kata-containers/kata-deploy" \

				                  "${tag}-${{ inputs.target-arch }}"

				          done

									
										57

.github/workflows/release-arm64.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,57 @@

				name: Publish Kata release artifacts for arm64

				on:

				  workflow_call:

				    inputs:

				      target-arch:

				        required: true

				        type: string

				jobs:

				  build-kata-static-tarball-arm64:

				    uses: ./.github/workflows/build-kata-static-tarball-arm64.yaml

				    with:

				      stage: release

				  kata-deploy:

				    needs: build-kata-static-tarball-arm64

				    runs-on: arm64-builder

				    steps:

				      - name: Login to Kata Containers docker.io

				        uses: docker/login-action@v3

				        with:

				          username: ${{ secrets.DOCKER_USERNAME }}

				          password: ${{ secrets.DOCKER_PASSWORD }}

				      - name: Login to Kata Containers quay.io

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - uses: actions/checkout@v4

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-arm64

				      - name: build-and-push-kata-deploy-ci-arm64

				        id: build-and-push-kata-deploy-ci-arm64

				        run: |

				          # We need to do such trick here as the format of the $GITHUB_REF

				          # is "refs/tags/<tag>"

				          tag=$(echo $GITHUB_REF | cut -d/ -f3-)

				          if [ "${tag}" = "main" ]; then

				              tag=$(./tools/packaging/release/release.sh release-version)

				              tags=(${tag} "latest")

				          else

				              tags=(${tag})

				          fi

				          for tag in ${tags[@]}; do

				              ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				                  $(pwd)/kata-static.tar.xz "docker.io/katadocker/kata-deploy" \

				                  "${tag}-${{ inputs.target-arch }}"

				              ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				                  $(pwd)/kata-static.tar.xz "quay.io/kata-containers/kata-deploy" \

				                  "${tag}-${{ inputs.target-arch }}"

				          done

									
										62

.github/workflows/release-ppc64le.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,62 @@

				name: Publish Kata release artifacts for ppc64le

				on:

				  workflow_call:

				    inputs:

				      target-arch:

				        required: true

				        type: string

				jobs:

				  build-kata-static-tarball-ppc64le:

				    uses: ./.github/workflows/build-kata-static-tarball-ppc64le.yaml

				    with:

				      stage: release

				  kata-deploy:

				    needs: build-kata-static-tarball-ppc64le

				    runs-on: ppc64le

				    steps:

				      - name: Prepare the self-hosted runner

				        run: |

				          bash ${HOME}/scripts/prepare_runner.sh

				          sudo rm -rf $GITHUB_WORKSPACE/*

				      - name: Login to Kata Containers docker.io

				        uses: docker/login-action@v3

				        with:

				          username: ${{ secrets.DOCKER_USERNAME }}

				          password: ${{ secrets.DOCKER_PASSWORD }}

				      - name: Login to Kata Containers quay.io

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - uses: actions/checkout@v4

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-ppc64le

				      - name: build-and-push-kata-deploy-ci-ppc64le

				        id: build-and-push-kata-deploy-ci-ppc64le

				        run: |

				          # We need to do such trick here as the format of the $GITHUB_REF

				          # is "refs/tags/<tag>"

				          tag=$(echo $GITHUB_REF | cut -d/ -f3-)

				          if [ "${tag}" = "main" ]; then

				              tag=$(./tools/packaging/release/release.sh release-version)

				              tags=(${tag} "latest")

				          else

				              tags=(${tag})

				          fi

				          for tag in ${tags[@]}; do

				              ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				                  $(pwd)/kata-static.tar.xz "docker.io/katadocker/kata-deploy" \

				                  "${tag}-${{ inputs.target-arch }}"

				              ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				                  $(pwd)/kata-static.tar.xz "quay.io/kata-containers/kata-deploy" \

				                  "${tag}-${{ inputs.target-arch }}"

				          done

									
										61

.github/workflows/release-s390x.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,61 @@

				name: Publish Kata release artifacts for s390x

				on:

				  workflow_call:

				    inputs:

				      target-arch:

				        required: true

				        type: string

				jobs:

				  build-kata-static-tarball-s390x:

				    uses: ./.github/workflows/build-kata-static-tarball-s390x.yaml

				    with:

				      stage: release

				    secrets: inherit

				  kata-deploy:

				    needs: build-kata-static-tarball-s390x

				    runs-on: s390x

				    steps:

				      - name: Take a pre-action for self-hosted runner

				        run: ${HOME}/script/pre_action.sh ubuntu-2204

				      - name: Login to Kata Containers docker.io

				        uses: docker/login-action@v3

				        with:

				          username: ${{ secrets.DOCKER_USERNAME }}

				          password: ${{ secrets.DOCKER_PASSWORD }}

				      - name: Login to Kata Containers quay.io

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - uses: actions/checkout@v4

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-s390x

				      - name: build-and-push-kata-deploy-ci-s390x

				        id: build-and-push-kata-deploy-ci-s390x

				        run: |

				          # We need to do such trick here as the format of the $GITHUB_REF

				          # is "refs/tags/<tag>"

				          tag=$(echo $GITHUB_REF | cut -d/ -f3-)

				          if [ "${tag}" = "main" ]; then

				              tag=$(./tools/packaging/release/release.sh release-version)

				              tags=(${tag} "latest")

				          else

				              tags=(${tag})

				          fi

				          for tag in ${tags[@]}; do

				              ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				                  $(pwd)/kata-static.tar.xz "docker.io/katadocker/kata-deploy" \

				                  "${tag}-${{ inputs.target-arch }}"

				              ./tools/packaging/kata-deploy/local-build/kata-deploy-build-and-upload-payload.sh \

				                  $(pwd)/kata-static.tar.xz "quay.io/kata-containers/kata-deploy" \

				                  "${tag}-${{ inputs.target-arch }}"

				          done

									
										305

.github/workflows/release.yaml
									
										vendored
									
												View File
												
				@@ -1,180 +1,189 @@

				name: Publish Kata release artifacts

				name: Release Kata Containers

				on:

				  push:

				    tags:

				      - '[0-9]+.[0-9]+.[0-9]+*'

				  workflow_dispatch

				jobs:

				  build-asset:

				  release:

				    runs-on: ubuntu-latest

				    strategy:

				      matrix:

				        asset:

				          - cloud-hypervisor

				          - firecracker

				          - kernel

				          - kernel-dragonball-experimental

				          - nydus

				          - qemu

				          - rootfs-image

				          - rootfs-initrd

				          - shim-v2

				          - virtiofsd

				    steps:

				      - uses: actions/checkout@v2

				      - name: Install docker

				        run: |

				          curl -fsSL https://test.docker.com -o test-docker.sh

				          sh test-docker.sh

				      - name: Checkout repository

				        uses: actions/checkout@v4

				        with:

				          fetch-depth: 0

				      - name: Build ${{ matrix.asset }}

				      - name: Create a new release

				        run: |

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-copy-yq-installer.sh

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-binaries-in-docker.sh --build="${KATA_ASSET}"

				          build_dir=$(readlink -f build)

				          # store-artifact does not work with symlink

				          sudo cp -r "${build_dir}" "kata-build"

				          ./tools/packaging/release/release.sh create-new-release

				        env:

				          KATA_ASSET: ${{ matrix.asset }}

				          TAR_OUTPUT: ${{ matrix.asset }}.tar.gz

				          GH_TOKEN: ${{ github.token }}

				      - name: store-artifact ${{ matrix.asset }}

				        uses: actions/upload-artifact@v2

				        with:

				          name: kata-artifacts

				          path: kata-build/kata-static-${{ matrix.asset }}.tar.xz

				          if-no-files-found: error

				  build-and-push-assets-amd64:

				    needs: release

				    uses: ./.github/workflows/release-amd64.yaml

				    with:

				      target-arch: amd64

				    secrets: inherit

				  create-kata-tarball:

				  build-and-push-assets-arm64:

				    needs: release

				    uses: ./.github/workflows/release-arm64.yaml

				    with:

				      target-arch: arm64

				    secrets: inherit

				  build-and-push-assets-s390x:

				    needs: release

				    uses: ./.github/workflows/release-s390x.yaml

				    with:

				      target-arch: s390x

				    secrets: inherit

				  build-and-push-assets-ppc64le:

				    needs: release

				    uses: ./.github/workflows/release-ppc64le.yaml

				    with:

				      target-arch: ppc64le

				    secrets: inherit

				  publish-multi-arch-images:

				    runs-on: ubuntu-latest

				    needs: build-asset

				    needs: [build-and-push-assets-amd64, build-and-push-assets-arm64, build-and-push-assets-s390x, build-and-push-assets-ppc64le]

				    steps:

				      - uses: actions/checkout@v2

				      - name: get-artifacts

				        uses: actions/download-artifact@v2

				        with:

				          name: kata-artifacts

				          path: kata-artifacts

				      - name: merge-artifacts

				        run: |

				          ./tools/packaging/kata-deploy/local-build/kata-deploy-merge-builds.sh kata-artifacts

				      - name: store-artifacts

				        uses: actions/upload-artifact@v2

				        with:

				          name: kata-static-tarball

				          path: kata-static.tar.xz

				      - name: Checkout repository

				        uses: actions/checkout@v4

				  kata-deploy:

				    needs: create-kata-tarball

				    runs-on: ubuntu-latest

				    steps:

				      - uses: actions/checkout@v2

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v2

				      - name: Login to Kata Containers docker.io

				        uses: docker/login-action@v3

				        with:

				          name: kata-static-tarball

				      - name: build-and-push-kata-deploy-ci

				        id: build-and-push-kata-deploy-ci

				          username: ${{ secrets.DOCKER_USERNAME }}

				          password: ${{ secrets.DOCKER_PASSWORD }}

				      - name: Login to Kata Containers quay.io

				        uses: docker/login-action@v3

				        with:

				          registry: quay.io

				          username: ${{ secrets.QUAY_DEPLOYER_USERNAME }}

				          password: ${{ secrets.QUAY_DEPLOYER_PASSWORD }}

				      - name: Get the image tags

				        run: |

				          tag=$(echo $GITHUB_REF | cut -d/ -f3-)

				          pushd $GITHUB_WORKSPACE

				          git checkout $tag

				          pkg_sha=$(git rev-parse HEAD)

				          popd

				          mv kata-static.tar.xz $GITHUB_WORKSPACE/tools/packaging/kata-deploy/kata-static.tar.xz

				          docker build --build-arg KATA_ARTIFACTS=kata-static.tar.xz -t katadocker/kata-deploy-ci:$pkg_sha -t quay.io/kata-containers/kata-deploy-ci:$pkg_sha $GITHUB_WORKSPACE/tools/packaging/kata-deploy

				          docker login -u ${{ secrets.DOCKER_USERNAME }} -p ${{ secrets.DOCKER_PASSWORD }}

				          docker push katadocker/kata-deploy-ci:$pkg_sha

				          docker login -u ${{ secrets.QUAY_DEPLOYER_USERNAME }} -p ${{ secrets.QUAY_DEPLOYER_PASSWORD }} quay.io

				          docker push quay.io/kata-containers/kata-deploy-ci:$pkg_sha

				          mkdir -p packaging/kata-deploy

				          ln -s $GITHUB_WORKSPACE/tools/packaging/kata-deploy/action packaging/kata-deploy/action

				          echo "::set-output name=PKG_SHA::${pkg_sha}"

				      - name: test-kata-deploy-ci-in-aks

				        uses: ./packaging/kata-deploy/action

				        with:

				          packaging-sha: ${{steps.build-and-push-kata-deploy-ci.outputs.PKG_SHA}}

				          release_version=$(./tools/packaging/release/release.sh release-version)

				          echo "KATA_DEPLOY_IMAGE_TAGS=$release_version latest" >> "$GITHUB_ENV"

				      - name: Publish multi-arch manifest on docker.io and quay.io

				        run: |

				          ./tools/packaging/release/release.sh publish-multiarch-manifest

				        env:

				          PKG_SHA: ${{steps.build-and-push-kata-deploy-ci.outputs.PKG_SHA}}

				          AZ_APPID: ${{ secrets.AZ_APPID }}

				          AZ_PASSWORD: ${{ secrets.AZ_PASSWORD }}

				          AZ_SUBSCRIPTION_ID: ${{ secrets.AZ_SUBSCRIPTION_ID }}

				          AZ_TENANT_ID: ${{ secrets.AZ_TENANT_ID }}

				      - name: push-tarball

				        run: |

				          # tag the container image we created and push to DockerHub

				          tag=$(echo $GITHUB_REF | cut -d/ -f3-)

				          tags=($tag)

				          tags+=($([[ "$tag" =~ "alpha"|"rc" ]] && echo "latest" || echo "stable"))

				          for tag in ${tags[@]}; do \

				            docker tag katadocker/kata-deploy-ci:${{steps.build-and-push-kata-deploy-ci.outputs.PKG_SHA}} katadocker/kata-deploy:${tag} && \

				            docker tag quay.io/kata-containers/kata-deploy-ci:${{steps.build-and-push-kata-deploy-ci.outputs.PKG_SHA}} quay.io/kata-containers/kata-deploy:${tag} && \

				            docker push katadocker/kata-deploy:${tag} && \

				            docker push quay.io/kata-containers/kata-deploy:${tag}; \

				          done

				          KATA_DEPLOY_REGISTRIES: "quay.io/kata-containers/kata-deploy docker.io/katadocker/kata-deploy"

				  upload-static-tarball:

				    needs: kata-deploy

				  upload-multi-arch-static-tarball:

				    needs: [build-and-push-assets-amd64, build-and-push-assets-arm64, build-and-push-assets-s390x, build-and-push-assets-ppc64le]

				    runs-on: ubuntu-latest

				    steps:

				      - uses: actions/checkout@v2

				      - name: download-artifacts

				        uses: actions/download-artifact@v2

				      - name: Checkout repository

				        uses: actions/checkout@v4

				      - name: Set KATA_STATIC_TARBALL env var

				        run: |

				          tarball=$(pwd)/kata-static.tar.xz

				          echo "KATA_STATIC_TARBALL=${tarball}" >> "$GITHUB_ENV"

				      - name: Download amd64 artifacts

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball

				      - name: install hub

				          name: kata-static-tarball-amd64

				      - name: Upload amd64 static tarball to GitHub

				        run: |

				          HUB_VER=$(curl -s "https://api.github.com/repos/github/hub/releases/latest" | jq -r .tag_name | sed 's/^v//')

				          wget -q -O- https://github.com/github/hub/releases/download/v$HUB_VER/hub-linux-amd64-$HUB_VER.tgz | \

				          tar xz --strip-components=2 --wildcards '*/bin/hub' && sudo mv hub /usr/local/bin/hub

				      - name: push static tarball to github

				          ./tools/packaging/release/release.sh upload-kata-static-tarball

				        env:

				          GH_TOKEN: ${{ github.token }}

				          ARCHITECTURE: amd64

				      - name: Download arm64 artifacts

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-arm64

				      - name: Upload arm64 static tarball to GitHub

				        run: |

				          tag=$(echo $GITHUB_REF | cut -d/ -f3-)

				          tarball="kata-static-$tag-x86_64.tar.xz"

				          mv kata-static.tar.xz "$GITHUB_WORKSPACE/${tarball}"

				          pushd $GITHUB_WORKSPACE

				          echo "uploading asset '${tarball}' for tag: ${tag}"

				          GITHUB_TOKEN=${{ secrets.GIT_UPLOAD_TOKEN }} hub release edit -m "" -a "${tarball}" "${tag}"

				          popd

				          ./tools/packaging/release/release.sh upload-kata-static-tarball

				        env:

				          GH_TOKEN: ${{ github.token }}

				          ARCHITECTURE: arm64

				      - name: Download s390x artifacts

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-s390x

				      - name: Upload s390x static tarball to GitHub

				        run: |

				          ./tools/packaging/release/release.sh upload-kata-static-tarball

				        env:

				          GH_TOKEN: ${{ github.token }}

				          ARCHITECTURE: s390x

				      - name: Download ppc64le artifacts

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-ppc64le

				      - name: Upload ppc64le static tarball to GitHub

				        run: |

				          ./tools/packaging/release/release.sh upload-kata-static-tarball

				        env:

				          GH_TOKEN: ${{ github.token }}

				          ARCHITECTURE: ppc64le

				  upload-versions-yaml:

				    needs: release

				    runs-on: ubuntu-latest

				    steps:

				      - name: Checkout repository

				        uses: actions/checkout@v4

				      - name: Upload versions.yaml to GitHub

				        run: |

				          ./tools/packaging/release/release.sh upload-versions-yaml-file

				        env:

				          GH_TOKEN: ${{ github.token }}

				  upload-cargo-vendored-tarball:

				    needs: upload-static-tarball

				    needs: release

				    runs-on: ubuntu-latest

				    steps:

				      - uses: actions/checkout@v2

				      - name: generate-and-upload-tarball

				      - name: Checkout repository

				        uses: actions/checkout@v4

				      - name: Generate and upload vendored code tarball

				        run: |

				          tag=$(echo $GITHUB_REF | cut -d/ -f3-)

				          tarball="kata-containers-$tag-vendor.tar.gz"

				          pushd $GITHUB_WORKSPACE

				          bash -c "tools/packaging/release/generate_vendor.sh ${tarball}"

				          GITHUB_TOKEN=${{ secrets.GIT_UPLOAD_TOKEN }} hub release edit -m "" -a "${tarball}" "${tag}" 

				          popd

				          ./tools/packaging/release/release.sh upload-vendored-code-tarball

				        env:

				          GH_TOKEN: ${{ github.token }}

				  upload-libseccomp-tarball:

				    needs: upload-cargo-vendored-tarball

				    needs: release

				    runs-on: ubuntu-latest

				    steps:

				      - uses: actions/checkout@v2

				      - name: download-and-upload-tarball

				        env:

				          GITHUB_TOKEN: ${{ secrets.GIT_UPLOAD_TOKEN }}

				          GOPATH: ${HOME}/go

				      - name: Checkout repository

				        uses: actions/checkout@v4

				      - name: Download libseccomp tarball and upload it to GitHub

				        run: |

				          pushd $GITHUB_WORKSPACE

				          ./ci/install_yq.sh

				          tag=$(echo $GITHUB_REF | cut -d/ -f3-)

				          versions_yaml="versions.yaml"

				          version=$(${GOPATH}/bin/yq read ${versions_yaml} "externals.libseccomp.version")

				          repo_url=$(${GOPATH}/bin/yq read ${versions_yaml} "externals.libseccomp.url")

				          download_url="${repo_url}/releases/download/v${version}"

				          tarball="libseccomp-${version}.tar.gz"

				          asc="${tarball}.asc"

				          curl -sSLO "${download_url}/${tarball}"

				          curl -sSLO "${download_url}/${asc}"

				          # "-m" option should be empty to re-use the existing release title

				          # without opening a text editor.

				          # For the details, check https://hub.github.com/hub-release.1.html.

				          hub release edit -m "" -a "${tarball}" "${tag}"

				          hub release edit -m "" -a "${asc}" "${tag}"

				          popd

				          ./tools/packaging/release/release.sh upload-libseccomp-tarball

				        env:

				          GH_TOKEN: ${{ github.token }}

				  publish-release:

				    needs: [ build-and-push-assets-amd64, build-and-push-assets-arm64, build-and-push-assets-s390x, build-and-push-assets-ppc64le, publish-multi-arch-images, upload-multi-arch-static-tarball, upload-versions-yaml, upload-cargo-vendored-tarball, upload-libseccomp-tarball ]

				    runs-on: ubuntu-latest

				    steps:

				      - name: Checkout repository

				        uses: actions/checkout@v4

				      - name: Publish a release

				        run: |

				          ./tools/packaging/release/release.sh publish-release

				        env:

				          GH_TOKEN: ${{ github.token }}

									
										54

.github/workflows/require-pr-porting-labels.yaml
									
										vendored
									
												View File
											
				@@ -1,54 +0,0 @@

				# Copyright (c) 2020 Intel Corporation

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				name: Ensure PR has required porting labels

				on:

				  pull_request_target:

				    types:

				      - opened

				      - reopened

				      - labeled

				      - unlabeled

				    branches:

				      - main

				jobs:

				  check-pr-porting-labels:

				    runs-on: ubuntu-latest

				    steps:

				      - name: Install hub

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: |

				          HUB_ARCH="amd64"

				          HUB_VER=$(curl -sL "https://api.github.com/repos/github/hub/releases/latest" |\

				            jq -r .tag_name | sed 's/^v//')

				          curl -sL \

				            "https://github.com/github/hub/releases/download/v${HUB_VER}/hub-linux-${HUB_ARCH}-${HUB_VER}.tgz" |\

				          tar xz --strip-components=2 --wildcards '*/bin/hub' && \

				          sudo install hub /usr/local/bin

				      - name: Checkout code to allow hub to communicate with the project

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        uses: actions/checkout@v2

				      - name: Install porting checker script

				        run: |

				          # Clone into a temporary directory to avoid overwriting

				          # any existing github directory.

				          pushd $(mktemp -d) &>/dev/null

				          git clone --single-branch --depth 1 "https://github.com/kata-containers/.github" && cd .github/scripts

				          sudo install pr-porting-checks.sh /usr/local/bin

				          popd &>/dev/null

				      - name: Stop PR being merged unless it has a correct set of porting labels

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        env:

				          GITHUB_TOKEN: ${{ secrets.KATA_GITHUB_ACTIONS_TOKEN }}

				        run: |

				          pr=${{ github.event.number }}

				          repo=${{ github.repository }}

				          pr-porting-checks.sh "$pr" "$repo"

									
										67

.github/workflows/run-cri-containerd-tests-ppc64le.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,67 @@

				name: CI | Run cri-containerd tests on ppc64le

				on:

				  workflow_call:

				    inputs:

				      tarball-suffix:

				        required: false

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-cri-containerd:

				    strategy:

				      # We can set this to true whenever we're 100% sure that

				      # the all the tests are not flaky, otherwise we'll fail

				      # all the tests due to a single flaky instance

				      fail-fast: false

				      matrix:

				        containerd_version: ['active']

				        vmm: ['qemu']

				    runs-on: ppc64le

				    env:

				      CONTAINERD_VERSION: ${{ matrix.containerd_version }}

				      GOPATH: ${{ github.workspace }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				    steps:

				      - name: Adjust a permission for repo

				        run: sudo chown -R $USER:$USER $GITHUB_WORKSPACE

				      - name: Prepare the self-hosted runner

				        run: |

				          bash ${HOME}/scripts/prepare_runner.sh cri-containerd

				          sudo rm -rf $GITHUB_WORKSPACE/*

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/integration/cri-containerd/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-ppc64le${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/integration/cri-containerd/gha-run.sh install-kata kata-artifacts

				      - name: Run cri-containerd tests

				        run: bash tests/integration/cri-containerd/gha-run.sh run

				      - name: Cleanup actions for the self hosted runner

				        run: ${HOME}/scripts/cleanup_runner.sh

									
										63

.github/workflows/run-cri-containerd-tests-s390x.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,63 @@

				name: CI | Run cri-containerd tests

				on:

				  workflow_call:

				    inputs:

				      tarball-suffix:

				        required: false

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-cri-containerd:

				    strategy:

				      # We can set this to true whenever we're 100% sure that

				      # the all the tests are not flaky, otherwise we'll fail

				      # all the tests due to a single flaky instance

				      fail-fast: false

				      matrix:

				        containerd_version: ['active']

				        vmm: ['qemu', 'qemu-runtime-rs']

				    runs-on: s390x-large

				    env:

				      CONTAINERD_VERSION: ${{ matrix.containerd_version }}

				      GOPATH: ${{ github.workspace }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				    steps:

				      - name: Take a pre-action for self-hosted runner

				        run: ${HOME}/script/pre_action.sh ubuntu-2204

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/integration/cri-containerd/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-s390x${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/integration/cri-containerd/gha-run.sh install-kata kata-artifacts

				      - name: Run cri-containerd tests

				        run: bash tests/integration/cri-containerd/gha-run.sh run

				      - name: Take a post-action for self-hosted runner

				        if: always()

				        run: ${HOME}/script/post_action.sh ubuntu-2204

									
										123

.github/workflows/run-k8s-tests-on-aks.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,123 @@

				name: CI | Run kubernetes tests on AKS

				on:

				  workflow_call:

				    inputs:

				      tarball-suffix:

				        required: false

				        type: string

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      pr-number:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-k8s-tests:

				    strategy:

				      fail-fast: false

				      matrix:

				        host_os:

				          - ubuntu

				        vmm:

				          - clh

				          - dragonball

				          - qemu

				          - stratovirt

				          - cloud-hypervisor

				        instance-type:

				          - small

				          - normal

				        include:

				          - host_os: cbl-mariner

				            vmm: clh

				            instance-type: small

				            genpolicy-pull-method: oci-distribution

				          - host_os: cbl-mariner

				            vmm: clh

				            instance-type: small

				            genpolicy-pull-method: containerd

				          - host_os: cbl-mariner

				            vmm: clh

				            instance-type: normal

				    runs-on: ubuntu-latest

				    env:

				      DOCKER_REGISTRY: ${{ inputs.registry }}

				      DOCKER_REPO: ${{ inputs.repo }}

				      DOCKER_TAG: ${{ inputs.tag }}

				      GH_PR_NUMBER: ${{ inputs.pr-number }}

				      KATA_HOST_OS: ${{ matrix.host_os }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				      KUBERNETES: "vanilla"

				      USING_NFD: "false"

				      K8S_TEST_HOST_TYPE: ${{ matrix.instance-type }}

				      GENPOLICY_PULL_METHOD: ${{ matrix.genpolicy-pull-method }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/integration/kubernetes/gha-run.sh install-kata-tools kata-artifacts

				      - name: Download Azure CLI

				        run: bash tests/integration/kubernetes/gha-run.sh install-azure-cli

				      - name: Log into the Azure account

				        run: bash tests/integration/kubernetes/gha-run.sh login-azure

				        env:

				          AZ_APPID: ${{ secrets.AZ_APPID }}

				          AZ_PASSWORD: ${{ secrets.AZ_PASSWORD }}

				          AZ_TENANT_ID: ${{ secrets.AZ_TENANT_ID }}

				          AZ_SUBSCRIPTION_ID: ${{ secrets.AZ_SUBSCRIPTION_ID }}

				      - name: Create AKS cluster

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh create-cluster

				      - name: Install `bats`

				        run: bash tests/integration/kubernetes/gha-run.sh install-bats

				      - name: Install `kubectl`

				        run: bash tests/integration/kubernetes/gha-run.sh install-kubectl

				      - name: Download credentials for the Kubernetes CLI to use them

				        run: bash tests/integration/kubernetes/gha-run.sh get-cluster-credentials

				      - name: Deploy Kata

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-kata-aks

				      - name: Run tests

				        timeout-minutes: 60

				        run: bash tests/integration/kubernetes/gha-run.sh run-tests

				      - name: Delete AKS cluster

				        if: always()

				        run: bash tests/integration/kubernetes/gha-run.sh delete-cluster

									
										100

.github/workflows/run-k8s-tests-on-garm.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,100 @@

				name: CI | Run kubernetes tests on GARM

				on:

				  workflow_call:

				    inputs:

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      pr-number:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-k8s-tests:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm:

				          - clh #cloud-hypervisor

				          - dragonball

				          - fc #firecracker

				          - qemu

				          - cloud-hypervisor

				        snapshotter:

				          - devmapper

				        k8s:

				          - k3s

				        instance:

				          - garm-ubuntu-2004

				          - garm-ubuntu-2004-smaller

				        include:

				          - instance: garm-ubuntu-2004

				            instance-type: normal

				          - instance: garm-ubuntu-2004-smaller

				            instance-type: small

				    runs-on: ${{ matrix.instance }}

				    env:

				      DOCKER_REGISTRY: ${{ inputs.registry }}

				      DOCKER_REPO: ${{ inputs.repo }}

				      DOCKER_TAG: ${{ inputs.tag }}

				      PR_NUMBER: ${{ inputs.pr-number }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				      KUBERNETES: ${{ matrix.k8s }}

				      SNAPSHOTTER: ${{ matrix.snapshotter }}

				      USING_NFD: "false"

				      K8S_TEST_HOST_TYPE: ${{ matrix.instance-type }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Deploy ${{ matrix.k8s }}

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-k8s

				      - name: Configure the ${{ matrix.snapshotter }} snapshotter

				        run: bash tests/integration/kubernetes/gha-run.sh configure-snapshotter

				      - name: Deploy Kata

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-kata-garm

				      - name: Install `bats`

				        run: bash tests/integration/kubernetes/gha-run.sh install-bats

				      - name: Run tests

				        timeout-minutes: 30

				        run: bash tests/integration/kubernetes/gha-run.sh run-tests

				      - name: Collect artifacts ${{ matrix.vmm }}

				        run: bash tests/integration/kubernetes/gha-run.sh collect-artifacts

				      - name: Archive artifacts ${{ matrix.vmm }}

				        uses: actions/upload-artifact@v4

				        with:

				          name: k8s-tests-garm-${{ matrix.vmm }}-${{ matrix.snapshotter }}-${{ matrix.k8s }}-${{ matrix.instance }}-${{ inputs.tag }}

				          path: /tmp/artifacts

				          retention-days: 1

				      - name: Delete kata-deploy

				        if: always()

				        run: bash tests/integration/kubernetes/gha-run.sh cleanup-garm

									
										82

.github/workflows/run-k8s-tests-on-ppc64le.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,82 @@

				name: CI | Run kubernetes tests on Power(ppc64le)

				on:

				  workflow_call:

				    inputs:

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      pr-number:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-k8s-tests:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm:

				          - qemu

				        k8s:

				          - kubeadm

				    runs-on: k8s-ppc64le

				    env:

				      DOCKER_REGISTRY: ${{ inputs.registry }}

				      DOCKER_REPO: ${{ inputs.repo }}

				      DOCKER_TAG: ${{ inputs.tag }}

				      PR_NUMBER: ${{ inputs.pr-number }}

				      GOPATH: ${{ github.workspace }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				      KUBERNETES: ${{ matrix.k8s }}

				      USING_NFD: "false"

				      TARGET_ARCH: "ppc64le"

				    steps:

				      - name: Prepare the self-hosted runner

				        run: | 

				          bash ${HOME}/scripts/prepare_runner.sh kubernetes

				          sudo rm -rf $GITHUB_WORKSPACE/*

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install golang

				        run: |

				          ./tests/install_go.sh -f -p

				          echo "/usr/local/go/bin" >> $GITHUB_PATH

				      - name: Prepare the runner for k8s cluster creation

				        run: bash ${HOME}/scripts/k8s_cluster_cleanup.sh

				      - name: Create k8s cluster using kubeadm

				        run: bash ${HOME}/scripts/k8s_cluster_create.sh

				      - name: Deploy Kata

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-kata-kubeadm

				      - name: Run tests

				        timeout-minutes: 30

				        run: bash tests/integration/kubernetes/gha-run.sh run-tests

				      - name: Delete cluster and post cleanup actions

				        run: bash ${HOME}/scripts/k8s_cluster_cleanup.sh

									
										93

.github/workflows/run-k8s-tests-on-zvsi.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,93 @@

				name: CI | Run kubernetes tests on IBM Cloud Z virtual server instance (zVSI)

				on:

				  workflow_call:

				    inputs:

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      pr-number:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-k8s-tests:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm:

				          - qemu

				        snapshotter:

				          - devmapper

				          - nydus

				        k8s:

				          - k3s

				        include:

				          - snapshotter: devmapper

				            pull-type: default

				            using-nfd: true

				            deploy-cmd: configure-snapshotter

				          - snapshotter: nydus

				            pull-type: guest-pull

				            using-nfd: false

				            deploy-cmd: deploy-snapshotter

				    runs-on: s390x-large

				    env:

				      DOCKER_REGISTRY: ${{ inputs.registry }}

				      DOCKER_REPO: ${{ inputs.repo }}

				      DOCKER_TAG: ${{ inputs.tag }}

				      PR_NUMBER: ${{ inputs.pr-number }}

				      GH_PR_NUMBER: ${{ inputs.pr-number }}

				      KATA_HOST_OS: "ubuntu"

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				      KUBERNETES: "k3s"

				      PULL_TYPE: ${{ matrix.pull-type }}

				      SNAPSHOTTER: ${{ matrix.snapshotter }}

				      USING_NFD: ${{ matrix.using-nfd }}

				      TARGET_ARCH: "s390x"

				    steps:

				      - name: Take a pre-action for self-hosted runner

				        run: ${HOME}/script/pre_action.sh ubuntu-2204

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Deploy ${{ matrix.k8s }}

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-k8s

				      - name: Configure the ${{ matrix.snapshotter }} snapshotter

				        run: bash tests/integration/kubernetes/gha-run.sh ${{ matrix.deploy-cmd }}

				      - name: Deploy Kata

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-kata-zvsi

				      - name: Run tests

				        timeout-minutes: 60

				        run: bash tests/integration/kubernetes/gha-run.sh run-tests

				      - name: Take a post-action

				        if: always()

				        run: |

				          bash tests/integration/kubernetes/gha-run.sh cleanup-zvsi || true

				          ${HOME}/script/post_action.sh ubuntu-2204

									
										86

.github/workflows/run-k8s-tests-with-crio-on-garm.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,86 @@

				name: CI | Run kubernetes tests, using CRI-O, on GARM

				on:

				  workflow_call:

				    inputs:

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      pr-number:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-k8s-tests:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm:

				          - qemu

				        k8s:

				          - k0s

				        instance:

				          - garm-ubuntu-2204

				          - garm-ubuntu-2204-smaller

				        include:

				          - instance: garm-ubuntu-2204

				            instance-type: normal

				          - instance: garm-ubuntu-2204-smaller

				            instance-type: small

				          - k8s: k0s

				            k8s-extra-params: '--cri-socket remote:unix:///var/run/crio/crio.sock --kubelet-extra-args --cgroup-driver="systemd"'

				    runs-on: ${{ matrix.instance }}

				    env:

				      DOCKER_REGISTRY: ${{ inputs.registry }}

				      DOCKER_REPO: ${{ inputs.repo }}

				      DOCKER_TAG: ${{ inputs.tag }}

				      PR_NUMBER: ${{ inputs.pr-number }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				      KUBERNETES: ${{ matrix.k8s }}

				      KUBERNETES_EXTRA_PARAMS: ${{ matrix.k8s-extra-params }}

				      USING_NFD: "false"

				      K8S_TEST_HOST_TYPE: ${{ matrix.instance-type }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Configure CRI-O

				        run: bash tests/integration/kubernetes/gha-run.sh setup-crio

				      - name: Deploy ${{ matrix.k8s }}

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-k8s

				      - name: Deploy Kata

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-kata-garm

				      - name: Install `bats`

				        run: bash tests/integration/kubernetes/gha-run.sh install-bats

				      - name: Run tests

				        timeout-minutes: 30

				        run: bash tests/integration/kubernetes/gha-run.sh run-tests

				      - name: Delete kata-deploy

				        if: always()

				        run: bash tests/integration/kubernetes/gha-run.sh cleanup-garm

									
										275

.github/workflows/run-kata-coco-tests.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,275 @@

				name: CI | Run kata coco tests

				on:

				  workflow_call:

				    inputs:

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      pr-number:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-k8s-tests-on-tdx:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm:

				          - qemu-tdx

				        snapshotter:

				          - nydus

				        pull-type:

				          - guest-pull

				    runs-on: tdx

				    env:

				      DOCKER_REGISTRY: ${{ inputs.registry }}

				      DOCKER_REPO: ${{ inputs.repo }}

				      DOCKER_TAG: ${{ inputs.tag }}

				      PR_NUMBER: ${{ inputs.pr-number }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				      KUBERNETES: "k3s"

				      USING_NFD: "true"

				      K8S_TEST_HOST_TYPE: "baremetal"

				      SNAPSHOTTER: ${{ matrix.snapshotter }}

				      PULL_TYPE: ${{ matrix.pull-type }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Deploy Snapshotter

				        timeout-minutes: 5

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-snapshotter

				      - name: Deploy Kata

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-kata-tdx

				      - name: Run tests

				        timeout-minutes: 30

				        run: bash tests/integration/kubernetes/gha-run.sh run-tests

				      - name: Delete kata-deploy

				        if: always()

				        run: bash tests/integration/kubernetes/gha-run.sh cleanup-tdx

				      - name: Delete Snapshotter

				        if: always()

				        run: bash tests/integration/kubernetes/gha-run.sh cleanup-snapshotter

				  run-k8s-tests-on-sev:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm:

				          - qemu-sev

				        snapshotter:

				          - nydus

				        pull-type:

				          - guest-pull

				    runs-on: sev

				    env:

				      DOCKER_REGISTRY: ${{ inputs.registry }}

				      DOCKER_REPO: ${{ inputs.repo }}

				      DOCKER_TAG: ${{ inputs.tag }}

				      PR_NUMBER: ${{ inputs.pr-number }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				      KUBECONFIG: /home/kata/.kube/config

				      KUBERNETES: "vanilla"

				      USING_NFD: "false"

				      K8S_TEST_HOST_TYPE: "baremetal"

				      SNAPSHOTTER: ${{ matrix.snapshotter }}

				      PULL_TYPE: ${{ matrix.pull-type }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Deploy Snapshotter

				        timeout-minutes: 5

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-snapshotter

				      - name: Deploy Kata

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-kata-sev

				      - name: Run tests

				        timeout-minutes: 30

				        run: bash tests/integration/kubernetes/gha-run.sh run-tests

				      - name: Delete kata-deploy

				        if: always()

				        run: bash tests/integration/kubernetes/gha-run.sh cleanup-sev

				      - name: Delete Snapshotter

				        if: always()

				        run: bash tests/integration/kubernetes/gha-run.sh cleanup-snapshotter

				  run-k8s-tests-sev-snp:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm:

				          - qemu-snp

				        snapshotter:

				          - nydus

				        pull-type:

				          - guest-pull

				    runs-on: sev-snp

				    env:

				      DOCKER_REGISTRY: ${{ inputs.registry }}

				      DOCKER_REPO: ${{ inputs.repo }}

				      DOCKER_TAG: ${{ inputs.tag }}

				      PR_NUMBER: ${{ inputs.pr-number }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				      KUBECONFIG: /home/kata/.kube/config

				      KUBERNETES: "vanilla"

				      USING_NFD: "false"

				      K8S_TEST_HOST_TYPE: "baremetal"

				      SNAPSHOTTER: ${{ matrix.snapshotter }}

				      PULL_TYPE: ${{ matrix.pull-type }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Deploy Snapshotter

				        timeout-minutes: 5

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-snapshotter

				      - name: Deploy Kata

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-kata-snp

				      - name: Run tests

				        timeout-minutes: 30

				        run: bash tests/integration/kubernetes/gha-run.sh run-tests

				      - name: Delete kata-deploy

				        if: always()

				        run: bash tests/integration/kubernetes/gha-run.sh cleanup-snp

				      - name: Delete Snapshotter

				        if: always()

				        run: bash tests/integration/kubernetes/gha-run.sh cleanup-snapshotter

				  # Generate jobs for testing CoCo on non-TEE environments

				  run-k8s-tests-coco-nontee:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm:

				          - qemu-coco-dev

				        snapshotter:

				          - nydus

				        pull-type:

				          - guest-pull

				    runs-on: ubuntu-latest

				    env:

				      DOCKER_REGISTRY: ${{ inputs.registry }}

				      DOCKER_REPO: ${{ inputs.repo }}

				      DOCKER_TAG: ${{ inputs.tag }}

				      GH_PR_NUMBER: ${{ inputs.pr-number }}

				      KATA_HOST_OS: ${{ matrix.host_os }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				      # Some tests rely on that variable to run (or not)

				      KBS: "true"

				      # Set the KBS ingress handler (empty string disables handling)

				      KBS_INGRESS: "aks"

				      KUBERNETES: "vanilla"

				      PULL_TYPE: ${{ matrix.pull-type }}

				      SNAPSHOTTER: ${{ matrix.snapshotter }}

				      USING_NFD: "false"

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Download Azure CLI

				        run: bash tests/integration/kubernetes/gha-run.sh install-azure-cli

				      - name: Log into the Azure account

				        run: bash tests/integration/kubernetes/gha-run.sh login-azure

				        env:

				          AZ_APPID: ${{ secrets.AZ_APPID }}

				          AZ_PASSWORD: ${{ secrets.AZ_PASSWORD }}

				          AZ_TENANT_ID: ${{ secrets.AZ_TENANT_ID }}

				          AZ_SUBSCRIPTION_ID: ${{ secrets.AZ_SUBSCRIPTION_ID }}

				      - name: Create AKS cluster

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh create-cluster

				      - name: Install `bats`

				        run: bash tests/integration/kubernetes/gha-run.sh install-bats

				      - name: Install `kubectl`

				        run: bash tests/integration/kubernetes/gha-run.sh install-kubectl

				      - name: Download credentials for the Kubernetes CLI to use them

				        run: bash tests/integration/kubernetes/gha-run.sh get-cluster-credentials

				      - name: Deploy Snapshotter

				        timeout-minutes: 5

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-snapshotter

				      - name: Deploy Kata

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-kata-aks

				      - name: Deploy CoCo KBS

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh deploy-coco-kbs

				      - name: Install `kbs-client`

				        timeout-minutes: 10

				        run: bash tests/integration/kubernetes/gha-run.sh install-kbs-client

				      - name: Run tests

				        timeout-minutes: 60

				        run: bash tests/integration/kubernetes/gha-run.sh run-tests

				      - name: Delete AKS cluster

				        if: always()

				        run: bash tests/integration/kubernetes/gha-run.sh delete-cluster

									
										90

.github/workflows/run-kata-deploy-tests-on-aks.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,90 @@

				name: CI | Run kata-deploy tests on AKS

				on:

				  workflow_call:

				    inputs:

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      pr-number:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-kata-deploy-tests:

				    strategy:

				      fail-fast: false

				      matrix:

				        host_os:

				          - ubuntu

				        vmm:

				          - clh

				          - dragonball

				          - qemu

				        include:

				          - host_os: cbl-mariner

				            vmm: clh

				    runs-on: ubuntu-latest

				    env:

				      DOCKER_REGISTRY: ${{ inputs.registry }}

				      DOCKER_REPO: ${{ inputs.repo }}

				      DOCKER_TAG: ${{ inputs.tag }}

				      GH_PR_NUMBER: ${{ inputs.pr-number }}

				      KATA_HOST_OS: ${{ matrix.host_os }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				      KUBERNETES: "vanilla"

				      USING_NFD: "false"

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Download Azure CLI

				        run: bash tests/functional/kata-deploy/gha-run.sh install-azure-cli

				      - name: Log into the Azure account

				        run: bash tests/functional/kata-deploy/gha-run.sh login-azure

				        env:

				          AZ_APPID: ${{ secrets.AZ_APPID }}

				          AZ_PASSWORD: ${{ secrets.AZ_PASSWORD }}

				          AZ_TENANT_ID: ${{ secrets.AZ_TENANT_ID }}

				          AZ_SUBSCRIPTION_ID: ${{ secrets.AZ_SUBSCRIPTION_ID }}

				      - name: Create AKS cluster

				        timeout-minutes: 10

				        run: bash tests/functional/kata-deploy/gha-run.sh create-cluster

				      - name: Install `bats`

				        run: bash tests/functional/kata-deploy/gha-run.sh install-bats

				      - name: Install `kubectl`

				        run: bash tests/functional/kata-deploy/gha-run.sh install-kubectl

				      - name: Download credentials for the Kubernetes CLI to use them

				        run: bash tests/functional/kata-deploy/gha-run.sh get-cluster-credentials

				      - name: Run tests

				        run: bash tests/functional/kata-deploy/gha-run.sh run-tests

				      - name: Delete AKS cluster

				        if: always()

				        run: bash tests/functional/kata-deploy/gha-run.sh delete-cluster

									
										65

.github/workflows/run-kata-deploy-tests-on-garm.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,65 @@

				name: CI | Run kata-deploy tests on GARM

				on:

				  workflow_call:

				    inputs:

				      registry:

				        required: true

				        type: string

				      repo:

				        required: true

				        type: string

				      tag:

				        required: true

				        type: string

				      pr-number:

				        required: true

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-kata-deploy-tests:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm:

				          - clh

				          - qemu

				        k8s:

				          - k0s

				          - k3s

				          - rke2

				    runs-on: garm-ubuntu-2004-smaller

				    env:

				      DOCKER_REGISTRY: ${{ inputs.registry }}

				      DOCKER_REPO: ${{ inputs.repo }}

				      DOCKER_TAG: ${{ inputs.tag }}

				      PR_NUMBER: ${{ inputs.pr-number }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				      KUBERNETES: ${{ matrix.k8s }}

				      USING_NFD: "false"

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Deploy ${{ matrix.k8s }}

				        run:  bash tests/functional/kata-deploy/gha-run.sh deploy-k8s

				      - name: Install `bats`

				        run: bash tests/functional/kata-deploy/gha-run.sh install-bats

				      - name: Run tests

				        run: bash tests/functional/kata-deploy/gha-run.sh run-tests

									
										59

.github/workflows/run-kata-monitor-tests.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,59 @@

				name: CI | Run kata-monitor tests

				on:

				  workflow_call:

				    inputs:

				      tarball-suffix:

				        required: false

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-monitor:

				    strategy:

				      fail-fast: false

				      matrix:

				        vmm:

				          - qemu

				        container_engine:

				          - crio

				          - containerd

				        include:

				          - container_engine: containerd

				            containerd_version: lts

				    runs-on: garm-ubuntu-2204-smaller

				    env:

				      CONTAINER_ENGINE: ${{ matrix.container_engine }}

				      CONTAINERD_VERSION: ${{ matrix.containerd_version }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/functional/kata-monitor/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/functional/kata-monitor/gha-run.sh install-kata kata-artifacts

				      - name: Run kata-monitor tests

				        run: bash tests/functional/kata-monitor/gha-run.sh run

									
										94

.github/workflows/run-metrics.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,94 @@

				name: CI | Run test metrics

				on:

				  workflow_call:

				    inputs:

				      tarball-suffix:

				        required: false

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  setup-kata:

				    name: Kata Setup

				    runs-on: metrics

				    env:

				      GOPATH: ${{ github.workspace }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/metrics/gha-run.sh install-kata kata-artifacts

				  run-metrics:

				    needs: setup-kata

				    strategy:

				      # We can set this to true whenever we're 100% sure that

				      # the all the tests are not flaky, otherwise we'll fail

				      # all the tests due to a single flaky instance.

				      fail-fast: false

				      matrix:

				        vmm: ['clh', 'qemu', 'stratovirt']

				      max-parallel: 1

				    runs-on: metrics

				    env:

				      GOPATH: ${{ github.workspace }}

				      KATA_HYPERVISOR: ${{ matrix.vmm }}

				    steps:

				      - name: enabling the hypervisor

				        run: bash tests/metrics/gha-run.sh enabling-hypervisor

				      - name: run launch times test

				        run: bash tests/metrics/gha-run.sh run-test-launchtimes

				      - name: run memory foot print test

				        run:  bash tests/metrics/gha-run.sh run-test-memory-usage

				      - name: run memory usage inside container test

				        run:  bash tests/metrics/gha-run.sh run-test-memory-usage-inside-container

				      - name: run blogbench test

				        run:  bash tests/metrics/gha-run.sh run-test-blogbench

				      - name: run tensorflow test

				        run:  bash tests/metrics/gha-run.sh run-test-tensorflow

				      - name: run fio test

				        run:  bash tests/metrics/gha-run.sh run-test-fio

				      - name: run iperf test

				        run:  bash tests/metrics/gha-run.sh run-test-iperf

				      - name: run latency test

				        run:  bash tests/metrics/gha-run.sh run-test-latency

				      - name: make metrics tarball ${{ matrix.vmm }}

				        run: bash tests/metrics/gha-run.sh make-tarball-results

				      - name: archive metrics results ${{ matrix.vmm }}

				        uses: actions/upload-artifact@v4

				        with:

				          name: metrics-artifacts-${{ matrix.vmm }}

				          path: results-${{ matrix.vmm }}.tar.gz

				          retention-days: 1

				          if-no-files-found: error

									
										46

.github/workflows/run-runk-tests.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,46 @@

				name: CI | Run runk tests

				on:

				  workflow_call:

				    inputs:

				      tarball-suffix:

				        required: false

				        type: string

				      commit-hash:

				        required: false

				        type: string

				      target-branch:

				        required: false

				        type: string

				        default: ""

				jobs:

				  run-runk:

				    runs-on: garm-ubuntu-2204-smaller

				    env:

				      CONTAINERD_VERSION: lts

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          ref: ${{ inputs.commit-hash }}

				          fetch-depth: 0

				      - name: Rebase atop of the latest target branch

				        run: |

				          ./tests/git-helper.sh "rebase-atop-of-the-latest-target-branch"

				        env:

				          TARGET_BRANCH: ${{ inputs.target-branch }}

				      - name: Install dependencies

				        run: bash tests/integration/runk/gha-run.sh install-dependencies

				      - name: get-kata-tarball

				        uses: actions/download-artifact@v4

				        with:

				          name: kata-static-tarball-amd64${{ inputs.tarball-suffix }}

				          path: kata-artifacts

				      - name: Install kata

				        run: bash tests/integration/runk/gha-run.sh install-kata kata-artifacts

				      - name: Run runk tests

				        run: bash tests/integration/runk/gha-run.sh run

									
										52

.github/workflows/snap-release.yaml
									
										vendored
									
												View File
											
				@@ -1,52 +0,0 @@

				name: Release Kata in snapcraft store

				on:

				  push:

				    tags:

				      - '[0-9]+.[0-9]+.[0-9]+*'

				env:

				  SNAPCRAFT_STORE_CREDENTIALS: ${{ secrets.snapcraft_token }}

				jobs:

				  release-snap:

				    runs-on: ubuntu-20.04

				    steps:

				      - name: Check out Git repository

				        uses: actions/checkout@v2

				        with:

				          fetch-depth: 0

				      - name: Install Snapcraft

				        run: |

				          # Required to avoid snapcraft install failure

				          sudo chown root:root /

				          # "--classic" is needed for the GitHub action runner

				          # environment.

				          sudo snap install snapcraft --classic

				          # Allow other parts to access snap binaries

				          echo /snap/bin >> "$GITHUB_PATH"

				      - name: Build snap

				        run: |

				          # Removing man-db, workflow kept failing, fixes: #4480

				          sudo apt -y remove --purge man-db

				          sudo apt-get install -y git git-extras

				          kata_url="https://github.com/kata-containers/kata-containers"

				          latest_version=$(git ls-remote --tags ${kata_url}  | egrep -o "refs.*" | egrep -v "\-alpha|\-rc|{}" | egrep -o "[[:digit:]]+\.[[:digit:]]+\.[[:digit:]]+" | sort -V -r | head -1)

				          current_version="$(echo ${GITHUB_REF} | cut -d/ -f3)"

				          # Check semantic versioning format (x.y.z) and if the current tag is the latest tag

				          if echo "${current_version}" | grep -q "^[[:digit:]]\+\.[[:digit:]]\+\.[[:digit:]]\+$" && echo -e "$latest_version\n$current_version" | sort -C -V; then

				            # Current version is the latest version, build it

				            snapcraft snap --debug --destructive-mode

				          fi

				      - name: Upload snap

				        run: |

				          snap_version="$(echo ${GITHUB_REF} | cut -d/ -f3)"

				          snap_file="kata-containers_${snap_version}_amd64.snap"

				          # Upload the snap if it exists

				          if [ -f ${snap_file} ]; then

				            snapcraft upload --release=stable ${snap_file}

				          fi

									
										37

.github/workflows/snap.yaml
									
										vendored
									
												View File
											
				@@ -1,37 +0,0 @@

				name: snap CI

				on:

				  pull_request:

				    types:

				      - opened

				      - synchronize

				      - reopened

				      - edited

				    paths-ignore: [ '**.md', '**.png', '**.jpg', '**.jpeg', '**.svg', '/docs/**' ]

				jobs:

				  test:

				    runs-on: ubuntu-20.04

				    steps:

				      - name: Check out

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        uses: actions/checkout@v2

				        with:

				          fetch-depth: 0

				      - name: Install Snapcraft

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: |

				          # Required to avoid snapcraft install failure

				          sudo chown root:root /

				          # "--classic" is needed for the GitHub action runner

				          # environment.

				          sudo snap install snapcraft --classic

				          # Allow other parts to access snap binaries

				          echo /snap/bin >> "$GITHUB_PATH"

				      - name: Build snap

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: |

				          snapcraft snap --debug --destructive-mode

									
										17

.github/workflows/stale.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,17 @@

				name: 'Automatically close stale PRs'

				on:

				  schedule:

				    - cron: '0 0 * * *'

				  workflow_dispatch:

				jobs:

				  stale:

				    runs-on: ubuntu-latest

				    steps:

				      - uses: actions/stale@v9

				        with:

				          stale-pr-message: 'This PR has been opened without with no activity for 180 days. Comment on the issue otherwise it will be closed in 7 days'

				          days-before-pr-stale: 180

				          days-before-pr-close: 7

				          days-before-issue-stale: -1

				          days-before-issue-close: -1

									
										33

.github/workflows/static-checks-dragonball.yaml
									
										vendored
									
												View File
											
				@@ -1,33 +0,0 @@

				on:

				  pull_request:

				    types:

				      - opened

				      - edited

				      - reopened

				      - synchronize

				    paths-ignore: [ '**.md', '**.png', '**.jpg', '**.jpeg', '**.svg', '/docs/**' ]

				name: Static checks dragonball

				jobs:

				  test-dragonball:

				    runs-on: self-hosted

				    env:

				      RUST_BACKTRACE: "1"

				    steps:

				      - uses: actions/checkout@v3

				      - name: Set env

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: |

				          echo "GOPATH=${{ github.workspace }}" >> $GITHUB_ENV

				      - name: Install Rust

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: |

				          ./ci/install_rust.sh

				          PATH=$PATH:"$HOME/.cargo/bin"

				      - name: Run Unit Test

				        if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				        run: |

				          cd src/dragonball

				          cargo version

				          rustc --version

				          sudo -E env PATH=$PATH LIBC=gnu SUPPORT_VIRTUALIZATION=true make test

									
										26

.github/workflows/static-checks-self-hosted.yaml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,26 @@

				on:

				  pull_request:

				    types:

				      - opened

				      - synchronize

				      - reopened

				      - labeled # a workflow runs only when the 'ok-to-test' label is added

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				name: Static checks self-hosted

				jobs:

				  build-checks:

				    if: ${{ contains(github.event.pull_request.labels.*.name, 'ok-to-test') }}

				    strategy:

				      fail-fast: false

				      matrix:

				        instance:

				          - "arm-no-k8s"

				          - "s390x"

				          - "ppc64le"

				    uses: ./.github/workflows/build-checks.yaml

				    with:

				      instance: ${{ matrix.instance }}

									
										173

.github/workflows/static-checks.yaml
									
										vendored
									
												View File
												
				@@ -6,93 +6,106 @@ on:

				      - reopened

				      - synchronize

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				name: Static checks

				jobs:

				  check-kernel-config-version:

				    runs-on: ubuntu-latest

				    steps:

				      - name: Checkout the code

				        uses: actions/checkout@v4

				        with:

				          fetch-depth: 0

				      - name: Ensure the kernel config version has been updated

				        run: |

				          kernel_dir="tools/packaging/kernel/"

				          kernel_version_file="${kernel_dir}kata_config_version"

				          modified_files=$(git diff --name-only origin/$GITHUB_BASE_REF..HEAD)

				          if git diff --name-only origin/$GITHUB_BASE_REF..HEAD "${kernel_dir}" | grep "${kernel_dir}"; then

				            echo "Kernel directory has changed, checking if $kernel_version_file has been updated"

				            if echo "$modified_files" | grep -v "README.md" | grep "${kernel_dir}" >>"/dev/null"; then

				              echo "$modified_files" | grep "$kernel_version_file" >>/dev/null || ( echo "Please bump version in $kernel_version_file" && exit 1)

				            else

				              echo "Readme file changed, no need for kernel config version update."

				            fi

				            echo "Check passed"

				          fi

				  build-checks:

				    uses: ./.github/workflows/build-checks.yaml

				    with:

				      instance: ubuntu-20.04

				  build-checks-depending-on-kvm:

				    runs-on: garm-ubuntu-2004-smaller

				    strategy:

				      fail-fast: false

				      matrix:

				        component:

				          - runtime-rs

				        include:

				          - component: runtime-rs

				            command: "sudo -E env PATH=$PATH LIBC=gnu SUPPORT_VIRTUALIZATION=true make test"

				          - component: runtime-rs

				            component-path: src/dragonball

				    steps:

				      - name: Checkout the code

				        uses: actions/checkout@v4

				        with:

				          fetch-depth: 0

				      - name: Install system deps

				        run: |

				          sudo apt-get install -y build-essential musl-tools

				      - name: Install yq

				        run: |

				          sudo -E ./ci/install_yq.sh

				        env:

				          INSTALL_IN_GOPATH: false

				      - name: Install rust

				        run: |

				          export PATH="$PATH:/usr/local/bin"

				          ./tests/install_rust.sh

				      - name: Running `${{ matrix.command }}` for ${{ matrix.component }}

				        run: |

				          export PATH="$PATH:${HOME}/.cargo/bin"

				          cd ${{ matrix.component-path }}

				          ${{ matrix.command }}

				        env:

				          RUST_BACKTRACE: "1"

				  static-checks:

				    runs-on: ubuntu-20.04

				    strategy:

				      fail-fast: false

				      matrix:

				        cmd:

				          - "make vendor"

				          - "make static-checks"

				          - "make check"

				          - "make test"

				          - "sudo -E PATH=\"$PATH\" make test"

				    env:

				      TRAVIS: "true"

				      TRAVIS_BRANCH: ${{ github.base_ref }}

				      TRAVIS_PULL_REQUEST_BRANCH: ${{ github.head_ref }}

				      TRAVIS_PULL_REQUEST_SHA : ${{ github.event.pull_request.head.sha }}

				      RUST_BACKTRACE: "1"

				      target_branch: ${{ github.base_ref }}

				      GOPATH: ${{ github.workspace }}

				    steps:

				    - name: Checkout code

				      uses: actions/checkout@v3

				      with:

				        fetch-depth: 0

				        path: ./src/github.com/${{ github.repository }}

				    - name: Install Go

				      uses: actions/setup-go@v3

				      with:

				        go-version: 1.19.3

				      env:

				        GOPATH: ${{ runner.workspace }}/kata-containers

				    - name: Check kernel config version

				      run: |

				        cd "${{ github.workspace }}/src/github.com/${{ github.repository }}"

				        kernel_dir="tools/packaging/kernel/"

				        kernel_version_file="${kernel_dir}kata_config_version"

				        modified_files=$(git diff --name-only origin/main..HEAD)

				        result=$(git whatchanged origin/main..HEAD "${kernel_dir}" >>"/dev/null")

				        if git whatchanged origin/main..HEAD "${kernel_dir}" >>"/dev/null"; then

				          echo "Kernel directory has changed, checking if $kernel_version_file has been updated"

				          if echo "$modified_files" | grep -v "README.md" | grep "${kernel_dir}" >>"/dev/null"; then

				            echo "$modified_files" | grep "$kernel_version_file" >>/dev/null || ( echo "Please bump version in $kernel_version_file" && exit 1)

				          else

				            echo "Readme file changed, no need for kernel config version update."

				          fi

				          echo "Check passed"

				        fi

				    - name: Setup GOPATH

				      if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				      run: |

				        echo "TRAVIS_BRANCH: ${TRAVIS_BRANCH}"

				        echo "TRAVIS_PULL_REQUEST_BRANCH: ${TRAVIS_PULL_REQUEST_BRANCH}"

				        echo "TRAVIS_PULL_REQUEST_SHA: ${TRAVIS_PULL_REQUEST_SHA}"

				        echo "TRAVIS: ${TRAVIS}"

				    - name: Set env

				      if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				      run: |

				        echo "GOPATH=${{ github.workspace }}" >> $GITHUB_ENV

				        echo "${{ github.workspace }}/bin" >> $GITHUB_PATH

				    - name: Setup travis references

				      if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				      run: |

				        echo "TRAVIS_BRANCH=${TRAVIS_BRANCH:-$(echo $GITHUB_REF | awk 'BEGIN { FS = \"/\" } ; { print $3 }')}"

				        target_branch=${TRAVIS_BRANCH}

				    - name: Setup

				      if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				      run: |

				        cd ${GOPATH}/src/github.com/${{ github.repository }} && ./ci/setup.sh

				      env:

				        GOPATH: ${{ runner.workspace }}/kata-containers

				    - name: Installing rust

				      if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				      run: |

				        cd ${GOPATH}/src/github.com/${{ github.repository }} && ./ci/install_rust.sh

				        PATH=$PATH:"$HOME/.cargo/bin"

				        rustup target add x86_64-unknown-linux-musl

				        rustup component add rustfmt clippy

				    - name: Setup seccomp

				      if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				      run: |

				        libseccomp_install_dir=$(mktemp -d -t libseccomp.XXXXXXXXXX)

				        gperf_install_dir=$(mktemp -d -t gperf.XXXXXXXXXX)

				        cd ${GOPATH}/src/github.com/${{ github.repository }} && ./ci/install_libseccomp.sh "${libseccomp_install_dir}" "${gperf_install_dir}"

				        echo "Set environment variables for the libseccomp crate to link the libseccomp library statically"

				        echo "LIBSECCOMP_LINK_TYPE=static" >> $GITHUB_ENV

				        echo "LIBSECCOMP_LIB_PATH=${libseccomp_install_dir}/lib" >> $GITHUB_ENV

				    - name: Run check

				      if: ${{ !contains(github.event.pull_request.labels.*.name, 'force-skip-ci') }}

				      run: |

				        cd ${GOPATH}/src/github.com/${{ github.repository }} && ${{ matrix.cmd }}

				      - name: Checkout code

				        uses: actions/checkout@v4

				        with:

				          fetch-depth: 0

				          path: ./src/github.com/${{ github.repository }}

				      - name: Install yq

				        run: |

				          cd ${GOPATH}/src/github.com/${{ github.repository }}

				          ./ci/install_yq.sh

				        env:

				          INSTALL_IN_GOPATH: false

				      - name: Install golang

				        run: |

				          cd ${GOPATH}/src/github.com/${{ github.repository }}

				          ./tests/install_go.sh -f -p

				          echo "/usr/local/go/bin" >> $GITHUB_PATH

				      - name: Install system dependencies

				        run: |

				          sudo apt-get -y install moreutils hunspell hunspell-en-gb hunspell-en-us pandoc

				      - name: Run check

				        run: |

				          export PATH=${PATH}:${GOPATH}/bin

				          cd ${GOPATH}/src/github.com/${{ github.repository }} && ${{ matrix.cmd }}

3

.gitignore vendored

View File

@@ -6,6 +6,8 @@
 **/.vscode
 **/.idea
 **/.fleet
 **/*.swp
 **/*.swo
 pkg/logging/Cargo.lock
 src/agent/src/version.rs
 src/agent/kata-agent.service
@@ -13,3 +15,4 @@ src/agent/protocols/src/*.rs
 !src/agent/protocols/src/lib.rs
 build
 src/tools/log-parser/kata-log-parser
 tools/packaging/static-build/agent/install_libseccomp.sh

83

CODEOWNERS

View File

@@ -1,4 +1,4 @@
 # Copyright (c) 2019 Intel Corporation
 # Copyright (c) 2019-2023 Intel Corporation
 #
 # SPDX-License-Identifier: Apache-2.0
 #
@@ -9,4 +9,83 @@
 # Order in this file is important. Only the last match will be
 # used. See https://help.github.com/articles/about-code-owners/
 *.md    @kata-containers/documentation
 /CODEOWNERS			@kata-containers/codeowners
 VERSION				@kata-containers/release
 # The versions database needs careful handling
 versions.yaml			@kata-containers/release @kata-containers/ci @kata-containers/tests
 Makefile*			@kata-containers/build
 *.mak				@kata-containers/build
 *.mk				@kata-containers/build
 # Documentation related files could also appear anywhere
 # else in the repo.
 *.md				@kata-containers/documentation
 *.drawio			@kata-containers/documentation
 *.jpg				@kata-containers/documentation
 *.png				@kata-containers/documentation
 *.svg				@kata-containers/documentation
 *.bash				@kata-containers/shell
 *.sh				@kata-containers/shell
 **/completions/			@kata-containers/shell
 Dockerfile*			@kata-containers/docker
 /ci/				@kata-containers/ci
 *.bats				@kata-containers/tests
 /tests/				@kata-containers/tests
 *.rs				@kata-containers/rust
 *.go				@kata-containers/golang
 /utils/				@kata-containers/utils
 # FIXME: Maybe a new "protocol" team would be better?
 #
 # All protocol changes must be reviewed.
 # Note, we include all subdirs, including the vendor dir, as at present there are no .proto files
 # in the vendor dir. Later we may have to extend this matching rule if that changes.
 /src/libs/protocols/*.proto	@kata-containers/architecture-committee @kata-containers/builder @kata-containers/packaging
 # GitHub Actions
 /.github/workflows/		@kata-containers/action-admins @kata-containers/ci
 /ci/				@kata-containers/ci @kata-containers/tests
 /docs/				@kata-containers/documentation
 /src/agent/			@kata-containers/agent
 /src/runtime*/			@kata-containers/runtime
 /src/runtime/			@kata-containers/golang
 src/runtime-rs/			@kata-containers/rust
 src/libs/			@kata-containers/rust
 src/dragonball/			@kata-containers/dragonball
 /tools/osbuilder/		@kata-containers/builder
 /tools/packaging/		@kata-containers/packaging
 /tools/packaging/kernel/	@kata-containers/kernel
 /tools/packaging/kata-deploy/	@kata-containers/kata-deploy
 /tools/packaging/qemu/		@kata-containers/qemu
 /tools/packaging/release/	@kata-containers/release
 **/vendor/			@kata-containers/vendoring
 # Handle arch specific files last so they match more specifically than
 # the kernel packaging files.
 **/*aarch64*			@kata-containers/arch-aarch64
 **/*arm64*			@kata-containers/arch-aarch64
 **/*amd64*			@kata-containers/arch-amd64
 **/*x86-64*			@kata-containers/arch-amd64
 **/*x86_64*			@kata-containers/arch-amd64
 **/*ppc64*			@kata-containers/arch-ppc64le
 **/*s390x*			@kata-containers/arch-s390x

									
										11

Makefile
									
												View File
												
				@@ -1,4 +1,4 @@

				# Copyright (c) 2020 Intel Corporation

				# Copyright (c) 2020-2023 Intel Corporation

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				@@ -23,6 +23,10 @@ TOOLS += trace-forwarder

				STANDARD_TARGETS = build check clean install static-checks-build test vendor

				# Variables for the build-and-publish-kata-debug target

				KATA_DEBUG_REGISTRY ?= ""

				KATA_DEBUG_TAG ?= ""

				default: all

				include utils.mk

				@@ -38,11 +42,14 @@ generate-protocols:

				# Some static checks rely on generated source files of components.

				static-checks: static-checks-build

					bash ci/static-checks.sh

					bash tests/static-checks.sh github.com/kata-containers/kata-containers

				docs-url-alive-check:

					bash ci/docs-url-alive-check.sh

				build-and-publish-kata-debug:

					bash tools/packaging/kata-debug/kata-debug-build-and-upload-payload.sh ${KATA_DEBUG_REGISTRY} ${KATA_DEBUG_TAG} 

				.PHONY: \

					all \

					kata-tarball \

									
										20

README.md
									
												View File
												
				@@ -1,4 +1,6 @@

				<img src="https://www.openstack.org/assets/kata/kata-vertical-on-white.png" width="150">

				<img src="https://object-storage-ca-ymq-1.vexxhost.net/swift/v1/6e4619c416ff4bd19e1c087f27a43eea/www-images-prod/openstack-logo/kata/SVG/kata-1.svg" width="900">

				[![CI | Publish Kata Containers payload](https://github.com/kata-containers/kata-containers/actions/workflows/payload-after-push.yaml/badge.svg)](https://github.com/kata-containers/kata-containers/actions/workflows/payload-after-push.yaml) [![Kata Containers Nightly CI](https://github.com/kata-containers/kata-containers/actions/workflows/ci-nightly.yaml/badge.svg)](https://github.com/kata-containers/kata-containers/actions/workflows/ci-nightly.yaml)

				# Kata Containers

				@@ -121,7 +123,7 @@ The table below lists the core parts of the project:

				| [agent](src/agent) | core | Management process running inside the virtual machine / POD that sets up the container environment. |

				| [`dragonball`](src/dragonball) | core | An optional built-in VMM brings out-of-the-box Kata Containers experience with optimizations on container workloads |

				| [documentation](docs) | documentation | Documentation common to all components (such as design and install documentation). |

				| [tests](https://github.com/kata-containers/tests) | tests | Excludes unit tests which live with the main code. |

				| [tests](tests) | tests | Excludes unit tests which live with the main code. |

				### Additional components

				@@ -132,19 +134,27 @@ The table below lists the remaining parts of the project:

				| [packaging](tools/packaging) | infrastructure | Scripts and metadata for producing packaged binaries<br/>(components, hypervisors, kernel and rootfs). |

				| [kernel](https://www.kernel.org) | kernel | Linux kernel used by the hypervisor to boot the guest image. Patches are stored [here](tools/packaging/kernel). |

				| [osbuilder](tools/osbuilder) | infrastructure | Tool to create "mini O/S" rootfs and initrd images and kernel for the hypervisor. |

				| [kata-debug](tools/packaging/kata-debug/README.md) | infrastructure | Utility tool to gather Kata Containers debug information from Kubernetes clusters. |

				| [`agent-ctl`](src/tools/agent-ctl) | utility | Tool that provides low-level access for testing the agent. |

				| [`kata-ctl`](src/tools/kata-ctl) | utility | Tool that provides advanced commands and debug facilities. |

				| [`trace-forwarder`](src/tools/trace-forwarder) | utility | Agent tracing helper. |

				| [`runk`](src/tools/runk) | utility | Standard OCI container runtime based on the agent. |

				| [`ci`](https://github.com/kata-containers/ci) | CI | Continuous Integration configuration files and scripts. |

				| [`ci`](.github/workflows) | CI | Continuous Integration configuration files and scripts. |

				| [`katacontainers.io`](https://github.com/kata-containers/www.katacontainers.io) | Source for the [`katacontainers.io`](https://www.katacontainers.io) site. |

				| [`Webhook`](tools/testing/kata-webhook/README.md) | utility | Example of a simple admission controller webhook to annotate pods with the Kata runtime class |

				### Packaging and releases

				Kata Containers is now

				[available natively for most distributions](docs/install/README.md#packaged-installation-methods).

				However, packaging scripts and metadata are still used to generate [snap](snap/local) and GitHub releases. See

				the [components](#components) section for further details.

				## General tests

				See the [tests documentation](tests/README.md).

				## Metrics tests

				See the [metrics documentation](tests/metrics/README.md).

				## Glossary of Terms

2

VERSION

View File

@@ -1 +1 @@
 .1.0-rc0
 .5.0

									
										343

ci/README.md
									
										Normal file
									
												View File
												
				@@ -0,0 +1,343 @@

				# Kata Containers CI

				> [!WARNING]

				> While this project's CI has several areas for improvement, it is constantly

				> evolving. This document attempts to describe its current state, but due to

				> ongoing changes, you may notice some outdated information here. Feel free to

				> modify/improve this document as you use the CI and notice anything odd. The

				> community appreciates it!

				## Introduction

				The Kata Containers CI relies on [GitHub Actions][gh-actions], where the actions

				themselves can be found in the `.github/workflows` directory, and they may call

				helper scripts, which are located under the `tests` directory, to actually

				perform the tasks required for each test case.

				## The different workflows

				There are a few different sets of workflows that are running as part of our CI,

				and here we're going to cover the ones that are less likely to get rotten.  With

				this said, it's fair to advise that if the reader finds something that got

				rotten, opening an issue to the project pointing to the problem is a nice way to

				help, and providing a fix for the issue is a very encouraging way to help.

				### Jobs that run automatically when a PR is raised

				These are a bunch of tests that will automatically run as soon as a PR is

				opened, they're mostly running on "cost free" runners, and they do some

				pre-checks to evaluate that your PR may be okay to start getting reviewed.

				Mind, though, that the community expects the contributors to, at least, build

				their code before submitting a PR, which the community sees as a very fair

				request.

				Without getting into the weeds with details on this, those jobs are the ones

				responsible for ensuring that:

				- The commit message is in the expected format

				- There's no missing Developer's Certificate of Origin

				- Static checks are passing

				### Jobs that require a maintainer's approval to run

				These are the required tests, and our so-called "CI".  These require a

				maintainer's approval to run as parts of those jobs will be running on "paid

				runners", which are currently using Azure infrastructure.

				Once a maintainer of the project gives "the green light" (currently by adding an

				`ok-to-test` label to the PR, soon to be changed to commenting "/test" as part

				of a PR review), the following tests will be executed:

				- Build all the components (runs on free cost runners, or bare-metal depending on the architecture)

				- Create a tarball with all the components (runs on free cost runners, or bare-metal depending on the architecture)

				- Create a kata-deploy payload with the tarball generated in the previous step (runs on free costs runner, or bare-metal depending on the architecture)

				- Run the following tests:

				  - Tests depending on the generated tarball

				    - Metrics (runs on bare-metal)

				    - `docker` (runs on Azure small instances)

				    - `nerdctl` (runs on Azure small instances)

				    - `kata-monitor` (runs on Azure small instances)

				    - `cri-containerd` (runs on Azure small instances)

				    - `nydus` (runs on Azure small instances)

				    - `vfio` (runs on Azure normal instances)

				  - Tests depending on the generated kata-deploy payload

				    - kata-deploy (runs on Azure small instances)

				      - Tests are performed using different "Kubernetes flavors", such as k0s, k3s, rke2, and Azure Kubernetes Service (AKS).

				    - Kubernetes (runs in Azure small and medium instances depending on what's required by each test, and on TEE bare-metal machines)

				      - Tests are performed with different runtime engines, such as CRI-O and containerd.

				      - Tests are performed with different snapshotters for containerd, namely OverlayFS and devmapper.

				      - Tests are performed with all the supported hypervisors, which are Cloud Hypervisor, Dragonball, Firecracker, and QEMU.

				For all the tests relying on Azure instances, real money is being spent, so the

				community asks for the maintainers to be mindful about those, and avoid abusing

				them to merely debug issues.

				## The different runners

				In the previous section we've mentioned using different runners, now in this section we'll go through each type of runner used.

				- Cost free runners:  Those are the runners provided by GIthub itself, and

				  those are fairly small machines with no virtualization capabilities enabled - 

				- Azure small instances: Those are runners which have virtualization

				  capabilities enabled, 2 CPUs, and 8GB of RAM.  These runners have a "-smaller"

				  suffix to their name. 

				- Azure normal instances: Those are runners which have virtualization

				  capabilities enabled, 4 CPUs, and 16GB of RAM.  These runners are usually

				  `garm` ones with no "-smaller" suffix.

				- Bare-metal runners: Those are runners provided by community contributors,

				  and they may vary in architecture, size and virtualization capabilities.

				  Builder runners don't actually require any virtualization capabilities, while

				  runners which will be actually performing the tests must have virtualization

				  capabilities and a reasonable amount for CPU and RAM available (at least

				  matching the Azure normal instances).

				## Adding new tests

				Before someone decides to add a new test, we strongly recommend them to go

				through [GitHub Actions Documentation][gh-actions],

				which will provide you a very sensible background on how to read and understand

				current tests we have, and also become familiar with how to write a new test.

				On the Kata Containers land, there are basically two sets of tests: "standalone"

				and "part of something bigger".

				The "standalone" tests, for example the commit message check, won't be covered

				here as they're better covered by the GitHub Actions documentation pasted above.

				The "part of something bigger" is the more complicated one and not so

				straightforward to add, so we'll be focusing our efforts on describing the

				addition of those.

				> [!NOTE]

				> TODO: Currently, this document refers to "tests" when it actually means the

				> jobs (or workflows) of GitHub. In an ideal world, except in some specific cases,

				> new tests should be added without the need to add new workflows. In the

				> not-too-distant future (hopefully), we will improve the workflows to support

				> this.

				### Adding a new test that's "part of something bigger"

				The first important thing here is to align expectations, and we must say that

				the community strongly prefers receiving tests that already come with:

				- Instructions how to run them

				- A proven run where it's passing

				There are several ways to achieve those two requirements, and an example of that

				can be seen in PR #8115.

				With the expectations aligned, adding a test consists in:

				- Adding a new yaml file for your test, and ensure it's called from the

				  "bigger" yaml. See the [Kata Monitor test example][monitor-ex01].

				- Adding the helper scripts needed for your test to run. Again, use the [Kata Monitor script as example][monitor-ex02].

				Following those examples, the community advice during the review, and even

				asking the community directly on Slack are the best ways to get your test

				accepted.

				## Running tests

				### Running the tests as part of the CI

				If you're a maintainer of the project, you'll be able to kick in the tests by

				yourself.  With the current approach, you just need to add the `ok-to-test`

				label and the tests will automatically start.  We're moving, though, to use a

				`/test` command as part of a GitHub review comment, which will simplify this

				process.

				If you're not a maintainer, please, send a message on Slack or wait till one of

				the maintainers reviews your PR.  Maintainers will then kick in the tests on

				your behalf.

				In case a test fails and there's the suspicion it happens due to flakiness in

				the test itself, please, create an issue for us, and then re-run (or asks

				maintainers to re-run) the tests following these steps:

				- Locate which tests is failing

				- Click in "details"

				- In the top right corner, click in "Re-run jobs"

				- And then in "Re-run failed jobs"

				- And finally click in the green "Re-run jobs" button

				> [!NOTE]

				> TODO: We need figures here

				### Running the tests locally

				In this section, aligning expectations is also something very important, as one

				will not be able to run the tests exactly in the same way the tests are running

				in the CI, as one most likely won't have access to an Azure subscription.

				However, we're trying our best here to provide you with instructions on how to

				run the tests in an environment that's "close enough" and will help you to debug

				issues you find with the current tests, or even provide a proof-of-concept to

				the new test you're trying to add.

				The basic steps, which we will cover in details down below are:

				 1. Create a VM matching the configuration of the target runner

				 2. Generate the artifacts you'll need for the test, or download them from a

				    current failed run

				 3. Follow the steps provided in the action itself to run the tests.

				Although the general overview looks easy, we know that some tricks need to be

				shared, and we'll go through the general process of debugging one non-Kubernetes

				and one Kubernetes specific test for educational purposes.

				One important thing to note is that "Create a VM" can be done in innumerable

				different ways, using the tools of your choice.  For the sake of simplicity on

				this guide, we'll be using `kcli`, which we strongly recommend in case you're a

				non-experienced user, and happen to be developing on a Linux box.

				For both non-Kubernetes and Kubernetes cases, we'll be using PR #8070 as an

				example, which at the time this document is being written serves us very well

				the purpose, as you can see that we have `nerdctl` and Kubernetes tests failing.

				## Debugging tests

				### Debugging a non Kubernetes test

				As shown above, the `nerdctl` test is failing.

				As a developer you can go ahead to the details of the job, and expand the job

				that's failing in order to gather more information.

				But when that doesn't help, we need to set up our own environment to debug

				what's going on.

				Taking a look at the `nerdctl` test, which is located here, you can easily see

				that it runs-on a `garm-ubuntu-2304-smaller` virtual machine.

				The important parts to understand are `ubuntu-2304`, which is the OS where the

				test is running on; and "smaller", which means we're running it on a machine

				with 2 CPUs and 8GB of RAM.

				With this information, we can go ahead and create a similar VM locally using `kcli`.

				```bash

				$ sudo kcli create vm -i ubuntu2304 -P disks=[60] -P numcpus=2 -P memory=8192 -P cpumodel=host-passthrough debug-nerdctl-pr8070

				```

				In order to run the tests, you'll need the "kata-tarball" artifacts, which you

				can build your own using "make kata-tarball" (see below), or simply get them

				from the PR where the tests failed.  To download them, click on the "Summary"

				button that's on the top left corner, and then scroll down till you see the

				artifacts, as shown below.

				Unfortunately GitHub doesn't give us a link that we can download those from

				inside the VM, but we can download them on our local box, and then `scp` the

				tarball to the newly created VM that will be used for debugging purposes.

				> [!NOTE]

				> Those artifacts are only available (for 15 days) when all jobs are finished.

				Once you have the `kata-static.tar.xz` in your VM, you can login to the VM with

				`kcli ssh debug-nerdctl-pr8070`, go ahead and then clone your development branch

				```bash

				$ git clone --branch feat_add-fc-runtime-rs https://github.com/nubificus/kata-containers

				```

				Add the upstream as a remote, set up your git, and rebase your branch atop of the upstream main one

				```bash

				$ git remote add upstream https://github.com/kata-containers/kata-containers

				$ git remote update

				$ git config --global user.email "you@example.com"

				$ git config --global user.name "Your Name"

				$ git rebase upstream/main 

				```

				Now copy the `kata-static.tar.xz` into your `kata-containers/kata-artifacts` directory

				```bash

				$ mkdir kata-artifacts

				$ cp ../kata-static.tar.xz kata-artifacts/

				```

				> [!NOTE]

				> If you downloaded the .zip from GitHub you need to uncompress first to see `kata-static.tar.xz`

				And finally run the tests following what's in the yaml file for the test you're

				debugging. 

				In our case, the `run-nerdctl-tests-on-garm.yaml`.

				When looking at the file you'll notice that some environment variables are set,

				such as `KATA_HYPERVISOR`, and should be aware that, for this particular example,

				the important steps to follow are:

				Install the dependencies

				Install kata

				Run the tests

				Let's now run the steps mentioned above exporting the expected environment variables

				```bash

				$ export KATA_HYPERVISOR=dragonball

				$ bash ./tests/integration/nerdctl/gha-run.sh install-dependencies

				$ bash ./tests/integration/nerdctl/gha-run.sh install-kata

				$ bash tests/integration/nerdctl/gha-run.sh run

				```

				And with this you should've been able to reproduce exactly the same issue found

				in the CI, and from now on you can build your own code, use your own binaries,

				and have fun debugging and hacking! 

				### Debugging a Kubernetes test

				Steps for debugging the Kubernetes tests are very similar to the ones for

				debugging non-Kubernetes tests, with the caveat that what you'll need, this

				time, is not the `kata-static.tar.xz` tarball, but rather a payload to be used

				with kata-deploy.

				In order to generate your own kata-deploy image you can generate your own

				`kata-static.tar.xz` and then take advantage of the following script.  Be aware

				that the image generated and uploaded must be accessible by the VM where you'll

				be performing your tests.

				In case you want to take advantage of the payload that was already generated

				when you faced the CI failure, which is considerably easier, take a look at the

				failed job, then click in "Deploy Kata" and expand the "Final kata-deploy.yaml

				that is used in the test" section.  From there you can see exactly what you'll

				have to use when deploying kata-deploy in your local cluster.

				> [!NOTE]

				> TODO: WAINER TO FINISH THIS PART BASED ON HIS PR TO RUN A LOCAL CI

				## Adding new runners

				Any admin of the project is able to add or remove GitHub runners, and those are

				the folks you should rely on.

				If you need a new runner added, please, tag @ac in the Kata Containers slack,

				and someone from that group will be able to help you.

				If you're part of that group and you're looking for information on how to help

				someone, this is simple, and must be done in private. Basically what you have to

				do is:

				- Go to the kata-containers/kata-containers repo

				- Click on the Settings button, located in the top right corner

				- On the left panel, under "Code and automation", click on "Actions"

				- Click on "Runners"

				If you want to add a new self-hosted runner:

				- In the top right corner there's a green button called "New self-hosted runner"

				If you want to remove a current self-hosted runner:

				- For each runner there's a "..." menu, where you can just click and the

				  "Remove runner" option will show up

				## Known limitations

				As the GitHub actions are structured right now we cannot: Test the addition of a

				GitHub action that's not triggered by a pull_request event as part of the PR.

				[gh-actions]: https://docs.github.com/en/actions

				[monitor-ex01]: https://github.com/kata-containers/kata-containers/commit/a3fb067f1bccde0cbd3fd4d5de12dfb3d8c28b60

				[monitor-ex02]: https://github.com/kata-containers/kata-containers/commit/489caf1ad0fae27cfd00ba3c9ed40e3d512fa492

									
										182

ci/gh-util.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,182 @@

				#!/bin/bash

				# Copyright (c) 2020 Intel Corporation

				# Copyright (c) 2024 IBM Corporation

				#

				# SPDX-License-Identifier: Apache-2.0

				set -o errexit

				set -o errtrace

				set -o nounset

				set -o pipefail

				[ -n "${DEBUG:-}" ] && set -o xtrace

				script_name=${0##*/}

				#---------------------------------------------------------------------

				die()

				{

				    echo >&2 "$*"

				    exit 1

				}

				usage()

				{

				    cat <<EOF

				Usage: $script_name [OPTIONS] [command] [arguments]

				Description: Utility to expand the abilities of the GitHub CLI tool, gh.

				Command descriptions:

				  list-issues-for-pr     List issues linked to a PR.

				  list-labels-for-issue  List labels, in json format for an issue

				Commands and arguments:

				  list-issues-for-pr <pr>

				  list-labels-for-issue <issue>

				Options:

				 -h                 Show this help statement.

				 -r <owner/repo>    Optional <org/repo> specification. Default: 'kata-containers/kata-containers'

				Examples:

				- List issues for a Pull Request 123 in kata-containers/kata-containers repo

				  $ $script_name list-issues-for-pr 123

				EOF

				}

				list_issues_for_pr()

				{

				    local pr="${1:-}"

				    local repo="${2:-kata-containers/kata-containers}"

				    [ -z "$pr" ] && die "need PR"

				    local commits=$(gh pr view ${pr} --repo ${repo} --json commits --jq .commits[].messageBody)

				    [ -z "$commits" ] && die "cannot determine commits for PR $pr"

				    # Extract the issue number(s) from the commits.

				    #

				    # This needs to be careful to take account of lines like this:

				    #

				    # fixes 99

				    # fixes: 77

				    # fixes #123.

				    # Fixes: #1, #234, #5678.

				    #

				    # Note the exclusion of lines starting with whitespace which is

				    # specifically to ignore vendored git log comments, which are whitespace

				    # indented and in the format:

				    #

				    #     "<git-commit> <git-commit-msg>"

				    #

				    local issues=$(echo "$commits" |\

				        egrep -v "^( |	)" |\

				        egrep -i "fixes:* *(#*[0-9][0-9]*)" |\

				        tr ' ' '\n' |\

				        grep "[0-9][0-9]*" |\

				        sed 's/[.,\#]//g' |\

				        sort -nu || true)

				    [ -z "$issues" ] && die "cannot determine issues for PR $pr"

				    echo "# Issues linked to PR"

				    echo "#"

				    echo "# Fields: issue_number"

				    local issue

				    echo "$issues"|while read issue

				    do

				        printf "%s\n" "$issue"

				    done

				}

				list_labels_for_issue()

				{

				    local issue="${1:-}"

				    [ -z "$issue" ] && die "need issue number"

				    local labels=$(gh issue view ${issue} --repo kata-containers/kata-containers --json labels)

				    [ -z "$labels" ] && die "cannot determine labels for issue $issue"

				    printf "$labels"

				}

				setup()

				{

				    for cmd in gh jq

				    do

				        command -v "$cmd" &>/dev/null || die "need command: $cmd"

				    done

				}

				handle_args()

				{

				    setup

				    local show_all="false"

				    local opt

				    while getopts "ahr:" opt "$@"

				    do

				        case "$opt" in

				            a) show_all="true" ;;

				            h) usage && exit 0 ;;

				            r) repo="${OPTARG}" ;;

				        esac

				    done

				    shift $(($OPTIND - 1))

				    local repo="${repo:-kata-containers/kata-containers}"

				    local cmd="${1:-}"

				    case "$cmd" in

				        list-issues-for-pr) ;;

				        list-labels-for-issue) ;;

				        "") usage && exit 0 ;;

				        *) die "invalid command: '$cmd'" ;;

				    esac

				    # Consume the command name

				    shift

				    local issue=""

				    local pr=""

				    case "$cmd" in

				        list-issues-for-pr)

				            pr="${1:-}"

				            list_issues_for_pr "$pr" "${repo}"

				            ;;

				        list-labels-for-issue)

				            issue="${1:-}"

				            list_labels_for_issue "$issue"

				            ;;

				        *) die "impossible situation: cmd: '$cmd'" ;;

				    esac

				    exit 0

				}

				main()

				{

				    handle_args "$@"

				}

				main "$@"

									
										19

ci/install_libseccomp.sh
									
												View File
												
				@@ -7,12 +7,10 @@

				set -o errexit

				cidir=$(dirname "$0")

				source "${cidir}/lib.sh"

				script_dir="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"

				script_name="$(basename "${BASH_SOURCE[0]}")"

				clone_tests_repo

				source "${tests_repo_dir}/.ci/lib.sh"

				source "${script_dir}/../tests/common.bash"

				# The following variables if set on the environment will change the behavior

				# of gperf and libseccomp configure scripts, that may lead this script to

				@@ -25,11 +23,11 @@ workdir="$(mktemp -d --tmpdir build-libseccomp.XXXXX)"

				# Variables for libseccomp

				libseccomp_version="${LIBSECCOMP_VERSION:-""}"

				if [ -z "${libseccomp_version}" ]; then

				    libseccomp_version=$(get_version "externals.libseccomp.version")

				    libseccomp_version=$(get_from_kata_deps "externals.libseccomp.version")

				fi

				libseccomp_url="${LIBSECCOMP_URL:-""}"

				if [ -z "${libseccomp_url}" ]; then

				    libseccomp_url=$(get_version "externals.libseccomp.url")

				    libseccomp_url=$(get_from_kata_deps "externals.libseccomp.url")

				fi

				libseccomp_tarball="libseccomp-${libseccomp_version}.tar.gz"

				libseccomp_tarball_url="${libseccomp_url}/releases/download/v${libseccomp_version}/${libseccomp_tarball}"

				@@ -38,11 +36,11 @@ cflags="-O2"

				# Variables for gperf

				gperf_version="${GPERF_VERSION:-""}"

				if [ -z "${gperf_version}" ]; then

				    gperf_version=$(get_version "externals.gperf.version")

				    gperf_version=$(get_from_kata_deps "externals.gperf.version")

				fi

				gperf_url="${GPERF_URL:-""}"

				if [ -z "${gperf_url}" ]; then

				    gperf_url=$(get_version "externals.gperf.url")

				    gperf_url=$(get_from_kata_deps "externals.gperf.url")

				fi

				gperf_tarball="gperf-${gperf_version}.tar.gz"

				gperf_tarball_url="${gperf_url}/${gperf_tarball}"

				@@ -87,7 +85,8 @@ build_and_install_libseccomp() {

				    curl -sLO "${libseccomp_tarball_url}"

				    tar -xf "${libseccomp_tarball}"

				    pushd "libseccomp-${libseccomp_version}"

				    ./configure --prefix="${libseccomp_install_dir}" CFLAGS="${cflags}" --enable-static --host="${arch}"

				    [ "${arch}" == $(uname -m) ] && cc_name="" || cc_name="${arch}-linux-gnu-gcc"

				    CC=${cc_name} ./configure --prefix="${libseccomp_install_dir}" CFLAGS="${cflags}" --enable-static --host="${arch}"

				    make

				    make install

				    popd

									
										14

ci/install_yq.sh
									
												View File
												
				@@ -17,6 +17,7 @@ die() {

				function install_yq() {

					local yq_pkg="github.com/mikefarah/yq"

					local yq_version=3.4.1

					local precmd=""

					INSTALL_IN_GOPATH=${INSTALL_IN_GOPATH:-true}

					if [ "${INSTALL_IN_GOPATH}"  == "true" ];then

				@@ -25,6 +26,15 @@ function install_yq() {

						local yq_path="${GOPATH}/bin/yq"

					else

						yq_path="/usr/local/bin/yq"

						# Check if we need sudo to install yq

						if [ ! -w "/usr/local/bin" ]; then

							# Check if we have sudo privileges

							if ! sudo -n true 2>/dev/null; then

								die "Please provide sudo privileges to install yq"

							else

								precmd="sudo"

							fi

						fi

					fi

					[ -x  "${yq_path}" ] && [ "`${yq_path} --version`"X == "yq version ${yq_version}"X ] && return

				@@ -75,9 +85,9 @@ function install_yq() {

					## NOTE: ${var,,} => gives lowercase value of var

					local yq_url="https://${yq_pkg}/releases/download/${yq_version}/yq_${goos}_${goarch}"

					curl -o "${yq_path}" -LSsf "${yq_url}"

					${precmd} curl -o "${yq_path}" -LSsf "${yq_url}"

					[ $? -ne 0 ] && die "Download ${yq_url} failed"

					chmod +x "${yq_path}"

					${precmd} chmod +x "${yq_path}"

					if ! command -v "${yq_path}" >/dev/null; then

						die "Cannot not get ${yq_path} executable"

									
										33

ci/lib.sh
									
												View File
												
				@@ -5,6 +5,9 @@

				set -o nounset

				GOPATH=${GOPATH:-${HOME}/go}

				export kata_repo="github.com/kata-containers/kata-containers"

				export kata_repo_dir="$GOPATH/src/$kata_repo"

				export tests_repo="${tests_repo:-github.com/kata-containers/tests}"

				export tests_repo_dir="$GOPATH/src/$tests_repo"

				export branch="${target_branch:-main}"

				@@ -39,28 +42,46 @@ clone_tests_repo()

				run_static_checks()

				{

					clone_tests_repo

					# Make sure we have the targeting branch

					git remote set-branches --add origin "${branch}"

					git fetch -a

					bash "$tests_repo_dir/.ci/static-checks.sh" "$@"

					bash "$kata_repo_dir/tests/static-checks.sh" "$@"

				}

				run_docs_url_alive_check()

				{

					clone_tests_repo

					# Make sure we have the targeting branch

					git remote set-branches --add origin "${branch}"

					git fetch -a

					bash "$tests_repo_dir/.ci/static-checks.sh" --docs --all "github.com/kata-containers/kata-containers"

					bash "$kata_repo_dir/tests/static-checks.sh" --docs --all "$kata_repo"

				}

				run_get_pr_changed_file_details()

				{

					clone_tests_repo

					# Make sure we have the targeting branch

					git remote set-branches --add origin "${branch}"

					git fetch -a

					source "$tests_repo_dir/.ci/lib.sh"

					source "$kata_repo_dir/tests/common.bash"

					get_pr_changed_file_details

				}

				# Check if the 1st argument version is greater than and equal to 2nd one

				# Version format: [0-9]+ separated by period (e.g. 2.4.6, 1.11.3 and etc.)

				#

				# Parameters:

				#	$1	- a version to be tested

				#	$2	- a target version

				#

				# Return:

				# 	0 if $1 is greater than and equal to $2

				#	1 otherwise

				version_greater_than_equal() {

					local current_version=$1

					local target_version=$2

					smaller_version=$(echo -e "$current_version\n$target_version" | sort -V | head -1)

					if [ "${smaller_version}" = "${target_version}" ]; then

						return 0

					else

						return 1

					fi

				}

									
										55

ci/openshift-ci/cleanup.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,55 @@

				#!/bin/bash

				#

				# Copyright (c) 2024 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# This script tries to removes most of the resources added by `test.sh` script

				# from the cluster.

				scripts_dir=$(dirname $0)

				deployments_dir=${scripts_dir}/cluster/deployments

				configs_dir=${scripts_dir}/configs

				source ${scripts_dir}/lib.sh

				# Set to 'yes' if you want to configure SELinux to permissive on the cluster

				# workers.

				#

				SELINUX_PERMISSIVE=${SELINUX_PERMISSIVE:-no}

				# Enable workaround for OCP 4.13 https://github.com/kata-containers/kata-containers/pull/9206

				#

				WORKAROUND_9206_CRIO=${WORKAROUND_9206_CRIO:-no}

				# Ignore errors as we want best-effort-approach here

				trap - ERR

				# Delete potential smoke-test resources

				oc delete -f "${scripts_dir}/smoke/service.yaml"

				oc delete -f "${scripts_dir}/smoke/service_kubernetes.yaml"

				oc delete -f "${scripts_dir}/smoke/http-server.yaml"

				# Delete test.sh resources

				oc delete -f "${deployments_dir}/relabel_selinux.yaml"

				if [[ "$WORKAROUND_9206_CRIO" == "yes" ]]; then

					oc delete -f "${deployments_dir}/workaround-9206-crio-ds.yaml"

					oc delete -f "${deployments_dir}/workaround-9206-crio.yaml"

				fi

				[ ${SELINUX_PERMISSIVE} == "yes" ] && oc delete -f "${deployments_dir}/machineconfig_selinux.yaml.in"

				# Delete kata-containers

				pushd "$katacontainers_repo_dir/tools/packaging/kata-deploy"

				oc delete -f kata-deploy/base/kata-deploy.yaml

				oc -n kube-system wait --timeout=10m --for=delete -l name=kata-deploy pod

				oc apply -f kata-cleanup/base/kata-cleanup.yaml

				echo "Wait for all related pods to be gone"

				( repeats=1; for i in $(seq 1 600); do

				  oc get pods -l name="kubelet-kata-cleanup" --no-headers=true -n kube-system 2>&1 | grep "No resources found" -q && ((repeats++)) || repeats=1

				  [ "$repeats" -gt 5 ] && echo kata-cleanup finished && break

				  sleep 1

				done) || { echo "There are still some kata-cleanup related pods after 600 iterations"; oc get all -n kube-system; exit -1; }

				oc delete -f kata-cleanup/base/kata-cleanup.yaml

				oc delete -f kata-rbac/base/kata-rbac.yaml

				oc delete -f runtimeclasses/kata-runtimeClasses.yaml

6

ci/openshift-ci/cluster/configs/selinux.conf Normal file

View File

@@ -0,0 +1,6 @@
 # Copyright (c) 2020 Red Hat, Inc.
 #
 # SPDX-License-Identifier: Apache-2.0
 #
 SELINUX=permissive
 SELINUXTYPE=targeted

									
										35

ci/openshift-ci/cluster/deploy_webhook.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,35 @@

				#!/bin/bash

				#

				# Copyright (c) 2021 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# This script builds the kata-webhook and deploys it in the test cluster.

				#

				# You should export the KATA_RUNTIME variable with the runtimeclass name

				# configured in your cluster in case it is not the default "kata-ci".

				#

				set -e

				set -o nounset

				set -o pipefail

				script_dir="$(dirname $0)"

				webhook_dir="${script_dir}/../../../tools/testing/kata-webhook"

				source "${script_dir}/../lib.sh"

				KATA_RUNTIME=${KATA_RUNTIME:-kata-ci}

				info "Creates the kata-webhook ConfigMap"

				RUNTIME_CLASS="${KATA_RUNTIME}" \

					envsubst < "${script_dir}/deployments/configmap_kata-webhook.yaml.in" \

					| oc apply -f -

				pushd "${webhook_dir}" >/dev/null

				# Build and deploy the webhook

				#

				info "Builds the kata-webhook"

				./create-certs.sh

				info "Deploys the kata-webhook"

				oc apply -f deploy/

				# Check the webhook was deployed and is working.

				RUNTIME_CLASS="${KATA_RUNTIME}" ./webhook-check.sh

				popd >/dev/null

									
										13

ci/openshift-ci/cluster/deployments/configmap_installer_kernel.yaml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,13 @@

				# Copyright (c) 2021 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# Instruct the daemonset installer to configure Kata Containers to use the

				# host kernel.

				#

				apiVersion: v1

				kind: ConfigMap

				metadata:

				  name: ci.kata.installer.kernel

				data:

				  host_kernel: "yes"

									
										14

ci/openshift-ci/cluster/deployments/configmap_installer_qemu.yaml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,14 @@

				# Copyright (c) 2021 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# Instruct the daemonset installer to configure Kata Containers to use the

				# system QEMU.

				#

				apiVersion: v1

				kind: ConfigMap

				metadata:

				  name: ci.kata.installer.qemu

				data:

				  qemu_path: /usr/libexec/qemu-kvm

				  host_kernel: "yes"

									
										12

ci/openshift-ci/cluster/deployments/configmap_kata-webhook.yaml.in
									
										Normal file
									
												View File
												
				@@ -0,0 +1,12 @@

				# Copyright (c) 2021 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# Apply customizations to the kata-webhook.

				#

				apiVersion: v1

				kind: ConfigMap

				metadata:

				  name: kata-webhook

				data:

				  runtime_class: ${RUNTIME_CLASS}

									
										9

ci/openshift-ci/cluster/deployments/machineconfig_sandboxedcontainers_extension.yaml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,9 @@

				apiVersion: machineconfiguration.openshift.io/v1

				kind: MachineConfig

				metadata:

				  labels:

				    machineconfiguration.openshift.io/role: worker

				  name: 50-enable-sandboxed-containers-extension

				spec:

				  extensions:

				  - sandboxed-containers

									
										23

ci/openshift-ci/cluster/deployments/machineconfig_selinux.yaml.in
									
										Normal file
									
												View File
												
				@@ -0,0 +1,23 @@

				# Copyright (c) 2020 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# Configure SELinux on worker nodes.

				---

				apiVersion: machineconfiguration.openshift.io/v1

				kind: MachineConfig

				metadata:

				  labels:

				    machineconfiguration.openshift.io/role: worker

				  name: 51-kata-selinux

				spec:

				  config:

				    ignition:

				      version: 2.2.0

				    storage:

				      files:

				      - contents:

				              source: data:text/plain;charset=utf-8;base64,${SELINUX_CONF_BASE64}

				        filesystem: root

				        mode: 0644

				        path: /etc/selinux/config

									
										40

ci/openshift-ci/cluster/deployments/relabel_selinux.yaml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,40 @@

				apiVersion: apps/v1

				kind: DaemonSet

				metadata:

				  name: relabel-selinux-daemonset

				  namespace: kube-system

				spec:

				  selector:

				    matchLabels:

				      app: restorecon

				  template:

				    metadata:

				      labels:

				        app: restorecon

				    spec:

				      serviceAccountName: kata-deploy-sa

				      hostPID: true

				      containers:

				        - name: relabel-selinux-container

				          image: alpine

				          securityContext:

				            privileged: true

				          command: ["/bin/sh", "-c", "

				            set -e;

				            echo Starting the relabel;

				            nsenter --target 1 --mount bash -xc '

				                command -v semanage &>/dev/null || { echo Does not look like a SELINUX cluster, skipping; exit 0; };

				                for ENTRY in \

				                    \"/(.*/)?opt/kata/bin(/.*)?\" \

				                    \"/(.*/)?opt/kata/runtime-rs/bin(/.*)?\" \

				                    \"/(.*/)?opt/kata/share/kata-.*(/.*)?(/.*)?\" \

				                    \"/(.*/)?opt/kata/share/ovmf(/.*)?\" \

				                    \"/(.*/)?opt/kata/share/tdvf(/.*)?\" \

				                    \"/(.*/)?opt/kata/libexec(/.*)?\";

				                do

				                    semanage fcontext -a -t qemu_exec_t \"$ENTRY\" || semanage fcontext -m -t qemu_exec_t \"$ENTRY\" || { echo \"Error in semanage command\"; exit 1; }

				                done;

				                restorecon -v -R /opt/kata || { echo \"Error in restorecon command\"; exit 1; }

				            ';

				            echo NSENTER_FINISHED_WITH: $?;

				            sleep infinity"]

									
										28

ci/openshift-ci/cluster/deployments/workaround-9206-crio-ds.yaml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,28 @@

				---

				apiVersion: apps/v1

				kind: DaemonSet

				metadata:

				  name: workaround-9206-crio-ds

				spec:

				  selector:

				    matchLabels:

				      app: workaround-9206-crio-ds

				  template:

				    metadata:

				      labels:

				        app: workaround-9206-crio-ds

				    spec:

				      containers:

				      - name: workaround-9206-crio-ds

				        image: alpine

				        volumeMounts:

				        - name: host-dir

				          mountPath: /tmp/config

				        securityContext:

				          runAsUser: 0

				          privileged: true

				        command: ["/bin/sh", "-c", "while [ ! -f '/tmp/config/10-workaround-9206-crio' ]; do sleep 1; done; echo 'Config file present'; sleep infinity"]

				      volumes:

				      - name: host-dir

				        hostPath:

				          path: /etc/crio/crio.conf.d/

									
										18

ci/openshift-ci/cluster/deployments/workaround-9206-crio.yaml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,18 @@

				---

				apiVersion: machineconfiguration.openshift.io/v1

				kind: MachineConfig

				metadata:

				  labels:

				    machineconfiguration.openshift.io/role: worker

				  name: 10-workaround-9206-crio

				spec:

				  config:

				    ignition:

				      version: 2.2.0

				    storage:

				      files:

				      - contents:

				              source: data:text/plain;charset=utf-8;base64,W2NyaW9dCnN0b3JhZ2Vfb3B0aW9uID0gWwoJIm92ZXJsYXkuc2tpcF9tb3VudF9ob21lPXRydWUiLApdCg==

				        filesystem: root

				        mode: 0644

				        path: /etc/crio/crio.conf.d/10-workaround-9206-crio

									
										245

ci/openshift-ci/cluster/install_kata.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,245 @@

				#!/bin/bash

				#

				# Copyright (c) 2020 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# This script installs the built kata-containers in the test cluster,

				# and configure a runtime.

				scripts_dir=$(dirname $0)

				deployments_dir=${scripts_dir}/deployments

				configs_dir=${scripts_dir}/configs

				source ${scripts_dir}/../lib.sh

				# Set to 'yes' if you want to configure SELinux to permissive on the cluster

				# workers.

				#

				SELINUX_PERMISSIVE=${SELINUX_PERMISSIVE:-no}

				# Set to 'yes' if you want to configure Kata Containers to use the system's

				# QEMU (from the RHCOS extension).

				#

				KATA_WITH_SYSTEM_QEMU=${KATA_WITH_SYSTEM_QEMU:-no}

				# Set to 'yes' if you want to configure Kata Containers to use the host kernel.

				#

				KATA_WITH_HOST_KERNEL=${KATA_WITH_HOST_KERNEL:-no}

				# kata-deploy image to be used to deploy the kata (by default use CI image

				# that is built for each pull request)

				#

				KATA_DEPLOY_IMAGE=${KATA_DEPLOY_IMAGE:-quay.io/kata-containers/kata-deploy-ci:kata-containers-latest}

				# Enable workaround for OCP 4.13 https://github.com/kata-containers/kata-containers/pull/9206

				#

				WORKAROUND_9206_CRIO=${WORKAROUND_9206_CRIO:-no}

				# Leverage kata-deploy to install Kata Containers in the cluster.

				#

				apply_kata_deploy() {

					local deploy_file="tools/packaging/kata-deploy/kata-deploy/base/kata-deploy.yaml"

					pushd "$katacontainers_repo_dir"

					sed -ri "s#(\s+image:) .*#\1 ${KATA_DEPLOY_IMAGE}#" "$deploy_file"

					info "Applying kata-deploy"

					oc apply -f tools/packaging/kata-deploy/kata-rbac/base/kata-rbac.yaml

					oc label --overwrite ns kube-system pod-security.kubernetes.io/enforce=privileged pod-security.kubernetes.io/warn=baseline pod-security.kubernetes.io/audit=baseline

					oc apply -f "$deploy_file"

					oc -n kube-system wait --timeout=10m --for=condition=Ready -l name=kata-deploy pod

					info "Adding the kata runtime classes"

					oc apply -f tools/packaging/kata-deploy/runtimeclasses/kata-runtimeClasses.yaml

					popd

				}

				# Wait all worker nodes reboot.

				#

				# Params:

				#   $1 - timeout in seconds (default to 900).

				#

				wait_for_reboot() {

					local delta="${1:-900}"

					local sleep_time=60

					declare -A BOOTIDS

					local workers=($(oc get nodes | \

						awk '{if ($3 == "worker") { print $1 } }'))

					# Get the boot ID to compared it changed over time.

					for node in ${workers[@]}; do

						BOOTIDS[$node]=$(oc get -o jsonpath='{.status.nodeInfo.bootID}'\

							node/$node)

						echo "Wait $node reboot"

					done

					echo "Set timeout to $delta seconds"

					timer_start=$(date +%s)

					while [ ${#workers[@]} -gt 0 ]; do

						sleep $sleep_time

						now=$(date +%s)

						if [ $(($timer_start + $delta)) -lt $now ]; then

							echo "Timeout: not all workers rebooted"

							return 1

						fi

						echo "Checking after $(($now - $timer_start)) seconds"

						for i in ${!workers[@]}; do

							current_id=$(oc get \

								-o jsonpath='{.status.nodeInfo.bootID}' \

								node/${workers[i]})

							if [ "$current_id" != ${BOOTIDS[${workers[i]}]} ]; then

								echo "${workers[i]} rebooted"

								unset workers[i]

							fi

						done

					done

				}

				wait_mcp_update() {

					local delta="${1:-3600}"

					local sleep_time=30

					# The machineconfigpool is fine when all the workers updated and are ready,

					# and none are degraded.

					local ready_count=0

					local degraded_count=0

					local machine_count=$(oc get mcp worker -o jsonpath='{.status.machineCount}')

					if [[ -z "$machine_count" && "$machine_count" -lt 1 ]]; then

						warn "Unabled to obtain the machine count"

						return 1

					fi

					echo "Set timeout to $delta seconds"

					local deadline=$(($(date +%s) + $delta))

					# The ready count might not have changed yet, so wait a little.

					while [[ "$ready_count" != "$machine_count" && \

						"$degraded_count" == 0 ]]; do

						# Let's check it hit the timeout (or not).

						local now=$(date +%s)

						if [ $deadline -lt $now ]; then

							echo "Timeout: not all workers updated" >&2

							return 1

						fi

						sleep $sleep_time

						ready_count=$(oc get mcp worker \

							-o jsonpath='{.status.readyMachineCount}')

						degraded_count=$(oc get mcp worker \

							-o jsonpath='{.status.degradedMachineCount}')

						echo "check machineconfigpool - ready_count: $ready_count degraded_count: $degraded_count"

					done

					[ $degraded_count -eq 0 ]

				}

				# Enable the RHCOS extension for the Sandboxed Containers.

				#

				enable_sandboxedcontainers_extension() {

					info "Enabling the RHCOS extension for Sandboxed Containers"

					local deployment_file="${deployments_dir}/machineconfig_sandboxedcontainers_extension.yaml"

					oc apply -f ${deployment_file}

					oc get -f ${deployment_file} || \

						die "Sandboxed Containers extension machineconfig not found"

					wait_mcp_update || die "Failed to update the machineconfigpool"

				}

				# Print useful information for debugging.

				#

				# Params:

				#   $1 - the pod name

				debug_pod() {

					local pod="$1"

					info "Debug pod: ${pod}"

					oc describe pods "$pod"

				        oc logs "$pod"

				}

				# Wait for all pods of the app label to contain expected message

				#

				# Params:

				#   $1 - app labela

				#   $2 - expected pods count (>=1)

				#   $3 - message to be present in the logs

				#   $4 - timeout (60)

				#   $5 - namespace (the current one)

				wait_for_app_pods_message() {

					local app="$1"

					local pod_count="$2"

					local message="$3"

					local timeout="$4"

					local namespace="$5"

					[ -z "$pod_count" ] && pod_count=1

					[ -z "$timeout" ] && timeout=60

					[ -n "$namespace" ] && namespace=" -n $namespace "

					local pod

					local pods

					local i

					SECONDS=0

					while :; do

						pods=($(oc get pods -l app="$app" --no-headers=true $namespace | awk '{print $1}'))

						[ "${#pods}" -ge "$pod_count" ] && break

						if [ "$SECONDS" -gt "$timeout" ]; then

							echo "Unable to find ${pod_count} pods for '-l app=\"$app\"' in ${SECONDS}s (${pods[@]})"

							return -1

						fi

					done

					for pod in "${pods[@]}"; do

						while :; do

							local log=$(oc logs $namespace "$pod")

							echo "$log" | grep "$message" -q && echo "Found $(echo "$log" | grep "$message") in $pod's log ($SECONDS)" && break;

							if [ "$SECONDS" -gt "$timeout" ]; then

								echo -n "Message '$message' not present in '${pod}' pod of the '-l app=\"$app\"' "

								echo "pods after ${SECONDS}s (${pods[@]})"

								echo "Pod $pod's output so far:"

								echo "$log"

								return -1

							fi

							sleep 1;

						done

					done

				}

				oc config set-context --current --namespace=default

				worker_nodes=$(oc get nodes |  awk '{if ($3 == "worker") { print $1 } }')

				num_nodes=$(echo $worker_nodes | wc -w)

				[ $num_nodes -ne 0 ] || \

					die "No worker nodes detected. Something is wrong with the cluster"

				if [ "${KATA_WITH_SYSTEM_QEMU}" == "yes" ]; then

					# QEMU is deployed on the workers via RCHOS extension.

					enable_sandboxedcontainers_extension

					oc apply -f ${deployments_dir}/configmap_installer_qemu.yaml

				fi

				if [ "${KATA_WITH_HOST_KERNEL}" == "yes" ]; then

					oc apply -f ${deployments_dir}/configmap_installer_kernel.yaml

				fi

				apply_kata_deploy

				# Set SELinux to permissive mode

				if [ ${SELINUX_PERMISSIVE} == "yes" ]; then

					info "Configuring SELinux"

					if [ -z "$SELINUX_CONF_BASE64" ]; then

						export SELINUX_CONF_BASE64=$(echo \

							$(cat $configs_dir/selinux.conf|base64) | \

							sed -e 's/\s//g')

					fi

					envsubst < ${deployments_dir}/machineconfig_selinux.yaml.in | \

						oc apply -f -

					oc get machineconfig/51-kata-selinux || \

						die "SELinux machineconfig not found"

					# The new SELinux configuration will trigger another reboot.

					wait_for_reboot

				fi

				if [[ "$WORKAROUND_9206_CRIO" == "yes" ]]; then

					info "Applying workaround to enable skip_mount_home in crio on OCP 4.13"

					oc apply -f "${deployments_dir}/workaround-9206-crio.yaml"

					oc apply -f "${deployments_dir}/workaround-9206-crio-ds.yaml"

					wait_for_app_pods_message workaround-9206-crio-ds "$num_nodes" "Config file present" 1200 || echo "Failed to apply the workaround, proceeding anyway..."

				fi

				# FIXME: Remove when https://github.com/kata-containers/kata-containers/pull/8417 is resolved

				# Selinux context is currently not handled by kata-deploy

				oc apply -f ${deployments_dir}/relabel_selinux.yaml

				wait_for_app_pods_message restorecon "$num_nodes" "NSENTER_FINISHED_WITH:" 120 "kube-system" || echo "Failed to treat selinux, proceeding anyway..."

									
										20

ci/openshift-ci/lib.sh
									
										Normal file
									
												View File
												
				@@ -0,0 +1,20 @@

				#!/usr/bin/env bash

				#

				# Copyright (c) 2023 Red Hat

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# Ensure GOPATH set

				if command -v go > /dev/null; then

				    export GOPATH=${GOPATH:-$(go env GOPATH)}

				else

				    # if go isn't installed, set default location for GOPATH

				    export GOPATH="${GOPATH:-$HOME/go}"

				fi

				lib_dir=$(dirname "${BASH_SOURCE[0]}")

				source "$lib_dir/../../tests/common.bash"

				export katacontainers_repo=${katacontainers_repo:="github.com/kata-containers/kata-containers"}

				export katacontainers_repo_dir="${GOPATH}/src/${katacontainers_repo}"

									
										92

ci/openshift-ci/run_smoke_test.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,92 @@

				#!/bin/bash

				#

				# Copyright (c) 2020 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# Run a smoke test.

				#

				script_dir=$(dirname $0)

				source ${script_dir}/lib.sh

				pod='http-server'

				# Create a pod.

				#

				info "Creating the ${pod} pod"

				oc apply -f ${script_dir}/smoke/${pod}.yaml || \

					die "failed to create ${pod} pod"

				# Check it eventually goes to 'running'

				#

				wait_time=600

				sleep_time=5

				cmd="oc get pod/${pod} -o jsonpath='{.status.containerStatuses[0].state}' | \

					grep running > /dev/null"

				info "Wait until the pod gets running"

				waitForProcess $wait_time $sleep_time "$cmd" || timed_out=$?

				if [ -n "$timed_out" ]; then

					oc describe pod/${pod}

					oc delete pod/${pod}

					die "${pod} not running"

				fi

				info "${pod} is running"

				# Add a file with the hello message

				#

				hello_file=/tmp/hello

				hello_msg='Hello World'

				oc exec ${pod} -- sh -c "echo $hello_msg > $hello_file"

				info "Creating the service and route"

				if oc apply -f ${script_dir}/smoke/service.yaml; then

				    # Likely on OCP, use service

				    is_ocp=1

				    host=$(oc get route/http-server-route -o jsonpath={.spec.host})

				    port=80

				else

				    # Likely on plain kubernetes, test using another container

				    is_ocp=0

				    info "Failed to create service, likely not on OCP, trying via NodePort"

				    oc apply -f "${script_dir}/smoke/service_kubernetes.yaml"

				    # For some reason kcli's cluster lists external IP as internal IP, try both

				    host=$(oc get nodes -o jsonpath='{.items[0].status.addresses[?(@.type=="ExternalIP")].address}')

				    [ -z "$host"] && host=$(oc get nodes -o jsonpath='{.items[0].status.addresses[?(@.type=="InternalIP")].address}')

				    port=$(oc get service/http-server-service -o jsonpath='{.spec.ports[0].nodePort}')

				fi

				info "Wait for the HTTP server to respond"

				tempfile=$(mktemp)

				check_cmd="curl -vvv '${host}:${port}${hello_file}' 2>&1 | tee -a '$tempfile' | grep -q '$hello_msg'"

				if waitForProcess 60 1 "${check_cmd}"; then

				    test_status=0

				    info "HTTP server is working"

				else

				    test_status=1

				    echo "::error:: HTTP server not working"

				    echo "::group::Output of the \"curl -vvv '${host}:${port}${hello_file}'\""

				    cat "${tempfile}"

				    echo "::endgroup::"

				    echo "::group::Describe kube-system namespace"

				    oc describe -n kube-system all

				    echo "::endgroup::"

				    echo "::group::Descibe current namespace"

				    oc describe all

				    echo "::endgroup::"

				    info "HTTP server is unreachable"

				fi

				rm -f "$tempfile"

				# Delete the resources.

				#

				info "Deleting the service/route"

				if [ "$is_ocp" -eq 0 ]; then

				    oc delete -f ${script_dir}/smoke/service_kubernetes.yaml

				else

				    oc delete -f ${script_dir}/smoke/service.yaml

				fi

				info "Deleting the ${pod} pod"

				oc delete pod/${pod} || test_status=$?

				exit $test_status

									
										30

ci/openshift-ci/smoke/http-server.yaml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,30 @@

				# Copyright (c) 2020 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# Define the pod for a http server app.

				---

				apiVersion: v1

				kind: Pod

				metadata:

				  name: http-server

				  labels:

				    app: http-server-app

				spec:

				  containers:

				    - name: http-server

				      image: registry.fedoraproject.org/fedora

				      ports:

				        - containerPort: 8080

				      command: ["python3"]

				      args: [ "-m", "http.server", "8080"]

				      securityContext:

				        allowPrivilegeEscalation: false

				        capabilities:

				          drop:

				            - ALL

				        runAsNonRoot: true

				        runAsUser: 1000

				        seccompProfile:

				          type: RuntimeDefault

				  runtimeClassName: kata-qemu

									
										28

ci/openshift-ci/smoke/service.yaml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,28 @@

				# Copyright (c) 2020 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# Create the service on port 80 for the http-server app.

				---

				apiVersion: v1

				kind: Service

				metadata:

				  name: http-server-service

				spec:

				  selector:

				    app: http-server-app

				  ports:

				    - protocol: TCP

				      port: 80

				      targetPort: 8080

				# Create the route to the app's service '/'.

				---

				apiVersion: route.openshift.io/v1

				kind: Route

				metadata:

				  name: http-server-route

				spec:

				  path: "/"

				  to:

				    kind: Service

				    name: http-server-service

									
										18

ci/openshift-ci/smoke/service_kubernetes.yaml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,18 @@

				# Copyright (c) 2020 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				# Create the service on port 80 for the http-server app.

				---

				apiVersion: v1

				kind: Service

				metadata:

				  name: http-server-service

				spec:

				  selector:

				    app: http-server-app

				  ports:

				    - protocol: TCP

				      port: 80

				      targetPort: 8080

				  type: NodePort

									
										29

ci/openshift-ci/test.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,29 @@

				#!/bin/bash

				#

				# Copyright (c) 2020 Red Hat, Inc.

				#

				# SPDX-License-Identifier: Apache-2.0

				#

				script_dir=$(dirname $0)

				source ${script_dir}/lib.sh

				suite=$1

				if [ -z "$1" ]; then

					suite='smoke'

				fi

				# Make oc and kubectl visible

				export PATH=/tmp/shared:$PATH

				oc version || die "Test cluster is unreachable"

				info "Install and configure kata into the test cluster"

				export SELINUX_PERMISSIVE="no"

				${script_dir}/cluster/install_kata.sh || die "Failed to install kata-containers"

				info "Run test suite: $suite"

				test_status='PASS'

				${script_dir}/run_${suite}_test.sh || test_status='FAIL'

				info "Test suite: $suite: $test_status"

				[ "$test_status" == "PASS" ]

									
										185

docs/Debug-shim-guide.md
									
										Normal file
									
												View File
												
				@@ -0,0 +1,185 @@

				# Using a debugger with the runtime

				Setting up a debugger for the runtime is pretty complex: the shim is a server

				process that is run by the runtime manager (containerd/CRI-O), and controlled by

				sending gRPC requests to it.

				Starting the shim with a debugger then just gives you a process that waits for

				commands on its socket, and if the runtime manager doesn't start it, it won't

				send request to it.

				A first method is to attach a debugger to the process that was started by the

				runtime manager.

				If the issue you're trying to debug is not located at container creation, this

				is probably the easiest method.

				The other method involves a script that is placed in between the runtime manager

				and the actual shim binary. This allows to start the shim with a debugger, and

				wait for a client debugger connection before execution, allowing debugging of the

				kata runtime from the very beginning.

				## Prerequisite

				At the time of writing, a debugger was used only with the go shim, but a similar

				process should be doable with runtime-rs. This documentation will be enhanced

				with rust-specific instructions later on.

				In order to debug the go runtime, you need to use the [Delve debugger](https://github.com/go-delve/delve).

				You will also need to build the shim binary with debug flags to make sure symbols

				are available to the debugger.

				Typically, the flags should be: `-gcflags=all=-N -l`

				## Attach to the running process

				To attach the debugger to the running process, all you need is to let the container

				start as usual, then use the following command with `dlv`:

				`$ dlv attach [pid of your kata shim]`

				If you need to use your debugger remotely, you can use the following on your target

				machine:

				`$ dlv attach [pid of your kata shim] --headless --listen=[IP:port]`

				then from your client computer:

				`$ dlv connect [IP:port]`

				## Make CRI-O/containerd start the shim with the debugger

				You can use the [this script](../tools/containerd-shim-katadbg-v2) to make the

				shim binary executed through a debugger, and make the debugger wait for a client

				connection before running the shim.

				This allows starting your container, connecting your debugger, and controlling the

				shim execution from the beginning.

				### Adapt the script to your setup

				You need to edit the script itself to give it the actual binary

				to execute.

				Locate the following line in the script, and set the path accordingly.

				```bash

				SHIM_BINARY=

				```

				You may also need to edit the `PATH` variable set within the script,

				to make sure that the `dlv` binary is accessible.

				### Configure your runtime manager to use the script

				Using either containerd or CRI-O, you will need to have a runtime class that

				uses the script in place of the actual runtime binary.

				To do that, we will create a separate runtime class dedicated to debugging.

				- **For containerd**:

				Make sure that the `containerd-shim-katadbg-v2` script is available to containerd

				(putting it in the same folder as your regular kata shim typically).

				Then edit the containerd configuration, and add the following runtime configuration,.

				```toml

				[plugins]

				  [plugins."io.containerd.grpc.v1.cri"]

				    [plugins."io.containerd.grpc.v1.cri".containerd]

				      [plugins."io.containerd.grpc.v1.cri".containerd.runtimes]

				        [plugins."io.containerd.grpc.v1.cri".containerd.runtimes.katadbg]

				          runtime_type = "io.containerd.katadbg.v2"

				```

				- **For CRI-O**:

				Copy your existing kata runtime configuration from `/etc/crio/crio.conf.d/`, and

				make a new one with the name `katadbg`, and the runtime_path set to the location

				of the script.

				E.g:

				```toml

				[crio.runtime.runtimes.katadbg]

				  runtime_path = "/usr/local/bin/containerd-shim-katadbg-v2"

				  runtime_root = "/run/vc"

				  runtime_type = "vm"

				  privileged_without_host_devices = true

				  runtime_config_path = "/usr/share/defaults/kata-containers/configuration.toml"

				 ```

				NOTE: for CRI-O, the name of the runtime class doesn't need to match the name of the

				script. But for consistency, we're using `katadbg` here too.

				### Start your container and connect to the debugger

				Once the above configuration is in place, you can start your container, using

				your `katadbg` runtime class.

				E.g: `$ crictl runp --runtime=katadbg sandbox.json`

				The command will hang, and you can see that a `dlv` process is started

				```

				$ ps aux | grep dlv

				root        9137  1.4  6.8 6231104 273980 pts/10 Sl   15:04   0:02 dlv exec /go/src/github.com/kata-containers/kata-containers/src/runtime/__debug_bin --headless --listen=:12345 --accept-multiclient -r stdout:/tmp/shim_output_oMC6Jo -r stderr:/tmp/shim_output_oMC6Jo -- -namespace default -address  -publish-binary /usr/local/bin/crio -id 0bc23d2208d4ff8c407a80cd5635610e772cae36c73d512824490ef671be9293 -debug start

				```

				Then you can use the `dlv` debugger to connect to it:

				```

				$ dlv connect localhost:12345

				Type 'help' for list of commands.

				(dlv)

				```

				Before doing anything else, you need to to enable `follow-exec` mode in delve.

				This is because the first thing that the shim will do is to daemonize itself,

				i.e: start itself as a subprocess, and exit. So you really want the debugger

				to attach to the child process.

				```

				(dlv) target follow-exec -on .*/__debug_bin

				```

				Note that we are providing a regular expression to filter the name of the binary.

				This is to make sure that the debugger attaches to the runtime shim, and not

				to other subprocesses (hypervisor typically).

				To ease this process, we recommand the use of an init file containing the above

				command.

				```

				$ cat dlv.ini

				target follow-exec -on .*/__debug_bin

				$ dlv connect localhost:12345 --init=dlv.ini

				Type 'help' for list of commands.

				(dlv)

				```

				Once this is done, you can set breakpoints, and use the `continue` keyword to

				start the execution of the shim.

				You can also use a different client, like VSCode, to connect to it.

				A typical `launch.json` configuration for VSCode would look like:

				```yaml

				[...]

				{

				    "name": "Connect to the debugger",

				    "type": "go",

				    "request": "attach",

				    "mode": "remote",

				    "port": 12345,

				    "host": "127.0.0.1",

				}

				[...]

				```

				NOTE: VSCode's go extension doesn't seem to support the `follow-exec` mode from

				Delve. So if you want to use VScode, you'll still need to use a commandline

				`dlv` client to set the `follow-exec` flag.

				## Caveats

				Debugging takes time, and there are a lot of timeouts going on in a Kubernetes

				environments. It is very possible that while you're debugging, some processes

				will timeout and cancel the container execution, possibly breaking your debugging

				session.

				You can mitigate that by increasing the timeouts in the different components

				involved in your environment.

									
										53

docs/Developer-Guide.md
									
												View File
												
				@@ -2,6 +2,8 @@

				This document is written **specifically for developers**: it is not intended for end users.

				If you want to contribute changes that you have made, please read the [community guidelines](https://github.com/kata-containers/community/blob/main/CONTRIBUTING.md) for information about our processes.

				# Assumptions

				- You are working on a non-critical test or development system.

				@@ -13,11 +15,22 @@ The recommended way to create a development environment is to first

				to create a working system.

				The installation guide instructions will install all required Kata Containers

				components, plus *Docker*, the hypervisor, and the Kata Containers image and

				guest kernel.

				components, plus a container manager, the hypervisor, and the Kata

				Containers image and guest kernel.

				Alternatively, you can perform a

				[manual installation](install/container-manager/containerd/containerd-install.md),

				or continue with [the instructions below](#requirements-to-build-individual-components)

				to build the Kata Containers components from source.

				# Requirements to build individual components

				> **Note:**

				>

				> If you decide to build from sources, you should be aware of the

				> implications of using an unpackaged system which will not be automatically

				> updated as new [releases](https://github.com/kata-containers/kata-containers/releases) are made available.

				You need to install the following to build Kata Containers components:

				- [golang](https://golang.org/dl)

				@@ -255,8 +268,10 @@ to install `libseccomp` for the agent.

				```bash

				$ mkdir -p ${seccomp_install_path} ${gperf_install_path}

				$ kata-containers/ci/install_libseccomp.sh ${seccomp_install_path} ${gperf_install_path}

				$ pushd kata-containers/ci 

				$ script -fec 'sudo -E ./install_libseccomp.sh ${seccomp_install_path} ${gperf_install_path}"'

				$ export LIBSECCOMP_LIB_PATH="${seccomp_install_path}/lib"

				$ popd

				```

				On `ppc64le` and `s390x`, `glibc` is used. You will need to install the `libseccomp` library

				@@ -435,7 +450,7 @@ You can build and install the guest kernel image as shown [here](../tools/packag

				# Install a hypervisor

				When setting up Kata using a [packaged installation method](install/README.md#installing-on-a-linux-system), the

				`QEMU` VMM is installed automatically. Cloud-Hypervisor and Firecracker VMMs are available from the [release tarballs](https://github.com/kata-containers/kata-containers/releases), as well as through [`kata-deploy`](../tools/packaging/kata-deploy/README.md).

				`QEMU` VMM is installed automatically. Cloud-Hypervisor, Firecracker and StratoVirt VMMs are available from the [release tarballs](https://github.com/kata-containers/kata-containers/releases), as well as through [`kata-deploy`](../tools/packaging/kata-deploy/README.md).

				You may choose to manually build your VMM/hypervisor.

				## Build a custom QEMU

				@@ -585,10 +600,15 @@ $ sudo kata-monitor

				#### Connect to debug console

				Command `kata-runtime exec` is used to connect to the debug console.

				You need to start a container for example:

				```bash

				$ sudo ctr run  --runtime io.containerd.kata.v2 -d docker.io/library/ubuntu:latest testdebug

				```

				Then, you can use the command `kata-runtime exec <sandbox id>` to connect to the debug console.

				```

				$ kata-runtime exec 1a9ab65be63b8b03dfd0c75036d27f0ed09eab38abb45337fea83acd3cd7bacd

				$ kata-runtime exec testdebug

				bash-4.2# id

				uid=0(root) gid=0(root) groups=0(root)

				bash-4.2# pwd

				@@ -654,7 +674,7 @@ section when using rootfs, or when using initrd, complete the steps in the [Buil

				Install the image:

				>**Note**: When using an initrd image, replace the below rootfs image name `kata-containers.img` 

				>**Note**: When using an initrd image, replace the below rootfs image name `kata-containers.img`

				>with the initrd image name `kata-containers-initrd.img`.

				```bash

				@@ -688,25 +708,25 @@ $ sudo crictl run -r kata container.yaml pod.yaml

				The steps required to enable debug console for QEMU slightly differ with

				those for firecracker / cloud-hypervisor.

				##### Enabling debug console for QEMU

				Add `agent.debug_console` to the guest kernel command line to allow the agent process to start a debug console. 

				Add `agent.debug_console` to the guest kernel command line to allow the agent process to start a debug console.

				```bash

				$ sudo sed -i -e 's/^kernel_params = "\(.*\)"/kernel_params = "\1 agent.debug_console"/g' "${kata_configuration_file}"

				```

				Here `kata_configuration_file` could point to `/etc/kata-containers/configuration.toml` 

				Here `kata_configuration_file` could point to `/etc/kata-containers/configuration.toml`

				or `/usr/share/defaults/kata-containers/configuration.toml`

				or `/opt/kata/share/defaults/kata-containers/configuration-{hypervisor}.toml`, if

				you installed Kata Containers using `kata-deploy`.

				##### Enabling debug console for cloud-hypervisor / firecracker

				Slightly different configuration is required in case of firecracker and cloud hypervisor. 

				Firecracker and cloud-hypervisor don't have a UNIX socket connected to `/dev/console`. 

				Hence, the kernel command line option `agent.debug_console` will not work for them. 

				Slightly different configuration is required in case of firecracker and cloud hypervisor.

				Firecracker and cloud-hypervisor don't have a UNIX socket connected to `/dev/console`.

				Hence, the kernel command line option `agent.debug_console` will not work for them.

				These hypervisors support `hybrid vsocks`,  which can be used for communication

				between the host and the guest. The kernel command line option `agent.debug_console_vport`

				 was added to allow developers specify on which `vsock` port the debugging console should be connected.

				@@ -719,7 +739,7 @@ sudo sed -i -e 's/^kernel_params = "\(.*\)"/kernel_params = "\1 agent.debug_cons

				```

				> **Note** Ports 1024 and 1025 are reserved for communication with the agent

				> and gathering of agent logs respectively. 

				> and gathering of agent logs respectively.

				##### Connecting to the debug console

				@@ -751,6 +771,11 @@ $ sudo su -c 'cd /var/run/vc/vm/${sandbox_id} && socat "stdin,raw,echo=0,escape=

				To disconnect from the virtual machine, type `CONTROL+q` (hold down the

				`CONTROL` key and press `q`).

				## Use a debugger with the runtime

				For developers interested in using a debugger with the runtime, please

				look at [this document](Debug-shim-guide.md).

				## Obtain details of the image

				If the image is created using

									
										4

docs/Documentation-Requirements.md
									
												View File
												
				@@ -105,7 +105,7 @@ This section lists requirements for displaying commands and command output.

				The requirements must be adhered to since documentation containing code blocks

				is validated by the CI system, which executes the command blocks with the help

				of the

				[doc-to-script](https://github.com/kata-containers/tests/tree/main/.ci/kata-doc-to-script.sh)

				[doc-to-script](https://github.com/kata-containers/kata-containers/blob/main/tests/kata-doc-to-script.sh)

				utility.

				- If a document includes commands the user should run, they **MUST** be shown

				@@ -189,7 +189,7 @@ and compare them with standard tools (e.g. `diff(1)`).

				Since this project uses a number of terms not found in conventional

				dictionaries, we have a

				[spell checking tool](https://github.com/kata-containers/tests/tree/main/cmd/check-spelling)

				[spell checking tool](https://github.com/kata-containers/kata-containers/tree/main/tests/cmd/check-spelling)

				that checks both dictionary words and the additional terms we use.

				Run the spell checking tool on your document before raising a PR to ensure it

									
										2

docs/Licensing-strategy.md
									
												View File
												
				@@ -18,4 +18,4 @@ licensing and allows automated tooling to check the license of individual

				files.

				This SPDX licence identifier requirement is enforced by the

				[CI (Continuous Integration) system](https://github.com/kata-containers/tests/blob/main/.ci/static-checks.sh).

				[CI (Continuous Integration) system](https://github.com/kata-containers/kata-containers/blob/main/tests/static-checks.sh).

									
										3

docs/Limitations.md
									
												View File
												
				@@ -147,7 +147,8 @@ these commands is potentially challenging.

				See issue https://github.com/clearcontainers/runtime/issues/341 and [the constraints challenge](#the-constraints-challenge) for more information.

				For CPUs resource management see

				[CPU constraints](design/vcpu-handling.md).

				[CPU constraints(in runtime-go)](design/vcpu-handling-runtime-go.md).

				[CPU constraints(in runtime-rs)](design/vcpu-handling-runtime-rs.md).

				# Architectural limitations

									
										2

docs/README.md
									
												View File
												
				@@ -40,6 +40,7 @@ Documents that help to understand and contribute to Kata Containers.

				### Design and Implementations

				* [Kata Containers Architecture](design/architecture): Architectural overview of Kata Containers

				* [Kata Containers CI](../ci/README.md): Kata Containers CI document

				* [Kata Containers E2E Flow](design/end-to-end-flow.md): The entire end-to-end flow of Kata Containers

				* [Kata Containers design](./design/README.md): More Kata Containers design documents

				* [Kata Containers threat model](./threat-model/threat-model.md): Kata Containers threat model

				@@ -69,7 +70,6 @@ Documents that help to understand and contribute to Kata Containers.

				### The Release Process

				* [Release strategy](Stable-Branch-Strategy.md)

				* [Release Process](Release-Process.md)

				## Presentations

									
										119

docs/Release-Process.md
									
												View File
												
				@@ -1,89 +1,76 @@

				# How to do a Kata Containers Release

				  This document lists the tasks required to create a Kata Release.

				This document lists the tasks required to create a Kata Release.

				## Requirements

				- [hub](https://github.com/github/hub)

				  * Using an [application token](https://github.com/settings/tokens) is required for hub (set to a GITHUB_TOKEN environment variable).

				- GitHub permissions to run workflows.

				- GitHub permissions to push tags and create releases in Kata repositories.

				## Versioning

				- GPG configured to sign git tags. https://docs.github.com/en/authentication/managing-commit-signature-verification/generating-a-new-gpg-key

				The Kata Containers project uses [semantic versioning](http://semver.org/) for all releases.

				Semantic versions are comprised of three fields in the form:

				- You should configure your GitHub to use your ssh keys (to push to branches). See https://help.github.com/articles/adding-a-new-ssh-key-to-your-github-account/.

				    * As an alternative, configure hub to push and fork with HTTPS, `git config --global hub.protocol https` (Not tested yet) *

				```

				MAJOR.MINOR.PATCH

				```

				When `MINOR` increases, the new release adds **new features** but *without changing the existing behavior*.

				When `MAJOR` increases, the new release adds **new features, bug fixes, or

				both** and which **changes the behavior from the previous release** (incompatible with previous releases).

				A major release will also likely require a change of the container manager version used,

				-for example Containerd or CRI-O. Please refer to the release notes for further details.

				**Important** : the Kata Containers project doesn't have stable branches (see

				[this issue](https://github.com/kata-containers/kata-containers/issues/9064) for details).

				Bug fixes are released as part of `MINOR` or `MAJOR` releases only. `PATCH` is always `0`.

				## Release Process

				### Bump the `VERSION` file

				### Bump all Kata repositories

				When the `kata-containers/kata-containers` repository is ready for a new release,

				first create a PR to set the release in the `VERSION` file and have it merged.

				  Bump the repositories using a script in the Kata packaging repo, where:

				  - `BRANCH=<the-branch-you-want-to-bump>`

				  - `NEW_VERSION=<the-new-kata-version>`

				  ```

				  $ cd ${GOPATH}/src/github.com/kata-containers/kata-containers/tools/packaging/release

				  $ export NEW_VERSION=<the-new-kata-version>

				  $ export BRANCH=<the-branch-you-want-to-bump>

				  $ ./update-repository-version.sh -p "$NEW_VERSION" "$BRANCH"

				  ```

				### Check GitHub Actions

				### Point tests repository to stable branch

				We make use of [GitHub actions](https://github.com/features/actions) in the

				[release](https://github.com/kata-containers/kata-containers/actions/workflows/release.yaml)

				file from the `kata-containers/kata-containers` repository to build and upload

				release artifacts.

				  If you create a new stable branch, i.e. if your release changes a major or minor version number (not a patch release), then

				  you should modify the `tests` repository to point to that newly created stable branch and not the `main` branch.

				  The objective is that changes in the CI on the main branch will not impact the stable branch.

				The action is manually triggered and is responsible for generating a new

				release (including a new tag), pushing those to the

				`kata-containers/kata-containers` repository. The new release is initially

				created as a draft. It is promoted to an official release when the whole

				workflow has completed successfully.

				  In the test directory, change references the main branch in:

				  * `README.md`

				  * `versions.yaml`

				  * `cmd/github-labels/labels.yaml.in`

				  * `cmd/pmemctl/pmemctl.sh`

				  * `.ci/lib.sh`

				  * `.ci/static-checks.sh`

				Check the [actions status

				page](https://github.com/kata-containers/kata-containers/actions) to verify all

				steps in the actions workflow have completed successfully. On success, a static

				tarball containing Kata release artifacts will be uploaded to the [Release

				page](https://github.com/kata-containers/kata-containers/releases).

				  See the commits in [the corresponding PR for stable-2.1](https://github.com/kata-containers/tests/pull/3504) for an example of the changes.

				If the workflow fails because of some external environmental causes, e.g. network

				timeout, simply re-run the failed jobs until they eventually succeed.

				If for some reason you need to cancel the workflow or re-run it entirely, go first

				to the [Release page](https://github.com/kata-containers/kata-containers/releases) and

				delete the draft release from the previous run.

				### Merge all bump version Pull requests

				### Improve the release notes

				  - The above step will create a GitHub pull request in the Kata projects. Trigger the CI using `/test` command on each bump Pull request.

				  - Trigger the `test-kata-deploy` workflow which is under the `Actions` tab on the repository GitHub page (make sure to select the correct branch and validate it passes).

				  - Check any failures and fix if needed.

				  - Work with the Kata approvers to verify that the CI works and the pull requests are merged.

				Release notes are auto-generated by the GitHub CLI tool used as part of our

				release workflow.  However, some manual tweaking may still be necessary in

				order to highlight the most important features and bug fixes in a specific

				release.

				### Tag all Kata repositories

				  Once all the pull requests to bump versions in all Kata repositories are merged,

				  tag all the repositories as shown below.

				  ```

				  $ cd ${GOPATH}/src/github.com/kata-containers/kata-containers/tools/packaging/release

				  $ git checkout  <kata-branch-to-release>

				  $ git pull

				  $ ./tag_repos.sh -p -b "$BRANCH" tag

				  ```

				### Check Git-hub Actions

				  We make use of [GitHub actions](https://github.com/features/actions) in this [file](../.github/workflows/release.yaml) in the `kata-containers/kata-containers` repository to build and upload release artifacts. This action is auto triggered with the above step when a new tag is pushed to the `kata-containers/kata-containers` repository.

				  Check the [actions status page](https://github.com/kata-containers/kata-containers/actions) to verify all steps in the actions workflow have completed successfully. On success, a static tarball containing Kata release artifacts will be uploaded to the [Release page](https://github.com/kata-containers/kata-containers/releases).

				### Create release notes

				  We have a script in place in the packaging repository to create release notes that include a short-log of the commits across Kata components.

				  Run the script as shown below:

				  ```

				  $ cd ${GOPATH}/src/github.com/kata-containers/kata-containers/tools/packaging/release

				  # Note: OLD_VERSION is where the script should start to get changes.

				  $ ./release-notes.sh ${OLD_VERSION} ${NEW_VERSION} > notes.md

				  # Edit the `notes.md` file to review and make any changes to the release notes.

				  # Add the release notes in the project's GitHub.

				  $ hub release edit -F notes.md "${NEW_VERSION}"

				  ```

				With this in mind, please, poke @channel on #kata-dev and people who worked on

				the release will be able to contribute to that.

				### Announce the release

				  Publish in [Slack and Kata mailing list](https://github.com/kata-containers/community#join-us) that new release is ready.

				Publish in [Slack and Kata mailing

				list](https://github.com/kata-containers/community#join-us) that new release is

				ready.

									
										151

docs/Stable-Branch-Strategy.md
									
												View File
											
				@@ -1,151 +0,0 @@

				Branch and release maintenance for the Kata Containers project.

				## Introduction 

				This document provides details about Kata Containers releases.

				## Versioning

				The Kata Containers project uses [semantic versioning](http://semver.org/) for all releases. 

				Semantic versions are comprised of three fields in the form:

				```

				MAJOR.MINOR.PATCH

				```

				For examples: `1.0.0`, `1.0.0-rc.5`, and `99.123.77+foo.bar.baz.5`.

				Semantic versioning is used since the version number is able to convey clear 

				information about how a new version relates to the previous version. 

				For example, semantic versioning can also provide assurances to allow users to know 

				when they must upgrade compared with when they might want to upgrade:

				- When `PATCH` increases, the new release contains important **security fixes**

				  and an upgrade is recommended.

				  The patch field can contain extra details after the number. 

				Dashes denote pre-release versions. `1.0.0-rc.5` in the example denotes the fifth release

				 candidate for release `1.0.0`. Plus signs denote other details. In our example, `+foo.bar.baz.5` 

				provides additional information regarding release `99.123.77` in the previous example.

				- When `MINOR` increases, the new release adds **new features** but *without

				  changing the existing behavior*.

				- When `MAJOR` increases, the new release adds **new features, bug fixes, or

				  both** and which **changes the behavior from the previous release** (incompatible with previous releases).

				  A major release will also likely require a change of the container manager version used, 

				for example Containerd or CRI-O. Please refer to the release notes for further details.

				## Release Strategy

				Any new features added since the last release will be available in the next minor

				release. These will include bug fixes as well. To facilitate a stable user environment, 

				Kata provides stable branch-based releases and a main branch release.

				## Stable branch patch criteria

				No new features should be introduced to stable branches.  This is intended to limit risk to users,

				providing only bug and security fixes.

				## Branch Management

				Kata Containers will maintain **one** stable release branch, in addition to the main branch, for

				each active major release.

				Once a new MAJOR or MINOR release is created from main, a new stable branch is created for

				the prior MAJOR or MINOR release and the previous stable branch is no longer maintained. End of

				maintenance for a branch is announced on the Kata Containers mailing list.  Users can determine

				the version currently installed by running `kata-runtime kata-env`. It is recommended to use the

				latest stable branch available.

				A couple of examples follow to help clarify this process.

				### New bug fix introduced

				A bug fix is submitted against the runtime which does not introduce new inter-component dependencies.

				This fix is applied to both the main and stable branches, and there is no need to create a new

				stable branch.

				| Branch | Original version | New version |

				|--|--|--|

				| `main` | `2.3.0-rc0` | `2.3.0-rc1` |

				| `stable-2.2` | `2.2.0` | `2.2.1` |

				| `stable-2.1` | (unmaintained) | (unmaintained) |

				### New release made feature or change adding new inter-component dependency

				A new feature is introduced, which adds a new inter-component dependency. In this case a new stable

				branch is created (stable-2.3) starting from main and the previous stable branch (stable-2.2)

				is dropped from maintenance.

				| Branch | Original version | New version |

				|--|--|--|

				| `main` | `2.3.0-rc1` | `2.3.0` |

				| `stable-2.3` | N/A| `2.3.0` |

				| `stable-2.2` | `2.2.1` | (unmaintained) |

				| `stable-2.1` | (unmaintained) | (unmaintained) |

				Note, the stable-2.2 branch will still exist with tag 2.2.1, but under current plans it is

				not maintained further. The next tag applied to main will be 2.4.0-alpha0. We would then

				create a couple of alpha releases gathering features targeted for that particular release (in

				this case 2.4.0), followed by a release candidate. The release candidate marks a feature freeze.

				A new stable branch is created for the release candidate. Only bug fixes and any security issues

				are added to the branch going forward until release 2.4.0 is made.

				## Backporting Process 

				Development that occurs against the main branch and applicable code commits should also be submitted

				against the stable branches. Some guidelines for this process follow::

				  1. Only bug and security fixes which do not introduce inter-component dependencies are

				 candidates for stable branches. These PRs should be marked with "bug" in GitHub.

				  2. Once a PR is created against main which meets requirement of (1), a comparable one

				 should also be submitted against the stable branches. It is the responsibility of the submitter

				 to apply their pull request against stable, and it is the responsibility of the

				 reviewers to help identify stable-candidate pull requests.

				## Continuous Integration Testing

				The test repository is forked to create stable branches from main. Full CI

				runs on each stable and main PR using its respective tests repository branch.

				### An alternative method for CI testing:

				Ideally, the continuous integration infrastructure will run the same test suite on both main

				and the stable branches.  When tests are modified or new feature tests are introduced, explicit

				logic should exist within the testing CI to make sure only applicable tests are executed against

				stable and main. While this is not in place currently, it should be considered in the long term.

				## Release Management

				### Patch releases

				Releases are made every four weeks, which include a GitHub release as

				well as binary packages. These patch releases are made for both stable branches, and a "release candidate"

				for the next `MAJOR` or `MINOR` is created from main. If there are no changes across all the repositories, no

				release is created and an announcement is made on the developer mailing list to highlight this.

				If a release is being made, each repository is tagged for this release, regardless

				of whether changes are introduced. The release schedule can be seen on the

				[release rotation wiki page](https://github.com/kata-containers/community/wiki/Release-Team-Rota).

				If there is urgent need for a fix, a patch release will be made outside of the planned schedule.

				The process followed for making a release can be found at [Release Process](Release-Process.md).

				## Minor releases

				###  Frequency

				Minor releases are less frequent in order to provide a more stable baseline for users. They are currently

				running on a sixteen weeks cadence. The release schedule can be seen on the

				[release rotation wiki page](https://github.com/kata-containers/community/wiki/Release-Team-Rota).

				### Compatibility

				Kata guarantees compatibility between components that are within one minor release of each other. 

				This is critical for dependencies which cross between host (shimv2 runtime) and

				the guest (hypervisor, rootfs and agent).  For example, consider a cluster with a long-running

				deployment, workload-never-dies, all on Kata version 2.1.3 components. If the operator updates

				the Kata components to the next new minor release (i.e. 2.2.0), we need to guarantee that the 2.2.0

				shimv2 runtime still communicates with 2.1.3 agent within workload-never-dies.

				Handling live-update is out of the scope of this document. See this [`kata-runtime` issue](https://github.com/kata-containers/runtime/issues/492) for details.

									
										3

docs/Upgrading.md
									
												View File
												
				@@ -14,9 +14,6 @@ period of time, once a stable release for Kata Containers 2.x is published,

				Kata Containers 1.x stable users should consider switching to the Kata 2.x

				release.

				See the [stable branch strategy documentation](Stable-Branch-Strategy.md) for

				further details.

				# Determine current version

				To display the current Kata Containers version, run one of the following:

									
										9

docs/code-pr-advice.md
									
												View File
												
				@@ -171,10 +171,9 @@ allows you to think about what types of value to test.

				### Other categories of test

				Raised a GitHub issue in the

				[`tests`](https://github.com/kata-containers/tests) repository that

				Raised a GitHub issue in the Kata Containers repository that

				explains what sort of test is required along with as much detail as

				possible. Ensure the original issue is referenced on the `tests` issue.

				possible. Ensure the original issue is referenced in the issue.

				### Unsafe code

				@@ -229,13 +228,13 @@ maintenance issue.

				### Markdown syntax

				Run the

				[markdown checker](https://github.com/kata-containers/tests/tree/main/cmd/check-markdown)

				[markdown checker](https://github.com/kata-containers/kata-containers/tree/main/tests/cmd/check-markdown)

				on your documentation changes.

				### Spell check

				Run the

				[spell checker](https://github.com/kata-containers/tests/tree/main/cmd/check-spelling)

				[spell checker](https://github.com/kata-containers/kata-containers/tree/main/tests/cmd/check-spelling)

				on your documentation changes.

				## Finally

									
										6

docs/design/README.md
									
												View File
												
				@@ -6,15 +6,19 @@ Kata Containers design documents:

				- [API Design of Kata Containers](kata-api-design.md)

				- [Design requirements for Kata Containers](kata-design-requirements.md)

				- [VSocks](VSocks.md)

				- [VCPU handling](vcpu-handling.md)

				- [VCPU handling(in runtime-go)](vcpu-handling-runtime-go.md)

				- [VCPU handling(in runtime-rs)](vcpu-handling-runtime-rs.md)

				- [VCPU threads pinning](vcpu-threads-pinning.md)

				- [Host cgroups](host-cgroups.md)

				- [Agent systemd cgroup](agent-systemd-cgroup.md)

				- [`Inotify` support](inotify.md)

				- [`Hooks` support](hooks-handling.md)

				- [Metrics(Kata 2.0)](kata-2-0-metrics.md)

				- [Metrics in Rust Runtime(runtime-rs)](kata-metrics-in-runtime-rs.md)

				- [Design for Kata Containers `Lazyload` ability with `nydus`](kata-nydus-design.md)

				- [Design for direct-assigned volume](direct-blk-device-assignment.md)

				- [Design for core-scheduling](core-scheduling.md)

				- [Virtualization Reference Architecture](kata-vra.md)

				---

				- [Design proposals](proposals)

2

docs/design/VSocks.md

View File

@@ -78,4 +78,4 @@ with the containers is if the VM itself or the `containerd-shim-kata-v2` dies, i
 the containers are removed automatically.
 [1]: https://wiki.qemu.org/Features/VirtioVsock
 [2]: ./vcpu-handling.md#virtual-cpus-and-kubernetes-pods
 [2]: ./vcpu-handling-runtime-go.md#virtual-cpus-and-kubernetes-pods

BIN
docs/design/arch-images/guest-image-management-architecture.png Normal file

View File

Binary file not shown.

After

Width: | Height: | Size: 61 KiB

BIN
docs/design/arch-images/guest-image-management-details.png Normal file

View File

Binary file not shown.

After

Width: | Height: | Size: 122 KiB

									
										12

docs/design/architecture/README.md
									
												View File
												
				@@ -57,7 +57,7 @@ section explains what this means.

				> [the architectural history document](history.md).

				The

				[containerd runtime shimv2 architecture](https://github.com/containerd/containerd/tree/main/runtime/v2)

				[containerd runtime shimv2 architecture](https://github.com/containerd/containerd/tree/main/core/runtime/v2)

				or _shim API_ architecture resolves the issues with the old

				architecture by defining a set of shimv2 APIs that a compatible

				runtime implementation must supply. Rather than calling the runtime

				@@ -349,6 +349,16 @@ The `exec` command allows an administrator or developer to enter the

				See [the developer guide](../../Developer-Guide.md#connect-to-debug-console) for further details.

				### policy command

				The `policy set` command allows an administrator or developer to set the policy

				to [VM root environment](#environments). In this way, we can enable/disable

				kata-agent API through policy.

				The command is: `kata-runtime policy set policy.rego --sandbox-id XXXXXXXX`

				Please refer to [`genpolicy tool`](../../../src/tools/genpolicy/README.md) to see how to generate `policy.rego` mentioned above.

				And more about policy itself can be found at [Policy Details](../../../src/tools/genpolicy/genpolicy-auto-generated-policy-details.md).

				### Configuration

				See the [configuration file details](../../../src/runtime/README.md#configuration).

									
										6

docs/design/architecture/kubernetes.md
									
												View File
												
				@@ -3,16 +3,16 @@

				[Kubernetes](https://github.com/kubernetes/kubernetes/), or K8s, is a popular open source

				container orchestration engine. In Kubernetes, a set of containers sharing resources

				such as networking, storage, mount, PID, etc. is called a

				[pod](https://kubernetes.io/docs/user-guide/pods/).

				[pod](https://kubernetes.io/docs/concepts/workloads/pods/).

				A node can have multiple pods, but at a minimum, a node within a Kubernetes cluster

				only needs to run a container runtime and a container agent (called a

				[Kubelet](https://kubernetes.io/docs/admin/kubelet/)).

				[Kubelet](https://kubernetes.io/docs/concepts/overview/components/#kubelet)).

				Kata Containers represents a Kubelet pod as a VM.

				A Kubernetes cluster runs a control plane where a scheduler (typically

				running on a dedicated master node) calls into a compute Kubelet. This

				running on a dedicated control-plane node) calls into a compute Kubelet. This

				Kubelet instance is responsible for managing the lifecycle of pods

				within the nodes and eventually relies on a container runtime to

				handle execution. The Kubelet architecture decouples lifecycle

									
										2

docs/design/architecture/networking.md
									
												View File
												
				@@ -36,7 +36,7 @@ compatibility, and performance on par with MACVTAP.

				Kata Containers has deprecated support for bridge due to lacking performance relative to TC-filter and MACVTAP.

				Kata Containers supports both

				[CNM](https://github.com/docker/libnetwork/blob/master/docs/design.md#the-container-network-model)

				[CNM](https://github.com/moby/libnetwork/blob/master/docs/design.md#the-container-network-model)

				and [CNI](https://github.com/containernetworking/cni) for networking management.

				## Network Hotplug

									
										63

docs/design/hooks-handling.md
									
										Normal file
									
												View File
												
				@@ -0,0 +1,63 @@

				# Kata Containers support for `Hooks`

				## Introduction

				During container's lifecycle, different Hooks can be executed to do custom actions. In Kata Containers, we support two types of Hooks, `OCI Hooks` and `Kata Hooks`.

				### OCI Hooks

				The OCI Spec stipulates six hooks that can be executed at different time points and namespaces, including `Prestart Hooks`, `CreateRuntime Hooks`, `CreateContainer Hooks`, `StartContainer Hooks`, `Poststart Hooks` and `Poststop Hooks`. We support these types of Hooks as compatible as possible in Kata Containers.

				The path and arguments of these hooks will be passed to Kata for execution via `bundle/config.json`. For example:

				```

				...

				"hooks": {

				  "prestart": [

				    {

				      "path": "/usr/bin/prestart-hook",

				      "args": ["prestart-hook", "arg1", "arg2"],

				      "env":  [ "key1=value1"]

				    }

				  ],

				  "createRuntime": [

				    {

				      "path": "/usr/bin/createRuntime-hook",

				      "args": ["createRuntime-hook", "arg1", "arg2"],

				      "env":  [ "key1=value1"]

				    }

				  ]

				}

				...

				```

				### Kata Hooks

				In Kata, we support another three kinds of hooks executed in guest VM, including `Guest Prestart Hook`, `Guest Poststart Hook`, `Guest Poststop Hook`.

				The executable files for Kata Hooks must be packaged in the *guest rootfs*. The file path to those guest hooks should be specified in the configuration file, and guest hooks must be stored in a subdirectory of `guest_hook_path` according to their hook type. For example:

				+ In configuration file:

				```

				guest_hook_path="/usr/share/hooks"

				```

				+ In guest rootfs, prestart-hook is stored in `/usr/share/hooks/prestart/prestart-hook`.

				## Execution

				The table below summarized when and where those different hooks will be executed in Kata Containers:

				| Hook Name | Hook Type | Hook Path | Exec Place | Exec Time |

				|---|---|---|---|---|

				| `Prestart(deprecated)` | OCI hook | host runtime namespace | host runtime namespace | After VM is started, before container is created. |

				| `CreateRuntime` | OCI hook | host runtime namespace | host runtime namespace | After VM is started, before container is created, after `Prestart` hooks. |

				| `CreateContainer` | OCI hook | host runtime namespace | host vmm namespace* | After VM is started, before container is created, after `CreateRuntime` hooks. |

				| `StartContainer` | OCI hook | guest container namespace | guest container namespace | After container is created, before container is started. |

				| `Poststart` | OCI hook | host runtime namespace | host runtime namespace | After container is started, before start operation returns. |

				| `Poststop` | OCI hook | host runtime namespace | host runtime namespace | After container is deleted, before delete operation returns. |

				| `Guest Prestart` | Kata hook | guest agent namespace | guest agent namespace | During start operation, before container command is executed. |

				| `Guest Poststart` | Kata hook | guest agent namespace | guest agent namespace | During start operation, after container command is executed, before start operation returns. |

				| `Guest Poststop` | Kata hook | guest agent namespace | guest agent namespace | During delete operation, after container is deleted, before delete operation returns. |

				+ `Hook Path` specifies where hook's path be resolved.

				+ `Exec Place` specifies in which namespace those hooks can be executed.

				  + For `CreateContainer` Hooks, OCI requires to run them inside the container namespace while the hook executable path is in the host runtime, which is a non-starter for VM-based containers. So we design to keep them running in the *host vmm namespace.* 

				+ `Exec Time` specifies at which time point those hooks can be executed.

									
										119

docs/design/kata-guest-image-management-design.md
									
										Normal file
									
												View File
												
				@@ -0,0 +1,119 @@

				# Kata Containers Guest Image Management Design

				To safeguard the integrity of container images and prevent tampering from the host side, we propose guest image management. This method, employed for Confidential Containers, ensures container images remain unaltered and secure.

				## Introduction to remote snapshot

				Containerd 1.7 introduced `remote snapshotter` feature which is the foundation for pulling images in the guest for Confidential Containers.

				While it's beyond the scope of this document to fully explain how the container rootfs is created to the point it can be executed,  a fundamental grasp of the snapshot concept is essential. Putting it in a simple way, containerd fetches the image layers from an OCI registry into its local content storage. However, they cannot be mounted as is (e.g. the layer can be tar+gzip compressed) as well as they should be immutable so the content can be shared among containers. Thus containerd leverages snapshots of those layers to build the container's rootfs. 

				The role of `remote snapshotter` is to reuse snapshots that are stored in a remotely shared place, thus enabling containerd to prepare the container’s rootfs in a manner similar to that of a local `snapshotter`. The key behavior that makes this the building block of Kata's guest image management for Confidential Containers is that containerd will not pull the image layers from registry, instead it assumes that `remote snapshotter` and/or an external entity will perform that operation on his behalf.

				Maybe the simplest example of `remote snapshotter` in Confidential Containers is the pulling of images in the guest VM. Once ensuring the VM is part of a Trusted Computing Base (TCB) and  throughout a chain of delegations involving containerd, `remote snapshotter` and kata-runtime, it is possible for the kata-agent to pull the image directly.

				## `Remote snapshotter` implementations

				`Remote snapshotter` is containerd plug-ins that should implement the [Snapshot Interface](https://pkg.go.dev/github.com/containerd/containerd/v2/snapshots?utm_source=godoc#Snapshotter).

				The following `remote snapshotter` is leveraged by Kata Containers:

				- `nydus snapshotter`

				### `Nydus snapshotter`

				This `snapshotter` is implemented as an external containerd proxy plug-in for [`Nydus`](https://nydus.dev/).

				Currently it supports a couple of runtime backend, notably, `FUSE`, `virtiofs`, and `EROFS`, being the former leveraged on the tested Kata Containers CI.

				## `Nydus snapshotter` and containerd-shim-v2 integration

				```go

				// KataVirtualVolume encapsulates information for extra mount options and direct volumes.

				type KataVirtualVolume struct {

				   VolumeType   string                `json:"volume_type"`

				   Source       string                `json:"source,omitempty"`

				   FSType       string                `json:"fs_type,omitempty"`

				   Options      []string              `json:"options,omitempty"`

				   DirectVolume *DirectAssignedVolume `json:"direct_volume,omitempty"`

				   ImagePull    *ImagePullVolume      `json:"image_pull,omitempty"`   //<-Used for pulling images in the guest

				   NydusImage   *NydusImageVolume     `json:"nydus_image,omitempty"`

				   DmVerity     *DmVerityInfo         `json:"dm_verity,omitempty"`

				}

				```

				## Guest image management implementations

				### Guest pull with `nydus snapshotter`

				Pull the container image directly from the guest VM using `nydus snapshotter` backend.

				#### General Characteristics

				- Container image pulled in the guest

				- Pause image should be built in the guest's rootfs

				- Confidentiality for image manifest and config: No

				- Confidentiality for blob data: Yes

				- Use `nydus snapshotter` as `remote snapshotter` configured with the FUSE runtime backend

				#### Architecture

				The following diagram provides an overview of the architecture for pulling image in the guest with key components. 

				![guest-image-management-architecture](arch-images/guest-image-management-architecture.png)

				#### Sequence diagrams

				The following sequence diagram depicted below offers a detailed overview of the messages/calls exchanged to pull an unencrypted unsigned image from an unauthenticated container registry. This involves the kata-runtime, kata-agent, and the guest-components’ image-rs to use the guest pull mechanism.

				![guest-image-management-details](arch-images/guest-image-management-details.png)

				First and foremost, the guest pull code path is only activated when `nydus snapshotter` requires the handling of a volume which type is `image_guest_pull`, as can be seen on the message below:

				```json

				{

				  {

				  "volume_type": "image_guest_pull",

				  "source":"quay.io/kata-containers/confidential-containers:unsigned",

				  "fs_type":"overlayfs"

				  "options": [

				    "containerd.io/snapshot/cri.layer-digest=sha256:24fb2886d6f6c5d16481dd7608b47e78a8e92a13d6e64d87d57cb16d5f766d63",

				    "containerd.io/snapshot/nydus-proxy-mode=true"

				  ],

				  "image_pull": {

				    "metadata": {

				      "containerd.io/snapshot/cri.layer-digest": "sha256:24fb2886d6f6c5d16481dd7608b47e78a8e92a13d6e64d87d57cb16d5f766d63",

				      "containerd.io/snapshot/nydus-proxy-mode": "true"

				         }

				       }

				  }

				}

				```

				In other words, `VolumeType` of `KataVirtualVolumeType` is set to `image_guest_pull`.

				Next the `handleImageGuestPullBlockVolume()` is called to build the Storage object that will be attached to the message later sent to kata-agent via the `CreateContainerRequest()` RPC. It is in the `handleImageGuestPullBlockVolume()` that it will begin the handling of the pause image if the request is for a sandbox container type (see more about pause image below).

				Below is an example of storage information packaged in the message sent to the kata-agent:

				```json

				"driver": "image_guest_pull", 

				    "driver_options": [

				        "image_guest_pull"{

				            "metadata":{

				                "containerd.io/snapshot/cri.layer-digest": "sha256:24fb2886d6f6c5d16481dd7608b47e78a8e92a13d6e64d87d57cb16d5f766d63",

				                "containerd.io/snapshot/nydus-proxy-mode": "true",

				                "io.katacontainers.pkg.oci.bundle_path": "/run/containerd/io.containerd.runtime.v2.task/k8s.io/cb0b47276ea66ee9f44cc53afa94d7980b57a52c3f306f68cb034e58d9fbd3c6",

				                "io.katacontainers.pkg.oci.container_type": "pod_container",

				                "io.kubernetes.cri.container-name": "coco-container",

				                "io.kubernetes.cri.container-type": "container",

				                "io.kubernetes.cri.image-name": "quay.io/kata-containers/confidential-containers:unsigned",

				                "io.kubernetes.cri.sandbox-id":"7a0d058477e280604ae02de6a016959e8a05fcd3165c47af41eabcf205b55517",

				                "io.kubernetes.cri.sandbox-name": "coco-pod","io.kubernetes.cri.sandbox-namespace": "default",

				                "io.kubernetes.cri.sandbox-uid": "de7c6a0c-79c0-44dc-a099-69bb39f180af",

				            }

				        }

				    ], 

				    "source": "quay.io/kata-containers/confidential-containers:unsigned", 

				    "fstype": "overlay", 

				    "options": [], 

				    "mount_point": "/run/kata-containers/cb0b47276ea66ee9f44cc53afa94d7980b57a52c3f306f68cb034e58d9fbd3c6/rootfs",

				```

				Next, the kata-agent's RPC module will handle the create container request which, among other things, involves adding storages to the sandbox. The storage module contains implementations of `StorageHandler` interface for various storage types, being the `ImagePullHandler` in charge of handling the storage object for the container image (the storage manager instantiates the handler based on the value of the "driver").

				`ImagePullHandler` delegates the image pulling operation to the `ImageService.pull_image()` that is going to create the image's bundle directory on the guest filesystem and, in turn, class the image-rs to in fact fetch and uncompress the image's bundle. 

				> **Notes:**

				> In this flow, `ImageService.pull_image()` parses the image metadata, looking for either the `io.kubernetes.cri.container-type: sandbox` or `io.kubernetes.cri-o.ContainerType: sandbox` (CRI-IO case) annotation, then it never calls the `image-rs.pull_image()` because the pause image is expected to already be inside the guest's filesystem, so instead `ImageService.unpack_pause_image()` is called.

				References:

				[1] [[RFC] Image management proposal for hosting sharing and peer pods](https://github.com/confidential-containers/confidential-containers/issues/137)

				[2] https://github.com/containerd/containerd/blob/main/docs/content-flow.md

Compare commits

3538 Commits 3.1.2 ... 3.5.0

4 .github/workflows/PR-wip-checks.yaml vendored Unescape Escape View File

100 .github/workflows/add-backport-label.yaml vendored Unescape Escape View File

6 .github/workflows/add-issues-to-project.yaml vendored Unescape Escape View File

15 .github/workflows/add-pr-sizing-label.yaml vendored Unescape Escape View File

29 .github/workflows/auto-backport.yaml vendored Unescape Escape View File

336 .github/workflows/basic-ci-amd64.yaml vendored Normal file Unescape Escape View File

113 .github/workflows/build-checks.yaml vendored Normal file Unescape Escape View File

142 .github/workflows/build-kata-static-tarball-amd64.yaml vendored Normal file Unescape Escape View File

123 .github/workflows/build-kata-static-tarball-arm64.yaml vendored Normal file Unescape Escape View File

124 .github/workflows/build-kata-static-tarball-ppc64le.yaml vendored Normal file Unescape Escape View File

172 .github/workflows/build-kata-static-tarball-s390x.yaml vendored Normal file Unescape Escape View File

8 .github/workflows/cargo-deny-runner.yaml vendored Unescape Escape View File

21 .github/workflows/ci-nightly-s390x.yaml vendored Normal file Unescape Escape View File

19 .github/workflows/ci-nightly.yaml vendored Normal file Unescape Escape View File

30 .github/workflows/ci-on-push.yaml vendored Normal file Unescape Escape View File

248 .github/workflows/ci.yaml vendored Normal file Unescape Escape View File

28 .github/workflows/commit-message-check.yaml vendored Unescape Escape View File

10 .github/workflows/darwin-tests.yaml vendored Unescape Escape View File

4 .github/workflows/docs-url-alive-check.yaml vendored Unescape Escape View File

86 .github/workflows/kata-deploy-push.yaml vendored Unescape Escape View File

169 .github/workflows/kata-deploy-test.yaml vendored Unescape Escape View File

36 .github/workflows/kata-runtime-classes-sync.yaml vendored Normal file Unescape Escape View File

19 .github/workflows/move-issues-to-in-progress.yaml vendored Unescape Escape View File

107 .github/workflows/payload-after-push.yaml vendored Normal file Unescape Escape View File

66 .github/workflows/publish-kata-deploy-payload-amd64.yaml vendored Normal file Unescape Escape View File

71 .github/workflows/publish-kata-deploy-payload-arm64.yaml vendored Normal file Unescape Escape View File

75 .github/workflows/publish-kata-deploy-payload-ppc64le.yaml vendored Normal file Unescape Escape View File

69 .github/workflows/publish-kata-deploy-payload-s390x.yaml vendored Normal file Unescape Escape View File

57 .github/workflows/release-amd64.yaml vendored Normal file Unescape Escape View File

57 .github/workflows/release-arm64.yaml vendored Normal file Unescape Escape View File

62 .github/workflows/release-ppc64le.yaml vendored Normal file Unescape Escape View File

61 .github/workflows/release-s390x.yaml vendored Normal file Unescape Escape View File

305 .github/workflows/release.yaml vendored Unescape Escape View File

54 .github/workflows/require-pr-porting-labels.yaml vendored Unescape Escape View File

67 .github/workflows/run-cri-containerd-tests-ppc64le.yaml vendored Normal file Unescape Escape View File

63 .github/workflows/run-cri-containerd-tests-s390x.yaml vendored Normal file Unescape Escape View File

123 .github/workflows/run-k8s-tests-on-aks.yaml vendored Normal file Unescape Escape View File

100 .github/workflows/run-k8s-tests-on-garm.yaml vendored Normal file Unescape Escape View File

82 .github/workflows/run-k8s-tests-on-ppc64le.yaml vendored Normal file Unescape Escape View File

93 .github/workflows/run-k8s-tests-on-zvsi.yaml vendored Normal file Unescape Escape View File

86 .github/workflows/run-k8s-tests-with-crio-on-garm.yaml vendored Normal file Unescape Escape View File

275 .github/workflows/run-kata-coco-tests.yaml vendored Normal file Unescape Escape View File

90 .github/workflows/run-kata-deploy-tests-on-aks.yaml vendored Normal file Unescape Escape View File

65 .github/workflows/run-kata-deploy-tests-on-garm.yaml vendored Normal file Unescape Escape View File

59 .github/workflows/run-kata-monitor-tests.yaml vendored Normal file Unescape Escape View File

94 .github/workflows/run-metrics.yaml vendored Normal file Unescape Escape View File

46 .github/workflows/run-runk-tests.yaml vendored Normal file Unescape Escape View File

52 .github/workflows/snap-release.yaml vendored Unescape Escape View File

37 .github/workflows/snap.yaml vendored Unescape Escape View File

17 .github/workflows/stale.yaml vendored Normal file Unescape Escape View File

33 .github/workflows/static-checks-dragonball.yaml vendored Unescape Escape View File

26 .github/workflows/static-checks-self-hosted.yaml vendored Normal file Unescape Escape View File

173 .github/workflows/static-checks.yaml vendored Unescape Escape View File

3 .gitignore vendored Unescape Escape View File

83 CODEOWNERS Unescape Escape View File

11 Makefile Unescape Escape View File

20 README.md Unescape Escape View File

2 VERSION Unescape Escape View File

343 ci/README.md Normal file Unescape Escape View File

182 ci/gh-util.sh Executable file Unescape Escape View File

19 ci/install_libseccomp.sh Unescape Escape View File

14 ci/install_yq.sh Unescape Escape View File

33 ci/lib.sh Unescape Escape View File

55 ci/openshift-ci/cleanup.sh Executable file Unescape Escape View File

6 ci/openshift-ci/cluster/configs/selinux.conf Normal file Unescape Escape View File

35 ci/openshift-ci/cluster/deploy_webhook.sh Executable file Unescape Escape View File

13 ci/openshift-ci/cluster/deployments/configmap_installer_kernel.yaml Normal file Unescape Escape View File

14 ci/openshift-ci/cluster/deployments/configmap_installer_qemu.yaml Normal file Unescape Escape View File

12 ci/openshift-ci/cluster/deployments/configmap_kata-webhook.yaml.in Normal file Unescape Escape View File

9 ci/openshift-ci/cluster/deployments/machineconfig_sandboxedcontainers_extension.yaml Normal file Unescape Escape View File

23 ci/openshift-ci/cluster/deployments/machineconfig_selinux.yaml.in Normal file Unescape Escape View File

40 ci/openshift-ci/cluster/deployments/relabel_selinux.yaml Normal file Unescape Escape View File

28 ci/openshift-ci/cluster/deployments/workaround-9206-crio-ds.yaml Normal file Unescape Escape View File

18 ci/openshift-ci/cluster/deployments/workaround-9206-crio.yaml Normal file Unescape Escape View File

245 ci/openshift-ci/cluster/install_kata.sh Executable file Unescape Escape View File

20 ci/openshift-ci/lib.sh Normal file Unescape Escape View File

92 ci/openshift-ci/run_smoke_test.sh Executable file Unescape Escape View File

30 ci/openshift-ci/smoke/http-server.yaml Normal file Unescape Escape View File

3538 Commits

3.1.2 ... 3.5.0

4

.github/workflows/PR-wip-checks.yaml vendored

View File

100

.github/workflows/add-backport-label.yaml vendored

View File

6

.github/workflows/add-issues-to-project.yaml vendored

View File

15

.github/workflows/add-pr-sizing-label.yaml vendored

View File

29

.github/workflows/auto-backport.yaml vendored

View File

336

.github/workflows/basic-ci-amd64.yaml vendored Normal file

View File

113

.github/workflows/build-checks.yaml vendored Normal file

View File

142

.github/workflows/build-kata-static-tarball-amd64.yaml vendored Normal file

View File

123

.github/workflows/build-kata-static-tarball-arm64.yaml vendored Normal file

View File

124

.github/workflows/build-kata-static-tarball-ppc64le.yaml vendored Normal file

View File

172

.github/workflows/build-kata-static-tarball-s390x.yaml vendored Normal file

View File

8

.github/workflows/cargo-deny-runner.yaml vendored

View File

21

.github/workflows/ci-nightly-s390x.yaml vendored Normal file

View File

19

.github/workflows/ci-nightly.yaml vendored Normal file

View File

30

.github/workflows/ci-on-push.yaml vendored Normal file

View File

248

.github/workflows/ci.yaml vendored Normal file

View File

28

.github/workflows/commit-message-check.yaml vendored

View File

10

.github/workflows/darwin-tests.yaml vendored

View File

4

.github/workflows/docs-url-alive-check.yaml vendored

View File

86

.github/workflows/kata-deploy-push.yaml vendored

View File

169

.github/workflows/kata-deploy-test.yaml vendored

View File

36

.github/workflows/kata-runtime-classes-sync.yaml vendored Normal file

View File

19

.github/workflows/move-issues-to-in-progress.yaml vendored

View File

107

.github/workflows/payload-after-push.yaml vendored Normal file

View File

66

.github/workflows/publish-kata-deploy-payload-amd64.yaml vendored Normal file

View File

71

.github/workflows/publish-kata-deploy-payload-arm64.yaml vendored Normal file

View File

75

.github/workflows/publish-kata-deploy-payload-ppc64le.yaml vendored Normal file

View File

69

.github/workflows/publish-kata-deploy-payload-s390x.yaml vendored Normal file

View File

57

.github/workflows/release-amd64.yaml vendored Normal file

View File

57

.github/workflows/release-arm64.yaml vendored Normal file

View File

62

.github/workflows/release-ppc64le.yaml vendored Normal file

View File

61

.github/workflows/release-s390x.yaml vendored Normal file

View File

305

.github/workflows/release.yaml vendored

View File

54

.github/workflows/require-pr-porting-labels.yaml vendored

View File

67

.github/workflows/run-cri-containerd-tests-ppc64le.yaml vendored Normal file

View File

63

.github/workflows/run-cri-containerd-tests-s390x.yaml vendored Normal file

View File

123

.github/workflows/run-k8s-tests-on-aks.yaml vendored Normal file

View File

100

.github/workflows/run-k8s-tests-on-garm.yaml vendored Normal file

View File

82

.github/workflows/run-k8s-tests-on-ppc64le.yaml vendored Normal file

View File

93

.github/workflows/run-k8s-tests-on-zvsi.yaml vendored Normal file

View File

86

.github/workflows/run-k8s-tests-with-crio-on-garm.yaml vendored Normal file

View File

275

.github/workflows/run-kata-coco-tests.yaml vendored Normal file

View File

90

.github/workflows/run-kata-deploy-tests-on-aks.yaml vendored Normal file

View File

65

.github/workflows/run-kata-deploy-tests-on-garm.yaml vendored Normal file

View File

59

.github/workflows/run-kata-monitor-tests.yaml vendored Normal file

View File

94

.github/workflows/run-metrics.yaml vendored Normal file

View File

46

.github/workflows/run-runk-tests.yaml vendored Normal file

View File

52

.github/workflows/snap-release.yaml vendored

View File

37

.github/workflows/snap.yaml vendored

View File

17

.github/workflows/stale.yaml vendored Normal file

View File

33

.github/workflows/static-checks-dragonball.yaml vendored

View File

26

.github/workflows/static-checks-self-hosted.yaml vendored Normal file

View File

173

.github/workflows/static-checks.yaml vendored

View File

3

.gitignore vendored

View File

83

CODEOWNERS

View File

11

Makefile

View File

20

README.md

View File

2

VERSION

View File

343

ci/README.md Normal file

View File

182

ci/gh-util.sh Executable file

View File

19

ci/install_libseccomp.sh

View File

14

ci/install_yq.sh

View File

33

ci/lib.sh

View File

55

ci/openshift-ci/cleanup.sh Executable file

View File

6

ci/openshift-ci/cluster/configs/selinux.conf Normal file

View File

35

ci/openshift-ci/cluster/deploy_webhook.sh Executable file

View File

13

ci/openshift-ci/cluster/deployments/configmap_installer_kernel.yaml Normal file

View File

14

ci/openshift-ci/cluster/deployments/configmap_installer_qemu.yaml Normal file

View File

12

ci/openshift-ci/cluster/deployments/configmap_kata-webhook.yaml.in Normal file

View File

9

ci/openshift-ci/cluster/deployments/machineconfig_sandboxedcontainers_extension.yaml Normal file

View File

23

ci/openshift-ci/cluster/deployments/machineconfig_selinux.yaml.in Normal file

View File

40

ci/openshift-ci/cluster/deployments/relabel_selinux.yaml Normal file

View File

28

ci/openshift-ci/cluster/deployments/workaround-9206-crio-ds.yaml Normal file

View File

18

ci/openshift-ci/cluster/deployments/workaround-9206-crio.yaml Normal file

View File

245

ci/openshift-ci/cluster/install_kata.sh Executable file

View File

20

ci/openshift-ci/lib.sh Normal file

View File

92

ci/openshift-ci/run_smoke_test.sh Executable file

View File

30

ci/openshift-ci/smoke/http-server.yaml Normal file

View File

28

ci/openshift-ci/smoke/service.yaml Normal file

View File