kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2026-01-25 22:54:29 +00:00

Author	SHA1	Message	Date
Fabiano Fidêncio	a04cdbc40f	tests: Enforce qemu-coco-dev for experimental_force_guest_pull The fact that we were not explicitly setting the VMM was leading to us testing with the default runtime class (qemu). :-/ Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-12 16:07:05 +01:00
Fabiano Fidêncio	6d3c20bc45	riscv: Introduce its own nightly tests By doing this, the ones interested on RISC-V support can still have a ood visibility of its state, without the extra noise in our CI. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-12 09:46:17 +01:00
Fabiano Fidêncio	d82eb8d0f1	ci: Drop docker tests We have had those tests broken for months. It's time to get rid of those. NOTE that we could easily revert this commit and re-add those tests as soon as we find someone to maintain and be responsible for such integration. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-11 17:02:02 +01:00
Fabiano Fidêncio	464764c7e0	tests: nvidia: kbs: Ensure KBS_INGRESS=nodeport I've missed doing this doing the KBS deployment set up. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-10 13:01:30 +01:00
Fabiano Fidêncio	37d4eb0b77	ci: nvidia: Ensure K8S_TEST_HOST_TYPE=baremetal So the proper cleanups are performed in case something goes awry in a previous run. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-09 10:51:33 +01:00
Fabiano Fidêncio	03e06fdf4d	tests: nvidia: Deploy Trustee Let's ensure Trustee is deployed as some of the tests rely images that live behind authentication. /o\ The approach taken here to deploy Trustee is exactly the same one taken on the other CoCo tests, apart from an env var passed to ensure we're using the NVIDIA remote verifier (which will be in handy very very soon). Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-07 12:32:11 +01:00
Hyounggyu Choi	ff429072b6	Merge pull request #11924 from BbolroC/fix-static-checks-actionspz ci: Fix failing static checks to enable IBM actionspz - Z specific	2025-11-06 09:04:04 +01:00
Manuel Huber	d8953f67c5	ci: Onboard another NVIDIA machine Let's add a new NVIDIA machine, which later on will be used for CC related tests. For now the current tests are skipped in the CC capable machine. Signed-off-by: Manuel Huber <manuelh@nvidia.com> Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-05 23:23:08 +01:00
Fabiano Fidêncio	ace9cf942d	tests: guest-pull: Fix names When added, I've mistakenly used the wrong test-type name, which is now fixed and should be enough to trigger the tests correctly. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-05 18:21:48 +01:00
Hyounggyu Choi	4ee2037974	GHA: Run runtime tests on self-hosted runners for P/Z On IBM actionspz P/Z runners, the following error was observed during runtime tests: ``` host system doesn't support vsock: stat /dev/vhost-vsock: no such file or directory ``` Since loading the vsock module on the fly is not permitted, this commit moves the runtime tests back to self-hosted runners for P/Z. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-11-05 16:35:04 +00:00
Fabiano Fidêncio	1dfbb14093	tests: Stop testing on stratovirt Stratovirt has been failing for a considerable amount of time, with no sign of someone watching it and being actively working on a fix. With this we also stop building and shipping stratovirt as part of our release as we cannot test it. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-04 10:22:46 +01:00
Fabiano Fidêncio	4293cdf846	tests: Add stability tests for experimental-force-guest-pull A few weeks ago we've tested nydus-snapshotter with this approach, and we DID find issues with it. Now, let's also test this with `experimental_force_guest_pull`. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-04 09:02:19 +01:00
Fabiano Fidêncio	157b2c32ce	scripts: release: Run helm dependencies update Otherwise we'll face issues like: ``` Error: found in Chart.yaml, but missing in charts/ directory: node-feature-discovery ``` Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-11-01 17:54:58 +01:00
Fabiano Fidêncio	1bc873397b	tests: Use NFD as part of the tests As we have the ability to deploy NFD as a sub-chart of our chart, let's make sure we test it during our CI. We had to increase the timeout values, where we had timeouts set, to deploy / undeploy kata, as now NFD is also deployed / undeployed. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-31 16:30:13 +01:00
Fabiano Fidêncio	e30e2b5f45	tests: k8s: Remove tests running on GitHub provided runner We have 2 tests running on GitHub provided runners: * devmapper * CRI-O - devmapper situation For devmapper, we're currently testing devmapper with s390x as part of one of its jobs. More than that, this test has been failing here due to a lack of space in the machine for quite some time, and no-action was taken to bring it back either via GARM or some other way. With that said, let's rely on the s390x CI to test devmapper and avoid one extra failure on our CI by removing this one. - cri-o situation CRI-O is being tested with a fixed version of kubernetes that's already reached its EOL, and a CRI-O version that matches that k8s version. There has been attempts to raise issues, and also to provide a PR that does at least part of the work ... leaving the debugging part for the maintainers of the CI. However, there was no action on those from the maintainers. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-30 11:46:59 +01:00
Fabiano Fidêncio	59883a2d99	actions: Remove unused USING_NFD There's no reason to keep the env var / input as it's never been used and now kata-deploy detects automatically whether NFD is deployed or not. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-28 21:24:27 +01:00
Amulyam24	c603094584	revert: Enable new ibm runners for ppc64le Temporarily disables the new runners for building artifacts jobs. Will be re-enabled once they are stable. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2025-10-28 17:09:26 +05:30
Hyounggyu Choi	7d2fe5e187	revert: Enable new ibm runners for s390x This partially reverts `8dcd91c` for the s390x because the CI jobs are currently blocking the release. The new runners will be re-introduced once they are stable and no longer impact critical paths. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-10-28 11:11:51 +01:00
Amulyam24	9876cbffd6	github: migrate k8s job to a different runner on ppc64le Migrate the k8s job to a different runner and use a long running cluster instead of creating the cluster on every run. Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2025-10-24 18:20:11 +05:30
Steve Horsman	5713072385	Merge pull request #11974 from fidencio/topic/payload-after-build-upload-latest-charts actions: Push a `0.0.0-dev` chart package to the registries	2025-10-24 13:13:02 +01:00
Fabiano Fidêncio	ebc1d64096	actions: Push a `0.0.0-dev` chart package to the registries This will help immensely projects consuming the kata-deploy helm chart to use configuration options added during the development cycle that are waiting for a release to be out ... allowing very early tests of the stack. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-24 11:44:27 +02:00
Dan Mihai	b8c1215d99	gha: no policy for cbl-mariner during ci Temporarily disable the auto-generated Agent Policy on Mariner hosts, to workaround the new test failures on these hosts. When re-enabling auto-generated policy in the future, that would be better achieved with a tests/integration/kubernetes/gha-run.sh change. Those changes are easier to test compared with GHA YAML changes. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2025-10-24 04:00:36 +00:00
Zvonko Kaiser	0b11190fcf	gpu: Add Arm64 kernel signing Adopt working amd64 workflow to arm64 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-10-22 21:05:32 +02:00
Aurélien Bombo	b7f542443e	ci: Always refresh OIDC token before cluster deletion This forces OIDC token refresh even if the tests step failed, so that we also have proper credentials to delete the cluster in that case. I first noticed the original issue here: https://github.com/kata-containers/kata-containers/actions/runs/18659064688/job/53215379040?pr=11950 Fixes: #11953 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-10-21 09:35:52 -05:00
Seunguk Shin	40dac78412	kata-deploy: support build confidential kernel and shim-v2 for CCA After supporting the Arm CCA, it will rely on the kernel kvm.h headers to build the runtime. The kernel-headers currently quite new with the traditional one, so that we rely on build the kernel header first and then inject it to the shim-v2 build container. Signed-off-by: Kevin Zhao <kevin.zhao@linaro.org> Co-authored-by: Seunguk Shin <seunguk.shin@arm.com>	2025-10-16 17:23:58 +08:00
Fabiano Fidêncio	aa7e46b5ed	tests: Check the multi-snapshotter situation on containerd One problem that we've been having for a reasonable amount of time, is containerd not behaving very well when we have multiple snapshotters. Although I'm adding this test with my "CoCo" hat in mind, the issue can happen easily with any other case that requires a different snapshotter (such as, for instance, firecracker + devmapper). With this in mind, let's do some stability tests, checking every hour a simple case of running a few pre-defined containers with runc, and then running the same containers with kata. This should be enough to put us in the situation where containerd gets confused about which snapshotter owns the image layers, and break on us (or not break and show us that this has been solved ...). Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-15 13:35:43 +02:00
stevenhorsman	8ce714cf97	ci: Add protobuf-compiler dependencies We are seeing more protoc related failures on the new runners, so try adding the protobuf-compiler dependency to these steps to see if it helps. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-10-14 10:58:58 +01:00
Fabiano Fidêncio	e782d1ad50	ci: k8s: Test experimental_force_guest_pull Now that we have added the ability to deploy kata-containers with experimental_force_guest_pull configured, let's make sure we test it to avoid any kind of regressions. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-10 20:08:10 +02:00
Fabiano Fidêncio	496e255ea2	build: Fix KBUILD_SIGN_PIN usage What was done in the past, trying to set the env var on the same step it'd be used, simply does not work. Instead, we need to properly set it through the `env` set up, as done now. We're also bumping the kata_config_version to ensure we retrigger the kernel builds. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-10 15:25:10 +02:00
stevenhorsman	8dcd91cf5f	ci: Enable new ibm runners We have some scalable s390x and ppc runners, so start to use them for build and test, to improve the throughput of our CI Signed-off-by: stevenhorsman <steven@uk.ibm.com> Co-authored-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-10-10 09:42:06 +01:00
Fabiano Fidêncio	06a3bbdd44	ci: k8s: coco: Add "Report tests" step For some reason we didn't have the "Report tests" step as part of the TEE jobs. This step immensely helps to check which tests are failing and why, so let's add it while touching the workflow. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-10 09:51:59 +02:00
Fabiano Fidêncio	a1f90fe350	tests: k8s: Unify k8s TEE tests There's no reason to have the code duplication between the SNP / TDX tests for CoCo, as those are basically using the same configuration nowadays. Note that for the TEEs case, as the nydus-snapshotter is deployed by the admin, once, instead of deploying it on every run ... I'm actually removing the nydus-snapshotter steps so we make it clear that those steps are not performed by the CI. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-10 09:51:59 +02:00
Aurélien Bombo	07645cf58b	ci: actionlint: Address issues and set as required Address issues just introduced and set actionlint as a required by removing the path filter. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-10-08 16:55:27 -05:00
Aurélien Bombo	5a4ddb8c71	ci: zizmor: Fix all `template-injection` alerts Fix all instances of template injection by using environment variables as recommended by Zizmor, instead of directly injecting values into the commands. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-10-08 16:55:26 -05:00
Aurélien Bombo	7b203d1b43	ci: zizmor: Ignore `dangerous-triggers` audit for known safe usage The two ignored cases are strictly necessary for the CI to work today, and we have various security mitigations in place. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-10-08 16:55:08 -05:00
Aurélien Bombo	7afdfc7388	ci: zizmor: Disable `undocumented-permissions` audit There are 62 such warnings and addressing them would take quite a bit of time so just disable them for now. help[undocumented-permissions]: permissions without explanatory comments --> ./.github/workflows/release.yaml:71:7 \| 71 \| packages: write \| ^^^^^^^^^^^^^^^ needs an explanatory comment 72 \| id-token: write \| ^^^^^^^^^^^^^^^ needs an explanatory comment 73 \| attestations: write \| ^^^^^^^^^^^^^^^^^^^ needs an explanatory comment \| = note: audit confidence → High Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-10-08 16:55:08 -05:00
Aurélien Bombo	ec81ea95df	gha: Add `workflow_dispatch` trigger to `docs-url-alive-check` We can't test this PR because the workflow needs this trigger, so adding this will allow testing future PRs. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-10-08 14:39:34 -05:00
Aurélien Bombo	4d760e64ae	gha: Fix docs-url-alive-check workflow The Go installation step was broken because the checkout action was checking out the code in a subdirectory: https://github.com/kata-containers/kata-containers/actions/runs/18265538456/job/51999316919 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-10-08 14:39:34 -05:00
Fabiano Fidêncio	3418cedacc	ci: Add tests for erofs-snapshotter (for coco-qemu-dev) erofs-snapshotter can be used to leverage sharing the image from the host to the guest without the need of a shared filesystem (such as virtio-fs or virtio-9p). This case is ideal for Confidential Computing enabled on Kata Containers, and we can immensely benefit from this snapshotter, thus let's test it as soon as possible so we can find issues, report bugs, and ask for enhancement requests. There are at least a few things that we know for sure to be problematic now: * Policy has to be adjusted to the erofs-snapshotter * There is no support for signed nor encrypted images * Tests that use the KBS are disabled for now Even with the limitations, I do believe we should be testing the snapshoitter, so we can team up and get those limitations addressed. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-08 10:34:09 +02:00
Fabiano Fidêncio	f994bacf6c	tests: coco: Use the new way to set up nydus snapshotter Let's rely on kata-deploy setting up the nydus snapshotter for us, instead of doing this with external code. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-07 10:32:46 +02:00
Fabiano Fidêncio	4359c7b15d	tests: Ensure the nydus-snapshotter versions are aligned In the previous commit we added the assumption that the nydus-snapshotter version should be the same in two different places. Now, with this test, we ensure those will always be in sync. Signed-off-by: Fabiano Fidêncio <ffidencio@nvidia.com>	2025-10-07 10:32:46 +02:00
Markus Rudy	369124b180	ci: build genpolicy on darwin genpolicy is a developer tool that should be usable on MacOS. Adding it to the darwin CI job ensures that it can still be built after changes. On an Apple M2, the output of `uname -m` is `arm64`, which is why a new case is needed in the arch_to_* functions. We're not going to cross-compile binaries on darwin, so don't install any additional Rust targets. Fixes: #11635 Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-09-29 09:48:32 +02:00
Aurélien Bombo	433e59de1f	gha: zizmor: fix "workflow or action definition without a name" error This fixes that error everywhere by adding a `name:` field to all jobs that were missing it. We keep the same name as the job ID to ensure no disturbance to the required job names. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-09-25 23:34:40 -05:00
Aurélien Bombo	2e033d0079	gha: Run Zizmor without Advanced Security This does not change the security of the analysis, this is just to work around zizmorcore/zizmor-action#43. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-09-25 10:50:41 -05:00
Fabiano Fidêncio	8abfef358a	tests: Only run docker tests with one VMM Docker tests have been broken for a while and should be removed if we cannot maintain those. For now, though, let's limit it to run only with one hypervisor and avoid wasting resources for no reason. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-09-15 23:03:04 +02:00
Fabiano Fidêncio	dce6f13da8	tests: Only run devmapper tests with QEMU devmapper tests have been failing for a while. It's been breaking on the kata-deploy deployment, which is most likely related to Disk Pressure. Removing files was not enough to get the tests to run, so we'll just run those with QEMU as a way to test fixes. Once we get the test working, we can re-enable the other VMMs, but for now let's just not waste resources for no reason. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-09-15 23:02:33 +02:00
Fabiano Fidêncio	ad7e60030a	tests: k8s: kata-deploy: Remove unnecessary dirs to free up space This is following Steve's suggestion, based on what's been done on cloud-api-adaptor. The reason we're doing it here is because we've seen pods being evicted due to disk pressure. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-09-15 15:27:54 +02:00
stevenhorsman	9c0fcd30c5	ci: Add slab to dependabot groups Add slab, so that in future the different component bumps are all done together Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-09-15 09:48:03 +02:00
Aurélien Bombo	11655ef029	ci: Run Zizmor on pushes to any branch This runs Zizmor on pushes to any branch, not just main. This is useful for: 1. Testing changes in feature branches with the manually-triggered CI. 2. Forked repos that may use a different name than "main" for their default branch. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-09-11 09:33:25 -05:00
Hyounggyu Choi	1737777d28	Merge pull request #11743 from BbolroC/enable-ci-qemu-se-runtime-rs runtime-rs: Enable s390x nightly test for IBM SEL	2025-09-10 15:00:16 +02:00

1 2 3 4 5 ...

971 Commits