kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-08-17 15:38:00 +00:00

Author	SHA1	Message	Date
Paul Meyer	71796f7b12	ci/static-checks: install opa Make open-policy-agent available for static checks as prerequisite for rego checks. Signed-off-by: Paul Meyer <katexochen0@gmail.com>	2025-06-12 10:46:43 +02:00
Aurélien Bombo	66ae9473cb	Merge pull request #11397 from kata-containers/sprt/validate-ok-to-test ci: gha: Remove ok-to-test label on every push	2025-06-10 16:42:54 -05:00
stevenhorsman	99e70100c7	workflows: Set persist-credentials: false on checkout By default the checkout action leave the credentials in the checked-out repo's `.git/config`, which means they could get exposed. Use persist-credentials: false to prevent this happening. Note: static-checks.yaml does use git diff after the checkout, but the git docs state that git diff is just local, so doesn't need authentication. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-06-10 10:33:41 +01:00
Aurélien Bombo	2ee3470627	ci: gha: Remove ok-to-test label on every push This removes the ok-to-test label on every push, except if the PR author has write access to the repo (ie. permission to modify labels). This protects against attackers who would initially open a genuine PR, then push malicious code after the initial review. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-06-09 12:37:06 -05:00
Aurélien Bombo	9dd3807467	ci: Use OIDC to log into Azure This completely eliminates the Azure secret from the repo, following the below guidance: https://docs.github.com/en/actions/security-for-github-actions/security-hardening-your-deployments/configuring-openid-connect-in-azure The federated identity is scoped to the `ci` environment, meaning: * I had to specify this environment in some YAMLs. I don't believe there's any downside to this. * As previously, the CI works seamlessly both from PRs and in the manual workflow. I also deleted the tools/packaging/kata-deploy/action folder as it doesn't seem to be used anymore, and it contains a reference to the secret. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-06-06 15:26:10 -05:00
Steve Horsman	31a8944da1	Merge pull request #11334 from kata-containers/remove-inherit-secrets workflows: Replace secrets: inherit	2025-06-06 16:41:13 +01:00
stevenhorsman	66ef1c1198	workflows: Replace secrets: inherit Having secrets unconditionally being inherited is bad practice, so update the workflows to only pass through the minimal secrets that are needed Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-06-06 09:56:46 +01:00
stevenhorsman	89d038d2b4	workflows: Switch QUAY_DEPLOYER_USERNAME to var QUAY_DEPLOYER_USERNAME isn't sensitive, so update the secret for a var to simplify the workflows Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-06-06 09:49:14 +01:00
stevenhorsman	2eda21180a	workflows: Switch AUTHENTICATED_IMAGE_USER to var AUTHENTICATED_IMAGE_USER isn't sensitive, so update the secret for a var to simplify the workflows Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-06-06 09:49:14 +01:00
Markus Rudy	9ffed463a1	ci: fix artifact name of RISC-V tarball The artifact name accidentally referred to ARM64, which caused a clash in CI runs. Signed-off-by: Markus Rudy <mr@edgeless.systems>	2025-06-06 08:29:48 +02:00
stevenhorsman	6c6e16eef3	workflows: Remove docker hub registry publishing As docker hub has rate limiting issues, inside mirror quay.io to ghcr.io instead Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-06-05 11:46:51 +01:00
stevenhorsman	586d9adfe5	workflow: add packages: write to csi-driver publish This one was missed in the earlier PR Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-29 15:57:07 +01:00
stevenhorsman	c34416f53a	workflows: Add explicit permissions where needed We have a number of jobs that either need,or nest workflows that need gh permissions, such as for pushing to ghcr, or doing attest build provenance. This means they need write permissions on things like `packages`, `id-token` and `attestations`, so we need to set these permissions at the job-level (along with `contents: read`), so they are not restricted by our safe defaults. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-28 19:34:28 +01:00
stevenhorsman	088e97075c	workflow: Add top-level permissions Set: ``` permissions: contents: read ``` as the default top-level permissions explicitly to conform to recommended security practices e.g. https://github.com/ossf/scorecard/blob/main/docs/checks.md#token-permissions	2025-05-28 19:34:28 +01:00
Steve Horsman	7a9d919e3e	Merge pull request #11322 from kata-containers/workflow-permissions workflows: Add explicit permissions for attestation	2025-05-28 17:28:22 +01:00
stevenhorsman	4d4fb86d34	workflow: Update gatekeeper permissions I shortsightedly forgot that gatekeeper would need to read more than just the commit content in it's python scripts, so add read permissions to actions issues which it uses in it's processing Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-28 15:58:27 +01:00
Steve Horsman	fed63e0801	Merge pull request #11319 from stevenhorsman/remove-old-workflows workflows: Delete workflows	2025-05-28 15:38:19 +01:00
stevenhorsman	3ff602c1e8	workflows: Add explicit permissions for attestation We have a number of jobs that nest the build-static-tarball workflows later on. Due to these doing attest build provenance, and pushing to ghcr.io, t hey need write permissions on `packages`, `id-token` and `attestations`, so we need to set these permissions on the top-level jobs (along with `contents: read`), so they are not blocked. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-28 12:56:52 +01:00
stevenhorsman	2f0dc2ae24	workflows: gatekeeper: Update permissions Restrict the permissions of gatekeeper flow to read contents only for better security Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-28 09:57:19 +01:00
stevenhorsman	f900b0b776	workflows: Delete workflows Some legacy workflows require write access to github which is a security weakness and don't provide much value, so lets remove them. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-05-28 09:45:42 +01:00
Wainer dos Santos Moschetta	80a816db9d	workflows/run-k8s-tests-coco-nontee: add step to report tests Run `gha-run.sh report-tests` to generate the report of the tests. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2025-05-20 14:43:38 -03:00
Seunguk Shin	5cabce1a25	packaging: Build edk2 for arm64 The edk2 is required for memory hot plug on qemu for arm64. This adds the edk2 to static tarball for arm64. Signed-off-by: Seunguk Shin <seunguk.shin@arm.com> Reviewed-by: Nick Connolly <nick.connolly@arm.com>	2025-05-15 10:12:24 +01:00
Fabiano Fidêncio	71e8c1b4f0	helm: release: Publish our helm charts to the OCI registries Let's take advantage that helm take and OCI registry as the charts, and upload our charts to the OCI registries we've been using so far. Signed-off-by: Fabiano Fidêncio <fidencio@northflank.com>	2025-05-14 20:20:35 +02:00
Ruoqing He	384d335419	ci: Enable build-check for agent on riscv64 Enable build-check for `agent` component for riscv64 platforms. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-05-06 01:48:37 +00:00
stevenhorsman	f8fcd032ef	workflow: Set RUST_LIB_BACKTRACE=0 As discussed in #9538, with anyhow >=1.0.77 we have test failures due to backtrace behaviour changing, so set RUST_LIB_BACKTRACE=0, so that we only have backtrace on panics Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-04-30 19:38:13 +01:00
Aurélien Bombo	19371e2d3b	Merge pull request #11164 from wainersm/fix_kbs_on_aks tests/k8s: fix kbs installation on Azure AKS	2025-04-29 18:25:14 +01:00
Steve Horsman	3c8cc0cdbf	Merge pull request #11212 from BbolroC/add-cc-vfio-ap-test-s390x GHA: Add VFIO-AP to s390x nightly tests for CoCo	2025-04-29 16:15:00 +01:00
Hyounggyu Choi	63b9ae3ed0	GHA: Add VFIO-AP to s390x nightly tests for CoCo As #11076 introduces VFIO-AP bind/associate funtions for IBM Secure Execution (SEL), a new internal nightly test has been established. This PR adds a new entry `cc-vfio-ap-e2e-tests` to the existing matrix to share the test result. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-04-29 16:06:12 +02:00
Wainer dos Santos Moschetta	460c3394dd	gha: run CoCo non-TEE tests on "all" host type By running on "all" host type there are two consequences: 1) run the "normal" tests too (until now, it's only "small" tests), so increasing the coverage 2) create AKS cluster with larger VMs. This is a new requirement due to the current ingress controller for the KBS service eating too much vCPUs and lefting only few for the tests (resulting on failures) Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2025-04-28 12:08:31 -03:00
Steve Horsman	9248634baa	Merge pull request #11098 from stevenhorsman/golang-1.23.7 versions: Bump golang version	2025-04-28 13:46:11 +01:00
Steve Horsman	83d31b142b	Merge pull request #11044 from Jakob-Naucke/basic-s390x-ci ci: Extend basic s390x tests	2025-04-28 09:14:00 +01:00
stevenhorsman	c37840ce80	versions: Bump golang version Bump golang version to the latest minor 1.23.x release now that 1.24 has been released and 1.22.x is no longer stable and receiving security fixes Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-04-23 12:37:48 +01:00
stevenhorsman	ccfdf59607	workflows: Add apt update before install Add apt/apt-get updates before we do apt/apt-get installs to try and help with issues where we fail to fetch packages Co-authored-by: Fabiano Fidêncio <fidencio@northflank.com> Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-04-23 09:06:08 +01:00
Jakob Naucke	1c3b1f5adb	ci: Extend basic s390x tests Currently, s390x only tests cri-containerd. Partially converge to the feature set of basic-ci-amd64: - containerd-sandboxapi - containerd-stability - docker with the appropriate hypervisors. Do not run tests currently skipped on amd64, as well as - agent-ctl, which we don't package for s390x - nerdctl, does not package the `full` image for s390x - nydus, does not package for s390x Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com> Co-authored-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-04-22 21:34:02 +02:00
stevenhorsman	e6cca9da6d	ci: Remove metric jobs The metrics runner is broken, so skip the metrics jobs to stop the CI being stuck waiting. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-04-08 17:55:07 +01:00
Ruoqing He	96e43fbee5	ci: Enable `build-kata-static-tarball-riscv64.yaml` Previously we introduced `build-kata-static-tarball-riscv64.yaml`, enable that workflow in `ci.yaml`. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-04-01 16:35:14 +08:00
Ruoqing He	46caa986bb	ci: Skip tests depend on virtualization on riscv64 `VMContainerCapable` requires a present `kvm` device, which is not yet available in our RISC-V runners. Skipped related tests if it is running on `riscv-builder`. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-03-27 10:47:49 +08:00
Ruoqing He	7f0b1946c5	ci: Enable build-check for runtime on riscv64 `runtime` support for riscv64 is now ready, let enable building and testing on that component. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-03-27 10:38:30 +08:00
Ruoqing He	5e81f67ceb	ci: Generalize GITHUB_RUNNER_CI_ARM64 `GITHUB_RUNNER_CI_ARM64` is turned on for self hosted runners without virtualization to skipped those tests depend on virtualization. This may happen to other archs/runners as well, let's generalize it to `GITHUB_RUNNER_CI_NON_VIRT` so we can reuse it on other archs. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-03-21 09:49:44 +08:00
Aurélien Bombo	a678046d13	gha: Pin third-party actions to commit hashes A popular third-party action has recently been compromised [1][2] and the attacker managed to point multiple git version tags to a malicious commit containing code to exfiltrate secrets. This PR follows GitHub's recommendation [3] to pin third-party actions to a full-length commit hash, to mitigate such attacks. Hopefully actionlint starts warning about this soon [4]. [1] https://www.cve.org/CVERecord?id=CVE-2025-30066 [2] https://www.stepsecurity.io/blog/harden-runner-detection-tj-actions-changed-files-action-is-compromised [3] https://docs.github.com/en/actions/security-for-github-actions/security-guides/security-hardening-for-github-actions#using-third-party-actions [4] https://github.com/rhysd/actionlint/pull/436 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-03-19 13:52:49 -05:00
Ruoqing He	cb7508ffdc	ci: Enable runtime-rs component build-check on riscv64 `runtime-rs` is now buildable and testable on riscv64 platforms, enable `build-check` on `runtime-rs`. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-03-17 17:38:59 +08:00
Amulyam24	becb760e32	gha: use runner hooks instead of pre/post scripts for ppc64le runners This PR makes changes to remove steps to run scripts for preparing and cleaning the runner and instead use runner hooks env variables to manage them. Fixes: #9934 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2025-03-14 17:12:54 +05:30
Ruoqing He	a7e953c7a7	ci: Enable static-tarball build for riscv64 Enable `kernel` and `virtiofsd` static-tarball build for riscv64. Since `virtiofsd` was previously supported and `kernel` is supported now. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-03-13 13:43:29 +08:00
RuoqingHe	386fed342c	Merge pull request #10990 from kata-containers/shell-check-vendor-skip workflows: shellcheck: Expand vendor ignore	2025-03-12 21:34:26 +08:00
Steve Horsman	420b282279	Merge pull request #10948 from RuoqingHe/better-matrix ci: Refactor matrix for `build-checks`	2025-03-11 14:13:10 +00:00
stevenhorsman	ee0f0b7bfe	workflows: shellcheck: Expand vendor ignore - In the previous PR I only skipped the runtime/vendor directory, but errors are showing up in other vendor packages, so try a wildcard skip - Also update the job step was we can distinguish between the required and non-required versions Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-03-06 14:35:12 +00:00
Xuewei Niu	644af52968	Merge pull request #10876 from lifupan/fupan_containerd ci: cri-containerd: upgrade the LTS / Active versions for containerd	2025-03-06 17:08:40 +08:00
Fabiano Fidêncio	fd832d0feb	tests: kata-deploy: Run installation with only one VMM It doesn't make much sense to test different VMMs as that wouldn't trigger a different code path. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-03-05 19:44:27 +01:00
Fabiano Fidêncio	14bf653c35	tests: kata-deploy: Re-add tests, now using github runners As GitHub runners now support nested virt, we're don't depend on garm for those anymore. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-03-05 19:44:27 +01:00
Fupan Li	7024d3c600	CI: cri-containerd: upgrade the LTS / Active versions for containerd As we're testing against the LTS and the Active versions of containers, let's upgrade the lts version from 1.6 to 1.7 and active version from 1.7 to 2.0 to cover the sandboxapi tests. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-03-05 23:09:24 +08:00
Ruoqing He	186c88b1d5	ci: Move musl-tools installation into Setup rust `musl-tools` is only needed when a component needs `rust`, and the `instance` running is of `x86_64` or `aarch64`. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-03-05 09:43:19 +08:00
Zvonko Kaiser	4bb0eb4590	Merge pull request #10954 from kata-containers/topic/metrics-kata-deploy Rework and fix metrics issues	2025-03-04 20:22:53 -05:00
stevenhorsman	fb1d4b571f	workflows: Add required shellcheck workflow Start with a required smaller set of shellchecks to try and prevent regressions whilst we fix the current problems Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-03-04 09:35:46 +00:00
stevenhorsman	b3972df3ca	workflows: Shellcheck - ignore vendor Ignore the vendor directories in our shellcheck workflow as we can't fix them. If there is a way to set this in shellcheckrc that would be better, but it doesn't seem to be implemented yet. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-03-04 09:35:46 +00:00
stevenhorsman	3fab7944a3	workflows: Improve metrics jobs - As the metrics tests are largely independent then allow subsequent tests to run even if previous ones failed. The results might not be perfect if clean-up is required, but we can work on that later. - Move the test results check out of the latency test that seems arbitrary and into it's own job step - Add timeouts to steps that might fail/hang if there are containerd/K8s issues Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-03-01 17:50:05 +00:00
stevenhorsman	6f918d71f5	workflows: Update metrics jobs Currently the run-metrics job runs a manual install and does this in a separate job before the metrics tests run. This doesn't make sense as if we have multiple CI runs in parallel (like we often do), there is a high chance that the setup for another PR runs between the metrics setup and the runs, meaning it's not testing the correct version of code. We want to remove this from happening, so install (and delete to cleanup) kata as part of the metrics test jobs. Also switch to kata-deploy rather than manual install for simplicity and in order to test what we recommend to users. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-03-01 17:50:05 +00:00
Stephane Talbot	f80e7370d5	test: Verify deployement of kata-deploy on microk8s Enable fonctional test to verify deployment of kata-deploy on a Microk8s cluster Signed-off-by: Stephane Talbot <Stephane.Talbot@univ-savoie.fr>	2025-02-28 10:10:29 +01:00
Ruoqing He	09030ee96e	ci: Refactor build-checks workflow Refator matrix setup and according dependencies installation logic in `build-checks.yaml` and `build-checks-preview-riscv64.yaml` to provide better readability and maintainability. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-02-28 09:47:25 +08:00
Ruoqing He	eb94700590	ci: Drop install-libseccomp matrix variant `install-libseccomp` is applied only for `agent` component, and we are already combining matrix with `if`s in steps, drop `install-libseccomp` in matrix to reduce complexity. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-02-28 09:44:53 +08:00
Fabiano Fidêncio	96ed706d20	Merge pull request #10950 from fidencio/topic/skip-arm-check-tests-that-depend-on-virt ci: arm64: Skip tests that depend on virt on non-virt capable runners	2025-02-27 18:26:32 +01:00
Fabiano Fidêncio	e18e1ec3a8	ci: arm64: Skip tests that depend on virt on non-virt capable runners The GitHub hosted runners for ARM64 do not provide virtualisation support, thus we're just skipping the tests as those would check whether or not the system is "VMContainerCapable". Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-27 14:43:21 +01:00
Steve Horsman	f3c22411fc	Merge pull request #10930 from stevenhorsman/codeql-config workflows: Add codeql config	2025-02-27 12:43:41 +00:00
Ruoqing He	ec020399b9	ci: Enable partial components build-check on riscv Since we have RISC-V builders available now, let's start with `agent-ctl`, `trace-forwarder` and `genpolicy` components to run build-checks on these `riscv-builder`s, and gradually add the rest components when they are ready, to catch up with other architectures eventually. This workflow could be mannually triggered, `riscv-builder` will be the default instance when that is the case. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2025-02-26 15:38:39 +08:00
stevenhorsman	c97e9e1592	workflows: Add codeql config I noticed that CodeQl using the default config hasn't scanned since May 2024, so figured it would be worth trying an explicit configuration to see if that gets better results. It's mostly the template, but updated to be more relevant: - Only scan PRs and pushes to the `main` branch - Set a pinned runner version rather than latest (with mac support) - Edit the list of languages to be scanned to be more relevant for kata-containers Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-25 15:05:43 +00:00
stevenhorsman	5000fca664	workflows: Add build-checks to manual CI Currently the ci-on-push workflow that runs on PRs runs two jobs: gatekeeper-skipper.yaml and ci.yaml. In order to test things like for the error ``` too many workflows are referenced, total: 21, limit: 20 ``` on topic branches, we need ci-devel.yaml to have an extra workflow to match ci-on-push, so add the build-checks as this is helpful to run on topic branches anyway. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-25 11:38:49 +00:00
stevenhorsman	23434791f2	workflows: Refactor publish workflows Replace the four different publish workflows with a single one that take input parameters of the arch and runner, so reduce the amount of duplicated code and try and avoid the ``` too many workflows are referenced, total: 21, limit: 20 ``` error	2025-02-25 10:49:09 +00:00
Fabiano Fidêncio	7bd444fa52	ci: Run k8s tests on arm64 Let's take advantege of the current arm64 runners, and make sure we have those tests running there as well. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org> Signed-off-by: Kevin Zhao <kevin.zhao@linaro.org>	2025-02-24 18:43:20 +01:00
Aurélien Bombo	adca339c3c	ci: Fix GH throttling in run-nerdctl-tests Specify a GH API token to avoid the below throttling error: https://github.com/kata-containers/kata-containers/actions/runs/13450787436/job/37585810679?pr=10911#step:4:96 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-02-21 17:52:17 -06:00
Hyounggyu Choi	d973d41efb	GHA: Turn off MEASURED_ROOTFS in build-kata-static-tarball-s390x This is the first attempt to remove the following code: ``` if [ "${ARCH}" == "s390x" ]; then export MEASURED_ROOTFS=no fi ``` from install_shimv2() in kata-deploy-binaries.sh. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2025-02-19 18:19:19 +01:00
Zvonko Kaiser	ca4d227562	gpu: Add qemu-tdx-experimental build We need to introduce again the qemu-tdx build for the GPU Depends-on: #10867 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-19 14:48:56 +00:00
Zvonko Kaiser	1d9915147d	release: Remove artifacts for release We need to make sure the release does not have any residual binaries left for the release payload Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-17 20:16:48 +00:00
Adithya Krishnan Kannan	6cc5b79507	CI: Deprecate SEV Phase 1 of Issue #10840 AMD has deprecated SEV support on Kata Containers, and going forward, SNP will be the only AMD feature supported. As a first step in this deprecation process, we are removing the SEV CI workflow from the test suite to unblock the CI. Will be adding future commits to remove redundant SEV code paths. Signed-Off-By: Adithya Krishnan Kannan <AdithyaKrishnan.Kannan@amd.com>	2025-02-13 12:20:21 -06:00
Zvonko Kaiser	fbc8454d3d	Merge pull request #10866 from zvonkok/enable-cc-gpu-build gpu: enable confidential initrd build	2025-02-12 09:26:08 -05:00
Zvonko Kaiser	5431841a80	Merge pull request #10814 from kata-containers/shellcheck-gha gha: Add shellcheck	2025-02-11 18:30:41 -05:00
Zvonko Kaiser	b231a795d7	gha: Add shellcheck We need to start to fix our scripts. Lets run shellcheck and see what needs to be reworked. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-11 16:00:34 +00:00
Zvonko Kaiser	befb2a7c33	gpu: Confidential Initrd Start building the confidential initrd Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-11 15:41:36 +00:00
Fupan Li	a3fd3d90bc	ci: Add the sandbox api testcases A test case is added based on the intergrated cri-containerd case. The difference between cri containerd integrated testcase and sandbox api testcase is the "sandboxer" setting in the sandbox runtime handler. If the "sandboxer" is set to "" or "podsandbox", then containerd will use the legacy shimv2 api, and if the "sandboxer" is set to "shim", then it will use the sandbox api to launch the pod. In addition, add a containerd v2.0.0 version. Because containerd officially supports the sandbox api from version 2.0.0. Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2025-02-11 15:21:53 +01:00
Fabiano Fidêncio	c9f5966f56	Merge pull request #10860 from kata-containers/topic/debug-ci workflows: build: Do not store unnecessary content on the tarball	2025-02-10 20:01:37 +01:00
Fabiano Fidêncio	ec290853e9	workflows: build: Do not store unnecessary content on the tarball Otherwise we may end up simply unpacking kata-containers specific binaries into the same location that system ones are needed, leading to a broken system (most likely what happened with the metrics CI, and also what's happening with the GHA runners). Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-10 18:57:29 +01:00
Fabiano Fidêncio	23cb5bb6c2	ci: Only use the Ubuntu TDX machine in the CI We've been hitting issues with the CentOS 9 Stream machine, which Intel doesn't have cycles to debug. After raising this up in the Confidential Containers community meeting we got the green light from Red Hat (Ariel Adam) to just disable the CI based on CentOS 9 Stream for now. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2025-02-10 12:50:16 +01:00
Zvonko Kaiser	45bd451fa0	ci: add arm64 attestation Do the very same thing that we do on amd64 and add attestation Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-05 16:30:20 +00:00
Zvonko Kaiser	9a7dff9c40	gpu: Add arm64 targets We want to make sure we deliver arm64 GPU targets as well Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-05 16:30:20 +00:00
Zvonko Kaiser	968318180d	ci: Add extratarballs steps We introduced extratarballs with a make target. The CI currently only uploads tarballs that are listed in the matrix. The NV kernel builds a headers package which needs to be uploaded as well. The get-artifacts has a glob to download all artifacts hence we should be good. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-05 16:30:20 +00:00
Zvonko Kaiser	b04bdf54a5	gpu: Add rootfs target amd64/arm64 Adding the initrd build first to get the rootfs on amd64. With that we can start to add tests. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2025-02-05 16:30:20 +00:00
Steve Horsman	9060904c4f	Merge pull request #10826 from kata-containers/topic/crio-test-timeouts workflows: Add delete kata-deploy timeouts for crio tests	2025-02-04 13:09:49 +00:00
stevenhorsman	d9eb1b0e06	versions: Bump golang version Bump golang versions so we are more up-to-date and have the extra security fixes Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-03 15:28:53 +00:00
stevenhorsman	5203158195	workflows: Add delete kata-deploy timeouts for crio tests I've also seen cases (the qemu, crio, k0s tests) where Delete kata-deploy is still running for this test after 2 hours, and had to be manually cancelled, so let's try adding a 5m timeout to the kata-deploy delete to stop CI jobs hanging. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-02-03 11:45:43 +00:00
stevenhorsman	d625f20d18	workflows: Move arm static checks runner Now we have the build-assets running on the gh-hosted runners, try the same approach for the static-checks Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-01-23 14:23:09 +00:00
stevenhorsman	ab27e11d31	workflows: Switch to github-hosted arm runner Now that gituhb have hosted arm runners https://github.blog/changelog/2025-01-16-linux-arm64-hosted-runners-now-available-for-free-in-public-repositories-public-preview/ we should try and switch our arm64 builder jobs to run on these. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-01-22 16:27:17 +00:00
Ruoqing He	373a388844	ci: Retry on failure of Create AKS cluster The `Create AKS cluster` step in `run-k8s-tests-on-aks.yaml` is likely to fail fail since we are trying to issue `PUT` to `aks` in a relatively high frequency, while the `aks` end has it's limit on `bucket-size` and `refill-rate`, documented here [1]. Use `nick-fields/retry@v3` to retry in 10 seconds after request fail, based on observations that AKS were request 7, or 8 second delays before retry as part of their 429 response [1] https://learn.microsoft.com/en-us/azure/aks/quotas-skus-regions#throttling-limits-on-aks-resource-provider-apis Fixes: #10772 Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn> Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-01-22 13:24:51 +00:00
Aurélien Bombo	0d70dc31c1	ci: Unify on $GH_PR_NUMBER environment variable While working on #10559, I realized that some parts of the codebase use $GH_PR_NUMBER, while other parts use $PR_NUMBER. Notably, in that PR, since I used $GH_PR_NUMBER for CoCo non-TEE tests without realizing that TEE tests use $PR_NUMBER, the tests on that PR fail on TEEs: https://github.com/kata-containers/kata-containers/actions/runs/12818127344/job/35744760351?pr=10559#step:10:45 ... 44 error: error parsing STDIN: error converting YAML to JSON: yaml: line 90: mapping values are not allowed in this context ... 135 image: ghcr.io/kata-containers/csi-kata-directvolume: ... So let's unify on $GH_PR_NUMBER so that this issue doesn't repro in the future: I replaced all instances of PR_NUMBER with GH_PR_NUMBER. Note that since some test scripts also refer to that variable, the CI for this PR will fail (would have also happened with the converse substitution), hence I'm not adding the ok-to-test label and we should force-merge this after review. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-01-17 10:53:08 -06:00
stevenhorsman	9b6fce9e96	workflows: Add more ppc64le timeouts Unsurprisingly now we've got passed the containerd test hangs on the ppc64le, we are hitting others in the "Prepare the self-hosted runner" stage, so add timeouts to all of them to avoid CI blockages. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-12-20 17:31:24 +00:00
stevenhorsman	d9d8d53bea	workflows: Add timeout to some ppc64le steps In some runs e.g. https://github.com/kata-containers/kata-containers/actions/runs/12426384186/job/34697095588 and https://github.com/kata-containers/kata-containers/actions/runs/12422958889/job/34697016842 we've seen the Prepare the self-hosted runner and Install dependencies steps get stuck for 5hours+. If they are working then it should take a few minutes, so let's add timeouts and not hold up whole the CI if they are stuck Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-12-20 16:37:36 +00:00
stevenhorsman	cf8b82794a	workflows: Only remove artifacts in release builds Due to the agent-api tests requiring the agent to be deployed in the CI by the tarball, so in the short-term lets only do this on the release stage, so that both kata-manager works with the release and the agent-api tests work with the other CI builds. In the longer term we need to re-evaluate what is in our tarballs (issue #10619), but want to unblock the tests in the short-term. Fixes: #10630 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-12-12 17:38:27 +00:00
stevenhorsman	e1f6aca9de	workflows: Remove potential timing issues with artifacts With the code I originally did I think there is potentially a case where we can get a failure due to timing of steps. Before this change the `build-asset-shim-v2` job could start the `get-artifacts` step and concurrently `remove-rootfs-binary-artifacts` could run and delete the artifact during the download and result in the error. In this commit, I try to resolve this by making sure that the shim build waits for the artifact deletes to complete before starting. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-12-12 16:52:54 +00:00
stevenhorsman	b4b3471bcb	workflows: linting: Fix shellcheck SC1001 > This \/ will be a regular '/' in this context Remove ignored escape Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-12-06 13:50:12 +00:00
stevenhorsman	491210ed22	workflows: linting: Fix shellcheck SC2006 > Use $(...) notation instead of legacy backticks `...` Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-12-06 13:50:12 +00:00
stevenhorsman	5d7c5bdfa4	workflows: linting: Fix shellcheck SC2015 > A && B \|\| C is not if-then-else. C may run when A is true Refactor the echo so that we can't get into a situation where the retry of workspace delete happens if the original one was successful Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-12-06 13:50:12 +00:00
stevenhorsman	c2ba15c111	workflows: linting: Fix shellcheck SC2206 > Quote to prevent word splitting/globbing Double quote variables expanded in an array Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-12-06 13:50:12 +00:00
stevenhorsman	007514154c	workflows: linting: Fix shellcheck SC2068 > Double quote array expansions to avoid re-splitting elements Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-12-06 13:50:12 +00:00

1 2 3 4 5 ...

900 Commits