kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-04-28 03:42:09 +00:00

Author	SHA1	Message	Date
Aurélien Bombo	a678046d13	gha: Pin third-party actions to commit hashes A popular third-party action has recently been compromised [1][2] and the attacker managed to point multiple git version tags to a malicious commit containing code to exfiltrate secrets. This PR follows GitHub's recommendation [3] to pin third-party actions to a full-length commit hash, to mitigate such attacks. Hopefully actionlint starts warning about this soon [4]. [1] https://www.cve.org/CVERecord?id=CVE-2025-30066 [2] https://www.stepsecurity.io/blog/harden-runner-detection-tj-actions-changed-files-action-is-compromised [3] https://docs.github.com/en/actions/security-for-github-actions/security-guides/security-hardening-for-github-actions#using-third-party-actions [4] https://github.com/rhysd/actionlint/pull/436 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2025-03-19 13:52:49 -05:00
Ruoqing He	373a388844	ci: Retry on failure of Create AKS cluster The `Create AKS cluster` step in `run-k8s-tests-on-aks.yaml` is likely to fail fail since we are trying to issue `PUT` to `aks` in a relatively high frequency, while the `aks` end has it's limit on `bucket-size` and `refill-rate`, documented here [1]. Use `nick-fields/retry@v3` to retry in 10 seconds after request fail, based on observations that AKS were request 7, or 8 second delays before retry as part of their 429 response [1] https://learn.microsoft.com/en-us/azure/aks/quotas-skus-regions#throttling-limits-on-aks-resource-provider-apis Fixes: #10772 Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn> Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2025-01-22 13:24:51 +00:00
Dan Mihai	1a4928e710	gha: enable AUTO_GENERATE_POLICY where needed The behavior of Kata CI doesn't change. For local testing using kubernetes/gha-run.sh: 1. Before these changes: - AUTO_GENERATE_POLICY=yes was always used by the users of SEV, SNP, TDX, or KATA_HOST_OS=cbl-mariner. 2. After these changes: - Users of SEV, SNP, TDX, or KATA_HOST_OS=cbl-mariner must specify AUTO_GENERATE_POLICY=yes if they want to auto-generate policy. - These users have the option to test just using hard-coded policies (e.g., using the default policy built into the Guest rootfs) by using AUTO_GENERATE_POLICY=no. AUTO_GENERATE_POLICY=no is the default value of this env variable. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-10-02 23:20:33 +00:00
Aurélien Bombo	de98e467b4	ci: Use `ubuntu-22.04` instead of `ubuntu-latest` 22.04 is the default today: `23da668261/README.md` Being more specific will avoid unexpected errors when Github updates the default. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2024-08-27 16:44:39 +00:00
Wainer dos Santos Moschetta	73ab5942fb	tests/k8s: run for qemu-runtime-rs on AKS The following tests are disabled because they fail (alike with dragonball): - k8s-cpu-ns.bats - k8s-number-cpus.bats - k8s-sandbox-vcpus-allocation.bats Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-06-13 16:20:59 -03:00
Wainer dos Santos Moschetta	1e35291fd5	gha: move attestation tests to run-k8s-tests-coco-nontee The new run-k8s-tests-coco-nontee job should be the home of attestation tests. Changed run-k8s-tests-coco-nontee to get KBS installed and by the time the KBS variable is exported in the environment then the attestation tests will kick in (likewise they will skip in run-k8s-tests-on-aks). Fixes #9455 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-04-19 14:51:30 -03:00
Greg Kurz	424a5e243f	gha: Bump to `actions/[down\|up]load-artifact@v4` (all the rest) `Node.js 19` is deprecated. Bump to a new version based on `Node.js 20`. This fixes all remaining sites. Fixes #9245 Signed-off-by: Greg Kurz <groug@kaod.org>	2024-04-05 18:36:51 +02:00
Saul Paredes	8a92e81f98	gha: add GENPOLICY_PULL_METHOD Add GENPOLICY_PULL_METHOD that will be used to test pulling container images in genpolicy using the oci-distribution crate and/or the containerd interface. GENPOLICY_PULL_METHOD will start being used in a future PR. Fixes: #9384 Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-04-02 19:03:28 -07:00
Aurélien Bombo	71a1be9c57	ci: aks: also run tests in normal instance for Mariner Currently we're only running the small instance tests. This adds the normal instance tests as well. Fixes: #9298 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2024-03-18 23:33:17 +00:00
Wainer dos Santos Moschetta	4410df7233	gha: increase timeout of KBS steps The step to deploy KBS on run-k8s-tests-on-aks workflow should be increased so that there is enough time for checking the service is healthy and exposed. Likewise the step that builds the kbs-client which requires enough time to build the executable. Fixes #9058 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-29 22:05:58 -03:00
Wainer dos Santos Moschetta	b44e0c4e7c	gha: k8s: prepare AKS workflow to install the CoCo KBS Changed the "run k8s tests on AKS" workflows to get the CoCo KBS installed so that we can run attestation tests. The plan is to run attestation tests only on a subset of non-TEE jobs initially, so this commit restricts to install KBS only on kata-qemu configuration. Actually at this point it is added only stubs commands to tests/integration/kubernetes/gha-run.sh that should be implemented in a future commit. Fixes #9058 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-02-27 13:51:15 -03:00
Gabriela Cervantes	6771ca463b	gha: k8s: Add cloud-hypervisor (runtime-rs) support This PR adds the Cloud Hypervisor driver, integrated with the runtime-rs, as part of the kubernetes tests different with devmapper. Fixes #8995 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-02-01 21:22:56 +00:00
Fabiano Fidêncio	448c0aaecb	gha: azure: Set the correct subscription to the account Due to the changes done in the CI, we need to set the correct subscription to be used with the account from now on, otherwise we'd end up using CoCo subscription. Fixes: #8946 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-01-29 15:00:38 +01:00
Dan Mihai	ea9c659d36	gha: get ready to install genpolicy The changes to install and test genpolicy must come later, after CI picks up these gha changes. Fixes: #8856 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-01-19 23:37:49 +00:00
Fabiano Fidêncio	61aa84b158	Revert "tests: k8s: Allow passing rust-runtime env var to kata-deploy" This reverts commit `44899d4cdf`, as we've decided to keep both golang and rust runtime installable and usable at the same time. The decision of having both runtimes installable and usable will help users to test and easily catch any possible differences between those runtimes, helping us to get on par with both implementations. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-28 18:02:07 +01:00
Fabiano Fidêncio	44899d4cdf	tests: k8s: Allow passing rust-runtime env var to kata-deploy This will be used for selecting the correct runtimes and runtimeclasses to be deployed with kata-deploy. Fixes: #8475 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-11-20 09:13:05 +01:00
Liu Wenyuan	c77e990c3e	tests: Enable tests for StratoVirt hypervisor This commit enables StratoVirt hypervisor to be tested in kata GHA, incluing k8s, metrics, cri-containerd, nydus and so on. Meanwhile, adding some unit tests for StratoVirt to make sure it works. Fixes: #7794 Signed-off-by: Liu Wenyuan <liuwenyuan9@huawei.com>	2023-11-16 20:47:26 +08:00
Fabiano Fidêncio	c5cfad7023	actions: Move all the checkout actions to v4 It's been released for a while now, and we need to keep consistency between what we used. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-23 14:01:53 +02:00
Fabiano Fidêncio	c69a1e33bd	ci: Use variable size of VMs depending on the tests running Let me start with a fair warning that this commit is hard to split into different parts that could be easily tested (or not tested, just ignored) without breaking pieces. Now, about the commit itself, as we're on the run to reduce costs related to our sponsorship on Azure, we can split the k8s tests we run in 2 simple groups: * Tests that can be run in the smaller Azure instance (D2s_v5) * Tests that required the normal Azure instance (D4s_v5) With this in mind, we're now passing to the tests which type of host we're using, which allows us to select to run either one of the two types of tests, or even both in case of running the tests on a baremetal system. Fixes: #7972 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 09:13:54 +02:00
GabyCT	b384757ac7	Merge pull request #7874 from fidencio/topic/manually-rebase-branches-atop-of-the-target-one gha: Manually rebase PR atop of the target branch before testing	2023-09-11 10:35:01 -06:00
Fabiano Fidêncio	bd24afcf73	gha: Manually rebase PR atop of the target branch before testing We're changing what's been done as part of `ac939c458c`, as we've notcied issues using `github.event.pull_request.merge_commit_sha`. Basically, whenever a force-push would happen, the reference of merge_commit_sha wouldn't be updated, leading us to test PRs with the old code. :-/ In order to get the rebase properly working, we need to ensure we pull the hash of the commit as part of checkout action, and ensure fetch-depth is set to 0. Fixes: #7414 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 18:56:31 +02:00
Fabiano Fidêncio	fa62a4c01b	ci: k8s: Export KUBERNETES env var So we have a better control on which flavour of kubernetes kata-deploy is expected to be targetting. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 10:09:04 +02:00
Fabiano Fidêncio	91e1e612c3	k8s: Rely on the USING_NFD environment variable passed by the jobs Let's make sure we can rely on the tests passing down whether they want to be tested using Node Feataure Discovery or not. Right now, only the TDX job has this option set to "true", all the other jobs have this option set to "false". We can and have to merge this one before merging the NFD related patches as: 1) It causes no harm in exporting this environment variable, but not having it used 2) It will allow us to test the NFD after this one is merged, as changes in the yaml file, in the case of the pull_request_target event, are not taken into consideration before they're merged Fixes: #7495 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-31 13:30:18 +02:00
Fabiano Fidêncio	f28af98ac6	Merge pull request #7453 from sprt/fix-ci-node-debugger tests: Fix `k8s-job` test	2023-07-26 22:27:21 +02:00
Aurélien Bombo	c5a87eed29	tests: gha: Add timeout to cluster creation This has been intermittently taking a while lately so let's add a timeout. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-26 10:19:07 -07:00
Aurélien Bombo	bdde6aa948	tests: k8s: Split deployment and testing commands This splits deploying Kata and running the tests into separate commands to make it possible to rerun tests locally without having to redeploy Kata each time. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-25 15:44:46 -07:00
Fabiano Fidêncio	2ee2cd307b	ci: k8s: Move gha-run.sh to the kubernetes dir The file belongs there, as it's only used for k8s related tests. Fixes: #7373 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-18 15:45:06 +02:00
Fabiano Fidêncio	cc3993d860	gha: Pass event specific info from the caller workflow Let's ensure we're not relying, on any of the called workflows, on event specific information. Right now, the two information we've been relying on are: * PR number, coming from github.event.pull_request.number * Commit hash, coming from github.event.pull_request.head.sha As we want to, in the future, add nightly jobs, which will be triggered by a different event (thus, having different fields populated), we should ensure that those are not used unless it's in the "top action" that's trigerred by the event. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-06 11:23:17 +02:00
Aurélien Bombo	f487199edf	gha: aks: Fix argument in call to gha-run.sh Fixes: #7047 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-06 11:51:18 -07:00
Aurélien Bombo	aab6030962	gha: aks: Extract `run` commands to a script Github Actions reads and runs workflow files from the main branch, rather than from the PR branch. This means that PRs that modify workflow files aren't being tested with the updated workflows coming from the PR, but rather with the old workflows from the main branch. AFAIK, this behavior isn't avoidable for workflow files (but is for other scripts). This makes it very hard to reliably test workflow changes before they're actually merged into main and leads to issues that we have to hotifx (see #6983, #6995). This PR aims to mitigate that by extracting the commands used in workflows to a separate script file. The way our CI is set up, those script files are read from the PR branch and thus changes would be reflected in the CI checks. Fixes: #6971 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-02 10:22:35 -07:00
Jeremi Piotrowski	1c6d22c803	gha: aks: Use short SHA in cluster name Full SHA is 40 characters, while AKS cluster name has a limit of 63. Trim the SHA to 12 characters, which is widely considered to be unique enough and is short enough to be used in the cluster name Fixes: #7010 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-06-01 14:03:53 +02:00
Fabiano Fidêncio	aebd3b47d9	gha: aks: Ensure host_os is used everywhere needed We added that to create the cluster name, but I forgot to add that to the part we get the k8s config file, or to the part where we delete the AKS cluster. Fixes: #6999 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-31 20:50:55 +02:00
Fabiano Fidêncio	0c8282c224	gha: aks: Add the host_os as part of the aks cluster's name We need to do so, otherwise we'll create two clusters for testing Cloud Hypervisor with exactly the same name, one using Ubuntu, and one using Mariner. Fixes: #6999 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-31 05:20:04 +02:00
Aurélien Bombo	03027a7399	gha: Fix Mariner cluster creation While the Mariner Kata host is in preview, we need the `aks-preview` extension to enable the `--workload-runtime KataMshvVmIsolation` flag. Fixes: #6994 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-05-30 13:26:49 -07:00
Aurélien Bombo	af16d3fca4	gha: Unbreak CI and fix cluster creation step This fixes the regression introduced by #6686 by properly injecting the `--os-sku mariner --workload-runtime KataMshvVmIsolation` flags. Error reference: https://github.com/kata-containers/kata-containers/actions/runs/5111460297/jobs/9188819103 Fixes: #6982 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-05-29 13:32:47 -07:00
Aurélien Bombo	4af4ced1aa	gha: Create Mariner host as part of k8s tests The current testing setup only supports running Kata on top of an Ubuntu host. This adds Mariner to the matrix of testable hosts for k8s tests, with Cloud Hypervisor as a VMM. As preparation for the upcoming PR that will change only the actual test code (rather than workflow YAMLs), this also introduces a new file `setup.sh` that will be used to set host-specific parameters at test run-time. Fixes: #6961 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-05-25 14:29:46 -07:00
Fabiano Fidêncio	557b840814	gha: aks: Wait longer to start running the tests We're still facing issues related to the time taken to deploy the kata-deplot daemonset and starting to run the tests. Ideally, we should solve this with a readiness probe, and that's the approach we want to take in the future. However, for now, let's just make sure those tests are not on the way of the community. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-25 10:13:19 +02:00
Fabiano Fidêncio	c04c872c42	gha: aks: Increase the timeout time We've seen tests being aborted close to the end of the run due to the timeout. Let's increase it, avoiding to hit such cases again.. Fixes: #6964 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-25 10:13:08 +02:00
Fabiano Fidêncio	ad324adf1d	gha: aks: Wait a little bit more before run the tests `fa832f4709` increased the timeout, which helped a lot, mainly in the TEE machines. However, we're still seeing some failures here and there with the AKS tests. Let's bump it yet again and, hopefully, those errors to start the tests will go away. Fixes: #6905 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-19 16:40:35 +02:00
Fabiano Fidêncio	fa832f4709	gha: k8s: Make the tests more reliable We like it or not, every now and then we'll have to deal with flaky tests, and our tests using GHA are not exempt from that fact. With this simple commit, we're trying to improve the reliability of the tests in a few different fronts: * Giving enough time for the script used by kata-deploy to be executed * We've hit issues as the kata-deploy pod is considered "Ready" at the moment it starts running, not when it finishes the needed setup. We should also be looking on how to solve this on the kata-deploy side but, for now, let's ensure our tests do not break with the current kata-deploy behavior. * Merging the "Deploy kata-deploy" and "Run tests" steps * We've hit issues re-running tests and seeing even more failures than the ones we're trying to debug, as a step will simply be taken as succeeded as part of the re-run, in case it was successful executed as part of the first run. This causes issues with the kata-deploy deployment, as the tests would start running before even having the node set up for running Kata Containers. Fixes: #6865 #6649 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-17 13:38:08 +02:00
Fabiano Fidêncio	49ce685ebf	gha: k8s-on-aks: Always delete the AKS cluster Regardless of the tests succeeding or failing, the AKS cluster must be deleted. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 13:40:40 +02:00
Fabiano Fidêncio	79f3047f06	gha: k8s-on-aks: {create,delete} AKS must be a coded-in step I should have seen this coming, but currently the "create" and "delete" AKS workflows cannot be imported and uses as a job's step, resulting on an error trying to find the correspondent action.yaml file for those. Fixes: #6630 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 22:56:08 +02:00
Fabiano Fidêncio	c7ee45f7e5	Revert "gha: ci-on-push: Adapt chained jobs to workflow_run" This reverts commit `7855b43062`. Unfortunately we have to revert the PRs related to the switch done to using `workflow_run` instead of `pull_request_target`. The reason for that being that we can only mark jobs as required if they are targetting PRs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 19:09:54 +02:00
Fabiano Fidêncio	5d4d720647	Revert "gha: k8s-on-aks: Fix cluster name" This reverts commit `85cc5bb534`. Unfortunately we have to revert the PRs related to the switch done to using `workflow_run` instead of `pull_request_target`. The reason for that being that we can only mark jobs as required if they are targetting PRs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 19:07:04 +02:00
Fabiano Fidêncio	13d857a56d	gha: k8s-on-aks: Set {create,delete}_aks as steps We've been currently using {create,delete}_aks as jobs. However, it means that if the tests fail we'll end up deleting the AKS cluster (as expected), but not having a way to recreate the cluster without re-running all jobs, which is a waste of resources. Fixes: #6628 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 16:54:15 +02:00
Fabiano Fidêncio	85cc5bb534	gha: k8s-on-aks: Fix cluster name This was missed from the last series, as GHA will use the "target branch" yaml file to start the workflow. Basically we changed the name of the cluster created to stop relying on the PR number, as that's not easily accessible on `workflow_run`. Fixes: #6611 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-06 08:50:07 +02:00
Fabiano Fidêncio	68cb5689f5	Merge pull request #6584 from fidencio/topic/gha-k8s-also-test-dragonball gha: Also run k8s tests on AKS with dragonball	2023-04-05 22:50:14 +02:00
Fabiano Fidêncio	7855b43062	gha: ci-on-push: Adapt chained jobs to workflow_run As we're using the `workflow_run` event, the checkout action would pull the current target branch instead of the PR one. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-05 12:54:44 +02:00
Fabiano Fidêncio	8086c75f61	gha: Also run k8s tests on AKS with dragonball As already done for Cloud Hypervisor and QEMU, let's make sure we can run the AKS tests using dragonball. Fixes: #6583 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-04 10:58:47 +02:00
Fabiano Fidêncio	d17dfe4cdd	gha: Use ghcr.io for the k8s CI Let's switch to using the `ghcr.io` registry for the k8s CI, as this will save us some troubles on running the CI with PRs coming from forked repos. Fixes: #6587 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-03 15:52:33 +02:00

1 2

51 Commits