kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-07-02 02:02:24 +00:00

Author	SHA1	Message	Date
Xuewei Niu	136fb76222	tests: Add a integrated test for device cgroup `TestDeviceCgroup` is added to cri-containerd's integration tests. The test launches two containers. Each container has a block device. It checks the validity of device cgroup. Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2023-11-08 09:39:07 +08:00
Archana Shinde	c075fa6817	tests: Add test with nerdctl to verify macvlan support Add test to verify kata supports macvlan networks. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-11-07 10:13:51 -08:00
Archana Shinde	07db673eb9	tests: Add test with nerdctl to verify ipvlan support Add test to verify kata supports ipvlan networks. This test can be bit tricky as it requires knowledge about host interfaces to be used as a master for the ipvlan network. However, with github actions, we can assume interface called eth0 to be present on the host and functioning. Fixes: #8366 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-11-07 10:13:51 -08:00
Wainer Moschetta	949ac4d810	Merge pull request #8217 from beraldoleal/issues/8216 tests: fixes permission denied when running test	2023-11-07 12:25:23 -03:00
Greg Kurz	36109da93f	ci: k8s: Fix bogus firecracker check in k8s-credentials-secrets.bat Fixes #8259 Signed-off-by: Greg Kurz <groug@kaod.org>	2023-10-19 21:53:23 +02:00
Dan Mihai	32be8e3a87	tests: query data from the OPA service Add example for querying json data from the OPA service. Fixes: #8231 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-10-17 13:31:43 +00:00
Dan Mihai	b81c0a6693	tests: encode policy file during test Encode policy file during test - easier to understand than hard-coding the encoded file contents. Fixes: #8214 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-10-16 15:58:12 -07:00
Chao Wu	157caea9fe	Revert "nydus: Temporarily skip tests on dragonball" This reverts commit `aba36ab188`. Fixes: #8013 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2023-10-16 10:22:21 +08:00
Beraldo Leal	5ef691528d	tests: fixes permission denied when running test After running cri-containerd/integration-tests twice we receive permission denied during containerd clean. Fixes: #8216 Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-10-12 19:23:40 +00:00
Wainer dos Santos Moschetta	e669282c25	ci: k8s: set KUBERNETES default value The KUBERNETES variable is mostly used by kata-deploy whether to apply k3s specific deployments or not. It is used to select the type of kubernetes to be installed (k3s, k0s, rancher...etc) and it is always set on CI. Running the script locally we want to set a value by default to avoid `KUBERNETES: unbound variable` errors. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:08:48 -03:00
Wainer dos Santos Moschetta	c30c3ff185	tests: run k8s-volume on a given node This test can give false-positive on a multi-node cluster. Changed it to use the new get_one_kata_node() and the modified exec_host() to run the setup commands on a given node (that has kata installed) and ensure the test pod is scheduled at that same node. Fixes #7619 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:08:48 -03:00
Wainer dos Santos Moschetta	666993da8d	tests: run k8s-file-volume on a given node This test can give false-positive on a multi-node cluster. Changed it to use the new get_one_kata_node() and the modified exec_host() to run the setup commands on a given node (that has kata installed) and ensure the test pod is scheduled at that same node. Fixes #7619 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:08:48 -03:00
Wainer dos Santos Moschetta	3a00fc9101	tests: exec_host() now gets the node name The exec_host() simply fails on cluster with multi-nodes because `kubectl get node -o name" will return a list o names. Moreover, it will return control nodes names which usually don't have kata installed. Fixes #7619 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	61c9c17bff	tests: add get_one_kata_node() to tests_common.sh The introduced get_one_kata_node() returns the first node that has the kata-runtime=true label, i.e., supposedly a node with kata installed. This is useful for tests that should run on a determined worker node on a multi-nodes cluster. Fixes #7619 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	68f083c4d0	ci: k8s: set KATA_HYPERVISOR default value Let KATA_HYPERVISOR be qemu by default in gh-run.sh as this variable is required to tweak some configurations of kata-deploy. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	6677a61fe4	ci: k8s: configurable deploy kata timeout The deploy-kata() of gha-run.sh will wait for 10 minutes for the kata deploy installation finish. This allow users of the script to overwrite that value by exporting the KATA_DEPLOY_WAIT_TIMEOUT environment variable. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	200e542921	ci: k8s: shellcheck fixes to gha-run.sh Fixed a couple of warns shellcheck emitted and disabled others: * SC2154 (var is referenced but not assigned) * SC2086 (Double quote to prevent globbing and word splitting) Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	d54e6d9cda	ci: k8s: run_tests() for kcli The only difference to the other platforms is that it needs to export KUBECONFIG. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	c2ef1f0fb0	ci: k8s: add deploy-kata-kcli() to gh-run.sh The cleanup-kcli() behaves like other deploy kata for bare-metal (e.g. sev, tdx...etc) except that KUBECONFIG should be exported. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	d2be8eef1a	ci: k8s: add cleanup-kcli() to gha-run.sh The cleanup-kcli() behaves like other clean up for bare-metal (e.g. sev, tdx...etc) except that KUBECONFIG should be exported. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	cbb9aa15b6	ci: k8s: set default image for deploy_kata() On CI workflows the variables DOCKER_REGISTRY, DOCKER_REPO and DOCKER_TAG are exported to match the built image. However, when running the script outside of CI context, a developer might just use the latest image which in this case will be `quay.io/kata-containers/kata-deploy-ci:kata-containers-latest`. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Wainer dos Santos Moschetta	89bef7d036	ci: k8s: create k8s clusters with kcli Adapted the gha-run.sh script to create a Kubernetes cluster locally using the kcli tool. Use `./gha-run.sh create-cluster-kcli` to create it, and `./gha-run.sh delete-cluster-kcli` to delete. Fixes #7620 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-10-09 11:05:40 -03:00
Aurélien Bombo	e9bd852113	gha: ci: Revert tracing test PR to unbreak CI Revert "Merge pull request #8115 from fidencio/topic/ci-add-tracing-tests" This unbreaks CI as seen in https://github.com/kata-containers/kata-containers/actions/runs/6434757133 Fixes: #8161 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-10-06 14:13:17 -07:00
Fabiano Fidêncio	fa6786d1d7	Merge pull request #8117 from fidencio/topic/ci-add-runk-tests gha: ci: Port runk tests over	2023-10-06 11:19:55 +02:00
Fabiano Fidêncio	da91c9df88	ci: Port runk tests to this repo I'm basically moving the runk tests from the tests repo to this one, and I'm adding the "Signed-off-by:" of every single contributor the tests. Fixes: #8116 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Chen Yiyang <cyyzero@qq.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-10-04 20:41:29 +02:00
Fabiano Fidêncio	2c3bf406dc	ci: Create a function to install docker This will be re-used in other tests as well. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-10-04 15:01:51 +02:00
Wainer dos Santos Moschetta	0db8fb8f98	versions: migrate out of k8s.gcr.io The k8s.gcr.io is deprecated for a while now and has been redirected to registry.k8s.io. However on some bare-metal machines in our testing pools that redirection is not working, so let's just replace the registries. Fixes #8098 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com> (cherry picked from commit b2c3bca558c38deff2117d5909d9071c23c05590)	2023-10-03 11:52:59 +01:00
Fabiano Fidêncio	ef63d67c41	ci: crio: Trail '\r' from exec_host() output We've faced this as part of the CI, only happening with the CRI-O tests: ``` not ok 1 Test readonly volume for pods # (from function `exec_host' in file tests_common.sh, line 51, # in test file k8s-file-volume.bats, line 25) # `exec_host "echo "$file_body" > $tmp_file"' failed with status 127 # [bats-exec-test:38] INFO: k8s configured to use runtimeclass # bash: line 1: $'\r': command not found # # Error from server (NotFound): pods "test-file-volume" not found ``` I must say I didn't dig into figuring out why this is happening, but we may be safe enough to just trail the '\r', as long as all the tests keep passing on containerd. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-25 16:42:18 +02:00
Fabiano Fidêncio	d7105cf7a4	ci: k8s: Add a method to install CRI-O This is based on official CRI-O documentations[0] and right now we're making this specific to Ubuntu as that's what we have as runners. We may want to expand this in the future, but we're good for now. [0]: https://github.com/cri-o/cri-o/blob/main/install.md#apt-based-operating-systems Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-20 00:59:09 +02:00
Fabiano Fidêncio	5560e72024	Merge pull request #7896 from fidencio/topic/ground-work-for-testing-all-k8s-flavours-we-support ci: kata-deploy: Enable all k8s flavours that we support	2023-09-19 17:44:34 +02:00
Fabiano Fidêncio	09cc0ed438	ci: Move deploy_k8s() to gha-run-k8s-common.sh This will allow us to re-use the function in the kata-deploy tests, which will come soon. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 12:37:56 +02:00
Fabiano Fidêncio	486fe14c99	ci: Properly set K8S_TEST_UNION Otherwise only the first test will be executed Signed-off-by: Aurélien Bombo <abombo@microsoft.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-19 10:23:58 +02:00
Fabiano Fidêncio	aba36ab188	nydus: Temporarily skip tests on dragonball We're hitting a specific issue after updating, which will require some work on dragonball before it can be re-added here. The issue: ``` ... 3: failed to do rafs mount\\n 4: fail to attach rafs \\\"/var/lib/containerd-nydus/snapshots/2/fs/image/image.boot\\\"\\n 5: add share fs mount\\n 6: Mount rafs at /rafs/197ef3db03c86b91bf3045ff59183ce8b5750941ad1d3484f4a8301a70f5109f/rootfs_lower error: Failed to Mount backend ... Caused by: vmm action error: FsDevice(AttachBackendFailed(\\\"attach/detach a backend filesystem failed:: missing field `version` at line 1 column 489\\\"))\"): unknown" ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	b8a8dfcd15	nydus: Use `kata-${KATA_HYPERVISOR}` instead of `kata` This will ensure we're testing with the correct runtime, instead of using the `default` one. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
ChengyuZhu6	2f9c9e2e63	tests: nydus: Update nydus tests To support the v0.12.0 nydus-snapshotter, we need to update the config files and the commandline to start nydus-snapshotter. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	b73bde320d	gha: nydus: Populate run() And with this we finally enable the nydus tests to run as part of our GHA CI. Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	b3904a1a30	gha: nydus: Populate install_dependencies() Let's have all the dependencies needed for running the nydus tests installed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	d2b3b67f5d	gha: nydus: Actually install kata when `install-kata` is called We've been simply doing nothing whenever `install-kata` was called, and that was the intent when we added the placeholder calls. Now, let's install kata, as expected. :-) Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	0ec00ad42e	gha: nydus: Get rid of nydus{,-snapshotter} install from nydus_test.sh As we've added install_nydus() and install_nydus_snapshotter(), which do conform with the pattern we're following on GHA, let's rely on them rather than relying on the bits coming from nydus_test.sh. Later on we'll have install_nydus() and install_nydus_snapshotter() as part of the dependencies install in our `gha-run.sh`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	568439c77b	tests: nydus: Add timeout to the crictl calls Similarly to what's been done for the cri-containerd tests, as part of `84dd02e0f9`, we need to add the timeout here for the crictl calls. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	5ac3b76eb1	tests: nydus: Add uid / namespace to the nydus container / sandbox Otherwise we may face errors like: ``` getting sandbox status of pod "d3af2db414ce8": metadata.Name, metadata.Namespace or metadata.Uid is not in metadata "&PodSandboxMetadata{Name:nydus-sandbox,Uid:,Namespace:default,Attempt:1,}" getting sandbox status of pod "-A": rpc error: code = NotFound desc = an error occurred when try to find sandbox: not found ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	376574a16c	tests: nydus: Decorate some calls with `sudo` Otherwise we canoot properly start the nydus snapshotter, nor properly kill it after it's been started. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	4290fd4b67	tests: nydus: Adapt "source ..." to GHA The "source ..." we've been doing was not changed since those tests were part of the Jenkins tests, and we need to adapt them, either setting the correct path or entirely removing the ones that are not relevant to us anymore. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	a84efa3e87	tests: nydus: Adapt check to "clh" instead "cloud-hypervisor" As that's what we've been using as part of the GHA. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-18 17:40:06 +02:00
Fabiano Fidêncio	c69a1e33bd	ci: Use variable size of VMs depending on the tests running Let me start with a fair warning that this commit is hard to split into different parts that could be easily tested (or not tested, just ignored) without breaking pieces. Now, about the commit itself, as we're on the run to reduce costs related to our sponsorship on Azure, we can split the k8s tests we run in 2 simple groups: * Tests that can be run in the smaller Azure instance (D2s_v5) * Tests that required the normal Azure instance (D4s_v5) With this in mind, we're now passing to the tests which type of host we're using, which allows us to select to run either one of the two types of tests, or even both in case of running the tests on a baremetal system. Fixes: #7972 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-16 09:13:54 +02:00
Fabiano Fidêncio	094b6b2cf8	ci: k8s: Temporarily disable tests that require a bigger VM instance The list of tests which require a bigger VM instance is: * k8s-number-cpus.bats -- failing on all CIs * k8s-parallel.bats -- only failing on the cbl-mariner CI * k8s-scale-nginx.bats -- only failing on the cbl-mariner CI We'll keep those disabled while we re-work the logic to only run those in a bigger (and more expensive) VM instance. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-15 01:33:19 +02:00
Fabiano Fidêncio	92fff129fd	ci: k8s: Don't set cpu limit request for k8s-inotofy test Without setting the cpu limit / request to 1, we can make this test run in a smaller VM instance without any issue. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-14 22:03:16 +02:00
Fabiano Fidêncio	a1e3fa7ac4	Merge pull request #7905 from microsoft/danmihai1/mariner-annotations tests: fix kernel and initrd annotations	2023-09-14 10:37:42 +02:00
Fabiano Fidêncio	813bfdec01	ci: docker: nerdtl: Use io.containerd.kata-${KATA_HYPERVISOR}.io This will ensure that we're calling the correct binary for the hypervisor. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-13 13:10:14 +02:00
Fabiano Fidêncio	46bc0b1c01	ci: nerdctl: Create the containerd config Otherwise we'll fail to configure kata-containers in the `install-kata` step. This is mostly needed because the nerdctl-full tarball doesn't provide a contaienrd configuration, just the binary, as contaienrd does not actually require a configuration file to run with the default config. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-13 13:00:57 +02:00
Fabiano Fidêncio	13968aa7f6	ci: nerdctl: Switch to tcp port 80 ping TIL that the Azure VMs we use are created without an explicit outbund connectivity defined. This leads us to issues using `ping ...` as part of our tests, and when consulting Jeremi Piotrowski about the issue he pointed me out to two interesting links: * https://learn.microsoft.com/en-us/azure/virtual-network/ip-services/default-outbound-access * https://learn.microsoft.com/en-us/archive/blogs/mast/use-port-pings-instead-of-icmp-to-test-azure-vm-connectivity For your own sanity, do not read the comments, after all this is internet. :-) Anyways, the suggestion is to use nping instead, which is provided by the nmap package, so we can explicitly switch to using the tcp port 80 for the ping. With this in mind, I'm switching the image we use for the test and using one that provided nping as a possible entry point, and from now on (this part of) the tests should work. Fixes: #7910 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-13 13:00:57 +02:00
Fabiano Fidêncio	e0c811678b	ci: docker: Switch to tcp port 80 ping TIL that the Azure VMs we use are created without an explicit outbund connectivity defined. This leads us to issues using `ping ...` as part of our tests, and when consulting Jeremi Piotrowski about the issue he pointed me out to two interesting links: * https://learn.microsoft.com/en-us/azure/virtual-network/ip-services/default-outbound-access * https://learn.microsoft.com/en-us/archive/blogs/mast/use-port-pings-instead-of-icmp-to-test-azure-vm-connectivity For your own sanity, do not read the comments, after all this is internet. :-) Anyways, the suggestion is to use nping instead, which is provided by the nmap package, so we can explicitly switch to using the tcp port 80 for the ping. With this in mind, I'm switching the image we use for the test and using one that provided nping as a possible entry point, and from now on (this part of) the tests should work. Fixes: #7910 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-13 13:00:57 +02:00
Dan Mihai	c0ad914766	tests: fix kernel and initrd annotations Fix kernel and initrd annotations in the k8s tests on Mariner. These annotations must be applied to the spec.template for Deployment, Job and ReplicationController resources. Fixes: #7764 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-09-12 20:15:25 +00:00
Fabiano Fidêncio	f536ef5ce1	ci: docker: Also run the smoke test with runc This will help us to make sure that the failure is actually related to Kata Containers. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-12 16:54:02 +02:00
Fabiano Fidêncio	12d833d07d	ci: Add a very basic nerdctl sanity test Let's add a very basic sanity test to check that we can spawn a containers using nerdctl + Kata Containers. This will ensure that, at least, we don't regress to the point where this feature doesn't work at all. In the future, we should also test all the VMMs with devmapper, but that's for a follow-up PR after this test is working as expected. Fixes: #7911 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-12 16:52:55 +02:00
Fabiano Fidêncio	348b8644d6	ci: Add a very basic docker sanity test Let's add a very basic sanity test to check that we can spawn a containers using docker + Kata Containers. This will ensure that, at least, we don't regress to the point where this feature doesn't work at all. For now we're running this test against Cloud Hypervisor and QEMU only, due to an already reported issue with dragonball: https://github.com/kata-containers/kata-containers/issues/7912 In the future, we should also test all the VMMs with devmapper, but that's for a follow-up PR after this test is working as expected. Fixes: #7910 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-12 15:15:26 +02:00
Fabiano Fidêncio	9d74b7ccc9	k8s: ci: Skip "Pod quota" test with firecracker The test is failing, and an issue has been opened to track it. For now, let's skip it. Issue: https://github.com/kata-containers/kata-containers/issues/7873 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 15:51:46 +02:00
Fabiano Fidêncio	f6cd3930c5	ci: k8s: Remove useless skip statement from tests There's absolutely no need to have the skip check as part of the test itself when it's already done as part of the setup function. We're only touching the files here that were touched in the previous commit. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 14:25:29 +02:00
Fabiano Fidêncio	3cc20b47a6	ci: k8s: Also check for "fc" (for firecracker) Let's keep both checks for now, but in the future we'll be able to remove the check for "firecracker", as the hypervisor name used as part of the GitHub Actions has to match what's used as part of the kata-deploy stuff, which is `fc` (as in `kata-fc for the runtime class) instead of `firecracker`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 14:25:24 +02:00
Fabiano Fidêncio	b5bad3cb0f	ci: k8s: Add clean-up-garm argument for gha-run.sh The tests are failing to finish as the argument is invalid. Fixes: #6542 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 14:04:50 +02:00
Fabiano Fidêncio	27fa7d828d	ci: k8s: Add a kata-deploy-garm target We've been using the `kata-deploy-tdx` target as that also uses k3s as base, but it's better to just have a specific garm target. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 10:09:04 +02:00
Fabiano Fidêncio	fa62a4c01b	ci: k8s: Export KUBERNETES env var So we have a better control on which flavour of kubernetes kata-deploy is expected to be targetting. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-08 10:09:04 +02:00
Fabiano Fidêncio	3de23034f8	ci: k8s: Wait some time after restarting k3s Let's put a 1 minute sleep, just to make sure everything is back up again. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:46:58 +02:00
Fabiano Fidêncio	2df183fd99	ci: k8s: Append, instead of overwrite, the devmapper config As we were using `tee` without the `-a` (or `--apend`) aptton, the containerd config would be overwritten, leading to a NotReady state of the Node. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:12:55 +02:00
Fabiano Fidêncio	369a8af8f7	ci: k8s: Decrease k3s sleep from 4 to 2 minutes It should be plenty, and worked well in local tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:12:55 +02:00
Fabiano Fidêncio	ada65b988a	ci: k8s: Use vanilla kubectl with k3s Let's download the vanilla kubectl binary into `/usr/bin/`, as we need to avoid hitting issues like: ```sh error: open /etc/rancher/k3s/k3s.yaml.lock: permission denied ``` The issue basically happens because k3s links `/usr/local/bin/kubectl` to `/usr/local/bin/k3s`, and that does extra stuff that vanilla `kubectl` doesn't do. Also, in order to properly use the k3s.yaml config with the vanilla kubectl, we're copying it to ~/.kube/config. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:12:55 +02:00
Fabiano Fidêncio	ad45ab5d33	ci: k8s: Ensure k3s is deploy with --write-kubeconfig-mode=644 Otherwise the /etc/rancher/k3s/k3s.yaml is not readable by other users than root. As --write-config-mode is being passed, and that's an option that has to be passed to the `server`, -s is also added to the command line. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:12:55 +02:00
Fabiano Fidêncio	028a97e0d5	ci: k8s: Use the proper command for sleep `wait` waits for a job to complete, not a number of seconds. Not sure how I got that wrong in the first place, but it's what it's. Fixes: #6542 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-07 23:12:55 +02:00
Fabiano Fidêncio	b28b54df04	ci: k8s: Add a function to configure devmapper for containerd This function right now is completely based on what's part of the tests repo[0], and that's the reason I'm keeping the `Signed-off-by` of all the contributors to that file. This is not perfect, though, as it changes the default snapshotter to devmapper, instead of only doing so for the Kata Containers specific runtime handlers. OTOH, this is exactly what we've always been doing as part of the tests. We'll improve it, soon enough, when we get to also add a way for kata-deploy to set up different snapshotters for different handlers. But, for now, this is as good (or as bad) as it's always been. It's important to note that the devmapper setup doesn't take into consideration a BM machine, and this is not suitable for that. We're really only targetting GHA runners which will be thrown away after the run is over. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Shiming Zhang <wzshiming@foxmail.com> Signed-off-by: Marcel Apfelbaum <marcel@redhat.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-09-06 23:08:17 +02:00
Fabiano Fidêncio	54f7117212	ci: k8s: Add a function to deploy k3s One can use different kubernetes flavours for getting a kubernetes cluster up and running. As part of our CI, though, I really would like to avoid contributors spending time maintaining and updating kubernetes dependencies, as done with the tests repo, and which has been proven to be really good on getting things rotten. With this in mind, I'm taking the bullet and using "k3s" as the way to deploy kubernetes for the devmapper related tests, and that's the reason I'm adding a function to do so, and this will be used later on as part of this series. It's important to note that the k3s setup doesn't take into consideration a BM machine, and this is not suitable for that. We're really only targetting GHA runners which will be thrown away after the run is over. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-09-06 23:07:41 +02:00
Dan Mihai	bf21411e90	tests: add policy to k8s tests Use AGENT_POLICY=yes when building the Guest images, and add a permissive test policy to the k8s tests for: - CBL-Mariner - SEV - SNP - TDX Also, add an example of policy rejecting ExecProcessRequest. Fixes: #7667 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-09-01 14:28:08 +00:00
Fabiano Fidêncio	e286e842c1	tests: Expand confidential test to support TDX Let's expand the confidential test to also support TDX. The main difference on the test, though, is that we're not grepping for a string in the `dmesg` output, but rather relying on `cpuid` to detect a TDX guest. Fixes: #7184 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-29 14:10:47 +02:00
Unmesh Deodhar	e31f099be1	tests: Expand confidential test to support SNP Let's expand the confidential test to also support SNP. Fixes: #7184 Signed-off-by: Unmesh Deodhar <udeodhar@amd.com>	2023-08-29 14:10:47 +02:00
Unmesh Deodhar	c3b9d4945e	tests: Add confidential test for SEV Add a test case for the launch of unencrypted confidential container, verifying that we are running inside a TEE. Right now the test only works with SEV, but it'll be expanded in the coming commits, as part of this very same series. Fixes: #7184 Signed-Off-By: Unmesh Deodhar <udeodhar@amd.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-29 14:10:34 +02:00
Fabiano Fidêncio	02a08c956b	Merge pull request #7754 from microsoft/danmihai1/pod-quota-deployment tests: delete k8s deployment at the test's end	2023-08-27 17:52:00 +02:00
Dan Mihai	183f51d6f6	tests: use unique test name k8s-pid-ns.bats was already using the test name from k8s-kill-all-process-in-container.bats - probably a copy/paste bug. Fixes: #7753 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-08-25 03:41:06 +00:00
Dan Mihai	6a974679f2	tests: delete k8s deployment at the test's end At the end of k8s-kill-all-process-in-container.bats, delete the deployment it created. Fixes: #7752 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-08-25 03:34:37 +00:00
Dan Mihai	400eb88743	gha: capture additional kata-deploy output 10 lines can be insufficient for diagnostics. Fixes: #7707 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2023-08-21 15:58:57 +00:00
Fabiano Fidêncio	285e616b5e	tests: common: Ensure test_type is used as part of the cluster's name By doing this we can make sure there won't be any clash on the cluster name created for either the k8s or the kata-deploy tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-17 14:22:16 +02:00
Fabiano Fidêncio	cfc29c11a3	gha: k8s: Stop running kata-deploy tests as part of the k8s suite In a follow-up series, we'll add a whole suite for the kata-deploy tests. With this in mind, let's already get rid of this one and avoid more kata-deploy tests to land here. Fixes: #7642 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-17 09:48:54 +02:00
Fabiano Fidêncio	e470a650e0	Merge pull request #7654 from sprt/ci-fixes kata-deploy: Properly create default runtime class	2023-08-17 09:43:34 +02:00
Aurélien Bombo	f4dd152863	tests: k8s: Call ensure_yq() in setup.sh It wasn't the `common.bash` import in `run_kubernetes_tests.sh` causing the yq error so let's try this instead. Reference: https://github.com/kata-containers/kata-containers/actions/runs/5674941359/job/15379797568#step:10:341 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-08-16 14:13:56 -07:00
Aurélien Bombo	339569b69c	kata-deploy: Properly create default runtime class The default `kata` runtime class would get created with the `kata` handler instead of `kata-$KATA_HYPERVISOR`. This made Kata use the wrong hypervisor and broke CI. Fixes: #7663 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-08-16 11:04:44 -07:00
Fabiano Fidêncio	0bc48eab60	Merge pull request #7640 from fidencio/topic/gha-cri-containerd-enable-tests gha: cri-containerd: Enable tests	2023-08-15 21:18:28 +02:00
Fabiano Fidêncio	b3592ab25c	gha: cri-containerd: Enable tests As the cri-containerd tests have been fully migrated to GHA, let's make sure we get them running. Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:32:42 +02:00
Fabiano Fidêncio	84dd02e0f9	gha: cri-containerd: Add timeout to the crictl calls on testContainerStop As part of the runners, we're hitting a timeout that I cannot reproduce, at all, when allocating the same instance and running the tests manually. The default timeout to connect to the server is 2s when using `crictl`. Let's increase this to 20s. It's fairly important to mention that in the first tests I used a timeout of 10s, and that helped but we still hit issues every now and then. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:31:54 +02:00
Fabiano Fidêncio	b29782984a	gha: cri-containerd: Show pod before deleting it It'll help us to debug failures with the pod stop / pod delete. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:31:54 +02:00
Fabiano Fidêncio	ae0930824a	gha: cri-containerd: Print kata logs in case of error We need this to fully understand what are the issues we're facing. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:31:54 +02:00
Fabiano Fidêncio	6c8b2ffa60	gha: cri-containerd: Group containerd logs This improves readability in case of failures by a lot. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:31:54 +02:00
Fabiano Fidêncio	9e898701f5	gha: cri-containerd: Ensure RUNTIME takes KATA_HYPERVISOR into account Short commit log says it all. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-15 14:31:54 +02:00
Fabiano Fidêncio	af1b46bbf2	tests: Add gha-run-k8s-common.sh Let's split a good portion of `tests/integration/kuberentes/gha-run.sh` out, and put them in a place where they can be used to the soon-to-come kata-deploy specific tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-14 17:45:58 +02:00
Fabiano Fidêncio	eb463b38ec	ci: unencrypted-image: Don't fail to build on s390x Let's make sure that we don't fail in case we're building non x86_64. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 20:32:36 +02:00
Fabiano Fidêncio	5cdf981a2b	Merge pull request #7596 from fidencio/topic/create-image-to-be-used-by-the-confidential-tests tests: Create image that will be used in the unencrypted confidential tests	2023-08-09 17:06:07 +02:00
Fabiano Fidêncio	c932369f42	Merge pull request #7492 from fidencio/topic/adapt-tests-to-the-new-kata-deploy-env-vars kata-deploy: Ensure we cover SHIMS / DEFAULT_SHIM as part of our tests	2023-08-09 12:55:03 +02:00
Fabiano Fidêncio	034d7aab87	tests: k8s: Ensure the runtime classes are properly created With these 2 simple checks we can ensure that we do not regress on the behaviour of allowing the runtime classes / default runtime class to be created by the kata-deploy payload. Fixes: #7491 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 11:46:04 +02:00
Fabiano Fidêncio	ab5f603ffa	ci: k8s: Add the image used for unencrypted confidential tests Let's add here the image we'll be using for unencrypted confidential tests. Later on, we'll make sure to build and use this image as part of our CI. The image can easily be built as a multi-arch image, and has `cpuid` installed in case of `x86_64` build, so it can be used to detect whether we're running on a TEE guest without having to rely on `dmesg \| grep ...`. Fixes: #7595 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 11:33:18 +02:00
Fabiano Fidêncio	1e8fe131bd	k8s: tests: Take advantage of `SHIMS` and `DEFAULT_SHIM` env vars We don't have to do any sed to replace the runtimeclass being used by the moment we start taking advantage of the `DEFAULT_SHIM` environment variable exposed merged in the previous commits. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-09 11:15:34 +02:00
Unmesh Deodhar	aeaec9dae9	tests: upgrade bats version Instead of using package manager to install bats, building this from source. This gives us the updated version of bats which supports functions such as setup_file and teardown_file. We can use these functions into our current tests. Fixes: #7597 Signed-off-by: Unmesh Deodhar <udeodhar@amd.com>	2023-08-08 18:16:39 -05:00
Fabiano Fidêncio	f910c66d6f	ci: k8s: Do not fail when gathering info on AKS nodes Otherwise the VM deletion may not delete, leaving us with several machines behind. Fixes: #7509 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-08-01 12:36:33 +02:00
Aurélien	e8f8641988	Merge pull request #7132 from sprt/aks-volume-tests tests: Add `k8s-volume` and `k8s-file-volume` tests to GHA CI	2023-07-28 08:58:03 -07:00
Fabiano Fidêncio	8353aae41a	ci: k8s: Rework get_nodes_and_pods_info() The amount of info we've added seemed unnecessary, and ends up making our lives even harder when trying to find errors. Let's just rely on the kata-debug container to collect the needed info for us. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	6ad5d7112e	ci: k8s: Do not gather node info before running the tests It's been proven to not be useful, and ends up making things more confusing due to the amount of logs printed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	5261e3a60c	ci: k8s: Group messages to improve readability Right now is getting way too easy to get lost in the logs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	9cc6b5f461	ci: k8s: Get logs from kata-deploy Let's make sure we can debug kata-deploy in case something goes wrong during its execution. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	9d285c6226	ci: k8s: Let kata-deploy take care of the runtimeclasses By doing this we can test the change done for the daemonset. :-) Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 10:04:33 +02:00
Fabiano Fidêncio	a274333248	kata-deploy: Change default values of DEBUG This can be easily done as there was no official release with the previous values. The reason we're doing so is because when using `yq` to replace the value, even when forcing `--tag '!!str' "yes"`, the content is placed without quotes, causing errors in our CI. While here, we're also removing the fallback value for DEBUG, as it is always set in the kata-deploy.yaml file. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-28 09:50:39 +02:00
Aurélien Bombo	6222bd9103	tests: Add k8s-file-volume test This imports the k8s-file-volume test from the tests repo and modifies it slightly to set up the host volume on the AKS host. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-27 14:07:55 -07:00
Aurélien Bombo	187a72d381	tests: Add k8s-volume test This imports the k8s-volume test from the tests repo and modifies it slightly to set up the host volume on the AKS host. Fixes: #6566 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-27 14:06:43 -07:00
Fabiano Fidêncio	f28af98ac6	Merge pull request #7453 from sprt/fix-ci-node-debugger tests: Fix `k8s-job` test	2023-07-26 22:27:21 +02:00
Aurélien Bombo	6daeb08e69	tests: k8s: Clean up node debuggers after running This deletes node debugger pods after execution since their presence may affect tests that assume only test workloads pods are present. For example, in `k8s-job` we wait for any pod to be in the `Succeeded` state before proceeding, which causes failures. Fixes: #7452 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-26 10:19:07 -07:00
Aurélien Bombo	4703434b12	tests: k8s: Allow using custom resource group This simply allows setting a custom resource group when debugging locally, so as to prevent name collisions and not pollute the namespace. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-25 15:45:44 -07:00
Aurélien Bombo	350f3f70b7	tests: Import `common.bash` in `run_kubernetes_tests.sh` Not sure why this works in GHA, but the `info` call on line 65 would fail locally. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-25 15:45:44 -07:00
Aurélien Bombo	d7f04a64a0	tests: k8s: Leave `runtimeclass_workloads/` alone Makes it so that `setup.sh` doesn't make changes in `runtimeclass_workloads/` directly. Instead we treat that as a template directory and we use the new directory `runtimeclass_workloads_work/` as a work dir. This has two advantages: * Allows rerunning tests without the assumption that `setup.sh` must be idempotent. E.g. the `set_runtime_class()` step would break. * Doesn't pollute your git environment with a bunch of changes when developing. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-25 15:45:44 -07:00
Aurélien Bombo	bdde6aa948	tests: k8s: Split deployment and testing commands This splits deploying Kata and running the tests into separate commands to make it possible to rerun tests locally without having to redeploy Kata each time. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-25 15:44:46 -07:00
Aurélien Bombo	91a0b3b406	tests: aks: Simply delete cluster when cleaning up If we're going to delete the cluster anyway, no need to call kata-cleanup. Fixes: #7454 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-25 15:44:46 -07:00
Fabiano Fidêncio	7c4b597816	ci: nydus: Fix typo in "source" We should source from `nydus_dir`, instead of `cri_containerd_dir`, and that was a leftover from `fb4f7a002c`. Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-24 14:55:09 +02:00
Fabiano Fidêncio	fb4f7a002c	gha: nydus: Add a no-op GHA for nydus This newly added GHA does nothing, is not even triggered, and it's just a placeholder that we'll grow in the next commits / PRs, so we can actually start running the nydus tests as part of our CI. Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-24 13:37:33 +02:00
Fabiano Fidêncio	4a207a16f9	gha: nydus: Bring tests as they are from the tests repo Let's bring the nydus tests, without any kind of modification, from the tests repo. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-24 10:56:41 +02:00
Fabiano Fidêncio	e1a4040a6c	Merge pull request #7326 from fidencio/topic/gha-ci-add-cri-containerd-tests ci: gha: Add cri-containerd tests (but still do not enable them)	2023-07-21 19:29:38 +02:00
Fabiano Fidêncio	e91f5edba0	ci: cri-containerd: Fix default typo for testContainerStart() It must but {1:-0}, instead of {1-0}. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	8b8aef09af	ci: cri-containerd: Temporarily disable TestContainerSwap The test is currently failing with GHA, and I don't think it makes sense to block all the other tests to get merged while it's happening. For now, let's disable it and re-enable it as soon as we have it passing. Reference: https://github.com/kata-containers/kata-containers/issues/7410 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	56767001cb	ci: cri-containerd: Add namespace / uid to the pods Otherwise crictl will fail to remove them with: ``` getting sandbox status of pod "$pod": metadata.Name, metadata.Namespace or metadata.Uid is not in metadata "..." ``` A huge shout out to Steven Horsman for helping to debug this one. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	a84773652c	ci: cri-containerd: Always use sudo to call crictl Otherwise we may get the following error: ``` time="2023-07-15T21:12:13Z" level=fatal msg="validate service connection: validate CRI v1 runtime API for endpoint \"unix:///run/containerd/containerd.sock\": rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing dial unix /run/containerd/containerd.sock: connect: permission denied\"" ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	99ba86a1b2	ci: cri-containerd: Add /usr/local/go/bin to the PATH Otherwise go is not picked up. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	7f3b309997	ci: cri-containerd: Add `function` before each function We've been doing this for all files moved to this repo. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	fde22d6bce	ci: cri-containerd: Assume podman is always used For this set of tests, we'll always be using podman in order to avoid having containerd pulled in by docker. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	9465a04963	ci: cri-containerd: Adapt "source ..." to this repo Let's adapt what we "source" to the kata-containers repo. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	df8d144119	ci: cri-containerd: Remove CI variable We always want to run the tests using as much debug as possible. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	f90570aef0	ci: cri-containerd: Remove unused runc_runtime_bin The variable is not used anywhere in our tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	c3637039f4	ci: cri-containerd: Remove KILL_VMM_TEST env var We don't need the env var, we just need to restrict the test according to the KATA_HYPERVISOR used, as right now it's very specifict to QEMU. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	bc4919f9b2	ci: cri-containerd: Always run shim-v2 tests We only have shim-v2 as the runtime type, so we always need to run tests using it. :-) We had to adjust the script in order to properly run the tests with the current logic. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	f9e332c6db	ci: cri-containerd: Stop cloning containerd It's already done as part of the install_dependencies() Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	cfd662fee9	ci: cri-containerd: Remove ununsed SNAP_CI var We don't support SNAP anymore, thus we can remove the var. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	d36c3395c0	ci: cri-containerd: Update copyright As we're touching the file already, let's update its Copyright info. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	b5be8a4a8f	ci: cri-containerd: Move integration-tests.sh as it was Let's move the `integration/containerd/cri/integration-tests.sh` file from the tests repo to this one. The file has been moved as it is, it's not used, and in the following commits we'll clean it up before actually using it. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	f2e00c95c0	ci: cri-containerd: Populate install_dependencies() Let's install all the dependencies needed for running the `cri-containerd` tests. The list of dependencies we have are: * From the system - build-essential - jq - podman-docker * From our own repo - yq - go * From GitHub projects - containerd - cri-tools Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	34779491e0	gha: kubernetes: Avoid declaring repo_root_dir This is already declared as part of the `common.bash` file, so let's just make sure we use it from there. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	b87ed27416	tests: Move `ensure_yq` to common.bash As this function will be used by different scripts, let's move it to a common place. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	db77c9a438	tests: Make install_kata take care of the links It makes the kata-containers installation more complete. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 16:54:27 +02:00
Fabiano Fidêncio	630634c5df	ci: k8s: Group logs to make them easier to read Otherwise it becomes really hard to find the info you're looking for. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 14:05:30 +02:00
Fabiano Fidêncio	228b30f31c	ci: k8s: Gather node info during the cleanup This will make our lives easier to debug issues with the CI. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 14:05:30 +02:00
Fabiano Fidêncio	81f99543ec	ci: k8s: Cleanup cluster before deleting it This will help us to in two fronts: * catching possible issues related to kata-deploy cleanup * do more (like, in the future, collect logs) after the tests run Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-21 14:05:30 +02:00
Fabiano Fidêncio	fad801d0fb	ci: k8s: Adapt "source ..." to the new location of gha-run.sh This is a follow up of `2ee2cd307b`, which changed the location of gha-run.sh Fixes: #7373 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-18 21:26:41 +02:00
Fabiano Fidêncio	2ee2cd307b	ci: k8s: Move gha-run.sh to the kubernetes dir The file belongs there, as it's only used for k8s related tests. Fixes: #7373 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-18 15:45:06 +02:00
Fabiano Fidêncio	64f013f3bf	ci: k8s: Enable debug when running the tests This will help us to gather more information about Kata Containers in case of failure. Fixes: #7343 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-14 12:18:11 +02:00
Fabiano Fidêncio	75a294b74b	ci: cri-containerd: Ensure deps are installed Let's make sure we install the needed dependencies for running the `cri-containerd` tests. Right now this commit is basically adding a placeholder, and later on, when we'll actually be able to test the job, we'll add the logic of installing the needed dependencies. The obvious dependencies we've spotted so far are: * From the OS * jq * curl (already present) * From our repo * yq (using the install_yq script) * From GitHub * cri-containerd * cri-tools * cni plugins We may need a few more packages, but we will only figure this out as part of the actual work. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-13 12:04:22 +02:00
Fabiano Fidêncio	438fe3b829	gha: ci: Add cri-containerd tests skeleton This PR builds the foundation for us to start migrating the cri-containerd tests from Jenkins to GitHub Actions. Right now the test does nothing and should always finish successfully. The coming PRs will actually introduce logic to the `gha-run.sh` script where we'll be able to run the tests and make sure those pass before having them actually merged. Fixes: #6543 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-12 20:57:39 +02:00
Jeremi Piotrowski	b9a63d66a4	Merge pull request #7297 from jepio/fix-mariner-cache tools: Use a consistent target name when building mariner initrd	2023-07-12 13:43:47 +02:00
Fabiano Fidêncio	8c9d08e872	gha: ci: Gather info about the node / pods This is a very simple addition, that should be expanded by https://github.com/kata-containers/kata-containers/pull/7185, and it's targetting gathering more info that will help us to debug CI failures. Fixes: #7296 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-12 08:04:37 +02:00
Jeremi Piotrowski	307cfc8f7a	tools: Use a consistent target name when building mariner initrd Currently a mixture of cbl-mariner and mariner is used when creating the mariner initrd. The kata-static tarball has mariner in the name, but the jenkins url uses cbl-mariner. This breaks cache usage. Use mariner as the target name throughout the build, so that caching works. Fixes: #7292 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-11 14:17:14 +02:00
Yushuo	28c29b248d	bugfix: plus default_memory when calculating mem size We've noticed this caused regressions with the k8s-oom tests, and then decided to take a step back and do this in the same way it was done before `67972ec48a`. Moreover, this step back is also more reasonable in terms of the controlling logic. And by doing this we can re-enable the k8s-oom.bats tests, which is done as part of this PR. Fixes: #7271 Depends-on: github.com/kata-containers/tests#5705 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-07-10 15:53:04 +08:00
Fabiano Fidêncio	38f0aaa516	Revert "gha: k8s: dragonball: Skip k8s-number-cpus" This reverts commit `a79505b667`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-08 14:43:49 +02:00
Fabiano Fidêncio	828a721838	gha: k8s: dragonball: Skip k8s-oom Let's skip the k8s-oom, as the test is currently failing. We've an issue opened for that, and we'll be working on re-enabling it as soon as possible. Reference: https://github.com/kata-containers/kata-containers/issues/7271 Fixes: #7253 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-08 14:27:49 +02:00
Fabiano Fidêncio	a79505b667	gha: k8s: dragonball: Skip k8s-number-cpus Let's skip the k8s-number-cpus, as the test is currently failing. We've an issue opened for that, and we'll be working on re-enabling it as soon as possible. Reference: https://github.com/kata-containers/kata-containers/issues/7270 Fixes: #7253 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-08 14:27:42 +02:00
Fabiano Fidêncio	48c3cec1f4	Merge pull request #7243 from sprt/ensure-cluster-no-exist gha: k8s: Ensure cluster doesn't exist before creating it	2023-07-07 14:03:41 +02:00
Fabiano Fidêncio	18bd2d6e4a	Merge pull request #6839 from sprt/sprt/mariner-ci-tests tests: Enable running k8s tests on Mariner	2023-07-07 13:36:28 +02:00
Aurélien Bombo	c45f646b9d	gha: k8s: Ensure cluster doesn't exist before creating it The cluster cleanup step will sometimes fail to run, meaning the next run would fail in the cluster creation step. This PR addresses that. Example: https://github.com/kata-containers/kata-containers/actions/runs/5349582743/jobs/9867845852 Fixes: #7242 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-07-06 15:06:30 -07:00
Fabiano Fidêncio	7c0de8703c	gha: k8s: Ensure tests are running on a specific namespace Let's make sure we run our tests in a specific namespace, as in case of any kind of issue, we will just get rid of the namespace itself, which will take care of cleaning up any leftover from failing tests. One important thing to mention is why we can get rid of the `namespace: ${namespace}` on the tests that are already using it, and let's do it in parts: * namespace: default We can easily get rid of this as that's the default namespace where pods are created, so it was a no-op so far. * namespace: test-quota-ns My understanding is that we'd need this in order to get a clean namespace where we'd be setting a quota for. Doing this in the namespace that's only used for tests should not cause any side-effect on the tests, as we're running those in serial and there's no other pods running on the `kata-containers-k8s-tests` namespace Last but not least, we're not dynamically creating namespaces as the tests are not running in parallel, never, not in the case of having 2 tests being ran at same time, neither in the case of having 2 jobs being scheduled to the same machine. Fixes: #6864 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-06 14:14:50 +02:00
Jeremi Piotrowski	b568c7f7d8	tests/integration: Provide default value for KATA_HOST_OS Non AKS k8s tests (SEV/SNP/TDX) don't currently set KATA_HOST_OS, so provide a default empty value for the variable so that those tests can run. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-04 14:28:29 +02:00
Jeremi Piotrowski	d6e96ea06d	tests/integration: Use AzureLinux instead of Mariner as OSSKU value, to get rid of this warning when creating the AKS cluster: WARNING: The osSKU "AzureLinux" should be used going forward instead of "CBLMariner" or "Mariner". The osSKUs "CBLMariner" and "Mariner" will eventually be deprecated. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-04 12:49:07 +02:00
Jeremi Piotrowski	40c46c75ed	tests/integration: Perform yq install in run_tests() We only need to install in run_tests() so that the yq install is picked up by kubernets/setup.sh as well. We also need to either use (sudo && INSTALL_IN_GOPATH=false) \|\| (INSTALL_IN_GOPATH=true). Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com>	2023-07-04 12:49:07 +02:00
Aurélien Bombo	80c78eadce	tests: Use baked-in kernel with Mariner Mariner ships a bleeding-edge kernel that might be ahead of upstream, so we use that to guarantee compatibility with the host. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-30 12:51:40 -07:00
Aurélien Bombo	532755ce31	tests: Build Mariner rootfs initrd * Adds a new `rootfs-initrd-mariner` build target. * Sets the custom initrd path via annotation in `setup.sh` at test time. * Adapts versions.yaml to specify a `cbl-mariner` initrd variant. * Introduces env variable `HOST_OS` at deploy time to enable using a custom initrd. * Refactors the image builder so that its caller specifies the desired guest OS. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-30 12:51:40 -07:00
Aurélien Bombo	b535c7cbd8	tests: Enable running k8s tests on Mariner This removes the gate and lets CI run tests on Mariner. Fixes: #6840 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-22 10:30:52 -07:00
Aurélien Bombo	69668ce87f	tests: gha-run: Use correct env variable for repo s/DOCKER_IMAGE/DOCKER_REPO Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-06 11:54:43 -07:00
Aurélien Bombo	f487199edf	gha: aks: Fix argument in call to gha-run.sh Fixes: #7047 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-06 11:51:18 -07:00
Aurélien Bombo	aab6030962	gha: aks: Extract `run` commands to a script Github Actions reads and runs workflow files from the main branch, rather than from the PR branch. This means that PRs that modify workflow files aren't being tested with the updated workflows coming from the PR, but rather with the old workflows from the main branch. AFAIK, this behavior isn't avoidable for workflow files (but is for other scripts). This makes it very hard to reliably test workflow changes before they're actually merged into main and leads to issues that we have to hotifx (see #6983, #6995). This PR aims to mitigate that by extracting the commands used in workflows to a separate script file. The way our CI is set up, those script files are read from the PR branch and thus changes would be reflected in the CI checks. Fixes: #6971 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-06-02 10:22:35 -07:00
Fabiano Fidêncio	8cbb80da66	Merge pull request #6929 from LindaYu17/dev kubernetes: add agnhost command in pod yaml	2023-06-01 08:39:58 +02:00
Aurélien Bombo	4af4ced1aa	gha: Create Mariner host as part of k8s tests The current testing setup only supports running Kata on top of an Ubuntu host. This adds Mariner to the matrix of testable hosts for k8s tests, with Cloud Hypervisor as a VMM. As preparation for the upcoming PR that will change only the actual test code (rather than workflow YAMLs), this also introduces a new file `setup.sh` that will be used to set host-specific parameters at test run-time. Fixes: #6961 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-05-25 14:29:46 -07:00
Linda Yu	433b5add4a	kubernetes: add agnhost command in pod yaml Fixes: #6928 Signed-off-by: Linda Yu <linda.yu@intel.com>	2023-05-23 18:11:45 +08:00
Tobin Feldman-Fitzthum	521dad2a47	Tests: skip CPU constraints test on SEV and SNP Currently Kata does not support memory / CPU hotplug for SEV or SEV-SNP so we need to skip tests that rely on it. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-17 11:35:13 +02:00
Tobin Feldman-Fitzthum	72308ddb07	gha: ci-on-push: Don't skip tests for SEV Now that SEV artifacts are built by GHA, remove conditional that skips tests when using qemu-sev. Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-17 11:35:13 +02:00
Tobin Feldman-Fitzthum	da0f92cef8	gha: ci-on-push: Don't skip tests for SEV-SNP Now that we have SNP artifacts in place and they are built via gha, remove the condition that skips the tests for SNP. Fixes: #6809 Signed-off-by: Tobin Feldman-Fitzthum <tobin@ibm.com>	2023-05-17 11:35:13 +02:00
Ryan Savino	c57a44436c	gha: Add the ability to test qemu-snp With the changes proposed as part of this PR, a qemu-snp cluster will be created but no tests will be performed. GitHub Actions will only run the tests using the workflows that are part of the target branch, instead of the using the ones coming from the PR. No way to work around this for now. After this commit is merged, the tests (not the yaml files for the actions) will be altered in order for the checkout action to help in this case. Fixes: #6722 Signed-off-by: Ryan Savino <ryan.savino@amd.com>	2023-04-28 13:07:13 -05:00
Ryan Savino	521519d745	gha: Add the ability to test qemu-sev With the changes proposed as part of this PR, a qemu-sev cluster will be created but no tests will be performed. GitHub Actions will only run the tests using the workflows that are part of the target branch, instead of the using the ones coming from the PR. No way to work around this for now. After this commit is merged, the tests (not the yaml files for the actions) will be altered in order for the checkout action to help in this case. Fixes: #6711 Signed-off-by: Ryan Savino <ryan.savino@amd.com>	2023-04-26 17:56:28 -05:00
Fabiano Fidêncio	da35241a91	tests: k8s: Skip k8s-cpu-ns when testing TDX TEEs do not support CPU / memory hotplug, thus this test must be skipped. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 10:18:07 +02:00
Fabiano Fidêncio	e2a770df55	gha: ci-on-push: Run k8s tests with dragonball Now that the infra for running dragonball tests has been enabled, let's actually make sure to have them running on each PR. The tests skipped are: * `k8s-cpu-ns.bats`, as CPU resize doesn't seem to be yet properly supported on runtime-rs * https://github.com/kata-containers/kata-containers/issues/6621 Fixes: #6605 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 11:47:47 +02:00
Fabiano Fidêncio	108d80a86d	gha: Add the ability to also test Dragonball With the changes proposed as part of this PR, an AKS cluster will be created but no tests will be performed. The reason we have to do this is because GitHub Actions will only run the tests using the workflows that are part of the target branch, instead of the using the ones coming from the PR, and we didn't find yet a way to work this around. Once this commit is in, we'll actually change the tests themselves (not the yaml files for the actions), as those will be the ones we want as the checkout action helps us on this case. Fixes: #6583 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-05 15:53:03 +02:00
Fabiano Fidêncio	11e0099fb5	tests: Move k8s tests to this repo The first part of simplifying things to have all our tests using GitHub actions is moving the k8s tests to this repo, as those will be the first vict^W targets to be migrated to GitHub actions. Those tests have been slightly adapted, mainly related to what they load / import, so they are more self-contained and do not require us bringing a lot of scripts from the tests repo here. A few scripts were also dropped along the way, as we no longer plan to deploy kubernetes as part of every single run, but rather assume there will always be k8s running whenever we land to run those tests. It's important to mention that a few tests were not added here: * k8s-block-volume: * k8s-file-volume: * k8s-volume: * k8s-ro-volume: These tests depend on some sort of volume being created on the kubernetes node where the test will run, and this won't fly as the tests will run from a GitHub runner, targetting a different machine where kubernetes will be running. * https://github.com/kata-containers/kata-containers/issues/6566 * k8s-hugepages: This test depends a whole lot on the host where it lands and right now we cannot assume anything about that anymore, as the tests will run from a GitHub runner, targetting a different machine where kubernetes will be running. * https://github.com/kata-containers/kata-containers/issues/6567 * k8s-expose-ip: This is simply hanging when running on AKS and has to be debugged in order to figure out the root cause of that, and then adapted to also work on AKS. * https://github.com/kata-containers/kata-containers/issues/6578 Till those issues are solved, we'll keep running a jenkins job with hose tests to avoid any possible regression. Last but not least, I've decided to not keep the history when bringing those tests here, otherwise we'd end up polluting a lot the history of this repo, without any clear benefit on doing so. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-03-31 21:55:41 +02:00

... 10 11 12 13 14 ...

729 Commits