kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-08-25 11:13:15 +00:00

Author	SHA1	Message	Date
Jeremi Piotrowski	01265fb217	tests/vfio: Gather debug info and disable tdp_mmu tdp_mmu had some issues up until around Linux v6.3 that make it work particularly bad when running nested on Hyper-V. Reload the module at the start of the test and disable the tdp_mmu param. Gather debug info at the end of the test to make it easier to figure out what went wrong. This uses github actions group syntax so that each section can be collapsed. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com> (cherry picked from commit `5f6475a28a`)	2023-09-21 13:55:25 +02:00
Jeremi Piotrowski	44f37f689a	tests/vfio: Capture journal from vm For debugging (though this doesn't get exposed yet). Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com> (cherry picked from commit `8fffdc81c5`)	2023-09-21 13:55:19 +02:00
Jeremi Piotrowski	a69d0d1772	tests/vfio: Change to get the test working in GHA - reduce memory and cpu usage to fit in a D4s_v5 - source correct lib - mount workspace from 9p - disable cpu mitigations for speed - drop unused commands and variables - install containerd - install kata from built artifacts Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com> (cherry picked from commit `df815087e7`)	2023-09-21 13:55:14 +02:00
Jeremi Piotrowski	e90027f38c	tests/vfio: Move dependency installation to gha-run.sh To match the flow of other github actions workflows. Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com> (cherry picked from commit `a92ddeea15`)	2023-09-21 13:55:08 +02:00
Jeremi Piotrowski	62804d637c	gha: vfio: Import jobs scripts from tests repo This imports the vfio test scripts github.com/kata-containers/tests. The test case doesn't work yet but doing the changes in a separate commit will make it easier to track the changes. The only change in this commit is renaming vfio_jenkins_job_build.sh -> vfio_fedora_vm_wrapper.sh Fixes: #6555 Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com> (cherry picked from commit `5a551a85b1`)	2023-09-21 13:50:38 +02:00
Gabriela Cervantes	97283b18b4	metrics: Increase jitter value for qemu This PR increases the jitter value for qemu for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> (cherry picked from commit `49e2fa189c`)	2023-09-21 13:50:31 +02:00
Gabriela Cervantes	3c5bd8c44d	metrics: Increase value limit for jitter in clh This PR increases the value limit for jitter in clh. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> (cherry picked from commit `49234433a7`)	2023-09-21 13:50:25 +02:00
Fabiano Fidêncio	6abf513f06	ci: docker: nerdtl: Use io.containerd.kata-${KATA_HYPERVISOR}.io This will ensure that we're calling the correct binary for the hypervisor. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `813bfdec01`)	2023-09-21 13:50:15 +02:00
Fabiano Fidêncio	9a664ea8bb	ci: nerdctl: Create the containerd config Otherwise we'll fail to configure kata-containers in the `install-kata` step. This is mostly needed because the nerdctl-full tarball doesn't provide a contaienrd configuration, just the binary, as contaienrd does not actually require a configuration file to run with the default config. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `46bc0b1c01`)	2023-09-21 13:50:10 +02:00
Fabiano Fidêncio	5734c4cbca	ci: nerdctl: Switch to tcp port 80 ping TIL that the Azure VMs we use are created without an explicit outbund connectivity defined. This leads us to issues using `ping ...` as part of our tests, and when consulting Jeremi Piotrowski about the issue he pointed me out to two interesting links: * https://learn.microsoft.com/en-us/azure/virtual-network/ip-services/default-outbound-access * https://learn.microsoft.com/en-us/archive/blogs/mast/use-port-pings-instead-of-icmp-to-test-azure-vm-connectivity For your own sanity, do not read the comments, after all this is internet. :-) Anyways, the suggestion is to use nping instead, which is provided by the nmap package, so we can explicitly switch to using the tcp port 80 for the ping. With this in mind, I'm switching the image we use for the test and using one that provided nping as a possible entry point, and from now on (this part of) the tests should work. Fixes: #7910 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `13968aa7f6`)	2023-09-21 13:50:00 +02:00
Fabiano Fidêncio	55c8a47a40	ci: docker: Switch to tcp port 80 ping TIL that the Azure VMs we use are created without an explicit outbund connectivity defined. This leads us to issues using `ping ...` as part of our tests, and when consulting Jeremi Piotrowski about the issue he pointed me out to two interesting links: * https://learn.microsoft.com/en-us/azure/virtual-network/ip-services/default-outbound-access * https://learn.microsoft.com/en-us/archive/blogs/mast/use-port-pings-instead-of-icmp-to-test-azure-vm-connectivity For your own sanity, do not read the comments, after all this is internet. :-) Anyways, the suggestion is to use nping instead, which is provided by the nmap package, so we can explicitly switch to using the tcp port 80 for the ping. With this in mind, I'm switching the image we use for the test and using one that provided nping as a possible entry point, and from now on (this part of) the tests should work. Fixes: #7910 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `e0c811678b`)	2023-09-21 13:49:54 +02:00
Gabriela Cervantes	31c3d9bd80	metrics: Add iperf bandwidth value for qemu This PR adds the iperf bandwidth value for qemu for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> (cherry picked from commit `0aa073967d`)	2023-09-21 13:49:45 +02:00
Gabriela Cervantes	40ae855f0e	metrics: Add iperf bandwidth value for kata metrics This PR adds the iperf bandwidth value for kata metrics. Fixes #7924 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> (cherry picked from commit `615c1cbf19`)	2023-09-21 13:49:35 +02:00
Gabriela Cervantes	deadacd58f	metrics: Ensure docker is running in init_env This PR ensures that docker is running as part of the init_env function in kata metrics to avoid failures like docker is not running and making the kata metrics CI to fail. Fixes #7898 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> (cherry picked from commit `d53eb73eec`)	2023-09-21 13:49:28 +02:00
Gabriela Cervantes	31c33f9c1c	metrics: Add Cassandra Metrics documentation This PR adds the Cassandra Metrics documentation for kata metrics. Fixes #7922 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> (cherry picked from commit `ad08321b83`)	2023-09-21 13:49:21 +02:00
David Esparza	0968bf1eb9	metrics: this PR skips the FIO test temprarily to fix issues FIO test is showing ongoing issues when running in k8s. Working on running FIO on the ctr client which has been shown to be stable. Fixes: #7920 Signed-off-by: David Esparza <david.esparza.borquez@intel.com> (cherry picked from commit `a58ea66592`)	2023-09-21 13:48:24 +02:00
Fabiano Fidêncio	e5e3951398	ci: docker: Also run the smoke test with runc This will help us to make sure that the failure is actually related to Kata Containers. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `f536ef5ce1`)	2023-09-21 13:46:53 +02:00
Fabiano Fidêncio	c7147dabce	ci: docker: Run the tests after the kata-static is created There's no reason to wait till the payload is created to run the tests, as we rely on the tarball, not on the kata-deploy payload. That was a mistake on my side, and that's already fixed for the nerdctl tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `c83f167c59`)	2023-09-21 13:46:46 +02:00
Fabiano Fidêncio	33430ad60c	ci: Add a very basic nerdctl sanity test Let's add a very basic sanity test to check that we can spawn a containers using nerdctl + Kata Containers. This will ensure that, at least, we don't regress to the point where this feature doesn't work at all. In the future, we should also test all the VMMs with devmapper, but that's for a follow-up PR after this test is working as expected. Fixes: #7911 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `12d833d07d`)	2023-09-21 13:46:37 +02:00
Fabiano Fidêncio	69dd11f459	ci: Add a very basic docker sanity test Let's add a very basic sanity test to check that we can spawn a containers using docker + Kata Containers. This will ensure that, at least, we don't regress to the point where this feature doesn't work at all. For now we're running this test against Cloud Hypervisor and QEMU only, due to an already reported issue with dragonball: https://github.com/kata-containers/kata-containers/issues/7912 In the future, we should also test all the VMMs with devmapper, but that's for a follow-up PR after this test is working as expected. Fixes: #7910 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `348b8644d6`)	2023-09-21 13:46:28 +02:00
Fabiano Fidêncio	fcfa6c6e1a	ci: use github.ref_name instead of $GITHUB_REF_NAME As, regardless of what's mentioned in the documentation, it seems that $GITHUB_REF_NAME is passed down as a literal string. Fixes: #7414 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `f811b064ca`)	2023-09-21 13:46:00 +02:00
Fabiano Fidêncio	19d9fd9eb1	ci: Add more target-branch related fixes The ones for the payload-after-push.yamland ci-nightly.yaml are not that much important right now, but they're needed for when we start running those on stable branches as well. The other ones were missed during `bd24afcf73`. Fixes: #7414 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `6d795c089e`)	2023-09-21 13:45:52 +02:00
Fabiano Fidêncio	fe4247a90c	ci: Fix target-branch usage We missed those one as part of `bd24afcf73`. Fixes: #7414 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `8509c31870`)	2023-09-21 13:45:43 +02:00
Gabriela Cervantes	9f510d059b	metrics: Remove warning from metrics documentation Now that the metrics migration from the tests to kata containers has been completed, this PR removes the warning from the main metrics documentation. Fixes #7894 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> (cherry picked from commit `060499dcae`)	2023-09-21 13:45:31 +02:00
Fabiano Fidêncio	400418bce0	kata-deploy: Remove curl after it's used There's no need to keep curl there after the kubectl binary has already been downloaded. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `8b4a0b368f`)	2023-09-21 13:45:07 +02:00
Fabiano Fidêncio	1df997c38c	kata-deploy: Fix aarch64 image build Similarly to what's been done for x86_64 -> amd64, we need to do a aarch64 -> arm64 change in order to be able to download the kubectl binary. Fixes: #7861 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `139c7f03ab`)	2023-09-21 13:44:54 +02:00
Fabiano Fidêncio	61b1a99fca	gha: Manually rebase PR atop of the target branch before testing We're changing what's been done as part of `ac939c458c`, as we've notcied issues using `github.event.pull_request.merge_commit_sha`. Basically, whenever a force-push would happen, the reference of merge_commit_sha wouldn't be updated, leading us to test PRs with the old code. :-/ In order to get the rebase properly working, we need to ensure we pull the hash of the commit as part of checkout action, and ensure fetch-depth is set to 0. Fixes: #7414 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `bd24afcf73`)	2023-09-21 13:43:49 +02:00
Fabiano Fidêncio	db563709e3	kata-deploy: Switch to an alpine image This will make our image smaller, and still ensure it's multi-arch support. Fixes: #7861 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `670a8e9c73`)	2023-09-21 13:43:35 +02:00
Fabiano Fidêncio	bb5dbfbbce	k8s: ci: Skip "Pod quota" test with firecracker The test is failing, and an issue has been opened to track it. For now, let's skip it. Issue: https://github.com/kata-containers/kata-containers/issues/7873 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `9d74b7ccc9`) Conflicts: tests/integration/kubernetes/k8s-pod-quota.bats	2023-09-21 13:43:16 +02:00
Fabiano Fidêncio	263ed4afd1	ci: k8s: Remove useless skip statement from tests There's absolutely no need to have the skip check as part of the test itself when it's already done as part of the setup function. We're only touching the files here that were touched in the previous commit. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `f6cd3930c5`)	2023-09-21 13:42:24 +02:00
Fabiano Fidêncio	7e135294a7	ci: k8s: Also check for "fc" (for firecracker) Let's keep both checks for now, but in the future we'll be able to remove the check for "firecracker", as the hypervisor name used as part of the GitHub Actions has to match what's used as part of the kata-deploy stuff, which is `fc` (as in `kata-fc for the runtime class) instead of `firecracker`. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `3cc20b47a6`)	2023-09-21 13:42:17 +02:00
Fabiano Fidêncio	8892d9a7b2	ci: k8s: Add clean-up-garm argument for gha-run.sh The tests are failing to finish as the argument is invalid. Fixes: #6542 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `b5bad3cb0f`)	2023-09-21 13:42:06 +02:00
Fabiano Fidêncio	c723a7d9c8	ci: k8s: devmapper tests should be using ubuntu 20.04 That's what we've been using as part of Jenkins, so let's ensure things will work as they did before, and only after that consider upgrading the base OS used for the tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `aaec5a09f3`)	2023-09-21 13:41:57 +02:00
Fabiano Fidêncio	aee6f36c86	ci: k8s: Add a kata-deploy-garm target We've been using the `kata-deploy-tdx` target as that also uses k3s as base, but it's better to just have a specific garm target. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `27fa7d828d`)	2023-09-21 13:41:52 +02:00
Fabiano Fidêncio	5bb77b628d	ci: k8s: Export KUBERNETES env var So we have a better control on which flavour of kubernetes kata-deploy is expected to be targetting. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `fa62a4c01b`)	2023-09-21 13:41:45 +02:00
Fabiano Fidêncio	7ce5c8b3fa	ci: k8s: Install bats on GARM runners GARM runners do not come with the whole set of tools we need, or are used to when it comes to the GHA runners, so we need to manually install bats on those. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `8c9380a798`)	2023-09-21 13:41:39 +02:00
Fabiano Fidêncio	9fb291d88a	ci: k8s: Wait some time after restarting k3s Let's put a 1 minute sleep, just to make sure everything is back up again. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `3de23034f8`)	2023-09-21 13:41:31 +02:00
David Esparza	053308eefc	metrics: fix FIO test initialization This PR changes the order in which the FIO test first cleans the environment and then checks if the environment is indeed clean. Fixes: #7869 Signed-off-by: David Esparza <david.esparza.borquez@intel.com> (cherry picked from commit `adfea55b8f`)	2023-09-21 13:41:12 +02:00
Fabiano Fidêncio	89345b6731	ci: k8s: Append, instead of overwrite, the devmapper config As we were using `tee` without the `-a` (or `--apend`) aptton, the containerd config would be overwritten, leading to a NotReady state of the Node. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `2df183fd99`)	2023-09-21 13:41:01 +02:00
Fabiano Fidêncio	bb675f8101	ci: k8s: Decrease k3s sleep from 4 to 2 minutes It should be plenty, and worked well in local tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `369a8af8f7`)	2023-09-21 13:40:56 +02:00
Fabiano Fidêncio	695c7162ef	ci: k8s: Use vanilla kubectl with k3s Let's download the vanilla kubectl binary into `/usr/bin/`, as we need to avoid hitting issues like: ```sh error: open /etc/rancher/k3s/k3s.yaml.lock: permission denied ``` The issue basically happens because k3s links `/usr/local/bin/kubectl` to `/usr/local/bin/k3s`, and that does extra stuff that vanilla `kubectl` doesn't do. Also, in order to properly use the k3s.yaml config with the vanilla kubectl, we're copying it to ~/.kube/config. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `ada65b988a`)	2023-09-21 13:40:51 +02:00
Fabiano Fidêncio	7f865be398	ci: k8s: Ensure k3s is deploy with --write-kubeconfig-mode=644 Otherwise the /etc/rancher/k3s/k3s.yaml is not readable by other users than root. As --write-config-mode is being passed, and that's an option that has to be passed to the `server`, -s is also added to the command line. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `ad45ab5d33`)	2023-09-21 13:40:45 +02:00
Fabiano Fidêncio	7a96d0a589	ci: k8s: Use the proper command for sleep `wait` waits for a job to complete, not a number of seconds. Not sure how I got that wrong in the first place, but it's what it's. Fixes: #6542 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `028a97e0d5`)	2023-09-21 13:40:38 +02:00
Gabriela Cervantes	92fdaf9719	metrics: Use TensorFlow optimized image This PR replaces the ubuntu image for one which has TensorFlow optimized for kata metrics. Fixes #7866 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> (cherry picked from commit `3a427795ea`)	2023-09-21 13:39:40 +02:00
Fabiano Fidêncio	1b7ffeac53	ci: k8s: Fix typo in run-k8s-tests-on-garm.yaml integrations -> integration integrtion -> integration Fixes: #6542 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `8d99972a8a`)	2023-09-21 13:39:32 +02:00
Fabiano Fidêncio	79de72592f	ci: k8s: Add k8s devmapper tests (part 0) Let's enable the devmapper kubernetes tests to match exactly what's been tested as part of the Jenkins CI. Fixes: #6542 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `0e8bd50cbb`)	2023-09-21 13:39:12 +02:00
Fabiano Fidêncio	a41a56e326	ci: k8s: Add a function to configure devmapper for containerd This function right now is completely based on what's part of the tests repo[0], and that's the reason I'm keeping the `Signed-off-by` of all the contributors to that file. This is not perfect, though, as it changes the default snapshotter to devmapper, instead of only doing so for the Kata Containers specific runtime handlers. OTOH, this is exactly what we've always been doing as part of the tests. We'll improve it, soon enough, when we get to also add a way for kata-deploy to set up different snapshotters for different handlers. But, for now, this is as good (or as bad) as it's always been. It's important to note that the devmapper setup doesn't take into consideration a BM machine, and this is not suitable for that. We're really only targetting GHA runners which will be thrown away after the run is over. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> Signed-off-by: Shiming Zhang <wzshiming@foxmail.com> Signed-off-by: Marcel Apfelbaum <marcel@redhat.com> Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> (cherry picked from commit `b28b54df04`)	2023-09-21 13:39:05 +02:00
Fabiano Fidêncio	315288a000	ci: k8s: Add a function to deploy k3s One can use different kubernetes flavours for getting a kubernetes cluster up and running. As part of our CI, though, I really would like to avoid contributors spending time maintaining and updating kubernetes dependencies, as done with the tests repo, and which has been proven to be really good on getting things rotten. With this in mind, I'm taking the bullet and using "k3s" as the way to deploy kubernetes for the devmapper related tests, and that's the reason I'm adding a function to do so, and this will be used later on as part of this series. It's important to note that the k3s setup doesn't take into consideration a BM machine, and this is not suitable for that. We're really only targetting GHA runners which will be thrown away after the run is over. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com> (cherry picked from commit `54f7117212`)	2023-09-21 13:38:56 +02:00
Hyounggyu Choi	899c823c0b	packaging: do not install docker-compose-plugin for s390x\|ppc64le This PR is to skip installing docker-compose-plugin while buiding a `build-kata-deploy` image for s390x\|ppc64le. It is a temporary solution to fix current CI failures for s390x regarding `hash sum mismatch`. Fixes: #7848 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com> (cherry picked from commit `2efda20c77`)	2023-09-21 13:38:42 +02:00
Gabriela Cervantes	374e77d330	metrics: Add write 95 percentile for FIO for qemu This PR adds the write 95 percentile for FIO for qemu for checkmetrics for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> (cherry picked from commit `438fbf9669`)	2023-09-21 13:38:33 +02:00

1 2 3 4 5 ...

11383 Commits