kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-07-12 22:58:58 +00:00

Author	SHA1	Message	Date
stevenhorsman	b87b4b6756	metrics: Increase ranges range for qemu failing tests We've also seen the qemu metrics tests are failing due to the results being slightly outside the max range for network-iperf3 parallel and minimum for network-iperf3 jitter tests on PRs that have no code changes, so we've increase the bounds to not see false negatives. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-11-29 10:52:16 +00:00
stevenhorsman	4011071526	metrics: Increase minval range for failing tests We've seen a couple of instances recently where the metrics tests are failing due to the results being below the minimum value by ~2%. For tests like latency I'm not sure why values being too low would be an issue, but I've updated the minpercent range of the failing tests to try and get them passing. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-11-29 10:50:02 +00:00
Aurélien Bombo	16a91fccbe	Merge pull request #10561 from sprt/csi-driver-ci coco: ci: Lay groundwork for compiling and publishing CSI driver image [1/x]	2024-11-27 10:26:45 -06:00
Aurélien Bombo	5e4990bcf5	coco: ci: Add no-op steps to deploy CSI driver This adds no-op steps that'll be used to deploy and clean up the CSI driver used for testing. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2024-11-21 16:08:06 -06:00
Adithya Krishnan Kannan	2242aee099	ci: Skip the failing tests in SNP Per [Issue#10549](https://github.com/kata-containers/kata-containers/issues/10549), the following tests are failing on SNP. 1. k8s-guest-pull-image-encrypted.bats 2. k8s-guest-pull-image-authenticated.bats 3. k8s-guest-pull-image-signature.bats 4. k8s-confidential-attestation.bats Per @fidencio 's comment on [PR#10558](https://github.com/kata-containers/kata-containers/pull/10558), I am skipping the same. Signed-Off-By: Adithya Krishnan Kannan <AdithyaKrishnan.Kannan@amd.com>	2024-11-19 10:41:43 -06:00
Fabiano Fidêncio	9b1a5f2ac2	tests: Add a way to run only tests which rely on attestation We're doing this as, at Intel, we have two different kind of machines we can plug into our CI. Without going much into details, only one of those two kinds of machines will work for the attestation tests we perform with ITA, thus in order to speed up the CI and improve test coverage (OS wise), we're going to run different tests in different machines. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-11-14 15:51:57 +01:00
GabyCT	06fe459e52	Merge pull request #10508 from GabyCT/topic/installartsta gha: Get artifacts when installing kata tools in stability workflow	2024-11-11 15:59:06 -06:00
Fabiano Fidêncio	2281342fb8	Merge pull request #10513 from fidencio/topic/ci-adjust-proxy-nightmare-for-tdx ci: tdx: kbs: Ensure https_proxy is taken in consideration	2024-11-10 00:17:10 +01:00
Saul Paredes	461efc0dd5	tests: remove manifest v1 test This test was meant to show support for pulling images with v1 manifest schema versions. The nginxhttps image has been modified in https://hub.docker.com/r/ymqytw/nginxhttps/tags such that we are no longer able to pull it: $ docker pull ymqytw/nginxhttps:1.5 Error response from daemon: missing signature key We may remove this test since schema version 1 manifests are deprecated per https://docs.docker.com/engine/deprecated/#pushing-and-pulling-with-image-manifest-v2-schema-1 : "These legacy formats should no longer be used, and users are recommended to update images to use current formats, or to upgrade to more current images". This schema version was used by old docker versions. Further OCI spec https://github.com/opencontainers/image-spec/blob/main/manifest.md#image-manifest-property-descriptions only supports schema version 2. Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-11-08 13:38:51 -08:00
Fabiano Fidêncio	baf88bb72d	ci: tdx: kbs: Ensure https_proxy is taken in consideration Trustee's deployment must set the correct https_proxy as env var on the container that will talk to the ITA / ITTS server, otherwise the kbs service won't be able to start, causing then issues in our CI. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org> Signed-off-by: Krzysztof Sandowicz <krzysztof.sandowicz@intel.com>	2024-11-08 16:06:16 +01:00
Steve Horsman	1f728eb906	Merge pull request #10498 from stevenhorsman/update-create-container-timeout-log tests: k8s: Update image pull timeout error	2024-11-08 10:47:39 +00:00
Gabriela Cervantes	4274198664	gha: Get artifacts when installing kata tools in stability workflow This PR adds the get artifacts which are needed when installing kata tools in stability workflow to avoid failures saying that artifacts are missing. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-11-07 16:20:41 +00:00
GabyCT	47cea6f3c6	Merge pull request #10493 from GabyCT/topic/katatoolsta gha: Add install kata tools as part of the stability workflow	2024-11-06 14:16:48 -06:00
Gabriela Cervantes	13e27331ef	gha: Add install kata tools as part of the stability workflow This PR adds the install kata tools step as part of the k8s stability workflow. To avoid the failures saying that certain kata components are not installed it. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-11-06 20:07:06 +00:00
Fabiano Fidêncio	71c4c2a514	Merge pull request #10486 from kata-containers/topic/enable-AUTO_GENERATE_POLICY-for-qemu-coco-dev workflows: Use AUTO_GENERATE_POLICY for qemu-coco-dev	2024-11-06 21:04:45 +01:00
stevenhorsman	85554257f8	tests: k8s: Update image pull timeout error Currently the error we are checking for is `CreateContainerRequest timed out`, but this message doesn't always seem to be printed to our pod log. Try using a more general message that should be present more reliably. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-11-06 17:00:26 +00:00
Julien Ropé	da5e0c3f53	ci: skip nginx connectivity test with crio We have an error with service name resolution with this test when using crio. This error could not be reproduced outside of the CI for now. Skipping it to keep the CI job running until we find a solution. See: #10414 Signed-off-by: Julien Ropé <jrope@redhat.com>	2024-11-06 12:07:02 +01:00
Julien Ropé	6d0cb1e9a8	ci: export CONTAINER_RUNTIME to the test scripts This variable will allow tests to adapt their behaviour to the runtime (containerd/crio). Signed-off-by: Julien Ropé <jrope@redhat.com>	2024-11-06 11:29:11 +01:00
Fabiano Fidêncio	72979d7f30	workflows: Use AUTO_GENERATE_POLICY for qemu-coco-dev By the moment we're testing it also with qemu-coco-dev, it becomes easier for a developer without access to TEE to also test it locally. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-11-06 10:47:08 +01:00
Dan Mihai	03bf4433d7	Merge pull request #10459 from stevenhorsman/update-bats tests: k8s: Update bats	2024-11-04 12:26:58 -08:00
Aurélien Bombo	f639d3e87c	Merge pull request #10395 from Sumynwa/sumsharma/create_container agent-ctl: Add support to test kata-agent's container creation APIs.	2024-11-04 14:09:12 -06:00
Gabriela Cervantes	fd4d0dd1ce	gha: Fix source for gha stability run script This PR fixes the source to avoid duplication specially in the common.sh script and avoid failures saying that certain script is not in the directory. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-11-04 16:16:13 +00:00
stevenhorsman	175ebfec7c	Revert "k8s:kbs: Add trap statement to clean up tmp files" This reverts commit `973b8a1d8f`. As @danmihai1 points out https://github.com/bats-core/bats-core/issues/364 states that using traps in bats is error prone, so this could be the cause of the confidential test instability we've been seeing, like it was in the static checks, so let's try and revert this. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-11-04 09:59:37 +00:00
stevenhorsman	75cb1f46b8	tests/k8s: Add skip is setup_common fails At @danmihai1's suggestion add a die message in case the call to setup_common fails, so we can see if in the test output. Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-11-04 09:59:33 +00:00
stevenhorsman	3f5bf9828b	tests: k8s: Update bats We've seen some issues with tests not being run in some of the Coco CI jobs (Issue #10451) and in the envrionments that are more stable we noticed that they had a newer version of bats installed. Try updating the version to 1.10+ and print out the version for debug purposes Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-11-04 09:59:33 +00:00
Sumedh Alok Sharma	4b7aba5c57	agent-ctl: Add support to test kata-agent's container creation APIs. This commit introduces changes to enable testing kata-agent's container APIs of CreateContainer/StartContainer/RemoveContainer. The changeset include: - using confidential-containers image-rs crate to pull/unpack/mount a container image. Currently supports only un-authenicated registry pull - re-factor api handlers to reduce cmdline complexity and handle request generation logic in tool - introduce an OCI config template for container creation - add test case Fixes #9707 Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2024-11-01 22:18:54 +05:30
Gabriela Cervantes	c4089df9d2	gha: Add missing steps in Kata stability workflow This PR adds missing steps in the gha run script for the kata stability workflow. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-10-30 19:13:15 +00:00
Hyounggyu Choi	dca69296ae	Merge pull request #10476 from BbolroC/switch-to-kubeadm-s390x gha: Switch KUBERNETES from k3s to kubeadm on s390x	2024-10-30 09:52:06 +01:00
GabyCT	8539cd361a	Merge pull request #10462 from GabyCT/topic/increstress tests: Increase time to run stressng k8s tests	2024-10-29 11:08:47 -06:00
Hyounggyu Choi	238f67005f	tests: Add `kubeadm` option for KUBERNETES in gha-run.sh When creating a k8s cluster via kubeadm, the devmapper setup for containerd requires a different configuration. This commit introduces a new `kubeadm` option for the KUBERNETES variable and adjusts the path to the containerd config file for devmapper setup. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-10-29 14:19:42 +01:00
stevenhorsman	b1cffb4b09	Revert "tests: Add trap statement in kata doc script" This reverts commit `093a6fd542`. as it is breaking the static checks Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-10-29 09:57:18 +00:00
Fabiano Fidêncio	b70d7c1aac	tests: Enable measured rootfs tests for qemu-coco-dev Then it's on pair with what's being tested with TEEs using a rootfs image. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-10-28 12:43:54 +01:00
Fabiano Fidêncio	7d202fc173	tests: Re-enable measured_rootfs test for TDX As we're now building everything needed to test TDX with measured rootfs support, let's bring this test back in (for TDX only, at least for now). Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-10-28 12:43:53 +01:00
Fabiano Fidêncio	eb07a809ce	tests: Add a helper script to use prebuild components This is a helper script that does basically what's already being done by the s390x CI, which is: * Move a folder with the components that we were stored / downloaded during the GHA execution to the expected `build` location * Get rid of the dependencies for a specific asset, as the dependencies are already pulled in from previous GHA steps For now this script is only being added but not yet executed anywhere, and that will come as the next step in this series. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-10-28 12:43:52 +01:00
Gabriela Cervantes	a3ef8c0a16	tests: Increase time to run stressng k8s tests This PR increase the time to run the stressng k8s tests for the CoCo stability CI. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-10-24 16:34:17 +00:00
Gabriela Cervantes	093a6fd542	tests: Add trap statement in kata doc script This PR adds the trap statement into the kata doc script to clean up properly the temporary files. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-10-23 15:56:58 +00:00
alex.lyn	b25538f670	ci: Introduce CI to validate pod hostname Fixes #10422 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2024-10-21 16:32:56 +01:00
GabyCT	b00203ba9b	Merge pull request #10428 from GabyCT/topic/archk8sc gha: Use a arch_to_golang variable to have uniformity	2024-10-17 11:00:59 -06:00
Gabriela Cervantes	f0e0c74fd4	gha: Use a arch_to_golang variable to have uniformity This PR replaces the arch uname -m to use the arch_to_golang variable in the script to have a better uniformity across the script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-10-15 20:03:09 +00:00
Dan Mihai	ece0f9690e	tests: k8s-inotify: longer pod termination timeout inotify-configmap-pod.yaml is using: "inotifywait --timeout 120", so wait for up to 180 seconds for the pod termination to be reported. Hopefully, some of the sporadic errors from #10413 will be avoided this way: not ok 1 configmap update works, and preserves symlinks waitForProcess "${wait_time}" "$sleep_time" "${command}" failed Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-10-15 16:01:25 +00:00
Dan Mihai	ccfb7faa1b	tests: k8s-inotify.bats: don't leak configmap Delete the configmap if the test failed, not just on the successful path. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-10-15 16:01:25 +00:00
Leonard Cohnen	c06bf2e3bb	genpolicy: read binaryData value as String While Kubernetes defines `binaryData` as `[]byte`, when defined in a YAML file the raw bytes are base64 encoded. Therefore, we need to read the YAML value as `String` and not as `Vec<u8>`. Fixes: #10410 Signed-off-by: Leonard Cohnen <lc@edgeless.systems>	2024-10-14 20:03:11 +02:00
Fabiano Fidêncio	cf5d3ed0d4	kbs: ita: Ensure the proper image / image_tag is used for ITA When dealing with a specific release, it was easier to just do some adjustments on the image that has to be used for ITA without actually adding a new entry in the versions.yaml. However, it's been proven to be more complicated than that when it comes to dealing with staged images, and we better explicitly add (and update) those versions altogether to avoid CI issues. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-10-10 10:01:33 +02:00
Fabiano Fidêncio	3f7ce1d620	Merge pull request #10401 from stevenhorsman/kbs-deploy-overlays-update Kbs deploy overlays update	2024-10-10 09:50:19 +02:00
Fabiano Fidêncio	091ad2a1b2	ci: mariner: Ensure kernel_params can be set The reason we're doing this is because mariner image uses, by default, cgroups default-hierarchy as `unified` (aka, cgroupsv2). In order to keep the same initrd behaviour for mariner, let's enforce that `SYSTEMD_CGROUP_ENABLE_LEGACY_FORCE=1 systemd.legacy_systemd_cgroup_controller=yes systemd.unified_cgroup_hierarchy=0` is passed to the kernel cmdline, at least for now. Other tests that are setting `kernel_params` are not running on mariner, then we're safe taking this path as it's done as part of this PR. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-10-09 18:23:35 +02:00
Fabiano Fidêncio	3bbf3c81c2	ci: mariner: Use the image instead of the initrd As an image has been added for mariner as part of the commit `63c1f81c2`, let's start using it in the CI, instead of using the initrd. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-10-09 18:23:32 +02:00
stevenhorsman	8763880e93	tests/k8s: kbs: Update overlays logic In https://github.com/confidential-containers/trustee/pull/521 the overlays logic was modified to add non-SE s390x support and simplify non-ibm-se platforms. We need to update the logic in `kbs_k8s_deploy` to match and can remove the dummying of `IBM_SE_CREDS_DIR` for non-SE now Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-10-09 09:39:41 +01:00
ChengyuZhu6	a94024aedc	tests: add test for sealed file secrets add a test for sealed file secrets. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-10-08 16:01:48 +08:00
Dan Mihai	6d5fc898b8	tests: k8s: AUTO_GENERATE_POLICY=yes for local testing The behavior of Kata CI doesn't change. For local testing using kubernetes/gha-run.sh and AUTO_GENERATE_POLICY=yes: 1. Before these changes users were forced to use: - SEV, SNP, or TDX guests, or - KATA_HOST_OS=cbl-mariner 2. After these changes users can also use other platforms that are configured with "shared_fs = virtio-fs" - e.g., - KATA_HOST_OS=ubuntu + KATA_HYPERVISOR=qemu Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-10-04 18:26:00 +00:00
Dan Mihai	5aaef8e6eb	Merge pull request #10376 from microsoft/danmihai1/auto-generate-just-for-ci gha: enable AUTO_GENERATE_POLICY where needed	2024-10-04 10:52:31 -07:00
Dan Mihai	1a4928e710	gha: enable AUTO_GENERATE_POLICY where needed The behavior of Kata CI doesn't change. For local testing using kubernetes/gha-run.sh: 1. Before these changes: - AUTO_GENERATE_POLICY=yes was always used by the users of SEV, SNP, TDX, or KATA_HOST_OS=cbl-mariner. 2. After these changes: - Users of SEV, SNP, TDX, or KATA_HOST_OS=cbl-mariner must specify AUTO_GENERATE_POLICY=yes if they want to auto-generate policy. - These users have the option to test just using hard-coded policies (e.g., using the default policy built into the Guest rootfs) by using AUTO_GENERATE_POLICY=no. AUTO_GENERATE_POLICY=no is the default value of this env variable. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-10-02 23:20:33 +00:00
Gabriela Cervantes	973b8a1d8f	k8s:kbs: Add trap statement to clean up tmp files This PR adds the trap statement in the confidential kbs script to clean up temporary files and ensure we are leaving them. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-10-02 19:59:08 +00:00
Steve Horsman	8412c09143	Merge pull request #10371 from fidencio/topic/k8s-tdx-re-enable-empty-dir-tests k8s: tests: Re-enable empty-dirs tests for TDX / coco-qemu-dev	2024-10-02 18:41:19 +01:00
Dan Mihai	9a8341f431	Merge pull request #10370 from microsoft/danmihai1/k8s-policy-rc tests: k8s-policy-rc: remove default UID from YAML	2024-10-02 09:32:17 -07:00
GabyCT	a1d380305c	Merge pull request #10369 from GabyCT/topic/egrepfastf metrics: Update fast footprint script to use grep	2024-10-02 10:10:12 -06:00
Fabiano Fidêncio	b3ed7830e4	k8s: tests: Re-enable empty-dirs tests for TDX / coco-qemu-dev The tests is disabled for qemu-coco-dev / qemu-tdx, but it doesn't seen to actually be failing on those. Plus, it's passing on SEV / SNP, which means that we most likely missed re-enabling this one in the past. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-10-01 20:51:01 +02:00
Hyounggyu Choi	4ccf1f29f9	tests: Skip k8s-block-volume.bats for qemu-runtime-rs Currently, `qemu-runtime-rs` does not support `virtio-scsi`, which causes the `k8s-block-volume.bats` test to fail. We should skip this test until `virtio-scsi` is supported by the runtime. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-10-01 09:09:47 +02:00
Dan Mihai	3b24219310	tests: k8s-policy-rc: remove default UID from YAML The nginx container seems to error out when using UID=123. Depending on the timing between container initialization and "kubectl wait", the test might have gotten lucky and found the pod briefly in Ready state before nginx errored out. But on some of the nodes, the pod never got reported as Ready. Also, don't block in "kubectl wait --for=condition=Ready" when wrapping that command in a waitForProcess call, because waitForProcess is designed for short-lived commands. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-10-01 00:10:30 +00:00
Saul Paredes	94bc54f4d2	Merge pull request #10340 from microsoft/saulparedes/validate_create_sandbox_storages genpolicy: validate create sandbox storages	2024-09-30 14:24:56 -07:00
Dan Mihai	7fe44d3a3d	genpolicy: validate create sandbox storages Reject any unexpected values from the CreateSandboxRequest storages field. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-09-30 11:31:12 -07:00
Gabriela Cervantes	52ef092489	metrics: Update fast footprint script to use grep This PR updates the fast footprint script to remove the use of egrep as this command has been deprecated and change it to use grep command. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-09-30 17:43:08 +00:00
Aurélien Bombo	c037ac0e82	tests: Add k8s-block-volume test This imports the k8s-block-volume test from the tests repo and modifies it slightly to set up the host volume on the AKS host. This is a follow-up to #7132. Fixes: #7164 Signed-off-by: Salvador Fuentes <salvador.fuentes@intel.com> Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2024-09-30 10:58:30 -05:00
Fabiano Fidêncio	66bcfe7369	k8s: kbs: Properly delete ita kustomization The ita kustomization for Trustee, as well as previously used one (DCAP), doesn't have a $(uname -m) directory after the deployment directory name. Let's follow the same logic used for the deploy-kbs script and clean those up accordingly. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-09-27 21:47:29 +02:00
Gabriela Cervantes	bafa527be0	ci: tdx: Test attestation with ITTS Intel Tiber Trust Services (formerly known as Intel Trust Authority) is Intel's own attestation service, and we want to take advantage of the TDX CI in order to ensure ITTS works as expected. In order to do so, let's replace the former method used (DCAP) to use ITTS instead. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com> Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-09-27 21:47:25 +02:00
Hyounggyu Choi	01d460ac63	tests: Add teardown_common() to tests_common.sh There are many similar or duplicated code patterns in `teardown()`. This commit consolidates them into a new function, `teardown_common()`, which is now called within `teardown()`. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-26 13:56:36 +02:00
Hyounggyu Choi	e8d1feb25f	tests: Validate node name for exec_host() The current `exec_host()` accepts a given node name and creates a node debugger pod, even if the name is invalid. This could result in the creation of an unnecessary pending pod (since we are using nodeAffinity; if the given name does not match any actual node names, the pod won’t be scheduled), which wastes resources. This commit introduces validation for the node name to prevent this situation. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-26 13:20:50 +02:00
Hyounggyu Choi	57e8cbff6f	tests: Delete custom node debugger pod on EXIT It was observed that the custom node debugger pod is not cleaned up when a test times out. This commit ensures the pod is cleaned up by triggering the cleanup on EXIT, preventing any debugger pods from being left behind. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-25 20:36:05 +02:00
Hyounggyu Choi	c70588fafe	tests: Use custom-node-debugger pod With #10232 merged, we now have a persistent node debugger pod throughout the test. As a result, there’s no need to spawn another debugger pod using `kubectl debug`, which could lead to false negatives due to premature pod termination, as reported in #10081. This commit removes the `print_node_journal()` call that uses `kubectl debug` and instead uses `exec_host()` to capture the host journal. The `exec_host()` function is relocated to `tests/integration/kubernetes/lib.sh` to prevent cyclical dependencies between `tests_common.sh` and `lib.sh`. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-24 17:25:24 +02:00
Hyounggyu Choi	2c2941122c	tests: Fail fast in assert_pod_fail() `assert_pod_fail()` currently calls `k8s_create_pod()` to ensure that a pod does not become ready within the default 120s. However, this delays the test's completion even if an error message is detected earlier in the journal. This commit removes the use of `k8s_create_pod()` and modifies `assert_pod_fail()` to fail as soon as the pod enters a failed state. All failing pods end up in one of the following states: - CrashLoopBackOff - ImagePullBackOff The function now polls the pod's state every 5 seconds to check for these conditions. If the pod enters a failed state, the function immediately returns 0. If the pod does not reach a failed state within 120 seconds, it returns 1. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-24 16:09:20 +02:00
Hyounggyu Choi	2d6ac3d85d	tests: Re-enable guest-pull-image tests for qemu-coco-dev Now that the issue with handling loop devices has been resolved, this commit re-enables the guest-pull-image tests for `qemu-coco-dev`. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-20 14:37:43 +02:00
Hyounggyu Choi	c6b86e88e4	tests: Increase timeouts for qemu-coco-dev in trusted image storage tests Timeouts occur (e.g. `create_container_timeout` and `wait_time`) when using qemu-coco-dev. This commit increases these timeouts for the trusted image storage test cases Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-20 14:37:43 +02:00
Hyounggyu Choi	9cff9271bc	tests: Run all commands in _loop_device() using exec_host() If the host running the tests is different from the host where the cluster is running, the _loop_device() functions do not work as expected because the device is created on the test host, while the cluster expects the device to be local. This commit ensures that all commands for the relevant functions are executed via exec_host() so that a device should be handled on a cluster node. Additionally, it modifies exec_host() to return the exit code of the last executed command because the existing logic with `kubectl debug` sometimes includes unexpected characters that are difficult to handle. `kubectl exec` appears to properly return the exit code for a given command to it. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-20 14:37:43 +02:00
Hyounggyu Choi	374b8d2534	tests: Create and delete node debugger pod only once Creating and deleting a node debugger pod for every `exec_host()` call is inefficient. This commit changes the test suite to create and delete the pod only once, globally. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-20 14:37:43 +02:00
Hyounggyu Choi	aedf14b244	tests: Mimic node debugger with full privileges This commit addresses an issue with handling loop devices via a node debugger due to restricted privileges. It runs a pod with full privileges, allowing it to mount the host root to `/host`, similar to the node debugger. This change enables us to run tests for trusted image storage using the `qemu-coco-dev` runtime class. Fixes: #10133 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-20 14:37:43 +02:00
Fabiano Fidêncio	593cbb8710	Merge pull request #10306 from microsoft/danmihai1/more-security-contexts genpolicy: get UID from PodSecurityContext	2024-09-18 21:33:39 +02:00
Sumedh Alok Sharma	18c887f055	agent-ctl: Add SetPolicy support This patch adds support to call kata agents SetPolicy API. Also adds tests for SetPolicy API using agent-ctl. Fixes #9711 Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2024-09-18 10:53:49 +05:30
Fabiano Fidêncio	da2377346d	Merge pull request #10323 from stevenhorsman/update-kubectl-release-url kata-deploy: Switch Kubernetes URL	2024-09-17 20:47:17 +02:00
stevenhorsman	c0d35a66aa	ci: kata-deploy: Update kubectil install URL The `deploy_k0s` and `deploy_k3s` kubectl installs aren't failing yet, but let get ahead of this and bump them as well Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-09-17 15:35:42 +01:00
Sumedh Alok Sharma	cefba08903	agent: add support to provide default agent policy via env agent built with policy feature initializes the policy engine using a policy document from a default path, which is installed & linked during UVM rootfs build. This commit adds support to provide a default agent policy as environment variable. This targets development/testing scenarios where kata-agent is wanted to be started as a local process. Fixes #10301 Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2024-09-16 18:05:21 +05:30
stevenhorsman	aa9f21bd19	test: Add support for s390x in cosign testing We've added s390x test container image, so add support to use them based on the arch the test is running on Fixes: #10302 Signed-off-by: stevenhorsman <steven@uk.ibm.com> fixuop	2024-09-16 09:20:57 +01:00
stevenhorsman	3087ce17a6	tests: combined pod yaml creation for CoCo tests This commit brings some public parts of image pulling test series like encrypted image pulling, pulling images from authenticated registry and image verification. This would help to reduce the cost of maintainance. Co-authored-by: stevenhorsman <steven@uk.ibm.com> Signed-off-by: Xynnn007 <xynnn@linux.alibaba.com> Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-09-16 09:20:57 +01:00
Xynnn007	c80c8d84c3	test: add cosign signature verificaton tests Close #8120 Case 1 Create a pod from an unsigned image, on an insecureAcceptAnything registry works. Image: quay.io/prometheus/busybox:latest Policy rule: ``` "default": [ { "type": "insecureAcceptAnything" } ] ``` Case 2 Create a pod from an unsigned image, on a 'restricted registry' is rejected. Image: ghcr.io/confidential-containers/test-container-image-rs:unsigned Policy rule: ``` "quay.io/confidential-containers/test-container-image-rs": [ { "type": "sigstoreSigned", "keyPath": "kbs:///default/cosign-public-key/test" } ] ``` Case 3 Create a pod from a signed image, on a 'restricted registry' is successful. Image: ghcr.io/confidential-containers/test-container-image-rs:cosign-signed Policy rule: ``` "ghcr.io/confidential-containers/test-container-image-rs": [ { "type": "sigstoreSigned", "keyPath": "kbs:///default/cosign-public-key/test" } ] ``` Case 4 Create a pod from a signed image, on a 'restricted registry', but with the wrong key is rejected Image: ghcr.io/confidential-containers/test-container-image-rs:cosign-signed-key2 Policy: ``` "ghcr.io/confidential-containers/test-container-image-rs": [ { "type": "sigstoreSigned", "keyPath": "kbs:///default/cosign-public-key/test" } ] ``` Case 5 Create a pod from an unsigned image, on a 'restricted registry' works if enable_signature_verfication is false Image: ghcr.io/kata-containers/confidential-containers:unsigned image security enable: false Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-09-16 09:20:57 +01:00
Dan Mihai	5777869cf4	tests: k8s-policy-rc: add unexpected UID test Change pod runAsUser value of a Replication Controller after generating the RC's policy, and verify that the RC pods get rejected due to this change. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-09-13 22:05:31 +00:00
Dan Mihai	6773f14667	tests: k8s-policy-job: add unexpected UID test Change pod runAsUser value of a Job after generating the Job's policy, and verify that the Job gets rejected due to this change. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-09-13 22:05:31 +00:00
Dan Mihai	124f01beb3	tests: k8s-policy-deployment: add bad UID test Change pod runAsUser value of a Deployment after generating the Deployment's policy, and verify that the Deployment fails due to this change. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-09-13 22:05:31 +00:00
Dan Mihai	5badc30a69	Merge pull request #10316 from microsoft/danmihai1/k8s-inotify tests: k8s-inotify: pod termination polling	2024-09-13 15:02:38 -07:00
GabyCT	6f363bba18	Merge pull request #10304 from GabyCT/topic/fixcricont tests: Fix indentation in the cri containerd tests	2024-09-13 14:49:12 -06:00
Dan Mihai	d3127af9c5	tests: k8s-inotify: pod termination polling Poll/wait for pod termination instead of sleeping 2 minutes. This change typically saves ~90 seconds in my test cluster. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-09-13 17:12:55 +00:00
Hyounggyu Choi	4c933a5611	tests: Introduce retry mechanism for helm install Kata-deploy often fails due to a transiently unreachable k8s cluster for the qemu-coco-dev test on s390x. (e.g. https://github.com/kata-containers/kata-containers/actions/runs/10831142906/job/30058527098?pr=10009) This commit introduces a retry mechanism to mitigate these failures by retrying the command two more times with a 10-second interval as a workaround. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-13 14:03:44 +02:00
Dan Mihai	0c5ac042e7	tests: k8s-policy-pod: add workaround for #10297 If the CI platform being tested doesn't support yet the prometheus container image: - Use busybox instead of prometheus. - Skip the test cases that depend on the prometheus image. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-09-12 18:26:38 +00:00
Gabriela Cervantes	0346b32a90	tests: Fix indentation in the cri containerd tests This PR fixes the indentation in the cri containerd tests as we have in several places a misalignment in the script. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-09-12 16:18:34 +00:00
Dan Mihai	94d95fc055	tests: k8s-policy-pod: test container UID changes Add test cases for changing container UID after generating the policy. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-09-11 22:38:20 +00:00
Dan Mihai	db1ca4b665	tests: k8s-policy-pod: remove UID workaround Remove the workaround for #9928, now that genpolicy is able to convert user names from container images into the corresponding UIDs from these images. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-09-11 22:38:20 +00:00
Dan Mihai	eb7f747df1	genpolicy: enable create container UID verification Disabling the UID Policy rule was a workaround for #9928. Re-enable that rule here and add a new test/CI temporary workaround for this issue. This new test workaround will be removed after fixing #9928. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-09-11 22:38:20 +00:00
Dan Mihai	71ede4ea3f	tests: k8s-policy-pod: use prometheus container Change quay.io/prometheus/busybox to quay.io/prometheus/prometheus in this test. The prometheus image will be helpful for testing the future fix for #9928 because it specifies user = "nobody". Also, change: sh -c "ls -l /" to: echo -n "readinessProbe with space characters" as the test readinessProbe command line. Both include a command line argument containing space characters, but "sh -c" behaves differently when using the prometheus container image (causes the readinessProbe to time out, etc.). Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-09-11 22:38:20 +00:00
GabyCT	614328f342	Merge pull request #10295 from GabyCT/topic/removeimgvar metrics: Remove unused remove img var in common script	2024-09-11 15:02:39 -07:00
GabyCT	095c5ed961	Merge pull request #10289 from GabyCT/topic/enablestresst tests: Enable stressng k8s stability test for Kata CoCo CI	2024-09-11 10:47:33 -07:00
Fabiano Fidêncio	97ecdabde9	Merge pull request #10294 from fidencio/topic/bring-ita-support Bump guest-components / trustee to a version that supports ITA	2024-09-11 19:45:48 +02:00
Gabriela Cervantes	fdaf12d16c	metrics: Remove unused remove img var in common script This PR removes the remove_img variable in the metrics common script as it is not being used. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-09-11 17:45:18 +00:00
Gabriela Cervantes	04d1122a46	tests: Decrease iterations in soak test This PR decreases the number of iterations in the kubernetes soak test as this is already taking more than 2 hours for the kata coco ci stability. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-09-11 17:39:06 +00:00
Gabriela Cervantes	c48c6f974e	tests: Enable stressng k8s stability test for Kata CoCo CI This PR enables the stressng k8s stability test for Kata CoCo CI. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-09-11 17:38:13 +00:00
Fabiano Fidêncio	1178fe20e9	tests: Adapt error parser for failed image decryption With an older version of image-rs, we were getting the following error: ``` Message: failed to create containerd task: failed to create shim task: failed to handle layer: failed to get decrypt key no suitable key found for decrypting layer key: ``` However, with the version of image-rs we are bumping to, the error comes as: ``` Message: failed to create containerd task: failed to create shim task: failed to handle layer: failed to get decrypt key Caused by: no suitable key found for decrypting layer key: keyprovider: failed to unwrap key by ttrpc ``` Due to this change, I'm splitting the check in two different ones. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-09-11 17:07:56 +02:00
Dan Mihai	66dda37877	Merge pull request #10271 from Sumynwa/sumsharma/agent_ctl_issue_9689_local agent-ctl: Refactor CopyFile Handler	2024-09-11 07:35:09 -07:00
Fabiano Fidêncio	3946aa7283	ci: tdx: Adapt how we get the host IP In the process of switching the TDX CI machine we've noticed that `hostname -i` in one of the machines returns an one and only IP address, while in another machine it returns a full list of IPs. As we're only interested in the first one, let's adapt the code to always return the first one. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-09-11 09:31:43 +02:00
Sumedh Alok Sharma	b4bbbf65c6	ci: Do not start CDH/attestation procs with kata-agent as local process. Since CDH/attestation related processes and its dependencies are not fully available, the setup fails to start kata-agent as local process. This fix removes these procs to prevent kata-agent from trying to start them. Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2024-09-11 11:53:59 +05:30
Sumedh Alok Sharma	8045a7a2ba	ci: Install policy document on host to run kata-agent as local process. The test setup starts kata-agent as a local process without the UVM. The agent policy initialization fails due to missing policy document at `/etc/kata-opa/default-policy.rego`. The fix - installs a relaxed `allow-all.rego` policy document - cleans up the install during exit Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2024-09-11 11:25:08 +05:30
Sumedh Alok Sharma	822f898433	ci: Install bats as dependencies Install bats as part of dependencies for running the tests. Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2024-09-11 10:57:15 +05:30
Sumedh Alok Sharma	2c774fb207	ci: Add tests for CopyFile api. This commit introduces test cases for testing CopyFile API using kata-agent-ctl with improved command semantics and handling. - copy a file to /run/kata-containers - copy symlink to /run/kata-containers - copy directory to /run/kata-containers - copy file to /tmp - copy large file to /run/kata-containers Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2024-09-11 10:54:01 +05:30
Gabriela Cervantes	5a52fe1a75	tests: Increase timeout to wait for soak stability test deployment This PR increases the timeout to wait that the deployment for the soak stability test is ready in order to avoid random failures saying that the deployment is not ready yet. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-09-09 16:13:40 +00:00
GabyCT	37ddb837c4	Merge pull request #10267 from GabyCT/topic/updatemlcomments metrics: Update openVINO and oneDNN tests references	2024-09-06 09:42:21 -06:00
Dan Mihai	1885478e2e	Merge pull request #10270 from Sumynwa/sumsharma/enable_agent_tests_in_ci ci: Enable kata agent API tests	2024-09-05 14:24:49 -07:00
Sumedh Alok Sharma	e1ac2f4416	ci: Enable kata agent api tests This commit enables running tests for kata agent apis. The 'api-tests' directory will contain bats test files for individual APIs. Fixes #10269 Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2024-09-06 00:02:55 +05:30
GabyCT	4b257bcbb6	Merge pull request #10255 from Sumynwa/sumsharma/metrics_ci_kill_kata_components ci: send SIGKILL to kill kata components	2024-09-05 12:04:57 -06:00
Aurélien Bombo	cc9aeee81a	Merge pull request #10263 from Sumynwa/sumsharma/add_ci_workflow ci: Add workflow to run kata-agent api tests using kata-agent-ctl	2024-09-05 09:32:34 -07:00
Dan Mihai	7ab95b56f1	Merge pull request #10251 from microsoft/saulparedes/support_readonly_hostpath genpolicy: support readonly hostpath	2024-09-05 09:27:15 -07:00
GabyCT	deb6d12ff6	Merge pull request #10237 from GabyCT/topic/k8soakcoco tests: Enable k8s soak stability test for Kata CoCo CI	2024-09-05 09:56:48 -06:00
Gabriela Cervantes	fcc35dd3a7	metrics: Update openVINO and oneDNN tests references This PR updates the machine learning tests references or urls for the openVINO and oneDNN scripts as currently they are refering to a different performance benchmark. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-09-05 15:39:21 +00:00
Fabiano Fidêncio	70491ff29f	Merge pull request #10244 from BbolroC/turn-on-kbs-qemu-coco-dev-s390x gha: Turn on KBS for qemu-coco-dev on s390x	2024-09-05 13:02:42 +02:00
Sumedh Alok Sharma	ad66f4dfc9	ci: Add workflow to run kata-agent api tests using kata-agent-ctl enable CI to add test cases for testing kata-agent APIs. This commit introduces: - a workflow to run tests - setup scripts to prepare the test environment Fixes #10262 Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2024-09-05 14:38:29 +05:30
Saul Paredes	24c2d13fd3	genpolicy: support readonly emptyDir mount Set emptyDir access based on volume mount readOnly value Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-09-04 15:05:44 -07:00
Saul Paredes	36a4104753	genpolicy: support readonly hostpath Set hostpath access based on volume mount readOnly value Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-09-04 14:55:22 -07:00
Sumedh Alok Sharma	4025468e27	ci: send SIGKILL to kill kata components metrics tests sometimes fail with kata components still running. sending SIGKILL and waiting for the processes to reap. Fixes #8651 Signed-off-by: Sumedh Alok Sharma <sumsharma@microsoft.com>	2024-09-04 18:58:17 +05:30
Fabiano Fidêncio	13517cf9c1	Merge pull request #10192 from fidencio/topic/helm-add-post-delete-job helm: Several fixes, including some reasonable re-work on kata-deploy.sh script	2024-09-04 09:34:57 +02:00
Fabiano Fidêncio	a773797594	ci: Pass --debug to helm Just to make ourlives a little bit easier. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-09-03 23:08:22 +02:00
Wainer dos Santos Moschetta	3b23d62635	tests/k8s: fix wait for pods on deploy-kata action On commit `51690bc157` we switched the installation from kubectl to helm and used its `--wait` expecting the execution would continue when all kata-deploy Pods were Ready. It turns out that there is a limitation on helm install that won't wait properly when the daemonset is made of a single replica and maxUnavailable=1. In order to fix that issue, let's revert the changes partially to keep using kubectl and waitForProcess to the exection while Pods aren't Running. Fixes #10168 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-09-03 23:08:22 +02:00
Fabiano Fidêncio	40f8aae6db	Reapply "ci: make cleanup_kata_deploy really simple" This reverts commit `21f9f01e1d`, as the pacthes for helm are coming as part of this series. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-09-03 23:08:22 +02:00
Fabiano Fidêncio	cfe6e4ae71	Reapply "ci: Use helm to deploy kata-deploy" (partially) This reverts commit `36f4038a89`, as the pacthes for helm are coming as part of this series. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-09-03 23:08:22 +02:00
Fabiano Fidêncio	424347bf0e	Reapply "kata-deploy: Add Helm Chart" (partially) This reverts commit `b18c3dfce3`, as the pacthes for helm are coming as part of this series. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-09-03 23:08:22 +02:00
Gabriela Cervantes	5b0ab7f17c	metrics: Remove metrics report for Kata Containers This PR removes the metrics report which is not longer being used in Kata Containers. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-09-03 16:11:07 +00:00
Hyounggyu Choi	b0a912b8b4	tests: Enable KBS deployment for qemu-coco-dev on s390x To deploy KBS on s390x, the environment variable `IBM_SE_CREDS_DIR` must be exported, and the corresponding directory must be created. This commit enables KBS deployment for `qemu-coco-dev`, in addition to the existing `qemu-se` support on the platform. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-09-03 15:51:18 +02:00
Fabiano Fidêncio	e8657c502d	Revert "CI: Add tests for stdio" This reverts commit `704da86e9b`, as the tests never became stable to run. This was discussed and agreed with the maintainer. Conflicts: .github/workflows/basic-ci-amd64.yaml tests/integration/stdio/gha-run.sh Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-09-03 11:52:30 +02:00
Gabriela Cervantes	825cb2d22e	tests: Enable k8s soak stability test for Kata CoCo CI This PR enables the k8s soak stability test to run on the weekly Kata CoCo stability CI. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-09-02 16:30:44 +00:00
Gabriela Cervantes	aa8635727d	metrics: Remove unused variable in oneDNN benchmark This PR removes an unused variable in oneDNN metrics benchmark. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-29 15:52:47 +00:00
GabyCT	dd9f41547c	Merge pull request #10160 from microsoft/saulparedes/support_priority_class genpolicy: add priorityClassName as a field in PodSpec interface	2024-08-28 14:36:20 -06:00
Archana Choudhary	ae2cdedba8	genpolicy: add priorityClassName as a field in PodSpec interface This allows generation of policy for pods specifying priority classes. Signed-off-by: Archana Choudhary <archana1@microsoft.com>	2024-08-27 19:54:02 -07:00
Gabriela Cervantes	3affde5b28	docs: Add oneDNN benchmark information to metrics README This PR adds the oneDNN benchmark information to the machine learning metrics README. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-27 16:32:50 +00:00
Aurélien Bombo	a3dba3e82b	ci: reinstate Mariner host GH-9592 addressed a bug in a previous version of the AKS Mariner host kernel that blocked the CH v39 upgrade. This bug has now been fixed so we undo that PR. Note we also specify a different OCI version for Mariner as it differs from Ubuntu's. Fixes: #9594 Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2024-08-26 21:07:25 +00:00
GabyCT	6b0272d6bf	Merge pull request #10193 from GabyCT/topic/k8ssoak stability: Add kubernetes parallel test	2024-08-23 15:51:01 -06:00
GabyCT	83177efb9b	Merge pull request #10201 from GabyCT/topic/readmeopenvino metrics: Add OpenVINO general information into README	2024-08-23 14:11:26 -06:00
Hyounggyu Choi	4cd83d2b98	Merge pull request #10202 from BbolroC/fix-k8s-tests-s390x tests: Fix k8s test issues on s390x	2024-08-23 09:51:11 +02:00
Archana Shinde	b0be03a93f	Revert "tests: add image check before running coco tests" This reverts commit `41b7577f08`. We were seeing a lot of issues in the TDX CI of the nature: "Error: failed to create containerd container: create instance 470: object with key "470" already exists: unknown" With the TDX CI, we moved to having the nydus snapsotter pre-installed. Essentially the `deploy-snapshotter` step was performed once before any actual CI runs. We were seeing failures related to the error message above. On reverting this change, we are no longer seeing errors related to "key exists" with the TDX CI passing now. The change reverted here is related to downloading incomplete images, but this seems to be messing up TDX CI. It is possible to pass --snapshotter to `ctr image check` but that does not seem to have any effect on the data set returned. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org> Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2024-08-22 18:05:42 -07:00
Gabriela Cervantes	2fa8e85439	metrics: Add OpenVINO general information into README This PR adds the OpenVINO benchmark general information into the machine learning README metrics information. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-22 16:08:06 +00:00
Hyounggyu Choi	274de8c6af	tests: Introduce wait_time to k8s_create_pod() In certain environments (e.g., those with lower performance), `k8s_create_pod()` may require additional wait time, especially when dealing with large images. Since `k8s_wait_pod_be_ready()` — which is called by `k8s_create_pod()` — already accepts `wait_time` as a second argument, it makes sense to introduce `wait_time` to `k8s_create_pod()` and propagate it to the callee. This commit adds `wait_time` to `k8s_create_pod()` as the 2nd (optional) argument. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-08-22 17:46:53 +02:00
Hyounggyu Choi	5d7397cc69	tests: Load confidential_kbs.sh in k8s-guest-pull-iamge.bats Some of the tests call set_metadata_annotation() for updating the kernel parameters. For `kata-qemu-se`, repack_secure_image() is called which is defined in `lib_se.sh` and sourced by `confidential_kbs.sh`. This commit ensures that the function call chain for the relevant `KATA_HYPERVISOR` is properly handled. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-08-22 17:33:38 +02:00
GabyCT	3fd108b09a	Merge pull request #10198 from GabyCT/topic/remvaropenvino metrics: Remove unused variable in openvino script	2024-08-21 15:48:56 -06:00
Dan Mihai	8ccc8a8d0b	Merge pull request #9911 from microsoft/saulparedes/mounts genpolicy: deny UpdateEphemeralMountsRequest	2024-08-21 10:12:28 -07:00
Gabriela Cervantes	59e31baaee	metrics: Remove unused variable in openvino script This PR removes an unused variable in the openvino script for kata metrics. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-21 16:05:55 +00:00
Gabriela Cervantes	27d5539954	stability: Add pod deployment yaml for soak test This PR adds the pod deployment yaml for soak test which is part of the stability k8s tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-21 14:23:22 +00:00
Dan Mihai	6654491cc3	genpolicy: deny UpdateEphemeralMountsRequest * genpolicy: deny UpdateEphemeralMountsRequest Deny UpdateEphemeralMountsRequest by default, because paths to critical Guest components can be redirected using such request. Signed-off-by: Dan Mihai <Daniel.Mihai@microsoft.com>	2024-08-20 18:28:17 -07:00
Gabriela Cervantes	c04a805215	stability: Add kubernetes parallel test This PR adds a kubernetes parallel test that will launch multiple replicas from a kubernetes deployment and we will iterate this multiple times to verify that we are able to do this using CoCo Kata. This test will be part of the CoCo Kata stability CI. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-20 23:24:22 +00:00
Fabiano Fidêncio	b18c3dfce3	Revert "kata-deploy: Add Helm Chart" (partially) This partially reverts commit `94b3348d3c`, as there's more work needed in order to have this one done in a robust way, and we are taking the safer path of reverting for now, and adding it back as soon as the release is cut out. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-08-21 00:09:11 +02:00
Fabiano Fidêncio	36f4038a89	Revert "ci: Use helm to deploy kata-deploy" (partially) This partially reverts commit `51690bc157`, as there's more work needed in order to have this one done in a robust way, and we are taking the safer path of reverting for now, and adding it back as soon as the release is cut out. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-08-21 00:09:11 +02:00
Fabiano Fidêncio	21f9f01e1d	Revert "ci: make cleanup_kata_deploy really simple" This reverts commit `1221ab73f9`, as there's more work needed in order to have this one done in a robust way, and we are taking the safer path of reverting for now, and adding it back as soon as the release is cut out. Signed-off-by: Fabiano Fidêncio <fabiano@fidencio.org>	2024-08-21 00:09:11 +02:00
GabyCT	e0bff7ed14	Merge pull request #10177 from GabyCT/topic/cocoghas gha: Add k8s stability Kata CoCo GHA workflow	2024-08-20 15:12:29 -06:00
Fabiano Fidêncio	aeb6f54979	Merge pull request #10180 from fidencio/topic/ci-ensure-the-key-was-created-on-kbs ci: Ensure the KBS resources are created	2024-08-20 09:07:56 +02:00
Fabiano Fidêncio	40d385d401	Merge pull request #10188 from wainersm/kbs_key tests/k8s: check and save kbs.key	2024-08-19 23:29:10 +02:00
Fabiano Fidêncio	c0d7222194	ci: Ensure the KBS resources are created Otherwise we may have tests failing due to the resource not being created yet. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-08-19 23:27:06 +02:00
Wainer dos Santos Moschetta	e014eee4e8	tests/k8s: check and save kbs.key The deploy-kbs.sh script generates the kbs.key that's used to install KBS. This same file is used lately by kbs-client to authenticate. This ensures that the file was created, otherwise fail. Another problem solved here is that on bare-metal machines the key doesn't survive a reboot as it is created in a temporary directory (/tmp/trustee). So let's save the file to a non-temporary location. Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-08-19 16:03:03 -03:00
Fabiano Fidêncio	dd2d9e5524	ci: stdio: Fix typo on getting the containerd version I assume the PR that introduced this was based on an older version of yq, and as the test couldn't run before it got merged we never noticed the error. However, this test has been failing for a reasonable amount of time, which makes me think that we either need a maintainer for it, or just remove it completely, but that's a discussion for another day. For now, let's make it, at least, run. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-08-17 14:06:24 +02:00
Fabiano Fidêncio	0831081399	ci: k8s: Replace nginx alpine images The previous ones are gone, so let's switch to our own multi-arch image for the tests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-08-17 12:19:33 +02:00
Dan Mihai	79c1d0a806	Merge pull request #10136 from microsoft/danmihai1/docker-image-volume2 genpolicy: add bind mounts for image volumes	2024-08-16 13:07:01 -07:00
Fabiano Fidêncio	28aa4314ba	Merge pull request #10175 from ChengyuZhu6/error_message runtime: Add specific error message for gRPC request timeouts	2024-08-16 22:06:49 +02:00
Gabriela Cervantes	6ea34f13e1	gha: Add k8s stability Kata CoCo GHA workflow This PR adds the k8s stability Kata CoCo GHA workflow to run weekly the k8s stability tests. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-16 16:14:15 +00:00
Dan Mihai	c22ac4f72c	genpolicy: add bind mounts for image volumes Add bind mounts for volumes defined by docker container images, unless those mounts have been defined in the input K8s YAML file too. For example, quay.io/opstree/redis defines two mounts: /data /node-conf Before these changes, if these mounts were not defined in the YAML file too, the auto-generated policy did not allow this container image to start. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-08-16 15:11:05 +00:00
Fabiano Fidêncio	6c58ae5b95	Merge pull request #10171 from fidencio/topic/ci-treat-nydus-snapshotter-as-a-dep ci: nydus: Treat the snapshotter as a dependency	2024-08-16 16:39:48 +02:00
ChengyuZhu6	1eda6b7237	tests: update error message with guest pulling image timeout update error message with guest pulling image timeout. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-08-16 20:26:33 +08:00
Chengyu Zhu	ba3c484d12	Merge pull request #9999 from ChengyuZhu6/trusted-storage Trusted image storage	2024-08-16 15:39:50 +08:00
Aurélien Bombo	e1775e4719	Merge pull request #10164 from BbolroC/make-exec_host-stable tests: Ensure exec_host() consistently captures command output	2024-08-15 21:43:32 -07:00
Fabiano Fidêncio	3733266a60	ci: nydus: Treat the snapshotter as a dependency Instead of deploying and removing the snapshotter on every single run, let's make sure the snapshotter is always deploy on the TDX case. We're doing this as an experiment, in order to see if we'll be able to reduce the failures we've been facing with the nydus snapshotter. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-08-15 22:44:30 +02:00
Hyounggyu Choi	ba3e5f6b4a	Revert "tests: Disable k8s file volume test" This reverts commit `e580e29246`. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-08-15 21:10:39 +02:00
Hyounggyu Choi	758e650a28	tests: Ensure exec_host() consistently captures command output The `exec_host()` function often fails to capture the output of a given command because the node debugger pod is prematurely terminated. To address this issue, the function has been refactored to ensure consistent output capture by adjusting the `kubectl debug` process as follows: - Keep the node debugger pod running - Wait until the pod is fully ready - Execute the command using `kubectl exec` - Capture the output and terminate the pod This commit refactors `exec_host()` to implement the above steps, improving its reliability. Fixes: #10081 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-08-15 21:10:39 +02:00
Dan Mihai	905c76bd47	Merge pull request #10153 from microsoft/saulparedes/support_cron_job genpolicy: Add support for cron jobs	2024-08-15 11:11:00 -07:00
ChengyuZhu6	6ecb2b8870	tests: skip test trusted storage in qemu-coco-dev I can't set up loop device with `exec_host`, which the command is necessary for qemu-coco-dev. See issue #10133. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-08-15 20:32:44 +08:00
ChengyuZhu6	51b9d20d55	tests: update error message in pulling image encrypted tests Update error message in pulling image encrypted to "failed to get decrypt key no suitable key found for decrypting layer key". Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-08-15 20:32:44 +08:00
ChengyuZhu6	c5a973e68c	tests:k8s: add tests for guest pull with configured timeout add tests for guest pull with configured timeout: 1) failed case: Test we cannot pull a large image that pull time exceeds a short creatcontainer timeout(10s) inside the guest 2) successful case: Test we can pull a large image inside the guest with increasing createcontainer timeout(120s) Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-08-15 13:55:22 +08:00
ChengyuZhu6	6c506cde86	tests:k8s: add tests for pull images in the guest using trusted storage add tests for pull images in the guest using trusted storage: 1) failed case: Test we cannot pull an image that exceeds the memory limit inside the guest 2) successful case: Test we can pull an image inside the guest using trusted ephemeral storage. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-08-15 13:55:22 +08:00
GabyCT	ecfbc9515a	Merge pull request #10158 from GabyCT/topic/k8sstabil tests: Add kubernetes stability test	2024-08-14 14:44:49 -06:00
Gabriela Cervantes	d48ad94825	tests: Add kubernetes stability test This PR adds a k8s stability test that will be part of the CoCo Kata stability tests that will run weekly. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-14 15:30:49 +00:00
Fupan Li	506977b102	Merge pull request #10156 from GabyCT/topic/disablevolume tests: Disable k8s file volume test	2024-08-14 12:00:47 +08:00
Gabriela Cervantes	e580e29246	tests: Disable k8s file volume test This PR disables the k8s file volume test as we are having random failures in multiple GHA CIs mainly because the exec_host function sometimes does it not work properly. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-13 20:50:18 +00:00
Saul Paredes	af598a232b	tests: add test for cron job support Add simple test for cron job support Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-08-13 10:47:42 -07:00
Gabriela Cervantes	bdca5ca145	tests: Add kubernetes stress-ng tests This PR adds kubernetes stress-ng tests as part of the stability testing for kata. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-13 16:23:52 +00:00
Steve Horsman	91084058ae	Merge pull request #10007 from wainersm/run_k8s_on_free_runners ci: Transition GARM tests to free runners, pt. II	2024-08-12 18:12:18 +01:00
ChengyuZhu6	c3a0ab4b93	tests:k8s: Re-enable and refactor the tests with guest pull Currently, setting `io.containerd.cri.runtime-handler` annotation in the yaml is not necessary for pulling images in the guest. All TEE hypervisors are already running tests with guest-pulling enabled. Therefore, we can remove some duplicate tests and re-enable the guest-pull test for running different runtime pods at the same time. While considering to support different containerd version, I recommend to keep setting "io.containerd.cri.runtime-handler". Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-08-12 16:36:54 +08:00
Gabriela Cervantes	5e5fc145cd	tests: Update ubuntu image for stress Dockerfile This PR updates the ubuntu image for stress Dockerfile. The main purpose is to have a more updated image compared with the one that is in libpod which has not been updated in a while. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-09 15:29:10 +00:00
GabyCT	584d7a265e	Merge pull request #10127 from GabyCT/topic/execimage tests:k8s: Update image in kubectl debug for the exec host function	2024-08-07 17:00:52 -06:00
Wainer dos Santos Moschetta	dfb92e403e	tests/k8s: add "deploy-kata"/"cleanup" actions to gh-run.sh These new "kata-deploy" and "cleanup" actions are equivalent to "kata-deploy-garm" "cleanup-garm", respectively, and should be used on the workflows being migrated from GARM to Github's managed runners. Eventually "kata-deploy-garm" and "cleanup-garm" won't be used anymore then we will be able to remove them. See: #9940 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2024-08-07 15:20:23 -03:00
Gabriela Cervantes	d0ca43162d	tests:k8s: Update image in kubectl debug for the exec host function This PR updates the image that we are using in the kubectl debug command as part of the exec host function, as the current alpine image does not allow to create a temporary file for example and creates random kubernetes failures. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-06 21:13:46 +00:00
Zvonko Kaiser	1221ab73f9	ci: make cleanup_kata_deploy really simple Remove the unneeded logic for cleanup the values are encapsulated in the deployed helm release Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-08-06 11:57:04 +02:00
Zvonko Kaiser	51690bc157	ci: Use helm to deploy kata-deploy Rather then modifying the kata-depoy scripts let's use Helm and create a values.yaml that can be used to render the final templates Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-08-06 11:57:04 +02:00
Zvonko Kaiser	94b3348d3c	kata-deploy: Add Helm Chart For easier handling of kata-deploy we can leverage a Helm chart to get rid of all the base and overlays for the various components Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2024-08-06 11:57:04 +02:00
Fabiano Fidêncio	89f1581e54	ci: Enable encrypted image tests for TEEs After experimenting a little bit with those tests, they seem to be passing on all the available TEE machines. With this in mind, let's just enable them for those machines. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-08-03 09:27:32 +02:00
GabyCT	aadde2c25b	Merge pull request #10120 from kata-containers/fix_metrics_json_results_file Fix metrics json results file	2024-08-02 11:29:02 -06:00
Dan Mihai	2628b34435	Merge pull request #10098 from microsoft/danmihai1/allow-failing agent: fix the AllowRequestsFailingPolicy functionality	2024-08-02 08:42:47 -07:00
GabyCT	8da5f7a72f	Merge pull request #10102 from ChengyuZhu6/fix-debug tests: Fix error with `kubectl debug`	2024-08-02 09:25:13 -06:00
Fabiano Fidêncio	551e0a6287	Merge pull request #10116 from GabyCT/topic/kbsdependencies tests: kbs: Add missing dependencies to install kbs cli	2024-08-02 14:22:28 +02:00
Fabiano Fidêncio	4183680bc3	Merge pull request #10107 from fidencio/topic/rotate-journal-logs-every-run tests: k8s: Rotate & cleanup journal for every run	2024-08-02 07:27:10 +02:00
David Esparza	dcd0c0b269	metrics: Remove duplicated headers from results file. This PR removes duplicated entries (vcpus count, and available memory), from onednn and openvino results files. Fixes: #10119 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2024-08-01 18:11:06 -06:00
ChengyuZhu6	2eac8fa452	tests: Fix error with `kubectl debug` The issue is similar to #10011. The root cause is that tty and stderr are set to true at same time in containerd: #10031. Fixes: #10081 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-08-02 07:32:30 +08:00
David Esparza	1e640ec3a6	metrics: fix pargins json results file. This PR encloses the search string for 'default_vcpus =' and 'default_memory =' with double quotes in order to parse the precise values, which are included in the kata configuration file. Fixes: #10118 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2024-08-01 17:05:03 -06:00
Dan Mihai	c2a55552b2	agent: fix the AllowRequestsFailingPolicy functionality 1. Use the new value of AllowRequestsFailingPolicy after setting up a new Policy. Before this change, the only way to enable AllowRequestsFailingPolicy was to change the default Policy file, built into the Guest rootfs image. 2. Ignore errors returned by regorus while evaluating Policy rules, if AllowRequestsFailingPolicy was enabled. For example, trying to evaluate the UpdateInterfaceRequest rules using a policy that didn't define any UpdateInterfaceRequest rules results in a "not found" error from regorus. Allow AllowRequestsFailingPolicy := true to bypass that error. 3. Add simple CI test for AllowRequestsFailingPolicy. These changes are restoring functionality that was broken recently by commmit `df23eb09a6`. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-08-01 22:37:18 +00:00
GabyCT	20a88b6470	Merge pull request #10099 from GabyCT/topic/fixmemo metrics: Update memory tests to use grep -F	2024-08-01 13:48:36 -06:00
Fabiano Fidêncio	aef7da7bc9	tests: k8s: Rotate & cleanup journal for every run This will help to avoid huge logs, and allow us to debug issues in a better way. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2024-08-01 21:36:57 +02:00
Gabriela Cervantes	7454908690	metrics: Update memory tests to use grep -F This PR updates the memory tests like fast footprint to use grep -F instead of fgrep as this command has been deprecated. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-01 17:20:57 +00:00
Gabriela Cervantes	d72cb8ccfc	tests: kbs: Add missing dependencies to install kbs cli This PR adds missing packages depenencies to install kbs cli in a fresh new baremetal environment. This will avoid to have a failure when trying to run install-kbs-client. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-08-01 17:09:50 +00:00
ChengyuZhu6	41b7577f08	tests: add image check before running coco tests Currently, there are some issues with pulling images in CI, such as : https://github.com/kata-containers/kata-containers/actions/runs/10109747602/job/27959198585 This issue is caused by switching between different snapshotters for the same image in some scenarios. To resolve it, we can check existing images to ensure all content is available locally before running tests. Fixes: #10029 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-07-31 14:47:33 +08:00
Adithya Krishnan Kannan	fdf7036d5e	ci: Fix rate limit error by migrating busybox_image Changing the busybox_image from docker to quay to fix rate limit errors. Signed-Off-By: Adithya Krishnan Kannan <AdithyaKrishnan.Kannan@amd.com>	2024-07-30 22:32:22 -05:00
Dan Mihai	3e348e9768	tests: k8s: rename hard-coded policy test script Rename k8s-exec-rejected.bats to k8s-policy-hard-coded.bats, getting ready to test additional hard-coded policies using the same script. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-07-26 20:14:05 +00:00
Dan Mihai	7b691455c2	tests: k8s: hard-coded policy for any platform Users of AUTO_GENERATE_POLICY=yes: - Already tested auto-generated policy on any platform. - Will be able to test hard-coded policy too on any platform, after this change. CI continues to test hard-coded policies just on the platforms listed here, but testing those policies locally (outside of CI) on other platforms can be useful too. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-07-26 19:30:03 +00:00
Dan Mihai	83056457d6	tests: k8s-policy-pod: avoid word splitting Avoid potential word splitting when using array of command args array. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-07-26 18:55:52 +00:00
Dan Mihai	5546ce4031	Merge pull request #10069 from microsoft/danmihai1/exec-args genpolicy: validate each exec command line arg	2024-07-26 11:39:44 -07:00
Archana Shinde	d7637f93f9	Merge pull request #9899 from amshinde/multiple-networks-fix Fix issue while adding multiple networks with nerdctl	2024-07-25 11:56:27 -07:00
Dan Mihai	a37f10fc87	genpolicy: validate each exec command line arg Generate policy that validates each exec command line argument, instead of joining those args and validating the resulting string. Joining the args ignored the fact that some of the args might include space characters. The older format from genpolicy-settings.json was similar to: "ExecProcessRequest": { "commands": [ "sh -c cat /proc/self/status" ], "regex": [] }, That format will not be supported anymore. genpolicy will detect if its users are trying to use the older "commands" field and will exit with a relevant error message in that case. The new settings format is: "ExecProcessRequest": { "allowed_commands": [ [ "sh", "-c", "cat /proc/self/status" ] ], "regex": [] }, Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-07-25 16:57:17 +00:00
Dan Mihai	0f11384ede	tests: k8s-policy-pod: exec_command clean-up Use "${exec_command[@]}" for calling both: - add_exec_to_policy_settings - kubectl exec Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-07-25 16:55:03 +00:00
Dan Mihai	95b78ecaa9	tests: k8s-exec: reuse sh_command variable Reuse sh_command variable instead of repeading "sh". Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-07-25 16:50:34 +00:00
Dan Mihai	c3adeda3cc	Merge pull request #10051 from microsoft/danmihai1/exec-variable-reuse tests: k8s: reuse policy exec variable	2024-07-24 14:58:40 -07:00
Aurélien Bombo	f08b594733	Merge pull request #9576 from microsoft/saulparedes/support_env_from genpolicy: Add support for envFrom	2024-07-24 13:39:54 -07:00
Archana Shinde	64d6293bb0	tests:Add nerdctl test for testing with multiple netwokrs Add integration test that creates two bridge networks with nerdctl and verifies that Kata container is brought up while passing the networks created. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2024-07-24 10:45:56 -07:00
Dan Mihai	fecb70b85e	tests: k8s: reuse policy exec variable Share a single test script variable for both: - Allowing a command to be executed using Policy settings. - Executing that command using "kubectl exec". Fixes: #10014 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-07-24 17:42:04 +00:00
Gabriela Cervantes	3d17a7038a	metrics: Update launch times to use grep -F This PR updates the metrics launch times to use grep -F instead of fgrep as this command has been deprecated. Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2024-07-23 17:13:52 +00:00
Dan Mihai	f26d595e5d	Merge pull request #9910 from microsoft/saulparedes/set_policy_rego_via_env tools: Allow setting policy rego file via	2024-07-22 11:00:30 -07:00
Hyounggyu Choi	c774cd6bb0	Merge pull request #10031 from ChengyuZhu6/fix-log-contain-tdx tests: Fix missing log on TDX	2024-07-20 07:26:08 +02:00
ChengyuZhu6	6ea6e85f77	tests: Re-enable authenticated image tests on tdx Try to re-enable authenticated image tests on tdx. Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com>	2024-07-20 12:10:02 +08:00
ChengyuZhu6	3476fb481e	tests: Fix missing log on TDX Currently, we have found that `assert_logs_contain` does not work on TDX. We manually located the specific log, but it fails to get the log using `kubectl debug`. The error found in CI is: ``` warning: couldn't attach to pod/node-debugger-984fee00bd70.jf.intel.com-pdgsj, falling back to streaming logs: error stream protocol error: unknown error ``` Upon debugging the TDX CI machine, we found an error in containerd: ``` Attach container from runtime service failed" err="rpc error: code = InvalidArgument desc = tty and stderr cannot both be true" containerID="abc8c7a546c5fede4aae53a6ff2f4382ff35da331bfc5fd3843b0c8b231728bf" ``` We believe this is the root cause of the test failures in TDX CI. Therefore, we need to ensure that tty and stderr are not set to true at same time. Fixes: #10011 Signed-off-by: ChengyuZhu6 <chengyu.zhu@intel.com> Signed-off-by: Wang, Arron <arron.wang@intel.com>	2024-07-20 12:10:01 +08:00
Dan Mihai	3127dbb3df	Merge pull request #10035 from microsoft/danmihai1/k8s-credentials-secrets tests: k8s-credentials-secrets: policy for second pod	2024-07-19 12:44:21 -07:00
Saul Paredes	2681fc7eb0	genpolicy: Add support for envFrom This change adds support for the `envFrom` field in the `Pod` resource Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-07-19 09:53:58 -07:00
GabyCT	be2d4719c2	Merge pull request #10040 from kata-containers/fix_blogbench_midvalues metrics: update avg reference values for blogbench.	2024-07-19 09:51:29 -06:00
Dan Mihai	44e443678d	Merge pull request #9835 from microsoft/saulparedes/test_policy_on_sev gha: enable autogenerated policy testing on SEV and SEV-SNP	2024-07-19 07:46:01 -07:00
ms-mahuber	ddff762782	tools: Allow setting policy rego file via environment variable * Set policy file via env var * Add restrictive policy file to kata-opa folder * Change restrictive policy file name * Change relative default path location * Add license headers Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-07-18 15:05:45 -07:00
David Esparza	60f52a4b93	metrics: update avg reference values for blogbench. This PR updates the Blogbench reference values for read and write operations used in the CI check metrics job. This is due to the update to version 1.2 of blobench. Fixes: #10039 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2024-07-18 15:47:14 -06:00
Greg Kurz	fc4357f642	Merge pull request #10034 from BbolroC/hide-repack_secure_image-from-test tests: Call repack_secure_image() in set_metadata_annotation()	2024-07-18 23:03:41 +02:00
Aurélien Bombo	ab6f37aa52	Merge pull request #10022 from microsoft/danmihai1/probes-and-lifecycle genpolicy: container.exec_commands args validation	2024-07-18 12:21:31 -07:00
Steve Horsman	256ab50f1a	Merge pull request #9959 from sprt/fix-ci-cleanup ci: cleanup: Ignore nonexisting resources	2024-07-18 19:23:48 +01:00
David Esparza	1fdc5c1183	Merge pull request #10028 from amshinde/upgrade-blogbench-1.2 metric: Upgrade blogbench to 1.2	2024-07-18 11:30:17 -06:00
Hyounggyu Choi	a7e4d3b738	tests: Call repack_secure_image() in set_metadata_annotation() It is not good practice to call repack_secure_image() from a bats file because the test code might not consider cases where `qemu-se` is used as `KATA_HYPERVISOR`. This commit moves the function call to set_metadata_annotation() if a key includes `kernel_params` and `KATA_HYPERVISOR` is set to `qemu-se`, allowing developers to focus on the test scenario itself. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-07-18 18:09:45 +02:00
Dan Mihai	035a42baa4	tests: k8s-credentials-secrets: policy for second pod Add policy to pod-secret-env.yaml from k8s-credentials-secrets.bats. Policy was already auto-generated for the other pod used by the same test (pod-secret.yaml). pod-secret-env.yaml was inconsistent, because it was taking advantage of the "allow all" policy built into the Guest image. Sooner or later, CI Guests for CoCo will not get the "allow all" policy built in anymore and pod-secret-env.yaml would have stopped working then. Note that pod-secret-env.yaml continues to use an "allow all" policy after these changes. #10033 must be solved before a more restrictive policy will be generated for pod-secret-env.yaml. Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-07-18 15:03:57 +00:00
Hyounggyu Choi	6e7ee4bdab	tests: Rebuild secure image for guest-pull-image-authenticated on SE Since #9904 was merged, newly introduced tests for `k8s-guest-pull-image-authenticated.bats` have been failing on IBM SE (s390x). The agent fails to start because a kernel parameter cannot pass to the guest VM via annotation. To fix this, the boot image must be rebuilt with updated parameters. This commit adds the rebuilding step in create_pod_yaml_with_private_image() for `qemu-se`. Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2024-07-18 14:56:12 +02:00
Saul Paredes	57d2ded3e2	gha: enable autogenerated policy testing on SEV-SNP Enable autogenerated policy testing on SEV-SNP Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-07-17 13:32:06 -07:00
Archana Shinde	30e5e88ff1	metric: Upgrade blogbench to 1.2 Move to blogbench 1.2 version from 1.1. This version includes an important fix for the read_score test which was reported to be broken in the previous version. It essentially fixes this issue here: https://github.com/jedisct1/Blogbench/issues/4 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2024-07-17 11:32:09 -07:00
Saul Paredes	b3cc8b200f	gha: enable autogenerated policy testing on SEV Enable autogenerated policy testing on SEV Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-07-17 09:55:13 -07:00
Dan Mihai	f31c1b121e	Merge pull request #9812 from microsoft/saulparedes/test_policy_on_tdx gha: enable policy testing on TDX	2024-07-17 08:47:44 -07:00
Dan Mihai	449103c7bf	Merge pull request #10020 from microsoft/danmihai1/pod-security-context tests: fix ps command in k8s-security-context	2024-07-17 08:12:57 -07:00
Dan Mihai	0e86a96157	tests: fix ps command in k8s-security-context 1. Use a container image that supports "ps --user 1000 -f". 2. Execute that command using: sh -c "ps --user 1000 -f" instead of passing additional arguments to sh: sh -c ps --user 1000 -f Fixes: #10019 Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-07-17 01:33:31 +00:00
Dan Mihai	9f4d1ffd43	genpolicy: container.exec_commands args validation Keep track of individual exec args instead of joining them in the policy text. Verifying each arg results in a more precise policy, because some of the args might include space characters. This improved validation applies to commands specified in K8s YAML files using: - livenessProbe - readinessProbe - startupProbe - lifecycle.postStart - lifecycle.preStop Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-07-17 01:19:23 +00:00
Dan Mihai	b23ea508d5	tests: k8s: container.exec_commands policy tests Add tests for genpolicy's handling of container.exec_commands. These are commands allowed by the policy and originating from these input K8s YAML fields: - livenessProbe - readinessProbe - startupProbe - lifecycle.postStart - lifecycle.preStop Signed-off-by: Dan Mihai <dmihai@microsoft.com>	2024-07-17 01:19:00 +00:00
stevenhorsman	567b4d5788	test/k8s: Fix up node logging typo We had a typo in the attestation tests that we've copied around a lot and Wainer spotted it in the authenticated registry tests, so let's fix it up now Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-07-16 21:39:31 -03:00
stevenhorsman	0015c8ef51	tests: Add guest-pull auth registry tests Add three new test cases for guest pull from an authenticated registry for the following scenarios: _Scenario: Creating a container from an authenticated image, with correct credentials via KBC works_ Given An authenticated container registry quay.io/kata-containers/confidential-containers-auth And a version of kata deployed with a guest image that has an agent with `guest_pull` feature enabled and nydus-snapshotter installed and configured for [guest-pulling](https://github.com/containerd/nydus-snapshotter/blob/main/misc/snapshotter/config-coco-guest-pulling.toml) And a KBS set up to have the correct auth.json for registry quay.io/kata-containers/confidential-containers-auth embedded in the `"Credential"` section of `its resources file` When I create a pod from the container image quay.io/kata-containers/confidential-containers-auth:test Then The pull image works and the pod can start _Scenario: Creating a container from an authenticated image, with incorrect credentials via KBC fails_ Given An authenticated container registry quay.io/kata-containers/confidential-containers-auth And a version of kata deployed with a guest image that has an agent with `guest_pull` feature enabled and nydus-snapshotter installed and configured for [guest-pulling](https://github.com/containerd/nydus-snapshotter/blob/main/misc/snapshotter/config-coco-guest-pulling.toml) And An installed kata CC with the sample_kbs set up to have the auth.json for registry quay.io/kata-containers/confidential-containers-auth embedded in the `"Credential"` resource, but with a dummy user name and password When I create a pod from the container image quay.io/kata-containers/confidential-containers-auth:test Then The pull image fails with a message that reflects that the authorisation failed _Scenario: Creating a container from an authenticated image, with no credentials fails_ Given An authenticated container registry quay.io/kata-containers/confidential-containers-auth And a version of kata deployed with a guest image that has an agent with `guest_pull` feature enabled and nydus-snapshotter installed and configured for [guest-pulling](https://github.com/containerd/nydus-snapshotter/blob/main/misc/snapshotter/config-coco-guest-pulling.toml) And An installed kata CC with no credentials section When I create a pod from the container image quay.io/kata-containers/confidential-containers-auth:test Then The pull image fails with a message that reflects that the authorisation failed Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2024-07-16 21:39:31 -03:00
Saul Paredes	af49252c69	gha: enable policy testing on TDX Enable policy testing on TDX Signed-off-by: Saul Paredes <saulparedes@microsoft.com>	2024-07-15 14:09:49 -07:00
Dan Mihai	bcaf7fc3b4	Merge pull request #10008 from microsoft/danmihai1/runAsUser genpolicy: add support for runAsUser fields	2024-07-15 12:08:50 -07:00
Dan Mihai	648265d80e	Merge pull request #9998 from microsoft/danmihai1/GENPOLICY_PULL_METHOD tests: k8s: GENPOLICY_PULL_METHOD clean-up	2024-07-15 09:32:29 -07:00

... 3 4 5 6 7 ...

1559 Commits