Gabriela Cervantes
fce2487971
metrics: Add metrics report R files
...
This PR adds the metrics report R files.
Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com >
2023-08-29 16:45:22 +00:00
Gabriela Cervantes
08812074d1
metrics: Add report dockerfile
...
This PR adds the report dockerfile for kata metrics.
Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com >
2023-08-29 16:28:32 +00:00
Gabriela Cervantes
69781fc027
metrics: Add metrics report script
...
This PR adds metrics report script for kata metrics.
Fixes #7782
Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com >
2023-08-29 16:25:14 +00:00
Chao Wu
e4fb20c74a
Merge pull request #7585 from lifupan/main
...
dragonball: vsock add fifo/pipe stream support for passed fd hybridSt…
2023-08-29 23:39:21 +08:00
Fabiano Fidêncio
50e51bcafe
Merge pull request #7185 from UnmeshDeodhar/add-cc-sev-test
...
tests: Add confidential test
2023-08-29 15:32:25 +02:00
Fabiano Fidêncio
e286e842c1
tests: Expand confidential test to support TDX
...
Let's expand the confidential test to also support TDX.
The main difference on the test, though, is that we're not grepping for
a string in the `dmesg` output, but rather relying on `cpuid` to detect
a TDX guest.
Fixes : #7184
Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com >
2023-08-29 14:10:47 +02:00
Unmesh Deodhar
e31f099be1
tests: Expand confidential test to support SNP
...
Let's expand the confidential test to also support SNP.
Fixes : #7184
Signed-off-by: Unmesh Deodhar <udeodhar@amd.com >
2023-08-29 14:10:47 +02:00
Unmesh Deodhar
c3b9d4945e
tests: Add confidential test for SEV
...
Add a test case for the launch of unencrypted confidential
container, verifying that we are running inside a TEE.
Right now the test only works with SEV, but it'll be expanded in the
coming commits, as part of this very same series.
Fixes : #7184
Signed-Off-By: Unmesh Deodhar <udeodhar@amd.com >
Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com >
2023-08-29 14:10:34 +02:00
Fabiano Fidêncio
708b0a3052
Merge pull request #7768 from fidencio/topic/update-tdx-to-the-6.2-kernel-based-stack
...
tdx: Update the components needed for using the 6.2 kernel stack
2023-08-28 19:27:15 +02:00
Fabiano Fidêncio
3818bf3311
local-build: Remove $HOME/.docker/buildx/activity/default
...
The file can be removed between builds without causing any issue, and
leaving it around has been causing us some headache due to:
```
ERROR: open /home/runner/.docker/buildx/activity/default: permission denied
```
Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com >
2023-08-28 13:41:36 +02:00
Fabiano Fidêncio
d1b54ede29
qemu: tdx: Workaround SMP issue with TDX 1.5
...
`...,sockets=1,cores=numvcpus,threads=1,...` must be used.
Fixes : #7770
Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com >
2023-08-28 13:41:36 +02:00
Archana Shinde
1e34220c41
qemu: tdx: Adapt to the TDX 1.5 stack
...
QEMU for TDX 1.5 makes use of private memory map/unmap.
Make changes to govmm to support this. Support for private backing fd
for memory is added as knob to the qemu config.
Userspace's map/unmap operations are done by fallocate() ioctl on the
backing store fd.
Reference:
https://lore.kernel.org/linux-mm/20220519153713.819591-1-chao.p.peng@linux.intel.com/
Fixes : #7770
Signed-off-by: Archana Shinde <archana.m.shinde@intel.com >
Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com >
2023-08-28 13:41:36 +02:00
Fabiano Fidêncio
8115a0522d
versions: tdx: Update Kernel to 6.2 + TDX
...
This is the version that's been used and tested inside Intel, and it
matches with https://github.com/intel/tdx-tools/releases/tag/2023ww15 .
Fixes : #7770
Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com >
2023-08-28 13:11:34 +02:00
Fabiano Fidêncio
ec18180f34
versions: tdx: Update TDVF to the "edk2-stable202302"
...
This is the version that's been used and tested inside Intel, and it
matches with https://github.com/intel/tdx-tools/releases/tag/2023ww15 .
Fixes : #7770
Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com >
2023-08-28 13:11:34 +02:00
Fabiano Fidêncio
9803b24286
versions: tdx: Update QEMU to v7.2 + TDX v1.10
...
This is the version that's been used and tested inside Intel, and it
matches with https://github.com/intel/tdx-tools/releases/tag/2023ww15 .
Fixes : #7770
Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com >
2023-08-28 13:11:27 +02:00
Fabiano Fidêncio
02a08c956b
Merge pull request #7754 from microsoft/danmihai1/pod-quota-deployment
...
tests: delete k8s deployment at the test's end
2023-08-27 17:52:00 +02:00
Fabiano Fidêncio
98037ced52
Merge pull request #7755 from microsoft/danmihai1/unique-test-name
...
tests: use unique test name
2023-08-27 17:27:40 +02:00
Zhongtao Hu
f0440a9cfe
Merge pull request #7742 from frezcirno/fix-log-forwarder-loop
...
runtime-rs: check peer close in log_forwarder
2023-08-26 10:44:09 +08:00
Fabiano Fidêncio
16a610d788
Merge pull request #7758 from fidencio/topic/gha-avoid-fail-fast-till-everything-is-ultra-stable
...
gha: Avoid "fail-fast" in tests that are known to be flaky
2023-08-25 16:49:26 +02:00
Jiang Liu
91db888d83
Merge pull request #7602 from jiangliu/agent-storage
...
Refine storage device management for kata-agent
2023-08-25 22:20:18 +08:00
Zixuan Tan
dffc16e5b3
runtime-rs: check peer close in log_forwarder
...
The log_forwarder task does not check if the peer has closed, causing a
meaningless loop during the period of “kata vm exit”, when the peer
closed, and “ShutdownContainer RPC received” that aborts the log forwarder.
This patch fixes the problem.
Fixes : #7741
Signed-off-by: Zixuan Tan <tanzixuan.me@gmail.com >
2023-08-25 19:00:07 +08:00
Jiang Liu
aaa5ab1264
agent: simplify storage device by removing StorageDeviceObject
...
Simplify storage device implementation by removing StorageDeviceObject.
Signed-off-by: Jiang Liu <gerry@linux.alibaba.com >
2023-08-25 17:23:16 +08:00
Fabiano Fidêncio
fb49d5d7ce
gha: Avoid "fail-fast" in tests that are known to be flaky
...
Otherwise we'll have to re-run all the tests due to a flaky behaviour in
one of the parts.
Fixes : #7757
Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com >
2023-08-25 10:00:17 +02:00
Dan Mihai
183f51d6f6
tests: use unique test name
...
k8s-pid-ns.bats was already using the test name from
k8s-kill-all-process-in-container.bats - probably a copy/paste bug.
Fixes : #7753
Signed-off-by: Dan Mihai <dmihai@microsoft.com >
2023-08-25 03:41:06 +00:00
Dan Mihai
6a974679f2
tests: delete k8s deployment at the test's end
...
At the end of k8s-kill-all-process-in-container.bats, delete the
deployment it created.
Fixes : #7752
Signed-off-by: Dan Mihai <dmihai@microsoft.com >
2023-08-25 03:34:37 +00:00
David Esparza
686eb3878b
Merge pull request #7751 from GabyCT/topic/unusednhwc
...
metrics: Remove unused variable in tensorflow nhwc script
2023-08-24 18:34:06 -06:00
Fabiano Fidêncio
f1d8e1f513
Merge pull request #7747 from fidencio/topic/kata-deploy-dont-try-to-remove-opt-kata
...
kata-deploy: Don't try to remove /opt/kata
2023-08-24 18:56:52 +02:00
Gabriela Cervantes
32a778b6da
metrics: Remove unused variable in tensorflow nhwc script
...
This PR removes unused variable in tensorflow nhwc script.
Fixes #7750
Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com >
2023-08-24 15:54:27 +00:00
David Esparza
875a85ee14
Merge pull request #7736 from GabyCT/topic/tensorflowfp32
...
metrics: Add TensorFlow ResNet50 FP32 benchmark
2023-08-24 08:56:24 -06:00
Fabiano Fidêncio
d8f3ce6497
kata-deploy: Don't try to remove /opt/kata
...
The directory is a host path mount and cannot be removed from within the
container. What we actually want to remove is whatever is inside that
directory.
This may raise errors like:
```
rm: cannot remove '/opt/kata/': Device or resource busy
```
Fixes : #7746
Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com >
2023-08-24 13:57:36 +02:00
Jeremi Piotrowski
71c90b994a
Merge pull request #7745 from jepio/vfio-part-0
...
gha: vfio: Run on Ubuntu 23.04 runner
2023-08-24 12:15:19 +02:00
Greg Kurz
9991772b26
Merge pull request #7718 from littlejawa/fix_filemode_when_zero
...
kata-agent: use default filemode for block device when it is set to 0
2023-08-24 11:40:28 +02:00
Jeremi Piotrowski
936e8091a7
gha: vfio: Run on Ubuntu 23.04 runner
...
The vfio test requires nested-nested virtualization:
L0 Azure host
-> L1 Ubuntu VM
-> L2 Fedora VM
-> L3 Kata
This hits a kernel bug on v5.15 but works quite nicely on the v6.2 kernel
included in Ubuntu 23.04. We can switch back to Ubuntu 22.04 when they roll out
v6.2.
Fixes : #6555
Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com >
2023-08-24 10:10:02 +02:00
Jiang Liu
0e7248264d
agent: move storage device related code into dedicated files
...
Move storage device related code into dedicated files.
Signed-off-by: Jiang Liu <gerry@linux.alibaba.com >
2023-08-24 13:48:51 +08:00
Jiang Liu
8f49ee33b2
agent: refine storage related code a bit
...
Refine storage related code by:
- remove the STORAGE_HANDLER_LIST
- define type alias
- move code near to its caller
Signed-off-by: Jiang Liu <gerry@linux.alibaba.com >
2023-08-24 13:09:10 +08:00
Jiang Liu
60ca12ccb0
agent: switch to new storage subsystem
...
Switch to new storage subsystem to create a StorageDevice for each
storage object.
Fixes : #7614
Signed-off-by: Jiang Liu <gerry@linux.alibaba.com >
2023-08-24 13:09:09 +08:00
Jiang Liu
fcbda0b419
kata-types: introduce StorageDevice and StorageHandlerManager
...
Introduce StorageDevice and StorageHandlerManager, which will be used
to refine storage device management for kata-agent.
Signed-off-by: Jiang Liu <gerry@linux.alibaba.com >
2023-08-24 13:08:55 +08:00
Jiang Liu
b03b1f6134
agent: simplify the way to manage storage object
...
Simplify the way to manage storage objects, and introduce
StorageStateCommon structures for coming extensions.
Signed-off-by: Jiang Liu <gerry@linux.alibaba.com >
2023-08-24 12:58:24 +08:00
Jiang Liu
8392c71bf2
sys-util: support more mount flags in parse_mount_options()
...
Support more mount flags in parse_mount_options().
Signed-off-by: Jiang Liu <gerry@linux.alibaba.com >
2023-08-24 12:17:39 +08:00
Jiang Liu
c00d8f3d48
agent: use create_mount_destination() from kata-sys-util
...
Use create_mount_destination() from kata-sys-util crate to reduce
redundant code.
Signed-off-by: Jiang Liu <gerry@linux.alibaba.com >
2023-08-24 12:17:38 +08:00
Jiang Liu
5e867f0538
types: add more mount related constants
...
Add more mount related constants.
Signed-off-by: Jiang Liu <gerry@linux.alibaba.com >
2023-08-24 12:17:36 +08:00
Jiang Liu
880e6c9a76
agent: use function from kata-sys-utils to reduce code
...
Use function get_linux_mount_info() from kata-sys-util crate to share
common code.
Signed-off-by: Jiang Liu <gerry@linux.alibaba.com >
2023-08-24 12:17:34 +08:00
QuanweiZhou
a6921dd837
Merge pull request #7698 from jiangliu/virtual-volume
...
kata-types: introduce KataVirtualVolume to support nydus, direct volume and image pull
2023-08-24 11:50:39 +08:00
Fabiano Fidêncio
7705c5962e
Merge pull request #7728 from ManaSugi/fix/typo-test-toml
...
libs,tests: fix typo disable_guest_seccomp in configuration-anno-1.toml
2023-08-23 23:55:41 +02:00
GabyCT
c1712e1930
Merge pull request #7737 from jepio/fix-local-build
...
local-build: Remove GID before creating group
2023-08-23 12:26:39 -06:00
Jeremi Piotrowski
3b881fbc0e
local-build: Remove GID before creating group
...
docker install now creates a group with gid 999 which happens to match what we
need to get docker-in-docker to work. Remove the group first as we don't need
it.
Fixes : #7726
Signed-off-by: Jeremi Piotrowski <jpiotrowski@microsoft.com >
2023-08-23 18:58:38 +02:00
David Esparza
ebce5d25a9
Merge pull request #7734 from fidencio/topic/kata-deploy-fix-removal
...
kata-deploy: Avoid failing on content removal
2023-08-23 10:29:57 -06:00
Gabriela Cervantes
959ca49447
metrics: Add TensorFlow ResNet50 fp32 Dockerfile
...
This PR adds the TensorFlow ResNet50 fp32 Dockerfile for kata metrics.
Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com >
2023-08-23 16:24:58 +00:00
Gabriela Cervantes
4b7d72c4a8
metrics: Add TensorFlow ResNet50 FP32 benchmark
...
This PR adds TensorFlow ResNet50 FP32 benchmark for kata metrics.
Fixes #7735
Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com >
2023-08-23 16:21:09 +00:00
Fabiano Fidêncio
e7e4cc2182
Merge pull request #7716 from bergwolf/github/image-initrd-assets
...
runtime: fix image and initrd assets handling
2023-08-23 18:02:15 +02:00