kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-07-12 14:48:13 +00:00

Author	SHA1	Message	Date
Fabiano Fidêncio	bb78d35db8	kata-sys-util: Fix "match-like-matches-macro" warning As we bumped the rust toolchain to 1.66.0, some new warnings have been raised due to "match-like-matches-macro". Let's fix them all here. For more info about the warnings, please, take a look at: https://rust-lang.github.io/rust-clippy/master/index.html#match_like_matches_macro Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-01-02 14:28:07 +01:00
Fabiano Fidêncio	668e652401	kata-sys-util: Fix unnecessary_cast warnings As we bumped the rust toolchain to 1.66.0, some new warnings have been raised due to unnecessary_cast. Let's fix them all here. For more info about the warnings, please, take a look at: https://rust-lang.github.io/rust-clippy/master/index.html#unnecessary_cast Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-01-02 14:28:07 +01:00
Fabiano Fidêncio	c1a8d89a72	kata-sys-util: Fix needless_borrow warnings As we bumped the rust toolchain to 1.66.0, some new warnings have been raised due to needless_borrow. Let's fix them all here. For more info about the warnings, please, take a look at: https://rust-lang.github.io/rust-clippy/master/index.html#needless_borrow Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-01-02 14:28:07 +01:00
Fabiano Fidêncio	c9c38e6d01	logging: Allow clippy::type-complexity warning As the rust toolchain version bump to its 1.66.0 release raised a warning about the type complexity used for the closure, and that's something we don't want to change, let's ignore such warning in this very specific case. See: https://rust-lang.github.io/rust-clippy/master/index.html#type_complexity Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-01-02 14:28:07 +01:00
Fabiano Fidêncio	ffd6fbb6b6	logging: Fix needless_borrow warnings As we bumped the rust toolchain to 1.66.0, some new warnings have been raised due to needless_borrow. Let's fix them all here. For more info about the warnings, please, take a look at: https://rust-lang.github.io/rust-clippy/master/index.html#needless_borrow Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-01-02 14:18:14 +01:00
Fabiano Fidêncio	60df30015b	protocols: Fix unnecessary_cast warnings As we bumped the rust toolchain to 1.66.0, some new warnings have been raised due to unnecessary_cast. Let's fix them all here. For more info about the warnings, please, take a look at: https://rust-lang.github.io/rust-clippy/master/index.html#unnecessary_cast Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-01-02 14:18:14 +01:00
Danny Canter	56e7b5d0fd	runtime/Makefile: Get some bits happy on darwin Substitution in the yq install script doesn't like zsh, and additionally the version of yq we're using doesn't have a darwin/arm64 build so grab the amd64 version and let rosetta work its magic. Additionally swap to abspath from readlink -m for the printing of what binaries to install, as the -m flag doesn't exist on the BSD variant, and this should be the same behavior. Fixes: #5970 Signed-off-by: Danny Canter <danny@dcantah.dev>	2023-01-02 04:19:58 -08:00
Fabiano Fidêncio	0bbeb34b4c	protocols: Fix needless_borrow warnings As we bumped the rust toolchain to 1.66.0, some new warnings have been raised due to needless_borrow. Let's fix them all here. For more info about the warnings, please, take a look at: https://rust-lang.github.io/rust-clippy/master/index.html#needless_borrow Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-01-02 12:41:29 +01:00
Danny Canter	86ee24b33c	Runtime: Clarify mutability of global var Was about to change `urandomdev` to a constant when I realized it's intentionally mutable so it can be mocked in tests. There's other comments to the same effect so clarify here as well. Fixes: #5965 Signed-off-by: Danny Canter <danny@dcantah.dev>	2023-01-02 01:13:34 -08:00
Zhongtao Hu	dae6670628	kata-runtime: add rust runtime path for kata-runtime exec add rust runtime path for kata-runtime exec Fixes:#5963 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-12-30 13:34:34 +08:00
Chao Wu	a2e3715e01	upcall: remove upcall client when stopping vm In order to avoid resource leak, we need to remove upcall client in vm and vcpu manager when stopping vm. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-12-28 20:23:39 +08:00
wllenyj	31591d7915	dragonball: fix unit test failure case about Kvm. Due to the wrong use of as_raw_fd, Kvm was dropped twice. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-12-26 11:32:31 +08:00
wllenyj	2b02e0a9bf	dragonball: add more unit test for vcpu manager Added more unit tests for Vcpu Manager. Fixes: #4899 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-12-26 11:31:42 +08:00
Yushuo	85f9094f17	agent: refactor guest hooks We have to execute some hooks both in host and guest. And in /libs/kata-sys-util/src/hooks.rs, the coomon operations are implemented. In this commit, we are going to refactor the code of guest hooks using code in /libs/kata-sys-util/src/hooks.rs. At the same time, we move function valid_env to kata-sys-util to make it usable by both agent and runtime. Fixes: #5857 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2022-12-26 10:15:19 +08:00
Chao Wu	1511587a9a	Merge pull request #5601 from openanolis/hugepage runtime-rs: enable hugepage	2022-12-25 22:35:06 +08:00
Zhongtao Hu	3605062258	runtime-rs: add dbs-upcall feature add dbs-upcall feature to dragonball Fixes:#5949 Depends-on: github.com/kata-containers/tests#5355 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-12-25 19:02:42 +08:00
Bin Liu	03a0c9d78e	kata-ctl: skip test if access GitHub.com fail This commit will call `error_for_status` after `send`, this call will generate errors if status code between 400-499 and 500-599. And sometime access github.com will fail, in this case we can skip the test to prevent the CI failing. Fixes: #5948 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-12-23 15:12:12 +08:00
Bin Liu	1dcbda3f0f	kata-ctl: update Cargo.lock kata-ctl depends on runtime-rs, and this commit: `fbf294da3f` added a new dependency named shim-interface, this Cargo.lock should be updated too. Signed-off-by: Bin Liu <bin@hyper.sh>	2022-12-23 15:06:50 +08:00
Fupan Li	dc9c8d3357	Merge pull request #5901 from justxuewei/fix/mpleak runtime-rs: Clean up mount points shared to guest	2022-12-21 09:59:25 +08:00
Jianyong Wu	3480780bd8	kata-ctl: add check framework support for non-x86 x86 changes the check framwork. Enable them for non-x86 accordingly. Fixes: #5923 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2022-12-20 11:41:00 +08:00
Jianyong Wu	1bd533f10b	kata-ctl: let check framework arch-agnostic The current check framwork is specific for x86. Refactor the code to let it arch-agnostic. Fixes: #5923 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2022-12-20 11:41:00 +08:00
Bin Liu	0cf443a612	Merge pull request #5915 from openanolis/legacy_device dragonball: refactor legacy device initialization	2022-12-19 13:31:45 +08:00
Xuewei Niu	fd77eebd4d	runtime-rs: fix the issues mentioned in the code review In order to avoid cloning, changed the signature of `ShareFsMount::share_rootfs`, `ShareFsMount::share_volume`, and `ShareFsMount::umount_rootfs` to receive a reference to a config. Fixes: #5898 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2022-12-19 11:46:50 +08:00
Xuewei Niu	0e69207909	runtime-rs: Clean up mount points shared to guest Fixed issues where shared volumes couldn't umount correctly. The rootfs of each container is cleaned up after the container is killed, except for `NydusRootfs`. `ShareFsRootfs::cleanup()` calls `VirtiofsShareMount::umount_rootfs()` to umount mount points shared to the guest, and umounts the bundle rootfs. Fixes: #5898 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2022-12-19 11:46:14 +08:00
Bin Liu	e4645642d0	Merge pull request #5877 from openanolis/fix_start_bundle runtime-rs: enable start container from bundle	2022-12-17 08:10:08 +08:00
Yushuo	d14c3af35c	dragonball: refactor legacy device initialization If the serial path is given, legacy_manager should create socket console based on that path. Or the console should be created based on stdio. Fixes: #5914 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2022-12-15 20:55:01 +08:00
Zhongtao Hu	ca39a07a14	runtime-rs: enable start container from bundle enable start container from bundle in this way $ ls ./bundle config.json rootfs $ sudo ctr run -d --runtime io.containerd.kata.v2 --config bundle/config.json test_kata Fixes:#5872 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-12-15 17:28:13 +08:00
Peng Tao	ebb73df6bc	Merge pull request #5899 from Bevisy/fix-outdated-comments shim: return hypervisor's pid not shim's pid	2022-12-15 14:55:54 +08:00
Chao Wu	fad229b853	Merge pull request #5875 from Ji-Xinyou/xyji/refactor-shim-mgmt refactor(shim-mgmt): move client side to libs	2022-12-15 10:59:45 +08:00
Alex	b5cfd09583	kata-ctl: Fixed format for check release options Fixed formatting for check release options Fixes: #5345 Signed-off-by: Alex <alee23@bu.edu> Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2022-12-14 09:42:57 -06:00
James O. D. Hunt	2e15af777c	Merge pull request #5786 from alexlee-23/main kata-ctl: check: only-list-releases and include-all-releases options	2022-12-14 11:25:36 +00:00
Ji-Xinyou	fbf294da3f	refactor(shim-mgmt): move client side to libs The client side is moved to libs. This is to solve the problem that including clients will bring about messy dependencies. Fixes: #5874 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-12-14 17:42:25 +08:00
Peng Tao	856d4b7361	Merge pull request #5798 from pmores/qemu-support basic framework for QEMU support in runtime-rs	2022-12-14 15:05:33 +08:00
Binbin Zhang	99485d871c	shim: return hypervisor's pid not shim's pid update outdated code comments Fixes: #3234 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2022-12-14 11:16:11 +08:00
Chao Wu	bb4be2a666	Merge pull request #5690 from yipengyin/fix-virtiofsd runtime-rs: fix standalone share fs	2022-12-14 00:16:10 +08:00
Pavel Mores	1f28ff6838	runtime-rs: add binary to exercise shim proper w/o containerd dependencies After building the binary as usual with `cargo build` run it as follows. It needs a configuration.toml in which only qemu keys `path`, `kernel` and `initrd` will initially need to be set. Point them to respective files e.g. from a kata distribution tarball. It also needs to be launched from an exported container bundle directory. One can be created by running mkdir rootfs podman export $(podman create busybox) \| tar -C ./rootfs -xvf - runc spec -b . in a suitable directory. Then launch the program like this: KATA_CONF_FILE=/path/to/configuration-qemu.toml /path/to/shim-ctl Fixes: #5817 Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-12-13 14:55:21 +01:00
Pavel Mores	eb8c9d38ff	runtime-rs: add launch of a simple qemu process to start_vm() The point here is just to get a simplest Kata VM running. Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-12-13 14:54:26 +01:00
Pavel Mores	2f6d0d408b	runtime-rs: support qemu in VirtContainer Added registration of qemu config plugin and support for creating Qemu Hypervisor instance. Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-12-13 14:54:26 +01:00
Pavel Mores	1413dfe91c	runtime-rs: add basic empty boilerplate for qemu driver This does almost literally nothing so far apart from getting and setting HypervisorConfig. It's mostly copied from/inspired by dragonball. Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-12-13 14:53:45 +01:00
Bin Liu	3952fedcd0	Merge pull request #5882 from bergwolf/github/oci-namespaces runtime-rs: fix sandbox_pidns calculation and oci spec amending	2022-12-13 18:32:02 +08:00
Fabiano Fidêncio	f1381eb361	Merge pull request #4813 from ManaSugi/fix/add-selinux-agent runtime,agent: Add SELinux support for containers inside the guest	2022-12-13 11:24:53 +01:00
Yuan-Zhuo	bf8848f926	agent: Eliminate unnecessary metrics DEFAULT_REGISTRY pre-registers many metrics that we don't need or have duplicated. This PR uses a custom register for metrics without interference and ensures that the registration process is executed only once when the program is running. Fixes: #5255 Signed-off-by: Yuan-Zhuo <yuanzhuo0118@outlook.com>	2022-12-13 16:18:33 +08:00
Fupan Li	015674df16	Merge pull request #5873 from justxuewei/fix/umount2 kata-sys-util: fix issues where umount2 couldn't get the correct path	2022-12-13 15:52:32 +08:00
Bin Liu	03b6124fc6	Merge pull request #5848 from Yuan-Zhuo/drop-cgmr-option agent: Drop the Option for LinuxContainer.cgroup_manager	2022-12-13 12:09:39 +08:00
Alex	8dbfc3dc82	kata-ctl: Fixed format for check release options Fixed formatting for check release options Fixes: #5345 Signed-off-by: Alex <alee23@bu.edu>	2022-12-13 03:10:19 +00:00
Alex	f3091a9da4	kata-ctl: Add kata-ctl check release options This pull request adds kata-ctl check only-list-releases and include-all-releases Fixes: #5345 Signed-off-by: Alex <alee23@bu.edu>	2022-12-13 03:04:30 +00:00
Peng Tao	79cf38e6ea	runtime-rs: clear OCI spec namespace path None of the host namespace paths make sense in the guest. Let's clear them all before sending the spec to the agent. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-12-12 11:07:14 +00:00
Peng Tao	62f4603e81	runtime-rs: reset rdma cgroup We don't support rdma cgroups yet. Let's make sure it is reset to empty. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-12-12 09:57:24 +00:00
Peng Tao	5b6596f54e	runtime-rs: CreateContainerRequest has Default We can just use it to initialize the default fields. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-12-12 09:57:24 +00:00
Peng Tao	e9e82ce28b	runtime-rs: fix is_pid_namespace_enabled check We should test is_pid_namespace_enabled before amending the container spec, where the pid namespace path is cleared and resulting sandbox_pidns to always being false. Fixes: #5881 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-12-12 09:54:48 +00:00
Zhongtao Hu	afaf17f423	runtime-rs: enable container hugepage enable the functionality of using hugepages in container Fixes: #5560 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-12-12 17:49:31 +08:00
Xuewei Niu	8079a9732d	kata-sys-util: fix issues where umount2 couldn't get the correct path Strings in Rust don't have \0 at the end, but C does, which leads to `umount2` in the libc can't get the correct path. Besides, calling `nix::mount::umount2` to avoid using an unsafe block is a robust solution. Fixes: #5871 Signed-off-by: Xuewei Niu <niuxuewei.nxw@antgroup.com>	2022-12-12 11:50:32 +08:00
Yipeng Yin	4661ea8d3b	runtime-rs: fix standalone share fs Standalone share fs should add virtiofs device in setup_device_before_start_vm and return the storages to mount the directory in guest. And it uses hypervisor's jailer root directly instead of jail config. Besides, we tweaked the parameter, so it adapts to rust version virtiofsd now. And its cache policy which forbids caching is "never" now, instead of "none". Hence, we change the default cache mode. Fixes: #5655 Signed-off-by: Yipeng Yin <yinyipeng@bytedance.com>	2022-12-12 10:58:09 +08:00
Zhongtao Hu	fc4a67eec3	runtime-rs: enable vm hugepage support vm hugepage,set the hugetlbfs mount point as vm memory path Fixes:#5560 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-12-09 00:01:16 +08:00
Greg Kurz	5ef7ed72ae	Merge pull request #5610 from UiPath/fix-process-wait runtime: prevent waiting 50 ms minimum for a process exit	2022-12-08 11:02:39 +01:00
Peng Tao	0a1d1ec2fa	Merge pull request #5830 from openanolis/fix-high-cpu runtime-rs: fix high cpu	2022-12-08 12:16:06 +08:00
Steve Horsman	39394fa2a8	Merge pull request #5844 from jtumber-ibm/patch-1 agent: remove `sysinfo` dependency	2022-12-07 16:35:05 +00:00
Fupan Li	cce316b5e9	Merge pull request #5607 from justxuewei/feat/sandbox-level-volume runtime-rs: bind mount volumes in sandbox level	2022-12-07 19:23:38 +08:00
Yuan-Zhuo	7fdbbcda82	agent: Drop the Option for LinuxContainer.cgroup_manager Cgroup manager for a container will always be created. Thus, dropping the option for LinuxContainer.cgroup_manager is feasible and could simplify the code. Fixes: #5778 Signed-off-by: Yuan-Zhuo <yuanzhuo0118@outlook.com>	2022-12-07 13:40:38 +08:00
Alexandru Matei	d04d45ea05	runtime: use pidfd to wait for processes on Linux Use pidfd_open and poll on newer versions of Linux to wait for the process to exit. For older versions use existing wait logic Fixes: #5617 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2022-12-06 16:31:05 +02:00
Alexandru Matei	e9ba0c11d0	runtime: use exponential backoff for process wait Initial wait period between checks is 1ms, and the next ones are min(wait_period*5, 50ms) Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2022-12-06 16:30:58 +02:00
James Tumber	748f22e7d0	agent: remove sysinfo dependency Removes the redundant dependency `sysinfo`. Fixes: #5843 Signed-off-by: James Tumber <james.tumber@ibm.com>	2022-12-06 10:18:53 +00:00
Quanwei Zhou	0019d653d6	runtime-rs: fix high cpu Fixed the issue when using nonblocking, the `tokio::io::copy()` needing to handle EAGAIN, resulting in high CPU usage. Fixes: #5740 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-12-06 14:25:33 +08:00
Chao Wu	326d589ff5	Merge pull request #5822 from liubin/fix/5820-var-name-and-typo runtime-rs: fix some variable names and typos	2022-12-06 14:24:11 +08:00
Zhongtao Hu	c12bb5008d	Merge pull request #5769 from jongwu/check_host_arm kata-ctl: add host check for aarch64	2022-12-06 14:05:52 +08:00
Chao Wu	538bddf4ee	Merge pull request #5811 from tzY15368/fix-katactl-conflict-dependency kata-ctl: fix dependency version conflict	2022-12-06 10:44:48 +08:00
Alexandru Matei	71491a69c3	runtime: move process wait logic to another function extract process wait logic to another function Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2022-12-05 13:32:04 +02:00
Alexandru Matei	92ebe61fea	runtime: reap force killed processes reap child processes after sending SIGKILL Fixes #5739 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2022-12-05 13:31:58 +02:00
Xuewei Niu	fdf0a7bb14	runtime-rs: fix the issues mentioned in the code review Removed the `Debug` trait for the `ShareFs` and etc. Renamed `ShareFsMount::upgrade()` and `ShareFsMount::downgrade()` to `upgrade_to_rw()` and `downgrade_to_ro()`. Protected `mounted_info_set` with a mutex to avoid race conditions. Fixes: #5588 Signed-off-by: Xuewei Niu <justxuewei@apache.org>	2022-12-05 11:18:26 +08:00
Xuewei Niu	1d823c4f65	runtime-rs: umount and permission controls in sandbox level This commit implemented umonut controls and permission controls. When a volume is no longer referenced, it will be umounted immediately. When a volume mounted with readonly permission and a new coming container needs readwrite permission, the volume should be upgraded to readwrite permission. On the contrary, if a volume with readwrite permission and no container needs readwrite, then the volume should be downgraded. Fixes: #5588 Signed-off-by: Xuewei Niu <justxuewei@apache.org>	2022-12-05 10:58:13 +08:00
Xuewei Niu	527b871414	runtime-rs: bind mount volumes in sandbox level Implemented bind mount related managment on the sandbox side, involving bind mount a volume if it's not mounted before, upgrade permission to readwrite if there is a new container needs. Fixes: #5588 Signed-off-by: Xuewei Niu <justxuewei@apache.org>	2022-12-05 10:58:13 +08:00
Bin Liu	9ccf2ebe8a	agent: add signal value to log For signal_process call, log the signal value in logs. Signed-off-by: Bin Liu <bin@hyper.sh>	2022-12-02 14:53:58 +08:00
Bin Liu	fb2c142f18	runtime-rs: fix some variable names and typos Fix some not perfect variable names, and some typos in logs. Fixes: #5820 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-12-02 14:52:34 +08:00
Bin Liu	514b7778a2	Merge pull request #5807 from liubin/fix/5806-add-shim-lanuage runtime: Add identification in version for runtime-rs	2022-12-02 11:36:55 +08:00
Tingzhou Yuan	737420469a	kata-ctl: fix dependency version conflict Also added crate `runtime-rs/crates/runtimes` as dependency as it's immediately depended upon by the `direct-volume` feature, see issue 5341 and PR 5467. Fixes #5810 Signed-off-by: Tingzhou Yuan <tzyuan15@bu.edu>	2022-12-01 17:53:21 +00:00
Bin Liu	d4321ab489	runtime: Add identification in version for runtime-rs Now we are supporting two runtime/shim, the go version, and the rust version, for debug purposes, we can add an identification in the version info to tell us which runtime/shim is used. Fixes: #5806 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-12-01 15:14:08 +08:00
Bin Liu	7fabfb2cf0	Merge pull request #5756 from chentt10/remove-version-number-from-commit-message runtime-rs: remove the version number from the commit display message	2022-12-01 13:11:47 +08:00
Fabiano Fidêncio	212325a9db	Merge pull request #5649 from ManaSugi/runk/refactor-start-using-agent-code runk: Re-implement start operation using the agent codes	2022-11-29 20:45:16 +01:00
Manabu Sugimoto	c617bbe70d	runtime: Pass SELinux policy for containers to the agent Pass SELinux policy for containers to the agent if `disable_guest_selinux` is set to `false` in the runtime configuration. The `container_t` type is applied to the container process inside the guest by default. Users can also set a custom SELinux policy to the container process using `guest_selinux_label` in the runtime configuration. This will be an alternative configuration of Kubernetes' security context for SELinux because users cannot specify the policy in Kata through Kubernetes's security context. To apply SELinux policy to the container, the guest rootfs must be CentOS that is created and built with `SELINUX=yes`. Fixes: #4812 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-11-29 19:07:56 +09:00
Manabu Sugimoto	9354769286	agent: Add SELinux support for containers The kata-agent supports SELinux for containers inside the guest to comply with the OCI runtime specification. Fixes: #4812 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-11-29 19:07:56 +09:00
Bin Liu	588f81a23c	Merge pull request #5612 from openanolis/fix-iptables fix(agent): fix iptables binary path in guest	2022-11-29 16:57:06 +08:00
Bin Liu	1da2d0603c	Merge pull request #5761 from gaohuatao-1/ght_overhead runtime-rs: moving only vCPU threads into sandbox controller	2022-11-29 13:53:01 +08:00
GabyCT	013752667b	Merge pull request #5776 from liubin/tmp/debug-static-check ci: let static checks don't depend on build	2022-11-28 07:51:42 -06:00
Bin Liu	6af037d379	Merge pull request #5154 from Yuan-Zhuo/main agent: support systemd cgroup for kata agent.	2022-11-28 18:40:10 +08:00
Manabu Sugimoto	e12db92e4d	runk: Re-implement start operation using the agent codes This commit re-implements `start` operation by leveraging the agent codes. Currently, `runk` has own `start` mechanism even if the agent already has the feature to handle starting a container. This worsen the maintainability and `runk` cannot keep up with the changes on the agent side easily. Hence, `runk` replaces own implementations with agent's ones. Fixes: #5648 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-11-28 19:11:21 +09:00
Bin Liu	e723bad0af	ci: let static checks don't depend on build Build is a time consumable operation, skip build while let ci run faster. Fixes: #5777 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-11-28 15:26:04 +08:00
Bin Liu	a55eb78c32	Merge pull request #5752 from liubin/fix/5750-go-fix-1.19 runtime: go fix code for 1.19	2022-11-26 02:09:02 +08:00
Bin Liu	57c80ad65c	Merge pull request #5758 from chentt10/update-runtime-rs-build-and-install doc: update runtime-rs "Build and Install"	2022-11-26 02:08:48 +08:00
Jianyong Wu	a5e4cad4b6	kata-ctl: add host check for aarch64 For now, we can check if host support running kata by check if "/dev/kvm" exist on aarch64. Fixes: #5768 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2022-11-25 18:55:32 +08:00
gaohuatao	2edbe389d8	runtime-rs: moving only vCPU threads into sandbox controller when overhead controller exists, just contrain vCPU threads in sandbox controller Fixes:#5760 Signed-off-by: gaohuatao <gaohuatao@bytedance.com>	2022-11-25 17:53:21 +08:00
Peng Tao	e32c023d96	Merge pull request #5714 from UiPath/fix-mkdir runtime: don't fail mkdir if the folder is already created by another process	2022-11-25 17:52:56 +08:00
Chen Taotao	2426ea9bdc	doc: update runtime-rs "Build and Install" When using source code to compile runtime-rs,make the documentation point out the detailed environment build and compilation methods to avoid errors caused by related dependent packages. Fixes:#5757 Signed-off-by: Chen Taotao <chentt10@chinatelecom.cn>	2022-11-25 13:13:00 +08:00
Chen Taotao	67fe703ff5	runtime-rs: remove the version number from the commit display message The displayed commit message and version message are partially duplicated. Remove the version number from the commit display message. Fixes:#5735 Signed-off-by: Chen Taotao <chentt10@chinatelecom.cn>	2022-11-25 13:00:01 +08:00
Ji-Xinyou	1d93a93468	fix(agent): fix iptables binary path in guest Some rootfs put iptables-save and iptables-restore under /usr/sbin instead of /sbin. This pr checks both and returns the one exist. Fixes: #5608 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-11-25 11:57:34 +08:00
Bin Liu	1dfd845f51	runtime: go fix code for 1.19 We have starting to use golang 1.19, some features are not supported later, so run `go fix` to fix them. Fixes: #5750 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-11-25 11:29:18 +08:00
Zhongtao Hu	f02bb1a9cb	Merge pull request #5729 from openanolis/netnsref runtime-rs: block on the current thread when setup the network to avoid be take over by other task	2022-11-25 08:09:10 +08:00
Alexandru Matei	4b45e13869	runtime: don't fail mkdir if the folder is already created Use MkdirAll instead of Mkdir so it doesn't generate an error when the folder is created by another process Fixes #5713 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2022-11-24 11:20:56 +02:00
Chao Wu	9bde32daa1	Merge pull request #5707 from openanolis/ref Refactor(runtime-rs): add conditional compile for virt-sandbox persist	2022-11-24 15:24:06 +08:00
Zhongtao Hu	b987bbc576	runtime-rs: block on the current thread when setup the network As the increase of the I/O intensive tasks, two issues could be caused: 1. When the future is blocked, the current thread (which is in the network namespace) might be take over by other tasks. After the future is finished, the thread take over the current task might not be in the pod network namespace 2. When finish setting up the network, the current thread will be set back to the host namsapce. But the task which be taken over would still stay in the pod network namespace To avoid that, we need to block the future on the current thread. Fixes:#5728 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-11-24 13:48:05 +08:00
Bin Liu	06a604b753	Merge pull request #5720 from YchauWang/wyc-docs-test-22 runtime: add log record to the qemu config method `appendDevices` for…	2022-11-24 13:15:06 +08:00
Peng Tao	b4d0a39f6d	Merge pull request #5723 from fidencio/topic/runtime-bump-containerd-to-v1.6.8 runtime: Use containerd v1.6.8	2022-11-24 11:28:58 +08:00
Fabiano Fidêncio	5cbf879659	Merge pull request #5693 from jongwu/test_ip_table agent: check if command exist before do ip_tables test	2022-11-23 08:15:08 +01:00
wangyongchao.bj	30a7ebf430	runtime: Log invalid devices in QEMU config When the user tried to add new devices to the VM, there is no error info for the invalid device. This PR adds a log record to the `appendDevices` for the invalid device of the qemu config. Fixes: #5719 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2022-11-23 09:09:45 +08:00
Fabiano Fidêncio	df3d9878d5	Merge pull request #5695 from darfux/virtiofs-queue-size runtime: Support virtiofs queue size for qemu and make it configurable	2022-11-22 20:04:30 +01:00
Fabiano Fidêncio	2539f31862	runtime: Use containerd v1.6.8 Let's follow the binary bump used in the CI and also bump the vendored version of containerd to v1.6.8. Fixes: #5722 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-11-22 18:28:30 +01:00
Chao Wu	8b04ba95cb	Merge pull request #5691 from yipengyin/support-vhost-vsock runtime-rs: support vhost-vsock	2022-11-22 14:59:55 +08:00
Yipeng Yin	d808adef95	runtime-rs: support vhost-vsock Rename old VsockConfig to HybridVsockConfig. And add VsockConfig to support vhost-vsock. We follow kata's old way to try random vhost fd for 50 times to generate uniqe fd. Fixes: #5654 Signed-off-by: Yipeng Yin <yinyipeng@bytedance.com>	2022-11-22 10:03:52 +08:00
Zhongtao Hu	6b2ef66f0f	runtime-rs: add conditional compile for virt-sandbox persist code refactoring, add conditional compile for virt-sandbox persist Fixes: #5706 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-11-21 19:51:43 +08:00
Jianyong Wu	b53171b605	agent: check command before do test_ip_tables test_ip_tables test depends on iptables tools. But we can't ensure these tools are exist. it's better to skip the test if there is no such tools. Fixes: #5697 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2022-11-21 14:56:51 +08:00
Bin Liu	7c8d474959	Merge pull request #5689 from kata-containers/kata-ctl-util utils: Add utility function to fetch the kernel version.	2022-11-21 14:44:05 +08:00
Peng Tao	a636d426d9	versions: update nydusd version To the latest stable v2.1.1. Depends-on: github.com/kata-containers/tests#5246 Fixes: #5635 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-11-19 16:33:29 +00:00
liyuxuan.darfux	3bb145c63a	runtime: Support virtiofs queue size for qemu and make it configurable The default vhost-user-fs queue-size of qemu is 128 now. Set it to 1024 by default which is same as clh. Also make this value configurable. Fixes: #5694 Signed-off-by: liyuxuan.darfux <liyuxuan.darfux@bytedance.com>	2022-11-19 15:38:11 +08:00
Archana Shinde	e80a9f09fa	utils: Add utility function to fetch the kernel version. Add functionality to get kernel version and related unit tests. This is intended to be used in the kata-env command going forward. Fixes: #5688 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-11-18 15:39:57 -08:00
Bin Liu	7506237420	Merge pull request #5144 from openanolis/nydus-dev runtime-rs: support nydus v5 and v6 rootfs	2022-11-18 14:05:04 +08:00
Bo Chen	36545aa81a	runtime: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v28.0. Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by openapi-generator. Fixes: #5683 Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-11-17 09:45:27 -08:00
Fabiano Fidêncio	2f5f575a43	log-parser: Simplify check ``` 14:13:15 parse.go:306:5: S1009: should omit nil check; len() for github.com/kata-containers/kata-containers/src/tools/log-parser.kvPairs is defined as zero (gosimple) 14:13:15 if pairs == nil \|\| len(pairs) == 0 { 14:13:15 ^ ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-11-17 14:17:29 +01:00
Fabiano Fidêncio	d94718fb30	runtime: Fix gofmt issues It seems that bumping the version of golang and golangci-lint new format changes are required. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-11-17 14:16:12 +01:00
Fabiano Fidêncio	16b8375095	golang: Stop using io/ioutils The package has been deprecated as part of 1.16 and the same functionality is now provided by either the io or the os package. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-11-17 13:43:25 +01:00
Peng Tao	eab8d6be13	build: update golang version to 1.19.2 So that we get the latest language fixes. There is little use to maitain compiler backward compatibility. Let's just set the default golang version to the latest 1.19.2. Fixes: #5494 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-11-16 19:02:39 +01:00
Chao Wu	e80dbc15d8	runtime-rs: workaround Dragonball compilation problem Since the upstream rust-vmm is changing its dependency style towards caret requirements in these days (more information: rust-vmm/vm-memory#199) and it breaks Dragonball compilation frequently. rust-vmm is expected to finish the changes this week and in order to not break Kata CI due to Dragonball's compilation error, we will add Cargo.lock file into /src/dragonball first and remove it later when rust-vmm is stable. fixes: #5657 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-11-16 12:44:41 +01:00
Ji-Xinyou	c3f1922df6	fix(fmt): fix cargo fmt to pass static check Fix cargo fmt Fixes: #5639 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-11-16 12:44:38 +01:00
Greg Kurz	1bbcb413c9	Merge pull request #5597 from UiPath/fix-clh-wait clh: avoid race condition when stopping clh	2022-11-16 07:39:27 +01:00
Zhongtao Hu	7d91150185	Merge pull request #5536 from chentt10/fix-name-shim-source-ambiguous runtime-rs : fix the shim source in the documentation test is ambiguous	2022-11-11 14:07:05 +08:00
Zhongtao Hu	c46814b26a	runtime-rs:support nydus v5 and v6 add nydus v5 snd v6 upport for container rootfs Fixes:#5142 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-11-11 10:15:35 +08:00
Alexandru Matei	a04afab74d	qemu: early exit from Check if the process was stopped Fixes: #5625 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2022-11-10 22:43:32 +02:00
Alexandru Matei	7e481f2179	qemu: set stopped only if StopVM is successful Fixes: #5624 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2022-11-10 22:43:32 +02:00
Alexandru Matei	0e3ac66e76	clh: return faster with dead clh process from isClhRunning Through proactively checking if Cloud Hypervisor process is dead, this patch provides a faster path for isClhRunning Fixes: #5623 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2022-11-10 22:43:32 +02:00
Alexandru Matei	9ef68e0c7a	clh: fast exit from isClhRunning if the process was stopped Use atomic operations instead of acquiring a mutex in isClhRunning. This stops isClhRunning from generating a deadlock by trying to reacquire an already-acquired lock when called via StopVM->terminate. Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2022-11-10 22:43:32 +02:00
Alexandru Matei	2631b08ff1	clh: don't try to stop clh multiple times Avoid executing StopVM concurrently when virtiofs dies as a result of clh being stopped in StopVM. Fixes: #5622 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2022-11-10 22:43:32 +02:00
Chao Wu	f45fe4f90d	versions: update vmm-sys-util and related crates to v0.11.0 Since the upstream of vmm-sys-utils upgraded to 0.11.0, some crates automatically upgrade to v0.11.0, and some stay at v0.10.0 ( depending on how they write version dependency in Cargo toml` which causes the compile error in runtime-rs. In order to fix this problem, we need to upgrade all vmm-sys-util dependencies in runtime-rs to v0.11.0. fixes: #5636 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-11-10 19:13:23 +08:00
quanweiZhou	bbc93260c9	Merge pull request #5615 from openanolis/chao/delete_cargo_patch runtime-rs: delete all cargo patches	2022-11-10 10:18:19 +08:00
Zhongtao Hu	071ac4693a	Merge pull request #5613 from openanolis/iptables feat(shim-mgmt): iptables handler	2022-11-09 17:21:45 +08:00
Ji-Xinyou	f8f97c1e22	feat(shim-mgmt): iptables handler Support the handlers in runtime, which are used by kata-ctl iptables series of commands in runtime. Fixes: #5370 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-11-09 10:39:50 +08:00
Chao Wu	29c75cf12b	runtime-rs: delete all cargo patches The cargo patch in the cargo.toml seems to cause the whole runtime-rs building time longer and also makes it harder to build runtime-rs in an environment without the network We should delete all patches from the cargo.toml file and publish all the crates that was once patched. fixes: #5614 #5527 #5526 #5449 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-11-09 10:02:58 +08:00
Chao Wu	f5f25d9379	Merge pull request #5431 from wllenyj/dragonball-ut-3 Built-in Sandbox: add more unit tests for dragonball. Part 3	2022-11-08 15:48:16 +08:00
Zhongtao Hu	351bdbfacd	Merge pull request #5567 from openanolis/chao/fix_mem_file_path_error Dragonball: enable mem_file_path config into hugetlbfs process	2022-11-08 09:00:13 +08:00
wllenyj	57336835da	dragonball: add more unit test for device manager Added more unit tests for device manager. Fixes: #4899 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-11-08 00:45:17 +08:00
wllenyj	2333700237	dragonball: add test utils. Added some tools for dragonball unit testing. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-11-08 00:45:17 +08:00
Bin Liu	bfe9157abc	Merge pull request #5570 from openanolis/capability runtime-rs:add hypervisor interface capabilities	2022-11-07 23:04:55 +08:00
Chao Wu	2adb1c1823	Dragonball: enable mem_file_path config into hugetlbfs process In the current Dragonball code, mem_file_path config is not used when hugetlbfs is enabled. In this commit we add mem_file_path into hugetlbfs enable process. fixes: #5566 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-11-07 16:07:57 +08:00
Fabiano Fidêncio	7250be3601	Merge pull request #5584 from fengyehong/clh-thread cloud-hypervisor: Fix GetThreadIDs function	2022-11-07 08:22:40 +01:00
Bin Liu	824ea83c3c	Merge pull request #5573 from pmores/fill-in-virtiofsd-standalone-impl runtime-rs: blanks filled & fixes made to virtiofsd launch	2022-11-07 14:19:45 +08:00
Bin Liu	83d052f82b	Merge pull request #4476 from LitFlwr0/vcpu-pinning-frq vCPUs pinning support for Kata Containers	2022-11-07 10:37:22 +08:00
Guanglu Guo	daeee26a1e	cloud-hypervisor: Fix GetThreadIDs function Get vcpu thread-ids by reading cloud-hypervisor process tasks information. Fixes: #5568 Signed-off-by: Guanglu Guo <guoguanglu@qiyi.com>	2022-11-05 17:23:19 +08:00
Bin Liu	427b01e298	Merge pull request #5548 from justxuewei/fix/share-fs-permission runtime-rs: fix shared volume permission issue	2022-11-04 21:21:50 +08:00
LitFlwr0	2508d39b7c	runtime: added vcpus pinning logics Core VCPU threads pinning logics for issue 4476. Also provided docs. Fixes:#4476 Signed-off-by: LitFlwr0 <861690705@qq.com>	2022-11-04 17:52:42 +08:00
Zhongtao Hu	fef8e92af1	runtime-rs:add hypervisor interface capabilities 1. be able to check does hypervisor support use block device, block device hotplug, multi-queue, and share file 2. be able to set the hypervisor capability of using block device, block device hotplug, multi-queue, and share file Fixes: #5569 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-11-04 09:24:36 +08:00
Bin Liu	b0c7bcce7c	Merge pull request #5556 from ManaSugi/runk/fix-kill-behavior runk: Ignore an error when calling kill cmd with --all option	2022-11-04 08:42:27 +08:00
Pavel Mores	27b1913584	runtime-rs: blanks filled & fixes made to virtiofsd launch The 'config' argument to ShareVirtioFsStandalone::new() is now actually used, taking care of an explicit TODO. If a shared path doesn't exist in ShareVirtioFsStandalone::virtiofsd_args() it is now created instead of returning an error, thus following ShareVirtioFsInline's suit. The '-o vhost_user_socket=...' command line argument doesn't seem to be supported by newer versions of virtiofsd so we replace it with '--socket-path' which should be functionally equivalent according to docs. Fixes #5572 Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-11-03 08:38:59 +01:00
Manabu Sugimoto	df092185ee	runk: Upgrade libseccomp crate to v0.3.0 in Cargo.lock The libseccomp crate was upgraded to v0.3.0 by `4696ead`, but `Cargo.lock` of runk wasn't updated by mistake. So, this commit updates `Cargo.lock` of runk to the latest dependencies. Fixes: #5487 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-11-01 20:26:33 +09:00
Manabu Sugimoto	16dca4ecd4	runk: Ignore an error when calling kill cmd with --all option Ignore an error handling that is triggered when the kill command is called with `--all option` to the stopped container. High-level container runtimes such as containerd call the kill command with `--all` option in order to terminate all processes inside the container even if the container already is stopped. Hence, a low-level runtime should allow `kill --all` regardless of the container state like runc. This commit reverts to the previous behavior. Fixes: #5555 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-11-01 20:24:29 +09:00
Xuewei Niu	b74c18024a	runtime-rs: fix shared volume permission issue Fix the issue where share volumes always have readwrite permission even if readonly permission is enough. Fixes: #5549 Signed-off-by: Xuewei Niu <justxuewei@apache.org>	2022-11-01 18:42:19 +08:00
Chen TaoTao	936fe35acb	runtime-rs : fix shim source is ambiguous In the documentation test, the name shim has multiple potential sources of import, now give it a clear source. Fixes: #5535 Signed-off-by: Chen TaoTao <chentt10@chinatelecom.cn>	2022-10-31 19:54:22 -07:00
snir911	288e337a6f	Merge pull request #5434 from Rouzip/remove-doNetNS add EnterNetNS in virtcontainers	2022-10-30 11:19:07 +02:00
David Esparza	37f0cd1c8f	Merge pull request #5436 from amshinde/kata-ctl-drop-privs Kata ctl drop privs	2022-10-26 11:37:27 -05:00
Archana Shinde	c0f5bc81b7	cargo: Add Cargo.lock to version control Add Cargo.lock to capture state of build. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-10-25 20:34:40 -07:00
Archana Shinde	474927ec90	gitignore: Add gitignore file Ignore autogeneraated version.rs Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-10-25 20:34:40 -07:00
Archana Shinde	699f821e12	utils: Add function to drop priveleges This function is meant to be used before operations such as accessing network to make sure those operations are not performed as a privilged user. Fixes: #5331 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-10-25 20:34:40 -07:00
Peng Tao	b015f34aff	runtime-rs: generate config files with the default target Right now it is not generated with a simple `make`. Fixes: #5509 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-10-26 10:25:29 +08:00
Yuan-Zhuo	d7bb4b5512	agent: support systemd cgroup for kata agent 1. Implemented a rust module for operating cgroups through systemd with the help of zbus (src/agent/rustjail/src/cgroups/systemd). 2. Add support for optional cgroup configuration through fs and systemd at agent (src/agent/rustjail/src/container.rs). 3. Described the usage and supported properties of the agent systemd cgroup (docs/design/agent-systemd-cgroup.md). Fixes: #4336 Signed-off-by: Yuan-Zhuo <yuanzhuo0118@outlook.com>	2022-10-25 13:57:09 +08:00
Bo Chen	a151d8ee50	Merge pull request #5493 from fidencio/topic/update-clh versions: Update Cloud Hypervisor to b4e39427080	2022-10-24 07:54:02 -07:00
Bin Liu	4696eadfeb	Merge pull request #5488 from ManaSugi/fix/update-libseccomp-crate rustjail: Upgrade libseccomp crate to v0.3.0	2022-10-24 17:03:30 +08:00
Bin Liu	badb2600b3	Merge pull request #5474 from openanolis/makefile makefile: remove sudo when create symbolic link	2022-10-24 17:03:20 +08:00
Bin Liu	ab5f97759d	Merge pull request #5497 from Rouzip/remove-redundant agent: remove redundant checks	2022-10-24 16:41:49 +08:00
Fabiano Fidêncio	190e623c40	Merge pull request #5317 from Champ-Goblem/fix-containerd-stats shim: Ensure pagesize is set when reporting hugetlb stats	2022-10-24 10:24:49 +02:00
Fabiano Fidêncio	7248cf51c5	Merge pull request #5447 from hbrueckner/fix-5438 kata-ctl: Re-enable network tests on s390x (fixes 5438)	2022-10-24 10:23:35 +02:00
James O. D. Hunt	65ef2a0a0b	Merge pull request #5089 from liubin/fix/4895-ignore-exit-error agent: use NLM_F_REPLACE replace NLM_F_EXCL in rtnetlink	2022-10-24 08:46:54 +01:00
snir911	ee189d2ebe	Merge pull request #5455 from kata-containers/main-validate-hp-size agent: validate hugepage size is supported	2022-10-23 08:15:05 +03:00
Rouzip	44d8de8923	agent: remove redundant checks Remove redundant checks for executable files. FIXes: #3730 Signed-off-by: Rouzip <1226015390@qq.com>	2022-10-22 23:31:18 +08:00
Fabiano Fidêncio	9d286af7b4	versions: Update Cloud Hypervisor to b4e39427080 An API change, done a long time ago, has been exposed on Cloud Hypervisor and we should update it on the Kata Containers side to ensure it doesn't affect Cloud Hypervisor CI and because the change is needed for an upcoming work to get QAT working with Cloud Hypervisor. Fixes: #5492 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-10-21 20:52:54 +02:00
Bin Liu	081ee48713	agent: use NLM_F_REPLACE replace NLM_F_EXCL in rtnetlink Sometimes we will face EEXIST error when adding arp neighbour. Using NLM_F_REPLACE replace NLM_F_EXCL will avoid fail if the entry exists. See https://man7.org/linux/man-pages/man7/netlink.7.html Fixes: #4895 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-10-21 21:19:14 +08:00
Hendrik Brueckner	e95089b716	kata-ctl: add basic cpu check for s390x Add a basic s390x cpu check for the "sie" feature to be present. Also re-enable cpu check testing. Fixes: #5438 Signed-off-by: Hendrik Brueckner <brueckner@linux.ibm.com>	2022-10-21 12:04:28 +00:00
Hendrik Brueckner	871d2cf2c0	kata-ctl: Limit running tests to x86 and use native-tls on s390x For s390x, use native-tls for reqwest because the rustls-tls/ring dependency is not available for s390x. Also exclude s390x, powerpc64le, and aarch64 from running the cpu check due to the lack of the arch-specific implementation. In this case, rust complains about unused functions in src/check.rs (both normal and test context). Fixes: #5438 Co-authored-by: James O. D. Hunt <james.o.hunt@intel.com> Signed-off-by: Hendrik Brueckner <brueckner@linux.ibm.com>	2022-10-21 11:54:26 +00:00
Manabu Sugimoto	cbd84c3f5a	rustjail: Upgrade libseccomp crate to v0.3.0 The libseccomp crate v0.3.0 has been released, so use it in the agent. Fixes: #5487 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-10-21 15:40:05 +09:00
Bin Liu	1bf64c9a11	Merge pull request #5453 from openanolis/chao/fix_comment_typo Makefile: fix an typo in runtime-rs makefile	2022-10-21 14:36:39 +08:00
Zhongtao Hu	748be0fe3d	makefile: remove sudo when create symbolic link when using mock to package rpm, we cannot have sudo permission Fixes: #5473 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-10-20 22:13:21 +08:00
Bin Liu	cd27ad144e	Merge pull request #5219 from openanolis/krt-modify Modify agent-url return value in runtime-rs	2022-10-20 11:17:29 +08:00
Bin Liu	faf363db75	Merge pull request #5414 from openanolis/chao/regulate_runtime_rs_makefile_comments runtime-rs: regulate the comment in runtime-rs makefile	2022-10-19 15:36:00 +08:00
Snir Sheriber	72738dc11f	agent: validate hugepage size is supported before setting a limit, otherwise paths may not be found. guest supporting different hugepage size is more likely with peer-pods where podvm may use different flavor. Fixes: #5191 Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2022-10-19 09:55:33 +03:00
Chao Wu	f74e328fff	Makefile: fix an typo in runtime-rs makefile There is a typo in runtime-rs makefile. _dragonball should be _DB fixes: #5452 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-10-19 14:12:48 +08:00
Chao Wu	f205472b01	Makefile: regulate the comment style for the runtime-rs comments In runtime-rs makefile, we use ``` ``` to let make help print out help information for variables and targets, but later commits forgot this rule. So we need to follow the previous rule and change the current comments. fixes: #5413 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-10-19 12:12:50 +08:00
Hendrik Brueckner	9f2c7e47c9	Revert "kata-ctl: Disable network check on s390x" This reverts commit `00981b3c0a`. Signed-off-by: Hendrik Brueckner <brueckner@linux.ibm.com>	2022-10-18 11:12:18 +00:00
James O. D. Hunt	00981b3c0a	kata-ctl: Disable network check on s390x s390x apparently does not support rust-tls, which is required by the network check (due to the `reqwest` crate dependency). Disable the network check on s390x until we can find a solution to the problem. > Note: > > This fix is assumed to be a temporary one until we find a solution. > Hence, I have not moved the network check code (which should be entirely > generic) into an architecture specific module. Fixes: #5435. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-17 10:24:06 +01:00
Rouzip	39363ffbfb	runtime: remove same function Add EnterNetNS in virtcontainers to remove same function. FIXes #5394 Signed-off-by: Rouzip <1226015390@qq.com>	2022-10-17 10:59:13 +08:00
James O. D. Hunt	c322d1d12a	kata-ctl: arch: Improve check call Rework the architecture-specific `check()` call by moving all the conditional logic out of the function. Fixes: #5402. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-15 11:41:53 +01:00
Zhongtao Hu	5d17cbeef7	Merge pull request #5383 from openanolis/chao/update_comments_in_event_manager Dragonball: remove redundant comments in event manager	2022-10-14 15:50:37 +08:00
Bin Liu	b23a24ab2f	Merge pull request #5417 from liubin/fix/typo-get_contaier_type runtime-rs: fix typo get_contaier_type to get_container_type	2022-10-13 22:35:23 +08:00
Bin Liu	c7b38532f0	Merge pull request #5412 from tzY15368/improve-cmd-descriptions kata-ctl: improve command descriptions for consistency	2022-10-13 19:17:42 +08:00
Bin Liu	4d9dd8790d	runtime-rs: fix typo get_contaier_type to get_container_type Change get_contaier_type to get_container_type Fixes: #5415 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-10-13 17:12:43 +08:00
Bin Liu	2de29b6f69	Merge pull request #5088 from liubin/fix/5087-force-shutdown-shim runtime-rs: force shutdown shim process in it can't exit	2022-10-13 16:55:05 +08:00
Tingzhou Yuan	70676d4a99	kata-ctl: improve command descriptions for consistency This change improves the command descriptions for kata-ctl and can avoid certain confusions in command functionality. Fixes #5411 Signed-off-by: Tingzhou Yuan <tzyuan15@bu.edu>	2022-10-13 04:10:23 +00:00
Bin Liu	3b70c72436	Merge pull request #5395 from wllenyj/dragonball-s390 ci: skip s390x for dragonball.	2022-10-13 09:03:08 +08:00
Bin Liu	157d3cdcb1	Merge pull request #5397 from openanolis/chao/delete_redundant_dragonball_comment Dragonball: delete redundant comments in blk_dev_mgr	2022-10-13 09:01:59 +08:00
James O. D. Hunt	d3ee8d9f1b	Merge pull request #5388 from jodh-intel/kata-ctl kata-ctl: Move development to main branch	2022-10-12 14:29:35 +01:00
James O. D. Hunt	00a42f69c0	kata-ctl: cargo: 2021 -> 2018 Revert to the 2018 edition of rust for consistency with other rust components. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-12 11:46:51 +01:00
James O. D. Hunt	fb63274747	kata-ctl: rustfmt + clippy fixes Make this file conform to the standard rust layout conventions and simplify the code as recommended by `clippy`. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-12 11:46:48 +01:00
wllenyj	1f1901e059	dragonball: fix clippy warning for aarch64 Added aarch64 check. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-10-12 18:29:00 +08:00
wllenyj	a343c570e4	dragonball: enhance dragonball ci Unified use of Makefile instead of calling `cargo test` directly. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-10-12 17:53:01 +08:00
wllenyj	6a64fb0eb3	ci: skip s390x for dragonball. Currently, Dragonball only supports x86_64 and aarch64 platforms. Fixes: #4381 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-10-12 15:27:45 +08:00
Bin Liu	7aacba0abc	Merge pull request #5282 from liubin/fix/4730-rs-emptydir runtime-rs: support ephemeral storage for emptydir	2022-10-12 09:53:59 +08:00
Chao Wu	a743e37daf	Dragonball: delete redundant comments in blk_dev_mgr delete redundent derive part for BlockDeviceMgr. fixes: #5396 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-10-11 19:41:47 +08:00
James O. D. Hunt	f7010b8061	kata-ctl: docs: Write basic documentation Provide a basic document explaining a little about the `kata-ctl` command. Fixes: #5351. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-11 10:04:48 +01:00
Bin Liu	ffdd7e1ad8	Merge pull request #4961 from wllenyj/dragonball-ut-2 Built-in Sandbox: add more unit tests for dragonball	2022-10-11 14:12:25 +08:00
Bin Liu	39702c19d5	Merge pull request #5276 from bergwolf/github/readme readme: remove libraries mentioning	2022-10-11 13:19:18 +08:00
wllenyj	26c043dee7	ci: Add dragonball test Enhanced Static-Check of CI to support nested virtualization. Fixes: #5378 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-10-11 00:36:20 +08:00
James O. D. Hunt	15c343cbf2	kata-ctl: Don't rely on system ssl libs Build using the rust TLS implementation rather than the system ones. This resolves the `reqwest` crate build failure: it doesn't appear to build against the native libssl libraries due to Kata defaulting to using the musl libc. Fixes: #5387. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-10 13:42:51 +01:00
James O. D. Hunt	c23584994a	kata-ctl: clippy: Resolve warnings and reformat Resolved a couple of clippy warnings and applied standard `rustfmt`. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-10 13:42:51 +01:00
David Esparza	133690434c	kata-ctl: implement CLI argument --check-version-only This kata-ctl argument returns the latest stable Kata release by hitting github.com. Adds check-version unit tests. Fixes: #11 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2022-10-10 13:42:51 +01:00
David Esparza	eb5423cb7f	kata-ctl: switch to use clap derive for CLI handling Switch from the functional version of `clap` to the declarative methodology. Signed-off-by: David Esparza <david.esparza.borquez@intel.com> Commit-edited-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-10 13:42:51 +01:00
Chelsea Mafrica	018aa899cb	kata-ctl: Add cpu check Add architecture-specific code for x86_64 and generic calls handling checks for CPU flags and attributes. Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2022-10-10 13:42:50 +01:00
James O. D. Hunt	7c9f9a5a1d	kata-ctl: Make arch test run at compile time Changed the `panic!()` call to a `compile_error!()` one to ensure it fires at compile time rather than runtime. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-10 13:42:50 +01:00
James O. D. Hunt	b63ba66dc3	kata-ctl: Formatting tweaks Automatic format updates. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-10 13:42:50 +01:00
James O. D. Hunt	cca7e32b54	kata-ctl: Lint fixes to allow the branch to be built Remove return value for branches that call `unimplemented!()`. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-10 13:42:50 +01:00
Chelsea Mafrica	8e7bb8521c	kata-ctl: add code for framework for arch Add framework for different architectures for check. In the existing kata-runtime check, the network checks do not appear to be architecture-specific while the kernel module, cpu, and kvm checks do have separate implementations for different architectures. Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2022-10-10 13:42:50 +01:00
David Esparza	303fc8b118	kata-ctl: Add unit tests cases Add more unit tests cases to --version argument. Signed-off-by: David Esparza <david.esparza.borquez@intel.com> Commit-edited-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-10 13:42:43 +01:00
David Esparza	d0b33e9a32	versions: Add kata-ctl version entry As we're switching to using the rust version of the kata-ctl, lets provide with its own entry in the kata-ctl command line. Signed-off-by: David Esparza <david.esparza.borquez@intel.com> Commit-edited-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-10-10 13:42:35 +01:00
Chelsea Mafrica	002b18054d	kata-ctl: Add initial rust code for kata-ctl Use agent-ctl tool rust code as an example for a skeleton for the new kata-ctl tool. Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2022-10-10 10:10:37 +01:00
wllenyj	b62b18bf1c	dragonball: fix clippy warning Fixed: - unnecessary_lazy_evaluations - derive_partial_eq_without_eq - redundant_closure - single_match - question_mark - unused-must-use - redundant_clone - needless_return Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-10-10 16:41:40 +08:00
wllenyj	2ddc948d30	Makefile: add dragonball components. Enable ci to run dragonball unit tests. Fixes: #4899 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-10-10 16:41:40 +08:00
wllenyj	3fe81fe4ab	dragonball-ut: use skip_if_not_root to skip root case Use skip_if_not_root to skip when unit test requires privileges. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-10-10 16:41:40 +08:00
wllenyj	72259f101a	dragonball: add more unit test for vmm actions Added more unit tests for vmm actions. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-10-10 16:41:39 +08:00
Chao Wu	9717dc3f75	Dragonball: remove redundant comments in event manager handle_events for EventManager doesn't take max_events as arguments, so we need to update the comments for it. p.s. max_events is defined when initializing the EventManager. fixes: #5382 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-10-09 14:38:12 +08:00
Fupan Li	2c88e1cd80	Merge pull request #5302 from liubin/fix/5285-SetFsSharingSupport-comment runtime: fix incorrect comment for SetFsSharingSupport function	2022-10-09 09:40:31 +08:00
Bin Liu	b556c9b986	Merge pull request #5235 from YchauWang/wyc-qmp-log virtcontainers: add warn log record for qmp hotplug cpu error	2022-10-09 08:29:09 +08:00
Bin Liu	53f209af44	libs/kata-types: adjust default_vcpus correctly With default_maxvcpus = 0 and default_vcpus = 1 settings, the default_vcpus will be set to 0 and leads to starting fail. The default_maxvcpus is not set correctly when it is set to 0, and the default_vcpus is set to 0. The correct action is setting default_maxvcpus to the max number of CPUs or MAX_DRAGONBALL_VCPUS, and the default_vcpus should be set to the desired value if the valuse is between 0 and default_maxvcpus. Fixes: #5110 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-10-08 16:52:05 +08:00
Bin Liu	dd34540b8a	Merge pull request #5305 from liubin/fix/5301-delete-duplicated-PASSTHROUGH_FS_DIR runtime-rs: delete duplicated PASSTHROUGH_FS_DIR const	2022-10-08 16:39:03 +08:00
Ji-Xinyou	9c1ac3d457	runtime-rs: return port on agent-url req Add the server vport (1024) when requesting agent-url Fixes: #5213 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-10-08 16:14:21 +08:00
Fabiano Fidêncio	ce73bc6dac	Merge pull request #5015 from vijaydhanraj/enable_acrn_kata2.x Enable ACRN hypervisor support for Kata 2.x release	2022-10-08 09:27:59 +02:00
Bin Liu	4616363eec	Merge pull request #5365 from fengwang666/mount-bug-fix agent: reduce reference count for failed mount	2022-10-08 14:27:38 +08:00
Fupan Li	1b7272c7ca	Merge pull request #5367 from fengwang666/signal-bug-fix agent: don't exit early if signal fails due to ESRCH	2022-10-08 14:21:50 +08:00
Feng Wang	ef5a2dc3bf	agent: don't exit early if signal fails due to ESRCH ESRCH usually means the process has exited. In this case, the execution should continue to kill remaining container processes. Fixes: #5366 Signed-off-by: Feng Wang <feng.wang@databricks.com> [Fix up cargo updates] Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-10-08 12:15:12 +08:00
Bin Liu	5ace4e2354	Merge pull request #5304 from liubin/fix/5299-delete-duplicated-get_bundle_path kata-sys-util: delete duplicated get_bundle_path	2022-10-08 10:57:52 +08:00
Vijay Dhanraj	435c8f181a	acrn: Enable ACRN hypervisor support for Kata 2.x release Currently ACRN hypervisor support in Kata2.x releases is broken. This commit re-enables ACRN hypervisor support and also refactors the code so as to remove dependency on Sandbox. Fixes #3027 Signed-off-by: Vijay Dhanraj <vijay.dhanraj@intel.com>	2022-10-07 07:40:32 -07:00
Feng Wang	c31cf7269e	agent: reduce reference count for failed mount The kata agent adds a reference for each storage object before mount and skip mount again if the storage object is known. We need to remove the object reference if mount fails. Fixes: #5364 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-10-06 21:37:59 -07:00
Archana Shinde	6e2d39c588	Merge pull request #5311 from likebreath/0930/clh_v27.0 Upgrade to Cloud Hypervisor v27.0	2022-10-04 10:56:00 -07:00
Fabiano Fidêncio	d5572d5fd5	Merge pull request #5106 from norbjd/fix/microvm-machine-options microvm: Remove kernel_irqchip=on option	2022-10-04 12:19:37 +02:00
Champ-Goblem	89e62d4edf	shim: Ensure pagesize is set when reporting hugetbl stats The containerd stats method and metrics API are broken with Kata 2.5.x, the stats fail to load and the metrics API responds with status code 500 This seems to be down to the conversion from the stats reported by the agent RPC `StatsContainer` where the field `Pagesize` is not completed by the `setHugetlbStats` method. In the case where multiple sized tables stats are reported, this causes containerd to register two metrics with the same label set, rather than each being partitioned by the `page` label. Fixes: #5316 Signed-off-by: Champ-Goblem <cameron@northflank.com>	2022-10-04 09:16:30 +01:00
Bo Chen	067e2b1e33	runtime: clh: Use the new API to boot with TDX firmware (td-shim) The new way to boot from TDX firmware (e.g. td-shim) is using the combination of '--platform tdx=on' with '--firmware tdshim'. Fixes: #5309 Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-10-03 10:30:54 -07:00
Bo Chen	5d63fcf344	runtime: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v27.0. Note: The client code of cloud-hypervisor's (CLH) OpenAPI is automatically generated by openapi-generator [1-2]. [1] https://github.com/OpenAPITools/openapi-generator [2] https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/pkg/cloud-hypervisor/README.md Fixes: #5309 Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-10-03 10:30:42 -07:00
Fabiano Fidêncio	0143036b84	Merge pull request #5303 from liubin/fix/5296-typo-unknow kata-sys-util: fix typo `unknow`	2022-10-03 15:29:45 +02:00
norbjd	17de94e118	microvm: Remove kernel_irqchip=on option `kernel_irqchip` option doesn't seem to bring any benefits and, on the contrary, its usage cause issues when using the microvm machine type. With this in mind, let's remove it. Fixes: #1984, #4386 Signed-off-by: norbjd <norbjd@users.noreply.github.com> Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-10-03 11:48:05 +02:00
Bin Liu	3aeaa6459d	runtime-rs: delete duplicated PASSTHROUGH_FS_DIR const The const PASSTHROUGH_FS_DIR defined twice, delte one. Fixes: #5301 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-30 15:53:08 +08:00
Bin Liu	43ae972335	kata-sys-util: delete duplicated get_bundle_path get_bundle_path has already defined in spec.rs, delete it from fs.rs. Fixes: #5299 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-30 15:50:58 +08:00
Bin Liu	ac04831223	kata-sys-util: fix typo `unknow` Change `unknow` to `unknown`. Fixes: #5296 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-30 15:47:34 +08:00
Bin Liu	68e8a86aec	runtime: fix incorrect comment for SetFsSharingSupport function The comment for SetFsSharingSupport is not suitable, correct the function name. Fixes: #5285 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-30 15:44:44 +08:00
Bin Liu	805e80b2a2	Merge pull request #5278 from openanolis/chao/update_linux_loader_ut dragonball: update ut for kernel config	2022-09-30 11:12:29 +08:00
Bin Liu	8d4ced3c86	runtime-rs: support ephemeral storage for emptydir Add support for ephemeral storage and k8s emptydir. Depends-on:github.com/kata-containers/tests#5161 Fixes: #4730 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-30 09:10:20 +08:00
Jianyong Wu	6d585d5919	dragonball: fix no "as_str" error on Arm Cmdline struct update in the latest linux-loader lib and its as_str method is changed to as_cstring, thus we need fix it according whereas the old as_str method is used. Fixes: #5287 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2022-09-29 21:06:31 +08:00
Bin Liu	949ffcc457	Merge pull request #5281 from liubin/fix/5280-update-cargo-lock runtime-rs: update Cargo.lock	2022-09-29 17:16:21 +08:00
Bin Liu	1352e31180	Merge pull request #5200 from openanolis/agent_rwlock refactor(runtime-rs): Use RwLock in runtime-agent	2022-09-29 13:15:41 +08:00
Bin Liu	457b0beaf0	runtime-rs: update Cargo.lock src/dragonball/Cargo.toml is updated and the Cargo.lock is not commited into repo. Fixes: #5280 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-29 13:15:01 +08:00
Bin Liu	abbdf89a06	Merge pull request #5271 from liubin/fix/4729-add-close-io-for-kubectl-cp runtime-rs: fix shim close_io call to support kubectl cp	2022-09-29 13:10:49 +08:00
Peng Tao	046ddc6463	readme: remove libraries mentioning There are two duplicated mentioning of the rust libraries in README.md. Let's just remove them all as the section is intended to list out core Kata components rather than general libraries. Fixes: #5275 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-09-29 12:10:50 +08:00
Chao Wu	f89ada2de1	dragonball: update ut for kernel config Since linux loader is updated in the Dragonball and the api for Cmdline has been changed ( as_str() changed to as_cstring() ), we need to update unit test in Dragonball. fixes: #5277 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-09-29 11:35:45 +08:00
Bin Liu	0e899669ee	runtime-rs: fix shim close_io call to support kubectl cp Add close_io to shim and call agent's close_stdin in close_io. Depends-on:github.com/kata-containers/tests#5155 Fixes: #4729 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-29 09:35:17 +08:00
Zhongtao Hu	96cf21fad0	runtime-rs: add comments for runtime-rs shared directory add comments for runtime-rs shared directory Fixes:#5197 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-09-28 15:46:34 +08:00
Zhongtao Hu	2f1a4b02ee	Merge pull request #5254 from openanolis/chao/update_linux_loader Dragonball: update linux_loader to 0.6.0	2022-09-28 15:04:09 +08:00
Bin Liu	0f6884b8c3	Merge pull request #5252 from zhaoxuat/main modify virtio_net_dev_mgr.rs wrong code comments	2022-09-28 11:34:20 +08:00
Bin Liu	d0be4a285e	Merge pull request #5260 from GabyCT/topic/fixrunkdoc docs: Update urls in runk documentation	2022-09-28 11:30:39 +08:00
Zhongtao Hu	ff053b0808	Merge pull request #5220 from liubin/fix/5184-rs-inotify runtime-rs: support watchable mount	2022-09-28 11:19:53 +08:00
Zhongtao Hu	319caa8e74	Merge pull request #5097 from openanolis/dbg-console runtime-rs: debug console support in runtime	2022-09-28 10:30:22 +08:00
Peng Tao	33b0720119	Merge pull request #5193 from openanolis/origin/kata-deploy kata-deploy: ship the rustified runtime binary	2022-09-28 10:19:16 +08:00
Gabriela Cervantes	9bd941098e	docs: Update urls in runk documentation This PR updates the urls that we have in the runk documentation. Fixes #5259 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2022-09-27 15:45:43 +00:00
Chao Wu	90ecc015e0	Dragonball: update linux_loader to 0.6.0 Since linux-loader 0.4.0 and 0.5.0 is yanked due to null terminator bug, we need to update linux-loader to 0.6.0. And as_str() function should also be changed. fixes: #5253 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-09-27 23:01:44 +08:00
Bin Liu	c64e56327f	Merge pull request #5190 from liubin/fix/5189-unbind-as-a-const runtime-rs: define VFIO unbind path as a const	2022-09-27 21:04:18 +08:00
Bin Liu	4a763925e5	runtime-rs: support watchable mount Use watchable mount to support inotify for virtio-fs. Fixes: #5184 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-27 19:08:25 +08:00
zhaoxu	abc26b00bb	dragonball: modify wrong code comments modify virtio_net_dev_mgr.rs wrong code comments Fixes: #5252 Signed-off-by: zhaoxu <zhaoxu@megvii.com>	2022-09-27 18:32:13 +08:00
Bin Liu	c95cf6dce7	Merge pull request #5250 from liubin/fix/5249-set-timeout-to-zero-for-stream-rpc runtime-rs: set agent timeout to 0 for stream RPCs	2022-09-27 17:39:35 +08:00
Peng Tao	8a2df6b31c	Merge pull request #4931 from jpecholt/snp-support Added SNP-Support for Kata-Containers	2022-09-27 14:17:54 +08:00
Bin Liu	20bcaf0e36	runtime-rs: set agent timeout to 0 for stream RPCs For stream RPCs: - write_stdin - read_stdout - read_stderr there should be no timeout (by setting it to 0). Fixes: #5249 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-27 11:47:37 +08:00
Bin Liu	407e46b1b7	Merge pull request #5218 from bergwolf/github/deps runtime/runtime-rs: update dependency	2022-09-27 11:02:46 +08:00
Bin Liu	a2f207b923	Merge pull request #5163 from liubin/fix/5162-add-test-for-StaticResource runtime-rs: add test for StaticResource	2022-09-26 17:44:20 +08:00
Zhongtao Hu	9d67f5a7e2	Merge pull request #5230 from openanolis/nohc runtime-rs: remove hardcoded string	2022-09-26 16:01:41 +08:00
quanweiZhou	ad87c7ac56	Merge pull request #5206 from openanolis/hypervisor/readme docs: add README for runtime-rs hypervisor crate	2022-09-26 16:01:12 +08:00
Bin Liu	5a98fb8d2b	Merge pull request #5186 from liubin/fix/5185 runtime-rs: use Path.is_file to check regular files	2022-09-26 12:33:47 +08:00
Zhongtao Hu	4a36bb9e21	Merge pull request #4924 from openanolis/runtime-rs-netUT runtime-rs: add unit tests for network resource	2022-09-23 17:45:24 +08:00
Zhongtao Hu	274de024c5	docs: add README for runtime-rs hypervisor crate add README for runtime-rs hypervisor crate Fixes:#4634 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-09-23 15:20:02 +08:00
Chao Wu	9cf5de0b4e	Merge pull request #5171 from liubin/fix/5170-use-macro runtime-rs/resource: use macro to reduce duplicated code	2022-09-23 10:59:53 +08:00
wangyongchao.bj	04bbce8dc3	virtcontainers: add warn log record for qmp hotplug cpu error The qmp command of hotplug cpu failed error was hidden. It didn't friendly for the user tracing the hotplug cpu error. The PR help us to improve the hotplug cpu error log. Add real qemu command error log for `failed to hot add vCPUs`. Through the error message, we can get the reason of the failed qmp command for hotplug cpu operation. Fixes: #5234 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2022-09-23 08:22:30 +08:00
Chelsea Mafrica	de869f2565	Merge pull request #5188 from liubin/fix/5187-incorrect-comments-in-kata-types-hypervisor runtime-rs: fix incorrect comments	2022-09-22 14:09:20 -07:00
Zhongtao Hu	d663f110d7	kata-deploy: get the config path from cri options get the config path for runtime-rs from cri options Fixes: #5000 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-09-22 17:39:25 +08:00
Ji-Xinyou	46965739a4	runtime-rs: remove hardcoded string Use KATA_PATH instead of "run/kata" Fixes: #5229 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-09-22 16:06:51 +08:00
Zhongtao Hu	a394761a5c	kata-deploy: add installation for runtime-rs setup the compile environment and installation path for the Rust runtime Fixes:#5000 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-09-22 15:59:44 +08:00
Peng Tao	a2c13bad45	Merge pull request #5156 from fengwang666/uid-reuse-bug Non-root hypervisor uid reuse bug	2022-09-22 15:35:39 +08:00
Peng Tao	af174c2b6d	Merge pull request #5195 from wllenyj/update-dbs Build-in Sandbox: update dragonball-sandbox dependencies	2022-09-22 15:07:11 +08:00
Ji-Xinyou	50299a3292	refactor(runtime-rs): Use RwLock in runtime agent Use RwLock for Agent in runtime, for better concurrency. Fixes: #5199 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-09-21 17:43:40 +08:00
Peng Tao	9628c7df0c	runtime: update runc dependency To bring fix to CVE-2022-29162. Fixes: #5217 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-09-21 17:21:37 +08:00
Peng Tao	7fbc883879	runtime-rs: drop dependency on rustc-serialize We are not using it and it hasn't got any updates for more than five years, leaving open CVEs unresolved. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-09-21 17:19:58 +08:00
Ji-Xinyou	e23bfd615e	runtime-rs: make function name more understandable Change kparams to kernel_params for understandability. Fixes: #5068 Signed-Off-By: Ji-Xinyou <jerryji0414@outlook.com>	2022-09-21 11:48:11 +08:00
Ji-Xinyou	426a436780	runtime-rs: add unit test and eliminate raw string Add two unit tests for coverage and eliminate raw strings to constant. Fixes: #5068 Signed-Off-By: Ji-Xinyou <jerryji0414@outlook.com>	2022-09-21 11:47:07 +08:00
Ji-Xinyou	87959cb72d	runtime-rs: debug console support in runtime Read debug console configuration in kernel params. Fixes: #5068 Signed-Off-By: Ji-Xinyou <jerryji0414@outlook.com>	2022-09-21 11:46:55 +08:00
Bin Liu	a2e7434a0f	Merge pull request #5082 from QiliangFan/main dragonball: Fix problem that stdio console cannot connect to stdout	2022-09-21 11:12:19 +08:00
wllenyj	0399da677d	runtime-rs: update dependencies Updated Cargo.lock. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-09-20 15:00:14 +08:00
wllenyj	f6f19917a8	dragonball: update dragonball-sandbox dependencies Updated vmm-sys-util to 0.10.0 Updated virtio-queue to 0.4.0 Updated vm-memory to 0.9.0 Updated linux-loader to 0.5.0 Fixes: #5194 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-09-20 14:48:09 +08:00
Zhongtao Hu	e05e42fd3c	Merge pull request #5113 from liubin/fix/5112-call-TomlConfig-validate-func runtime-rs: call TomlConfig's validate function after load	2022-09-20 14:38:42 +08:00
Zhongtao Hu	fc65e96ad5	Merge pull request #5133 from openanolis/shimmgmt feat(Shimmgmt): Shim management server and client	2022-09-20 14:37:19 +08:00
Bin Liu	2caee1f38d	runtime-rs: define VFIO unbind path as a const In src/runtime-rs/crates/hypervisor/src/device/vfio.rs, the path of new_id is defined as a const, but unbind is used as a local variable, they should be unified to const. Fixes: #5189 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-19 16:08:35 +08:00
Bin Liu	3f65ff2d07	runtime-rs: fix incorrect comments Some comments for types are incorrect in file src/libs/kata-types/src/config/hypervisor/mod.rs Fixes: #5187 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-19 16:03:06 +08:00
Bin Liu	9670a3caac	runtime-rs: use Path.is_file to check regular files Use Path.is_file to replace using `stat` to check the file type. Fixes: #5185 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-19 15:57:07 +08:00
Joana Pecholt	ded60173d4	runtime: Enable choice between AMD SEV and SNP This is based on a patch from @niteeshkd that adds a config parameter to choose between AMD SEV and SEV-SNP VMs as the confidential guest type in case both types are supported. SEV is the default. Signed-off-by: Joana Pecholt <joana.pecholt@aisec.fraunhofer.de>	2022-09-16 17:51:41 +02:00
Joana Pecholt	22bda0838c	runtime: Support for AMD SEV-SNP VMs This commit adds AMD SEV-SNP as a confidential guest option to the runtime. Information on required components such as OVMF, QEMU and a kernel supporting SEV-SNP are defined in the versions file and corresponding configs are added. Note: The CPU model 'host' provided by the current SNP-QEMU does not support all SNP capabilities yet, which is why this option is changed to EPYC-v4. Note: The guest's physical address space reduction specified with ReducedPhysBits is 1. Details are can be found in Section 15.34.6 here https://www.amd.com/system/files/TechDocs/24593.pdf Fixes #4437 Signed-off-by: Joana Pecholt <joana.pecholt@aisec.fraunhofer.de>	2022-09-16 17:51:41 +02:00
Joana Pecholt	105eda5b9a	runtime: Initrd path option added to config Adds initrd configuration option to the configuration.toml that is generated for the setup using QEMU. Signed-off-by: Joana Pecholt <joana.pecholt@aisec.fraunhofer.de>	2022-09-16 17:51:41 +02:00
Bin Liu	a8a8a28a34	runtime-rs/resource: use macro to reduce duplicated code Some device types have the same definition, they can be implemented by macro to reduce code. And this commit also deleted the `peer_name` field of the structs that is never been used. Fixes: #5170 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-15 15:45:26 +08:00
Bin Liu	156e1c3247	runtime-rs: delete some allow(dead_code) attributes Some #![allow(dead_code)]s and code are not needed indeed. Fixes: #5164 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-14 20:50:30 +08:00
qiliangfan	7622452f4b	Dragonball: Fix the problem about stdio console Let stdout stream connect to the com1_device, Fixes: #5083 Signed-off-by: qiliangfan <fanqiliang@mail.nankai.edu.cn>	2022-09-14 15:53:57 +08:00
Bin Liu	208233288a	runtime-rs: add test for StaticResource Add test case for StaticResource, the old test is not covering the StaticResource struct. Fixes: #5162 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-14 11:45:07 +08:00
Feng Wang	f914319874	runtime: store the user name in hypervisor config The user name will be used to delete the user instead of relying on uid lookup because uid can be reused. Fixes: #5155 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-09-13 10:32:55 -07:00
Feng Wang	5cafe21770	runtime: make StopVM thread-safe StopVM can be invoked by multiple threads and needs to be thread-safe Fixes: #5155 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-09-12 21:56:15 -07:00
Feng Wang	c3015927a3	runtime: add more debug logs for non-root user operation Previously the logging was insufficient and made debugging difficult Fixes: #5155 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-09-12 21:38:57 -07:00
Bin Liu	a58feba9bb	Merge pull request #5105 from liubin/fix/5104-ignore-virtiofs-daemon-for-inline-mode kata-types: don't check virtio_fs_daemon for inline-virtio-fs	2022-09-13 10:33:56 +08:00
Bin Liu	42d4da9b6c	Merge pull request #5101 from liubin/fix/5100-cpu-period-quota-data-type kata-types: change return type of getting CPU period/quota function	2022-09-13 10:33:29 +08:00
Tim Zhang	8ec4edcf4f	Merge pull request #5146 from liubin/fix/5145-check-host-dev runtime-rs: fix host device check pattern	2022-09-13 10:33:05 +08:00
Bin Liu	62cf6e6fc3	runtime-rs: remove meaningless comment The comment for `generate_mount_path` function is a copy miss and should be deleted. Fixes: #5150 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-09 16:07:35 +08:00
Bin Liu	55f4f3a95b	Merge pull request #4897 from ManaSugi/runk/enable-seccomp runk: Enable seccomp support by default	2022-09-09 14:11:35 +08:00
Manabu Sugimoto	bcf6bf843c	runk: Enable seccomp support by default Enable seccomp support in `runk` by default. Due to this, `runk` is built with `gnu libc` by default because the building `runk` with statically linked the `libseccomp` and `musl` requires additional configurations. Also, general container runtimes are built with `gnu libc` as dynamically linked binaries by default. The user can disable seccomp by `make SECCOMP=no`. Fixes: #4896 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-09-09 10:55:16 +09:00
GabyCT	be462baa7e	Merge pull request #5103 from liubin/fix/5102-add-inline-virtiofs-config config: add "inline-virtio-fs" as a "shared_fs" type	2022-09-08 10:33:20 -05:00
GabyCT	bcbce8317d	Merge pull request #5061 from liubin/fix/5022-runtime-rs-readme runtime-rs: add README.md	2022-09-08 10:32:08 -05:00
bin liu	2b1d058572	runtime-rs: fix host device check pattern Host devices should start with `/dev/` but not `/dev`. Fixes: #5145 Signed-off-by: bin liu <liubin0329@gmail.com>	2022-09-08 22:44:46 +08:00
Bin Liu	85b49cee02	runtime-rs: add README.md Add README.md for runtime-rs. Fixes: #5022 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-08 16:03:45 +08:00
Bin Liu	7cfc357c6e	Merge pull request #5034 from ManaSugi/runk/refactor-container-builder runk: Refactor container builder	2022-09-08 11:30:07 +08:00
Ji-Xinyou	5add50aea2	runtime-rs: timeout for shim management client Let client side support timeout if the timeout value is set. If timeout not set, execute directly. Fixes: #5114 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-09-08 11:11:33 +08:00
Bin Liu	36d805fab9	config: add "inline-virtio-fs" as a "shared_fs" type "inline-virtio-fs" is newly supported by kata 3.0 as a "shared_fs" type, it should be described in configuration file. "inline-virtio-fs" is the same as "virtio-fs", but it is running in the same process of shim, does not need an external virtiofsd process. Fixes: #5102 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-08 11:05:01 +08:00
Bin Liu	5df6ff991d	Merge pull request #5116 from liubin/fix/5115-replace-tab-by-space libs/kata-types: replace tabs by spaces in comments	2022-09-07 15:53:34 +08:00
Ji-Xinyou	9f13496e13	runtime-rs: shim management client Add client side function(public), to establish http connections (PUT, POST, GET) to the long standing shim mgmt server. Fixes: #5114 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-09-07 15:39:14 +08:00
Bin Liu	aaf6d69089	runtime-rs: call TomlConfig's validate function after load Call TomlConfig's validate function after it is loaded and adjusted by annotations. Fixes: #5112 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-07 11:34:08 +08:00
Bin Liu	fe55f6afd7	Merge pull request #5124 from amshinde/revert-arp-neighbour-api Revert arp neighbour api	2022-09-07 11:14:53 +08:00
Ji-Xinyou	e891295e10	runtime-rs: shim management - agent-url Add agent-url to its handler. The general framework of registering URL handlers is done. Fixes: #5114 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-09-07 11:13:21 +08:00
Chelsea Mafrica	051dabb0fe	Merge pull request #5099 from liubin/fix/5098-add-default-config-for-runtime-rs runtime-rs: add default agent/runtime/hypervisor for configuration	2022-09-06 17:49:42 -07:00
Archana Shinde	d23779ec9b	Revert "agent: fix unittests for arp neighbors" This reverts commit `81fe51ab0b`.	2022-09-06 15:41:42 -07:00
Archana Shinde	d340564d61	Revert "agent: use rtnetlink's neighbours API to add neighbors" This reverts commit `845c1c03cf`. Fixes: #5126	2022-09-06 15:41:42 -07:00
Bin Liu	50f9126153	libs/kata-types: replace tabs by spaces in comments Replace tabs by spaces in the comments of file libs/kata-types/src/annotations/mod.rs. Fixes: #5115 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-06 17:32:57 +08:00
Ji-Xinyou	59aeb776b0	runtime-rs: shim management Add shim management http server and boot it as a light-weight thread when the sandbox is created. Fixes: #5114 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-09-06 16:44:16 +08:00
Bin Liu	96c8be715b	libs/kata-types: change return type of getting CPU period/quota period should have a type of u64, and quota should be i64, the function of getting CPU period and quota from annotations should use the same data type as function return type. Fixes: #5100 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-06 11:35:52 +08:00
Bin Liu	fc9c6f87a3	kata-types: don't check virtio_fs_daemon for inline-virtio-fs If the shared_fs is set to "inline-virtio-fs", the "virtio_fs_daemon" should be ignored. Fixes: #5104 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-05 17:44:28 +08:00
James O. D. Hunt	662ce3d6f2	Merge pull request #5086 from Yuan-Zhuo/main docs: fix unix socket address in agent-ctl doc	2022-09-05 09:24:28 +01:00
Bin Liu	e879270a0c	runtime-rs: add default agent/runtime/hypervisor for configuration Kata 3.0 introduced 3 new configurations under runtime section: name="virt_container" hypervisor_name="dragonball" agent_name="kata" Blank values will lead to starting to fail. Adding default values will make user easy to migrate to kata 3.0. Fixes: #5098 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-05 15:55:28 +08:00
Bin Liu	e5437a7084	Merge pull request #5063 from liubin/fix/5062-split-amend-spec runtime-rs: split amend_spec function	2022-09-05 15:00:31 +08:00
Manabu Sugimoto	968c2f6e8e	runk: Refactor container builder Refactor the container builder code (`InitContainer` and `ActivatedContainer`) to make it easier to understand and to maintain. The details: 1. Separate the existing `builder.rs` into an `init_builder.rs` and `activated_builder.rs` to make them easy to read and maintain. 2. Move the `create_linux_container` function from the `builder.rs` to `container.rs` because it is shared by the both files. 3. Some validation functions such as `validate_spec` from `builder.rs` to `utils.rs` because they will be also used by other components as utilities in the future. Fixes: #5033 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-09-05 14:36:30 +09:00
Bin Liu	ba013c5d0f	Merge pull request #4744 from openanolis/runtime-rs-static_resource_mgmt runtime-rs: support functionality of static resource management	2022-09-05 11:17:09 +08:00
Wainer Moschetta	e81a73b622	Merge pull request #4719 from bookinabox/cargo-deny github-actions: Add cargo-deny	2022-09-02 17:24:50 -03:00
Bin Liu	86ad832e37	runtime-rs: force shutdown shim process in it can't exit In some case the call of cleanup from shim to service manager will fail, and the shim process will continue to running, that will make process leak. This commit will force shutdown the shim process in case of any errors in service crate. Fixes: #5087 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-02 19:43:50 +08:00
Yuan-Zhuo	5f4f5f2400	docs: fix unix socket address in agent-ctl doc Following the instructions in guidance doc will result in the ECONNREFUSED, thus we need to keep the unix socket address in the two commands consistent. Fixes: #5085 Signed-off-by: Yuan-Zhuo <yuanzhuo0118@outlook.com>	2022-09-02 17:37:44 +08:00
Peng Tao	b5786361e9	Merge pull request #4862 from egernst/memory-hotplug-limitation Address Memory hotplug limitation	2022-09-02 16:11:46 +08:00
Bin Liu	41ec71169f	runtime-rs: split amend_spec function amend_spec do two works: - modify the spec - check if the pid namespace is enabled This make it confusable. So split it into two functions. Fixes: #5062 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-09-01 14:44:54 +08:00
Ji-Xinyou	a828292b47	runtime-rs: add unit tests for network resource Add UTs for network resource Fixes: #4923 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-09-01 10:13:09 +08:00
Eric Ernst	9997ab064a	sandbox_test: Add test to verify memory hotplug behavior Augment the mock hypervisor so that we can validate that ACPI memory hotplug is carried out as expected. We'll augment the number of memory slots in the hypervisor config each time the memory of the hypervisor is changed. In this way we can ensure that large memory hotplugs are broken up into appropriately sized pieces in the unit test. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-08-31 10:32:30 -07:00
Eric Ernst	f390c122f0	sandbox: don't hotplug too much memory at once If we're using ACPI hotplug for memory, there's a limitation on the amount of memory which can be hotplugged at a single time. During hotplug, we'll allocate memory for the memmap for each page, resulting in a 64 byte per 4KiB page allocation. As an example, hotplugging 12GiB of memory requires ~192 MiB of free memory, which is about the limit we should expect for an idle 256 MiB guest (conservative heuristic of 75% of provided memory). From experimentation, at pod creation time we can reliably add 48 times what is provided to the guest. (a factor of 48 results in using 75% of provided memory for hotplug). Using prior example of a guest with 256Mi RAM, 256 Mi * 48 = 12 Gi; 12GiB is upper end of what we should expect can be hotplugged successfully into the guest. Note: It isn't expected that we'll need to hotplug large amounts of RAM after workloads have already started -- container additions are expected to occur first in pod lifecycle. Based on this, we expect that provided memory should be freely available for hotplug. If virtio-mem is being utilized, there isn't such a limitation - we can hotplug the max allowed memory at a single time. Fixes: #4847 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-08-31 10:32:30 -07:00
Peng Tao	f1276180b1	Merge pull request #4996 from liubin/fix/4995-delete-socket-option-for-shim runtime-rs: delete socket from shim command-line options	2022-08-31 14:16:56 +08:00
Bin Liu	515bdcb138	Merge pull request #4900 from wllenyj/dragonball-ut Built-in Sandbox: add more unit tests for dragonball.	2022-08-31 14:00:07 +08:00
Eric Ernst	e0142db24f	hypervisor: Add GetTotalMemoryMB to interface It'll be useful to get the total memory provided to the guest (hotplugged + coldplugged). We'll use this information when calcualting how much memory we can add at a time when utilizing ACPI hotplug. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-08-30 16:37:47 -07:00
Derek Lee	52bbc3a4b0	cargo.lock: update crates to comply with checks Updates versions of crossbeam-channel because 0.52.0 is a yanked package (creators mark version as not for release except as a dependency for another package) Updates chrono to use >0.42.0 to avoid: https://rustsec.org/advisories/RUSTSEC-2020-0159 Updates lz4-sys. Signed-off-by: Derek Lee <derlee@redhat.com>	2022-08-30 10:08:41 -07:00
Derek Lee	aa581f4b28	cargo.toml: Add oci to src/libs workplace Adds oci under the src/libs workplace. oci shares a Cargo.lock file with the rest of src/libs but was not listed as a member of the workspace. There is no clear reason why it is not included in the workspace, so adding it so cargo-deny stop complaining Signed-off-by: Derek Lee <derlee@redhat.com>	2022-08-30 09:30:03 -07:00
Derek Lee	7914da72c9	cargo.tomls: Added Apache 2.0 to cargo.tomls One of the checks done by cargo-deny is ensuring all crates have a valid license. As the rust programs import each other, cargo.toml files without licenses trigger the check. While I could disable this check this would be bad practice. This adds an Apache-2.0 license in the Cargo.toml files. Some of these files already had a header comment saying it is an Apache license. As the entire project itself is under an Apache-2.0 license, I assumed all individual components would also be covered under that license. Signed-off-by: Derek Lee <derlee@redhat.com>	2022-08-30 09:30:03 -07:00
Bin Liu	11383c2c0e	Merge pull request #4797 from openanolis/runtime-rs-coresched runtime-rs: add support for core scheduling	2022-08-29 14:28:30 +08:00
Archana Shinde	c174eb809e	Merge pull request #4983 from ManaSugi/runk/add-init-msg runk: Add cli message for init command	2022-08-27 00:15:25 +05:30
Fupan Li	63959b0be6	Merge pull request #5011 from liubin/fix/4962-add-logs agent: add some logs for mount operation	2022-08-26 17:12:15 +08:00
Bin Liu	c08a8631e0	agent: add some logs for mount operation Somewhere is lack of log info, add more details about the storage and log when error will help understand what happened. Fixes: #4962 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-08-26 14:09:56 +08:00
Archana Shinde	7d52934ec1	Merge pull request #4798 from amshinde/use-iouring-qemu Use iouring for qemu block devices	2022-08-26 04:00:24 +05:30
Wainer Moschetta	cbe5e324ae	Merge pull request #4815 from bookinabox/improve-agent-errors logging: Replace nix::Error::EINVAL with more descriptive msgs	2022-08-25 14:27:56 -03:00
Bin Liu	cce99c5c73	runtime-rs: delete socket from shim command-line options The socket is not used to specify the socket address, but an ENV variable is used for runtime-rs. Fixes: #4995 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-08-25 15:32:17 +08:00
Bin Liu	a7e64b1ca9	Merge pull request #4892 from openanolis/shuoyu/runtime-rs runtime-rs: support loading kernel modules in guest vm	2022-08-25 15:01:23 +08:00
Fabiano Fidêncio	ddc94e00b0	Merge pull request #4982 from fidencio/topic/improve-cloud-hypervisor-plus-tdx-support TDX: Get TDX working again with Cloud Hypervisor + a minor change on QEMU's code	2022-08-25 08:53:10 +02:00
Bin Liu	875d946fb4	Merge pull request #4976 from ManaSugi/runk/refactor-delete-func runk: Move delete logic to libcontainer	2022-08-25 14:30:30 +08:00
Yushuo	6cf16c4f76	agent-ctl: fix clippy error Fixes: #4988 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2022-08-25 11:00:49 +08:00
Yushuo	4b57c04c33	runtime-rs: support loading kernel modules in guest vm Users can specify the kernel module to be loaded through the agent configuration in kata configuration file or in pod anotation file. And information of those modules will be sent to kata agent when sandbox is created. Fixes: #4894 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2022-08-25 10:38:04 +08:00
Peng Tao	aa6bcacb7d	Merge pull request #4973 from bergwolf/github/go-depbot runtime: cri-o annotations have been moved to podman	2022-08-25 10:12:06 +08:00
Fabiano Fidêncio	dc90eae17b	qemu: Drop unnecessary `tdx_guest` kernel parameter With the current TDX kernel used with Kata Containers, `tdx_guest` is not needed, as TDX_GUEST is now a kernel configuration. With this in mind, let's just drop the kernel parameter. Fixes: #4981 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-08-24 20:02:43 +02:00
Fabiano Fidêncio	d4b67613f0	clh: Use HVC console with TDX As right now the TDX guest kernel doesn't support "serial" console, let's switch to using HVC in this case. Fixes: #4980 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-08-24 20:02:40 +02:00
Fabiano Fidêncio	c0cb3cd4d8	clh: Avoid crashing when memory hotplug is not allowed The runtime will crash when trying to resize memory when memory hotplug is not allowed. This happens because we cannot simply set the hotplug amount to zero, leading is to not set memory hotplug at all, and later then trying to access the value of a nil pointer. Fixes: #4979 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-08-24 20:02:22 +02:00
Fabiano Fidêncio	9f0a57c0eb	clh: Increase API and SandboxStop timeouts for TDX While doing tests using `ctr`, I've noticed that I've been hitting those timeouts more frequently than expected. Till we find the root cause of the issue (which is not in the Kata Containers), let's increase the timeouts when dealing with a Confidential Guest. Fixes: #4978 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-08-24 20:02:12 +02:00
Manabu Sugimoto	b535bac9c3	runk: Add cli message for init command Add cli message for init command to tell the user not to run this command directly. Fixes: #4367 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-08-25 00:32:35 +09:00
Fabiano Fidêncio	c142fa2541	clh: Lift the sharedFS restriction used with TDX When booting the TDX kernel with `tdx_disable_filter`, as it's been done for QEMU, VirtioFS can work without any issues. Whether this will be part of the upstream kernel or not is a different story, but it easily could make it there as Cloud Hypervisor relies on the VIRTIO_F_IOMMU_PLATFORM feature, which forces the guest to use the DMA API, making these devices compatible with TDX. See Sebastien Boeuf's explanation of this in the 3c973fa7ce208e7113f69424b7574b83f584885d commit: """ By using DMA API, the guest triggers the TDX codepath to share some of the guest memory, in particular the virtqueues and associated buffers so that the VMM and vhost-user backends/processes can access this memory. """ Fixes: #4977 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-08-24 17:14:05 +02:00
Manabu Sugimoto	bdf8a57bdb	runk: Move delete logic to libcontainer Move delete logic to `libcontainer` crate to make the code clean like other commands. Fixes: #4975 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-08-24 19:12:36 +09:00
Peng Tao	a06d819b24	runtime: cri-o annotations have been moved to podman Let's swith to depending on podman which also simplies indirect dependency on kubernetes components. And it helps to avoid cri-o security issues like CVE-2022-1708 as well. Fixes: #4972 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-08-24 18:11:37 +08:00
Peng Tao	ffd1c1ff4f	agent-ctl/trace-forwarder: udpate thread_local dependency To bring in fix to CWE-362. Fixes: #4968 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-08-24 17:10:49 +08:00
Peng Tao	69080d76da	agent/runk: update regex dependency To bring in fix to CVE-2022-24713. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-08-24 17:02:15 +08:00
Peng Tao	e0ec09039d	runtime-rs: update async-std dependency So that we bump several indirect dependencies like crossbeam-channel, crossbeam-utils to bring in fixes to known security issues like CVE-2020-15254. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-08-24 16:56:29 +08:00
Bin Liu	2b5dc2ad39	Merge pull request #4705 from bergwolf/github/agent-ut-improve UT: test_load_kernel_module needs root	2022-08-24 16:22:55 +08:00
Bin Liu	6551d4f25a	Merge pull request #4051 from bergwolf/github/vmx-vm-factory enable vmx for vm factory	2022-08-24 16:22:37 +08:00
Bin Liu	ad91801240	Merge pull request #4870 from cyyzero/runk-cgroup runk: add pause/resume commands	2022-08-24 14:44:43 +08:00
Derek Lee	763ceeb7ba	logging: Replace nix::Error::EINVAL with more descriptive msgs Replaces instances of anyhow!(nix::Error::EINVAL) with other messages to make it easier to debug. Fixes #954 Signed-off-by: Derek Lee <derlee@redhat.com>	2022-08-23 13:44:46 -07:00
Chen Yiyang	a6fbaac1bd	runk: add pause/resume commands To make cgroup v1 and v2 works well, I use `cgroups::cgroup` in `Container` to manager cgroup now. `CgroupManager` in rustjail has some drawbacks. Frist, methods in Manager traits are not visiable. So we need to modify rustjail and make them public. Second, CgrupManager.cgroup is private too, and it can't be serialized. We can't load/save it in status file. One solution is adding getter/setter in rustjail, then create `cgroup` and set it when loading status. In order to keep the modifications to a minimum in rustjail, I use `cgroups::cgroup` directly. Now it can work on cgroup v1 or v2, since cgroup-rs do this stuff. Fixes: #4364 #4821 Signed-off-by: Chen Yiyang <cyyzero@qq.com>	2022-08-22 23:11:50 +08:00
Fabiano Fidêncio	d797036b77	Merge pull request #4861 from ryansavino/upgrade-kernel-support-5.19 kernel: upgrade guest kernel support to 5.19	2022-08-22 14:57:00 +02:00
Bin Liu	8c8e97a495	Merge pull request #4772 from pmores/drop-in-cfg-files-support-rs Drop-in cfg files support in runtime-rs	2022-08-22 13:41:56 +08:00
Bin Liu	eb91ee45be	Merge pull request #4754 from liubin/fix/4749-rollback-when-creating-container-failed agent: do some rollback works if case of do_create_container failed	2022-08-22 10:44:11 +08:00
Ryan Savino	8e201501ef	kernel: fix for set_kmem_limit error Fixes: #4390 Fix in cargo cgroups-rs crate - Updated crate version to 0.2.10 Signed-Off-By: Ryan Savino <ryan.savino@amd.com>	2022-08-19 13:08:14 -05:00
Fabiano Fidêncio	9806ce8615	Merge pull request #4937 from chenhengqi/fix-error-msg network: Fix error message for setting hardware address on TAP interface	2022-08-19 17:54:58 +02:00
Pavel Mores	57bd3f42d3	runtime-rs: plug drop-in decoding into config-loading code To plug drop-in support into existing config-loading code in a robust way, more specifically to create a single point where this needs to be handled, load_from_file() and load_raw_from_file() were refactored. Seeing as the original implemenations of both functions were identical apart from adjust_config() calls in load_from_file(), load_from_file() was reimplemented in terms of load_raw_from_file(). Fixes #4771 Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-08-19 11:01:29 +02:00
Pavel Mores	87b97b6994	runtime-rs: add filesystem-related part of drop-in handling The central function being added here is load() which takes a path to a base config file and uses it to load the base config file itself, find the corresponding drop-in directory (get_dropin_dir_path()), iterate through its contents (update_from_dropins()) and load each drop-in in turn and merge its contents with the base file (update_from_dropin()). Also added is a test of load() which mirrors the corresponding test in the golang runtime (TestLoadDropInConfiguration() in config_test.go). Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-08-19 11:01:29 +02:00
Pavel Mores	cf785a1a23	runtime-rs: add core toml::Value tree merging This is the core functionality of merging config file fragments into the base config file. Our TOML parser crate doesn't seem to allow working at the level of TomlConfig instances like BurntSushi, used in the Golang runtime, does so we implement the required functionality at the level of toml::Value trees. Tests to verify basic requirements are included. Values set by a base config file and not touched by a subsequent drop-in should be preserved. Drop-in config file fragments should be able to change values set by the base config file and add settings not present in the base. Conversion of a merged tree into a mock TomlConfig-style structure is tested as well. Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-08-19 11:01:29 +02:00
Fabiano Fidêncio	828383bc39	Merge pull request #4933 from likebreath/0816/prepare_clh_v26.0 Upgrade to Cloud Hypervisor v26.0	2022-08-18 18:36:53 +02:00
James O. D. Hunt	6d6edb0bb3	Merge pull request #4903 from cmaf/tracing-defer-rootSpan-end runtime: tracing: End root span at end of trace	2022-08-18 08:51:41 +01:00
Peng Tao	f508c2909a	runtime: constify splitIrqChipMachineOptions A simple cleanup. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-08-18 10:09:20 +08:00
Peng Tao	2b0587db95	runtime: VMX is migratible in vm factory case We are not spinning up any L2 guests in vm factory, so the L1 guest migration is expected to work even with VMX. See https://www.linux-kvm.org/page/Nested_Guests Fixes: #4050 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-08-18 10:08:43 +08:00
Peng Tao	fa09f0ec84	runtime: remove qemuPaths It is broken that it doesn't list QemuVirt machine type. In fact we don't need it at all. Just drop it. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-08-18 10:06:10 +08:00
Peng Tao	326f1cc773	agent: enrich some error code path So that it is easier to find out why some function fails. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-08-18 10:02:12 +08:00
Peng Tao	4f53e010b4	agent: skip test_load_kernel_module if non-root We need root privilege to load a real kernel module. Fixes: #4704 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-08-18 10:02:12 +08:00
Bo Chen	3a597c2742	runtime: clh: Use the new 'payload' interface The new 'payload' interface now contains the 'kernel' and 'initramfs' config. Fixes: #4952 Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-08-17 12:23:43 -07:00
Bo Chen	16baecc5b1	runtime: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v26.0. Note: The client code of cloud-hypervisor's (CLH) OpenAPI is automatically generated by openapi-generator [1-2]. [1] https://github.com/OpenAPITools/openapi-generator [2] https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/pkg/cloud-hypervisor/README.md Fixes: #4952 Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-08-17 12:23:12 -07:00
wllenyj	c75970b816	dragonball: add more unit test for config manager Added more unit tests for config manager. Fixes: #4899 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-08-17 23:46:26 +08:00
Hengqi Chen	8ff5c10ac4	network: Fix error message for setting hardware address on TAP interface Error out with the correct interface name and hardware address instead. Fixes: #4944 Signed-off-by: Hengqi Chen <chenhengqi@outlook.com>	2022-08-17 16:42:07 +08:00
Peng Tao	338c282950	dep: update nix dependency To fix CVE-2021-45707 that affects nix < 0.20.2. Fixes: #4929 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-08-17 16:06:26 +08:00
James O. D. Hunt	82ad43f9bf	Merge pull request #4928 from liubin/fix/4925-share-test-utils-for-rust libs/test-utils: share test code by create a new crate	2022-08-17 08:31:11 +01:00
Bin Liu	8cd1e50eb6	Merge pull request #4921 from liubin/fix/2920-delete-vergen runtime-rs: delete vergen dependency	2022-08-17 10:09:12 +08:00
Bin Liu	34746496b7	libs/test-utils: share test code by create a new crate More and more Rust code is introduced, the test utils original in agent should be made easy to share, move it into a new crate will make it easy to share between different crates. Fixes: #4925 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-08-17 00:12:44 +08:00
Bin Liu	eab7c8f28f	runtime-rs: delete vergen dependency vergen is a build dependency, but it is not being used. we are processing ver/commit hash by make command, but not by vergen. Fixes: #4920 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-08-16 15:31:24 +08:00
Bin Liu	828574d27c	Merge pull request #4893 from openanolis/runtime-rs-main Runtime-rs: support persist file	2022-08-16 14:42:22 +08:00
Bin Liu	830fb266e6	Merge pull request #4854 from openanolis/runtime-rs-delete runtime-rs: delete route model	2022-08-15 20:48:58 +08:00
Ji-Xinyou	ff7c78e0e8	runtime-rs: static resource mgmt default to false Static resource management should be default to false. If default to be true, later update sandbox operation, e.g. resize, will not work. Fixes: #4742 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-08-15 14:42:38 +08:00
Ji-Xinyou	00f3a6de12	runtime-rs: make static resource mgmt idiomatic Make the get value process (cpu and mem) more idiomatic. Fixes: #4742 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-08-15 11:18:35 +08:00
Zhongtao Hu	4d7f3edbaf	runtime-rs: support the functionality of cleanup Cleanup sandbox resource Fixes: #4891 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-08-13 15:56:38 +08:00
Zhongtao Hu	5aa83754e5	runtime-rs: support save to persist file and restore Support the functionality of save and restore for sandbox state Fixes:#4891 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-08-13 15:44:13 +08:00
Chelsea Mafrica	fcc1e0c617	runtime: tracing: End root span at end of trace The root span should exist the duration of the trace. Defer ending span until the end of the trace instead of end of function. Add the span to the service struct to do so. Fixes #4902 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2022-08-12 13:15:39 -07:00
Zhongtao Hu	c280d6965b	runtime-rs: delete route model As route model is used for specific internal scenario, and it's not for the general requirement. Fixes:#4838 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-08-11 15:56:43 +08:00
Bin Liu	ca9d16e5ea	runtime-rs: update Cargo.lock Update Cargo.lock Fixes: #4875 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-08-11 10:34:36 +08:00
Ji-Xinyou	4a54876dde	runtime-rs: support static resource management functionality Supports functionalities of static resource management, enabled by default. Fixes: #4742 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-08-11 09:46:44 +08:00
Bin Liu	cb7f9524be	Merge pull request #4804 from openanolis/anolis/merge_runtime_rs_to_main runtime-rs:merge runtime rs to main	2022-08-11 08:40:41 +08:00
Tim Zhang	4813a3cef9	Merge pull request #4711 from liubin/fix/4710-wait-nydusd-api-server-ready nydus: wait nydusd API server ready before mounting share fs	2022-08-10 17:20:17 +08:00
Fabiano Fidêncio	065305f4a1	agent-ctl: Add an empty [workspace] "An empty [workspace] can be used with a package to conveniently create a workspace with the package and all of its path dependencies", according to the https://doc.rust-lang.org/cargo/reference/workspaces.html This is also matches with the suggestion provided by the Cargo itself, due to the errors faced with the Cloud Hypervisor CI: ``` 10:46:23 this may be fixable by adding `go/src/github.com/kata-containers/kata-containers/src/tools/agent-ctl` to the `workspace.members` array of the manifest located at: /tmp/jenkins/workspace/kata-containers-2-clh-PR/Cargo.toml 10:46:23 Alternatively, to keep it out of the workspace, add the package to the `workspace.exclude` array, or add an empty `[workspace]` table to the package's manifest. ``` Fixes: #4843 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-08-08 11:24:39 +02:00
liubin	2ae807fd29	nydus: wait nydusd API server ready before mounting share fs If the API server is not ready, the mount call will fail, so before mounting share fs, we should wait the nydusd is started and the API server is ready. Fixes: #4710 Signed-off-by: liubin <liubin0329@gmail.com> Signed-off-by: Bin Liu <bin@hyper.sh>	2022-08-08 16:18:38 +08:00
Tim Zhang	8d4d98587f	Merge pull request #4746 from liubin/fix/4745-add-log-field runtime: explicitly mark the source of the log is from qemu.log	2022-08-08 15:21:01 +08:00
Bin Liu	9516286f6d	Merge pull request #4829 from LetFu/fix/addUnlock runtime: add unlock before return in sendReq	2022-08-08 14:42:44 +08:00
Archana Shinde	c1e3b8f40f	govmm: Refactor qmp functions for adding block device Instead of passing a bunch of arguments to qmp functions for adding block devices, use govmm BlockDevice structure to reduce these. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-08-05 13:16:34 -07:00
Archana Shinde	598884f374	govmm: Refactor code to get rid of redundant code Get rid of redundant return values from function. args and blockdevArgs used to return different values to maintain compatilibity between qemu versions. These are exactly the same now. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-08-05 13:16:34 -07:00
Archana Shinde	00860a7e43	qmp: Pass aio backend while adding block device Allow govmm to pass aio backend while adding block device. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-08-05 13:16:34 -07:00
Archana Shinde	e1b49d7586	config: Add block aio as a supported annotation Allow Block AIO to be passed as a per pod annotation. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-08-05 13:16:34 -07:00
Archana Shinde	ed0f1d0b32	config: Add "block_device_aio" as a config option for qemu This configuration will allow users to choose between different I/O backends for qemu, with the default being io_uring. This will allow users to fallback to a different I/O mechanism while running on kernels olders than 5.1. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-08-05 13:16:34 -07:00
Fabiano Fidêncio	e2968b177d	Merge pull request #4763 from cyyzero/runk-ps runk: add ps sub-command	2022-08-05 16:28:38 +02:00
chmod100	d8ad16a34e	runtime: add unlock before return in sendReq Unlock is required before return, so there need to add unlock Fixes: #4827 Signed-off-by: chmod100 <letfu@outlook.com>	2022-08-05 13:30:12 +00:00
Peng Tao	b828190158	Merge pull request #4823 from openanolis/runtime-rs-merge-main-runtime-rs Depends-on:github.com/kata-containers/tests#4986 Runtime-rs:merge main runtime rs	2022-08-05 14:42:22 +08:00
Zhongtao Hu	8bbffc42cf	runtime-rs:update rtnetlink version update rtnetlink version for runtime-rs Fixes:#4824 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-08-05 11:18:09 +08:00
Zhongtao Hu	e403838131	runtim-rs: Merge remote-tracking branch 'origin/main' into runtime-rs To keep runtime-rs up to date, we will merge main into runtime-rs every week. Fixes:kata-containers#4822 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-08-05 10:49:33 +08:00
GabyCT	2764bd7522	Merge pull request #4770 from justxuewei/refactor/agent/netlink-neighbor agent: Use rtnetlink's neighbours API to add neighbors	2022-08-04 12:09:30 -05:00
Zhongtao Hu	389ae97020	runtime-rs:skip the test when the arch is s390x github.com/kata-containers/tests#4986.To avoid returning an error when running the ci, we just skip the test if the arch is s390x Fixes: #4816 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-08-04 21:13:50 +08:00
Zhongtao Hu	945e02227c	runtime-rs:skip the build process when the arch is s390x github.com/kata-containers/tests#4986.To avoid returning an error when running the ci, we just skip the build process if the arch is s390x Fixes: #4816 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-08-04 21:13:40 +08:00
Archana Shinde	b6cd2348f5	govmm: Add io_uring as AIO type io_uring was introduced as a new kernel IO interface in kernel 5.1. It is designed for higher performance than the older Linux AIO API. This feature was added in qemu 5.0. Fixes #4645 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-08-03 10:43:12 -07:00
Archana Shinde	81cdaf0771	govmm: Correct documentation for Linux aio. The comments for "native" aio are incorrect. Correct these. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-08-03 10:41:50 -07:00
Ji-Xinyou	a355812e05	runtime-rs: fixed bug on core-sched error handling Kernel code returns -errno, this should check negative values. Fixes: #4429 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-08-03 15:26:48 +08:00
Bin Liu	8b0e1859cb	Merge pull request #4784 from openanolis/fix-protocol-ci-err libs: fix CI error for protocols	2022-08-03 11:03:02 +08:00
Chen Yiyang	230a229052	runk: add ps sub-command ps command supprot two formats, `json` and `table`. `json` format just outputs pids in the container. `table` format will use `ps` utilty in the host, search and output all processes in the container. Add a struct `container` to represent a spawned container. Move the `kill` implemention from kill.rs as a method of `container`. Fixes: #4361 Signed-off-by: Chen Yiyang <cyyzero@qq.com>	2022-08-02 20:45:50 +08:00
Ji-Xinyou	591dfa4fe6	runtime-rs: add support for core scheduling Linux 5.14 supports core scheduling to have better security control for SMT siblings. This PR supports that. Fixes: #4429 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-08-02 17:54:04 +08:00
Zhongtao Hu	7247575fa2	runtime-rs:fix cargo clippy fix cargo clippy Fixes: #4791 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-08-02 13:17:37 +08:00
Zhongtao Hu	9803393f2f	runtime-rs: Merge branch 'main' into runtime-rs-merge-main-1 To keep runtime-rs up to date, we will merge main into runtime-rs every week. Fixes: #4790 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-08-02 10:53:01 +08:00
Quanwei Zhou	86ac653ba7	libs: fix CI error for protocols Fix CI error for protocols. Fixes: #4781 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-08-01 16:26:52 +08:00
Xuewei Niu	81fe51ab0b	agent: fix unittests for arp neighbors Set an ARP address explicitly before netlink::test_add_one_arp_neighbor() running. Signed-off-by: Xuewei Niu <justxuewei@apache.org>	2022-08-01 16:19:25 +08:00
Xuewei Niu	845c1c03cf	agent: use rtnetlink's neighbours API to add neighbors Bump rtnetlink version from 0.8.0 to 0.11.0. Use rtnetlinks's API to add neighbors and fix issues to adapt new verson of rtnetlink. Fixes: #4607 Signed-off-by: Xuewei Niu <justxuewei@apache.org>	2022-08-01 13:44:07 +08:00
Zhongtao Hu	adfad44efe	Merge remote-tracking branch 'origin/main' into runtime-rs-merge-tmp To keep runtime-rs up to date, we will merge main into runtime-rs every week. Fixes:#4776 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-08-01 11:12:48 +08:00
Ryan Savino	9b1940e93e	versions: update rust version Fixes #4764 versions: update rust version to fix ccv0 attestation-agent build error static-checks: kata tools, libs, and agent fixes Signed-Off-By: Ryan Savino <ryan.savino@amd.com>	2022-07-29 18:41:43 -05:00
Peng Tao	0aefab4d80	Merge pull request #4739 from liubin/fix/4738-trace-rpc-calls agent: log RPC calls for debugging	2022-07-29 14:18:23 +08:00
Peng Tao	5457deb034	Merge pull request #4741 from openanolis/fix-stop-failed-in-azure runtime-rs: fix stop failed in azure	2022-07-29 11:41:16 +08:00
Quanwei Zhou	fa0b11fc52	runtime-rs: fix stdin hang in azure Fix stdin hang in azure. Fixes: #4740 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-07-28 16:16:37 +08:00
yaoyinnan	5c3155f7e2	runtime: Support for host cgroup v2 Support cgroup v2 on the host. Update vendor containerd/cgroups to add cgroup v2. Fixes: #3073 Signed-off-by: yaoyinnan <yaoyinnan@foxmail.com>	2022-07-28 10:30:45 +08:00
Bin Liu	50b0b7cc15	Merge pull request #4681 from Tim-0731-Hzt/runtime-rs-sharepid runtime-rs: fix set share sandbox pid namespace	2022-07-27 21:43:58 +08:00
Bin Liu	557229c39d	Merge pull request #4724 from yahaa/fix-docs Docs: fix tables format error	2022-07-27 21:13:29 +08:00
Bin Liu	09672eb2da	agent: do some rollback works if case of do_create_container failed In some cases do_create_container may return an error, mostly due to `container.start(process)` call. This commit will do some rollback works if this function failed. Fixes: #4749 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-07-27 10:23:46 +08:00
Archana Shinde	1b01ea53d9	Merge pull request #4735 from nubificus/feature-fc-v1.1 versions: Update Firecracker version to v1.1.0	2022-07-27 04:50:32 +05:30
Peng Tao	27c82018d1	Merge pull request #4753 from Tim-Zhang/agent-fix-stream-fd-double-close agent: Fix stream fd's double close	2022-07-27 00:54:07 +08:00
Bin Liu	6fddf031df	Merge pull request #4664 from lifupan/main container: kill all of the processes in a container when it terminated	2022-07-26 23:12:11 +08:00
Tim Zhang	f5aa6ae467	agent: Fix stream fd's double close problem The fd would be closed on Pipestream's dropping and we should not close it agian. Fixes: #4752 Signed-off-by: Tim Zhang <tim@hyper.sh>	2022-07-26 20:05:06 +08:00
yahaa	6e149b43f7	Docs: fix tables format error Fixes: #4725 Signed-off-by: yahaa <1477765176@qq.com>	2022-07-26 19:05:09 +08:00
Bin Liu	85f4e7caf6	runtime: explicitly mark the source of the log is from qemu.log In qemu.StopVM(), if debug is enabled, the shim will dump logs from qemu.log, but users don't know which logs are from qemu.log and shim itself. Adding some additional messages will help users to distinguish these logs. Fixes: #4745 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-07-26 16:08:59 +08:00
Peng Tao	129335714b	Merge pull request #4727 from openanolis/anolis-fix-network fix network failed for kata ci	2022-07-26 15:10:55 +08:00
Peng Tao	71384b60f3	Merge pull request #4713 from openanolis/adjust_default_vcpu runtime-rs: handle default_vcpus greator than default_maxvcpu	2022-07-26 15:02:34 +08:00
gntouts	56d49b5073	versions: Update Firecracker version to v1.1.0 This patch upgrades Firecracker version from v0.23.4 to v1.1.0 * Generate swagger models for v1.1.0 (from firecracker.yaml) * Replace ht_enabled param to smt (API change) * Remove NUMA-related jailer param --node 0 Fixes: #4673 Depends-on: github.com/kata-containers/tests#4968 Signed-off-by: George Ntoutsos <gntouts@nubificus.co.uk> Signed-off-by: Anastassios Nanos <ananos@nubificus.co.uk>	2022-07-26 07:01:26 +00:00
Zhongtao Hu	b3147411e3	runtime-rs:add unit test for set share pid ns Fixes:#4680 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-07-26 14:42:00 +08:00
Zhongtao Hu	1ef3f8eac6	runtime-rs: set share sandbox pid namespace Set the share sandbox pid namepsace from spec Fixes:#4680 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-07-26 14:41:59 +08:00
Quanwei Zhou	57c556a801	runtime-rs: fix stop failed in azure Fix the stop failed in azure. Fixes: #4740 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-07-26 12:16:32 +08:00
liubin	0e24f47a43	agent: log RPC calls for debugging We can log all RPC calls to the agent for debugging purposes to check which RPC is called, which can help us to understand the container lifespan. Fixes: #4738 Signed-off-by: liubin <liubin0329@gmail.com>	2022-07-26 10:32:44 +08:00
Tim Zhang	e764a726ab	Merge pull request #4715 from Tim-Zhang/fix-ut-test_do_write_stream agent: fix fd-double-close problem in ut test_do_write_stream	2022-07-25 17:34:26 +08:00
Peng Tao	3f4dd92c2d	Merge pull request #4702 from openanolis/runtime-rs-endpoint-dev runtime-rs: add functionalities support for macvlan and vlan endpoints	2022-07-25 17:04:45 +08:00
Tim Zhang	427b29454a	Merge pull request #4709 from liubin/fix/4708-unwrap-error rustjail: check result to let it return early	2022-07-25 15:05:20 +08:00
Quanwei Zhou	c825065b27	runtime-rs: fix tc filter setup failed Fix bug using tc filter and protocol needs to use network byte order. Fixes: #4726 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-07-25 11:16:33 +08:00
Quanwei Zhou	e0194dcb5e	runtime-rs: update route destination with prefix Update route destination with prefix. Fixes: #4726 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-07-25 11:16:22 +08:00
Wainer Moschetta	0b4a91ec1a	Merge pull request #4644 from bookinabox/optimize-get-paths cgroups: remove unnecessary get_paths()	2022-07-22 17:01:01 -03:00
Ji-Xinyou	896478c92b	runtime-rs: add functionalities support for macvlan and vlan endpoints Add macvlan and vlan support to runtime-rs code and corresponding unit tests. Fixes: #4701 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-07-22 10:09:11 +08:00
Tim Zhang	912641509e	agent: fix fd-double-close problem in ut test_do_write_stream The fd will closed on struct Process's dropping, so don't close it again manually. Fixes: #4598 Signed-off-by: Tim Zhang <tim@hyper.sh>	2022-07-21 19:37:15 +08:00
Zhongtao Hu	43045be8d1	runtime-rs: handle default_vcpus greator than default_maxvcpu when the default_vcpus is greater than the default_maxvcpus, the default vcpu number should be set equal to the default_maxvcpus. Fixes: #4712 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-07-21 16:37:56 +08:00
liubin	0d7cb7eb16	agent: delete agent-type property in announce Since there is only one type of agent now, the agent-type is not needed anymore. Signed-off-by: liubin <liubin0329@gmail.com>	2022-07-21 14:53:01 +08:00
liubin	eec9ac81ef	rustjail: check result to let it return early. check the result to let it return early if there are some errors Fixes: #4708 Signed-off-by: liubin <liubin0329@gmail.com>	2022-07-21 14:51:30 +08:00
Quanwei Zhou	54f53d57ef	runtime-rs: support disable_guest_seccomp support disable_guest_seccomp Fixes: #4691 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-07-21 07:46:28 +08:00
Bin Liu	540303880e	Merge pull request #4688 from quanweiZhou/fix_sandbox_cgroup_false runtime-rs: fix sandbox_cgroup_only=false panic	2022-07-19 20:38:57 +08:00
Peng Tao	7c146a5d95	Merge pull request #4684 from quanweiZhou/fix-ctr-exit-error runtime-rs: fix ctr exit failed	2022-07-19 16:02:20 +08:00
Peng Tao	4c3bd6b1d1	Merge pull request #4656 from openanolis/runtime-rs-ipvlan runtime-rs: support functionalities of ipvlan endpoint	2022-07-19 11:15:31 +08:00
Bin Liu	960f2a7f70	Merge pull request #4678 from Tim-0731-Hzt/runtime-rs-makefile-2 runtime-rs: remove the value of hypervisor path in DB config	2022-07-19 09:34:45 +08:00
Quanwei Zhou	e9988f0c68	runtime-rs: fix sandbox_cgroup_only=false panic When run with configuration `sandbox_cgroup_only=false`, we will call `gen_overhead_path()` as the overhead path. The `cgroup-rs` will push the path with the subsystem prefix by `PathBuf::push()`. When the path has prefix “/” it will act as root path, such as ``` let mut path = PathBuf::from("/tmp"); path.push("/etc"); assert_eq!(path, PathBuf::from("/etc")); ``` So we shoud not set overhead path with prefix "/". Fixes: #4687 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-07-19 08:30:34 +08:00
Quanwei Zhou	cebbebbe8a	runtime-rs: fix ctr exit failed During use, there will be cases where the container is in the stop state and get another stop. In this case, the second stop needs to be ignored. Fixes: #4683 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-07-19 07:43:22 +08:00
Bin Liu	758cc47b32	Merge pull request #4671 from liubin/4670-upgrade-nix kata-sys-util: upgrade nix version	2022-07-18 23:31:07 +08:00
Ji-Xinyou	62182db645	runtime-rs: add unit test for ipvlan endpoint Add unit test to check the integrity of IPVlanEndpoint::new(...) Fixes: #4655 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-07-18 15:56:06 +08:00
xuejun-xj	99654ce694	runtime-rs: update dbs-xxx dependencies Update dbs-xxx commit ID for aarch64 in runtime-rs/Cargo.toml file to add dependencies for aarch64. Fixes: #4676 Signed-off-by: xuejun-xj <jiyunxue@alibaba.linux.com>	2022-07-18 13:46:46 +08:00
xuejun-xj	f4c3adf596	runtime-rs: Add compile option file Add file aarch64-options.mk for compiling on aarch64 architectures. Fixes: #4676 Signed-off-by: xuejun-xj <jiyunxue@alibaba.linux.com>	2022-07-18 13:46:46 +08:00
xuejun-xj	545ae3f0ee	runtime-rs: fix warning Module anyhow::anyhow is only used on x86_64 architecture in crates/hypervisor/src/device/vfio.rs file. Fixes: #4676 Signed-off-by: xuejun-xj <jiyunxue@alibaba.linux.com>	2022-07-18 13:46:39 +08:00
Zhongtao Hu	19eca71cd9	runtime-rs: remove the value of hypervisor path in DB config As a built in VMM, Path, jailer path, ctlpath are not needed for Dragonball. So we don't generate those value in Makefile. Fixes: #4677 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-07-18 13:37:51 +08:00
Ji-Xinyou	d8920b00cd	runtime-rs: support functionalities of ipvlan endpoint Add support for ipvlan endpoint Fixes: #4655 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2022-07-18 11:34:03 +08:00
xuejun-xj	2b01e9ba40	dragonball: fix warning Add map_err for vcpu_manager.set_reset_event_fd() function. Fixes: #4676 Signed-off-by: xuejun-xj <jiyunxue@alibaba.linux.com>	2022-07-18 09:52:13 +08:00
liubin	996a6b80bc	kata-sys-util: upgrade nix version New nix is supporting UMOUNT_NOFOLLOW, upgrade nix version to use this flag instead of the self-defined flag. Fixes: #4670 Signed-off-by: liubin <liubin0329@gmail.com>	2022-07-15 17:38:15 +08:00
Fupan Li	d93e4b939d	container: kill all of the processes in this container When a container terminated, we should make sure there's no processes left after destroying the container. Before this commit, kata-agent depended on the kernel's pidns to destroy all of the process in a container after the 1 process exit in a container. This is true for those container using a separated pidns, but for the case of shared pidns within the sandbox, the container exit wouldn't trigger the pidns terminated, and there would be some daemon process left in this container, this wasn't expected. Fixes: #4663 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2022-07-14 16:39:49 +08:00
Bin Liu	575b5eb5f5	Merge pull request #4506 from cyyzero/runk-exec runk: Support `exec` sub-command	2022-07-14 14:22:24 +08:00
Quanwei Zhou	3c989521b1	dragonball: update for review update for review Fixes: #3785 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-07-14 10:43:59 +08:00
wllenyj	274598ae56	kata-runtime: add dragonball config check support. add dragonball config check support. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-07-14 10:43:50 +08:00
Chao Wu	1befbe6738	runtime-rs: Cargo lock for fix version problem Cargo lock for fix version problem Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-14 08:49:39 +08:00
Quanwei Zhou	3d6156f6ec	runtime-rs: support dragonball and runtime-binary Fixes: #3785 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com> Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-07-14 08:49:30 +08:00
Zhongtao Hu	3f6123b4dd	libs: update configuration and annotations 1. support annotation for runtime.name, hypervisor_name, agent_name. 2. fix parse memory from annotation Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-07-14 08:49:17 +08:00
Derek Lee	9ae2a45b38	cgroups: remove unnecessary get_paths() Change get_mounts to get paths from a borrowed argument rather than calling get_paths a second time. Fixes #3768 Signed-off-by: Derek Lee <derlee@redhat.com>	2022-07-13 09:17:14 -07:00
Fabiano Fidêncio	be31207f6e	clh: Don't crash if no network device is set by the upper layer `ctr` doesn't set a network device when creating the sandbox, which leads to Cloud Hypervisor's driver crashing, see the log below: ``` panic: runtime error: invalid memory address or nil pointer dereference [signal SIGSEGV: segmentation violation code=0x1 addr=0x8 pc=0x55641c23b248] goroutine 32 [running]: github.com/kata-containers/kata-containers/src/runtime/virtcontainers.glob..func1(0xc000397900) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/clh.go:163 +0x128 github.com/kata-containers/kata-containers/src/runtime/virtcontainers.(cloudHypervisor).vmAddNetPut(...) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/clh.go:1348 github.com/kata-containers/kata-containers/src/runtime/virtcontainers.(cloudHypervisor).bootVM(0xc000397900, {0x55641c76dfc0, 0xc000454ae0}) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/clh.go:1378 +0x5a2 github.com/kata-containers/kata-containers/src/runtime/virtcontainers.(cloudHypervisor).StartVM(0xc000397900, {0x55641c76dff8, 0xc00044c240}, 0x55641b8016fd) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/clh.go:659 +0x7ee github.com/kata-containers/kata-containers/src/runtime/virtcontainers.(Sandbox).startVM.func2() /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/sandbox.go:1219 +0x190 github.com/kata-containers/kata-containers/src/runtime/virtcontainers.(LinuxNetwork).Run.func1({0xc0004a8910, 0x3b}) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/network_linux.go:319 +0x1b github.com/kata-containers/kata-containers/src/runtime/virtcontainers.doNetNS({0xc000048440, 0xc00044c240}, 0xc0005d5b38) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/network_linux.go:1045 +0x163 github.com/kata-containers/kata-containers/src/runtime/virtcontainers.(LinuxNetwork).Run(0xc000150c80, {0x55641c76dff8, 0xc00044c240}, 0xc00014e4e0) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/network_linux.go:318 +0x105 github.com/kata-containers/kata-containers/src/runtime/virtcontainers.(Sandbox).startVM(0xc000107d40, {0x55641c76dff8, 0xc0005529f0}) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/sandbox.go:1205 +0x65f github.com/kata-containers/kata-containers/src/runtime/virtcontainers.createSandboxFromConfig({_, _}, {{0x0, 0x0, 0x0}, {0xc000385a00, 0x1, 0x1}, {0x55641d033260, 0x0, ...}, ...}, ...) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/api.go:91 +0x346 github.com/kata-containers/kata-containers/src/runtime/virtcontainers.CreateSandbox({_, _}, {{0x0, 0x0, 0x0}, {0xc000385a00, 0x1, 0x1}, {0x55641d033260, 0x0, ...}, ...}, ...) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/api.go:51 +0x150 github.com/kata-containers/kata-containers/src/runtime/virtcontainers.(VCImpl).CreateSandbox(_, {_, _}, {{0x0, 0x0, 0x0}, {0xc000385a00, 0x1, 0x1}, {0x55641d033260, ...}, ...}) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/implementation.go:35 +0x74 github.com/kata-containers/kata-containers/src/runtime/pkg/katautils.CreateSandbox({_, _}, {_, _}, {{0xc0004806c0, 0x9}, 0xc000140110, 0xc00000f7a0, {0x0, 0x0}, ...}, ...) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/pkg/katautils/create.go:175 +0x8b6 github.com/kata-containers/kata-containers/src/runtime/pkg/containerd-shim-v2.create({0x55641c76dff8, 0xc0004129f0}, 0xc00034a000, 0xc00036a000) /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/pkg/containerd-shim-v2/create.go:147 +0xdea github.com/kata-containers/kata-containers/src/runtime/pkg/containerd-shim-v2.(service).Create.func2() /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/pkg/containerd-shim-v2/service.go:401 +0x32 created by github.com/kata-containers/kata-containers/src/runtime/pkg/containerd-shim-v2.(service).Create /home/ubuntu/go/src/github.com/kata-containers/kata-containers/src/runtime/pkg/containerd-shim-v2/service.go:400 +0x534 ``` This bug has been introduced as part of the https://github.com/kata-containers/kata-containers/pull/4312 PR, which changed how we add the network device. In order to avoid the crash, let's simply check whether we have a device to be added before iterating the list of network devices. Fixes: #4618 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-07-13 10:40:21 +02:00
Fabiano Fidêncio	dc3b6f6592	versions: Update Cloud Hypervisor to v25.0 Cloud Hypervisor v25.0 has been released on July 7th, 2022, and brings the following changes: ch-remote Improvements The ch-remote command has gained support for creating the VM from a JSON config and support for booting and deleting the VM from the VMM. VM "Coredump" Support Under the guest_debug feature flag it is now possible to extract the memory of the guest for use in debugging with e.g. the crash utility. (https://github.com/cloud-hypervisor/cloud-hypervisor/issues/4012) Notable Bug Fixes * Always restore console mode on exit (https://github.com/cloud-hypervisor/cloud-hypervisor/issues/4249, https://github.com/cloud-hypervisor/cloud-hypervisor/issues/4248) * Restore vCPUs in numerical order which fixes aarch64 snapshot/restore (https://github.com/cloud-hypervisor/cloud-hypervisor/issues/4244) * Don't try and configure IFF_RUNNING on TAP devices (https://github.com/cloud-hypervisor/cloud-hypervisor/issues/4279) * Propagate configured queue size through to vhost-user backend (https://github.com/cloud-hypervisor/cloud-hypervisor/issues/4286) * Always Program vCPU CPUID before running the vCPU to fix running on Linux 5.16 (https://github.com/cloud-hypervisor/cloud-hypervisor/issues/4156) * Enable ACPI MADT "Online Capable" flag for hotpluggable vCPUs to fix newer Linux guest Removals The following functionality has been removed: * The mergeable option from the virtio-pmem support has been removed (https://github.com/cloud-hypervisor/cloud-hypervisor/issues/3968) * The dax option from the virtio-fs support has been removed (https://github.com/cloud-hypervisor/cloud-hypervisor/issues/3889) Fixes: #4641 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-07-12 14:47:58 +00:00
Bin Liu	f3335c99ce	Merge pull request #4614 from Tim-0731-Hzt/runtime-rs-merge-main Runtime-rs merge main	2022-07-12 19:25:11 +08:00
xuejun-xj	d2584991eb	dragonball: fix dependency unused warning Fix the warning "unused import: `dbs_arch::gic::Error as GICError`" and "unused import: `dbs_arch::gic::GICDevice`" in file src/vm/mod.rs when compiling. Fixes: #4544 Signed-off-by: xuejun-xj <jiyunxue@alibaba.linux.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com>	2022-07-11 17:55:04 +08:00
xuejun-xj	458f6f42f6	dragonball: use const string for legacy device type As string "com1", "com2" and "rtc" are used in two files (device_manager/mod.rs and device_manager/legacy.rs), we use public const variables COM1, COM2 and RTC to replace them respectively. Fixes: #4544 Signed-off-by: xuejun-xj <jiyunxue@alibaba.linux.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com>	2022-07-11 17:46:10 +08:00
Zhongtao Hu	0826a2157d	Merge remote-tracking branch 'origin/main' into runtime-rs-1 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-07-11 09:47:23 +08:00
xuejun-xj	f6f96b8fee	dragonball: add legacy device support for aarch64 Implement RTC device for aarch64. Fixes: #4544 Signed-off-by: xuejun-xj <jiyunxue@alibaba.linux.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com>	2022-07-10 17:35:30 +08:00
xuejun-xj	7a4183980e	dragonball: add device info support for aarch64 Implement generate_virtio_device_info() and get_virtio_mmio_device_info() functions su support the mmio_device_info member, which is used by FDT. Fixes: #4544 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com>	2022-07-10 17:09:59 +08:00
Chao Wu	9cee52153b	fmt: do cargo fmt and add a dependency for blk_dev fmt: do cargo fmt and add a dependency for blk_dev Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
Chao Wu	47a4142e0d	fs: change vhostuser and virtio into const change fs mode vhostuser and virtio into const. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
Chao Wu	e14e98bbeb	cpu_topo: add handle_cpu_topology function add handle_cpu_topology funciton to make it easier to understand the set_vm_configuration function. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
Chao Wu	5d3b53ee7b	downtime: add downtime support add downtime support in `resume_all_vcpus_with_downtime` Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
Chao Wu	6a1fe85f10	vfio: add vfio as TODO We add vfio as TODO in this commit and create a github issue for this. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
Chao Wu	5ea35ddcdc	refractor: remove redundant by_id remove redundant by_id in get_vm_by_id_mut and get_vm_by_id. They are optimized to get_vm_mut and get_vm. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
Chao Wu	b646d7cb37	config: remove ht_enabled Since cpu topology could tell whether hyper thread is enabled or not, we removed ht_enabled config from VmConfigInfo Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
Chao Wu	cb54ac6c6e	memory: remove reserve_memory_bytes This is currently an unsupported feature and we will remove it from the current code. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
Chao Wu	bde6609b93	hotplug: add room for other hotplug solution Add room in the code for other hotplug solution without upcall Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
wllenyj	d88b1bf01c	dragonball: update vsock dependency 1. fix vsock device init failed 2. fix VsockDeviceConfigInfo not found Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
Chao Wu	dd003ebe0e	Dragonball: change error name and fix compile error Change error name from `StartMicrovm` to `StartMicroVm`, `StartMicrovmError` to `StartMicroVmError`. Besides, we fix a compile error in config_manager. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
Chao Wu	38957fe00b	UT: fix compile error in unit tests fix compile error in unit tests for DummyConfigInfo. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
wllenyj	11b3f95140	dragonball: add virtio-fs device support Virtio-fs devices are supported. Fixes: #4257 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
wllenyj	948381bdbe	dragonball: add virtio-net device support Virtio-net devices are supported. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
wllenyj	3d20387a25	dragonball: add virtio-blk device support Virtio-blk devices are supported. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-07-07 10:32:35 +08:00
Chao Wu	87d38ae49f	Doc: add document for Dragonball API add detailed explanation for Dragonball API Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-07 10:32:26 +08:00
Chen Yiyang	f59939a31f	runk: Support `exec` sub-command `exec` will execute a command inside a container which exists and is not frozon or stopped. Inside means that the new process share namespaces and cgroup with the container init process. Command can be specified by `--process` parameter to read from a file, or from other parameters such as arg, env, etc. In order to be compatible with `create`/`run` commands, I refactor libcontainer. `Container` in builder.rs is divided into `InitContainer` and `ActivatedContainer`. `InitContainer` is used for `create`/`run` command. It will load spec from given bundle path. `ActivatedContainer` is used by `exec` command, and will read the container's status file, which stores the spec and `CreateOpt` for creating the rustjail::LinuxContainer. Adapt the spec by replacing the process with given options and updating the namesapces with some paths to join the container. I also rename the `ContainerContext` as `ContainerLauncher`, which is only used to spawn process now. It uses the `LinuxContaier` in rustjail as the runner. For `create`/`run`, the `launch` method will create a new container and run the first process. For `exec`, the `launch` method will spawn a process which joins a container. Fixes #4363 Signed-off-by: Chen Yiyang <cyyzero@qq.com>	2022-07-06 21:11:30 +08:00
Manabu Sugimoto	4d89476c91	runtime: Fix DisableSelinux config Enable Kata runtime to handle `disable_selinux` flag properly in order to be able to change the status by the runtime configuration whether the runtime applies the SELinux label to VMM process. Fixes: #4599 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-07-06 15:50:28 +09:00
wllenyj	090de2dae2	dragonball: fix the clippy errors. fix clippy errors and do fmt in this PR. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-07-06 11:29:49 +08:00
wllenyj	a1593322bd	dragonball: add vsock api to api server Enables vsock to use the api for device configuration. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-07-06 11:29:49 +08:00
wllenyj	89b9ba8603	dragonball: add set_vm_configuration api Set virtual machine configuration configurations. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-07-06 11:29:49 +08:00
wllenyj	95fa0c70c3	dragonball: add start microvm support We add microvm start related support in thie pull request. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-06 11:29:49 +08:00
wllenyj	5c1ccc376b	dragonball: add Vmm struct The Vmm struct is global coordinator to manage API servers, virtual machines etc. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-07-06 11:29:49 +08:00
Jiang Liu	4d234f5742	dragonball: refactor code layout Refactored some code layout. Signed-off-by: Jiang Liu <gerry@linux.alibaba.com>	2022-07-06 11:29:49 +08:00
wllenyj	cfd5dae47c	dragonball: add vm struct The vm struct to manage resources and control states of an virtual machine instance. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com> Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-07-06 11:29:46 +08:00
wllenyj	527b73a8e5	dragonball: remove unused feature in AddressSpaceMgr log_dirty_pages is useless now and will be redesigned to support live migration in the future. Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-07-06 11:28:32 +08:00
Fabiano Fidêncio	071dd4c790	Merge pull request #4109 from pmores/drop-in-cfg-files-support Drop in cfg files support	2022-07-05 22:21:24 +02:00
Peng Tao	514b4e7235	Merge pull request #4543 from openanolis/anolis/add_vcpu_configure_aarch64 runtime-rs: Dragonball sandbox - add Vcpu::configure() function for aarch64	2022-07-05 17:47:40 +08:00
Bin Liu	d9e868f44e	Merge pull request #4479 from quanweiZhou/enhance-get-handled-signal agent: enhance get handled signal	2022-07-05 15:18:21 +08:00
Bin Liu	b33ad7e57a	Merge pull request #4574 from jelipo/fix-serde-serializing oci: fix serde skip serializing condition	2022-07-05 13:51:43 +08:00
Bin Liu	0189738283	Merge pull request #4576 from ManaSugi/fix/oci-poststart-hook agent: Run OCI poststart hooks after a container is launched	2022-07-05 11:08:49 +08:00
Peng Tao	cd2d8c6fe2	Merge pull request #4580 from ManaSugi/fix/replace-libc-with-nix agent: Replace some libc functions with nix ones	2022-07-05 10:53:42 +08:00
Peng Tao	a1de394e51	Merge pull request #4550 from liubin/fix/4548-overwrite-mount-type-for-bind-mount runtime: overwrite mount type to bind for bind mounts	2022-07-04 19:56:26 +08:00
haining.cao	0ddb34a38d	oci: fix serde skip serializing condition There is an extra space on the serde serialization condition. Fixes: #4578 Signed-off-by: haining.cao <haining.cao@daocloud.io>	2022-07-04 16:16:04 +08:00
xuejun-xj	7120afe4ed	dragonball: add vcpu test function for aarch64 add create_vcpu() function in vcpu test unit for aarch64 Fixes: #4445 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com>	2022-07-04 15:23:43 +08:00
xuejun-xj	648d285a24	dragonball: add vcpu support for aarch64 add configure() function for aarch64 vcpu Fixes: #4543 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com>	2022-07-04 15:23:37 +08:00
xuejun-xj	7dad7c89f3	dragonball: update dbs-xxx dependency change to up-to-date commit ID Fixes: #4543 Signed-off-by: xuejun-xj <jiyunxue@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com>	2022-07-04 15:23:11 +08:00
Manabu Sugimoto	fbb2e9bce9	agent: Replace some libc functions with nix ones Replace `libc::setgroups()`, `libc::fchown()`, and `libc::sethostname()` functions with nix crate ones for safety and maintainability. Fixes: #4579 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-07-04 14:49:38 +09:00
Manabu Sugimoto	acd3302bef	agent: Run OCI poststart hooks after a container is launched Run the OCI `poststart` hooks must be called after the user-specified process is executed but before the `start` operation returns in accordance with OCI runtime spec. Fixes: #4575 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-07-03 18:03:51 +09:00
James O. D. Hunt	59cab9e835	Merge pull request #4380 from Tim-0731-Hzt/rund/makefile runtime-rs: makefile for dragonball	2022-07-01 09:12:38 +01:00
liubin	1f363a386c	runtime: overwrite mount type to bind for bind mounts Some clients like nerdctl may pass mount type of none for volumes/bind mounts, this will lead to container start fails. Referring to runc, it overwrites the mount type to bind and ignores the input value. Fixes: #4548 Signed-off-by: liubin <liubin0329@gmail.com>	2022-07-01 12:13:01 +08:00
GabyCT	02a51e75a7	Merge pull request #4554 from liubin/fix/delete-not-used-console-from-container-config runtime: delete Console from Cmd type	2022-06-30 11:40:07 -05:00
Fabiano Fidêncio	aa561b49f5	Merge pull request #4540 from fidencio/topic/default_maxmemory Add `default_maxmemory` config option	2022-06-30 12:08:15 +02:00
quanweiZhou	2a4fbd6d8c	agent: enhance get handled signal For runC, send the signal to the init process directly. For kata, we try to send `SIGKILL` instead of `SIGTERM` when the process has not installed the handler for `SIGTERM`. The `is_signal_handled` function determine which signal the container process has been handled. But currently `is_signal_handled` is only catching (SigCgt). While the container process is ignoring (SigIgn) or blocking (SigBlk) also should not be converted from the `SIGTERM` to `SIGKILL`. For example, when using terminationGracePeriodSeconds the k8s will send SIGTERM first and then send `SIGKILL`, in this case, the container ignores the `SIGTERM`, so we should send the `SIGTERM` not the `SIGKILL` to the container. Fixes: #4478 Signed-off-by: quanweiZhou <quanweiZhou@linux.alibaba.com>	2022-06-30 14:44:46 +08:00
GabyCT	2a94261df5	Merge pull request #4549 from liubin/fix/4419-set-status-if-wait-process-failed shim: set a non-zero return code if the wait process call failed.	2022-06-29 17:04:53 -05:00
Fabiano Fidêncio	1e12d56512	Merge pull request #4469 from egernst/config-validation-refactor Refactor how hypervisor config validation is handled	2022-06-29 14:42:11 +02:00
liubin	a5a25ed13d	runtime: delete Console from Cmd type There is much code related to this property, but it is not used anymore. Fixes: #4553 Signed-off-by: liubin <liubin0329@gmail.com>	2022-06-29 17:36:32 +08:00
Pavel Mores	96553e8bd2	runtime: Add documentation of drop-in config file fragments Added user manual for the drop-in config file fragments feature. Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-06-29 10:56:53 +02:00
Pavel Mores	c656457e90	runtime: Add tests of drop-in config file decoding The tests ensure that interactions between drop-ins and the base configuration.toml and among drop-ins themselves work as intended, basically that files are evaluated in the correct order (base file first, then drop-ins in alphabetical order) and the last one to set a specific key wins. Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-06-29 09:54:39 +02:00
Pavel Mores	99f5ca80fc	runtime: Plug drop-in decoding into decodeConfig() Fixes #4108 Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-06-29 09:54:38 +02:00
Pavel Mores	0f9856c465	runtime: Scan drop-in directory, read files and decode them updateFromDropIn() uses the infrastructure built by previous commits to ensure no contents of 'tomlConfig' are lost during decoding. To do this, we preserve the current contents of our tomlConfig in a clone and decode a drop-in into the original. At this point, the original instance is updated but its Agent and/or Hypervisor fields are potentially damaged. To merge, we update the clone's Agent/Hypervisor from the original instance. Now the clone has the desired Agent/Hypervisor and the original instance has the rest, so to finish, we just need to move the clone's Agent/Hypervisor to the original. Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-06-29 09:54:38 +02:00
Pavel Mores	2c1efcc697	runtime: Add helpers to copy fields between tomlConfig instances These functions take a TOML key - an array of individual components, e.g. ["agent" "kata" "enable_tracing"], as returned by BurntSushi - and two 'tomlConfig' instances. They copy the value of the struct field identified by the key from the source instance to the target one if necessary. This is only done if the TOML key points to structures stored in maps by 'tomlConfig', i.e. 'hypervisor' and 'agent'. Nothing needs to be done in other cases. Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-06-29 09:54:38 +02:00
Pavel Mores	20f11877be	runtime: Add framework to manipulate config structs via reflection For 'tomlConfig' substructures stored in Golang maps - 'hypervisor' and 'agent' - BurntSushi doesn't preserve their previous contents as it does for substructures stored directly (e.g. 'runtime'). We use reflection to work around this. This commit adds three primitive operations to work with struct fields identified by their `toml:"..."` tags - one to get a field value, one to set a field value and one to assign a source struct field value to the corresponding field of a target. Signed-off-by: Pavel Mores <pmores@redhat.com>	2022-06-29 09:54:38 +02:00
liubin	ab5f1c9564	shim: set a non-zero return code if the wait process call failed. Return code is an int32 type, so if an error occurred, the default value may be zero, this value will be created as a normal exit code. Set return code to 255 will let the caller(for example Kubernetes) know that there are some problems with the pod/container. Fixes: #4419 Signed-off-by: liubin <liubin0329@gmail.com>	2022-06-29 12:33:32 +08:00
Zhongtao Hu	07231b2f3f	runtime-rs:refactor network model with netlink add unit test for tcfilter Fixes: #4289 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-29 11:38:23 +08:00
Zhongtao Hu	c8a9052063	build: format files add Enter at the end of the file Fixes: #4379 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-29 11:19:10 +08:00
Zhongtao Hu	242992e3de	build: put install methods in utils.mk put install methods in utils.mk to avoid duplication Fixes: #4379 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-29 11:19:10 +08:00
Zhongtao Hu	8a697268d0	build: makefile for dragonball config use makefile to generate dragonball config file Fixes: #4379 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-29 11:19:07 +08:00
Zhongtao Hu	9c526292e7	runtime-rs:refactor network model with netlink refactor tcfilter with netlink Fixes: #4289 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-29 11:03:29 +08:00
Eric Ernst	e5be5cb086	runtime: device: cleanup outdated comments Prior device config move didn't update the comments. Let's address this, and make sure comments match the new path... Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-06-28 18:22:28 -07:00
Eric Ernst	5f936f268f	virtcontainers: config validation is host specific Ideally this config validation would be in a seperate package (katautils?), but that would introduce circular dependency since we'd call it from vc, and it depends on vc types (which, shouldn't be vc, but probably a hypervisor package instead). Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-06-28 18:22:28 -07:00
Fabiano Fidêncio	323271403e	virtcontainers: Remove unused function While working on the previous commits, some of the functions become non-used. Let's simply remove them. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-28 21:19:24 +02:00
Fabiano Fidêncio	0939f5181b	config: Expose default_maxmemory Expose the newly added `default_maxmemory` to the project's Makefile and to the configuration files. Fixes: #4516 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-28 21:19:24 +02:00
Fabiano Fidêncio	58ff2bd5c9	clh,qemu: Adapt to using default_maxmemory Let's adapt Cloud Hypervisor's and QEMU's code to properly behave to the newly added `default_maxmemory` config. While implementing this, a change of behaviour (or a bug fix, depending on how you see it) has been introduced as if a pod requests more memory than the amount avaiable in the host, instead of failing to start the pod, we simply hotplug the maximum amount of memory available, mimicing better the runc behaviour. Fixes: #4516 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-28 21:19:24 +02:00
Zhongtao Hu	f3907aa127	runtime-rs:Merge remote-tracking branch 'origin/main' into runtime-rs-newv Fixes:#4536 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-28 20:58:40 +08:00
Bin Liu	badbbcd8be	Merge pull request #4400 from openanolis/anolis/dragonball-2 runtime-rs: built-in Dragonball sandbox part II - vCPU manager	2022-06-28 20:41:36 +08:00
Tim Zhang	916ffb75d7	Merge pull request #4432 from liubin/fix/4420-binary-log shim: support shim v2 logging plugin	2022-06-28 16:29:07 +08:00
Fabiano Fidêncio	afdc960424	hypervisor: Add default_maxmemory configuration Let's add a `default_maxmemory` configuration, which allows the admins to set the maximum amount of memory to be used by a VM, considering the initial amount + whatever ends up being hotplugged via the pod limits. By default this value is 0 (zero), and it means that the whole physical RAM is the limit. Fixes: #4516 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-28 08:32:15 +02:00
Bin Liu	4e30e11b31	shim: support shim v2 logging plugin Now kata shim only supports stdout/stderr of fifo from containerd/CRI-O, but shim v2 supports logging plugins, and nerdctl default will use the binary schema for logs. This commit will add the others type of log plugins: - file - binary In case of binary, kata shim will receive a stdout/stderr like: binary:///nerdctl?_NERDCTL_INTERNAL_LOGGING=/var/lib/nerdctl/1935db59 That means the nerdctl process will handle the logs(stdout/stderr) Fixes: #4420 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-06-28 13:54:22 +08:00
Eric Ernst	bdf5e5229b	virtcontainers: validate hypervisor config outside of hypervisor itself Depending on the user of it, the hypervisor from hypervisor interface could have differing view on what is valid or not. To help decouple, let's instead check the hypervisor config validity as part of the sandbox creation, rather than as part of the CreateVM call within the hypervisor interface implementation. Fixes: #4251 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-06-27 11:53:41 -07:00
Eric Ernst	469e098543	katautils: don't do validation when loading hypervisor config Policy for whats valid/invalid within the config varies by VMM, host, and by silicon architecture. Let's keep katautils simple for just translating a toml to the hypervisor config structure, and leave validation to virtcontainers. Without this change, we're doing duplicate validation. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-06-27 10:13:26 -07:00
Chao Wu	71db2dd5b8	hotplug: add room for future acpi hotplug mechanism In order to support ACPI hotplug in the future with the cooperative work from the Kata community, we add ACPI feature and dbs-upcall feature to add room for ACPI hotplug. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-27 21:52:36 +08:00
Zizheng Bian	8bb00a3dc8	dragonball: fix a bug when generating kernel boot args We should refuse to generate boot args when hotplugging, not cold starting. Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com>	2022-06-27 18:12:50 +08:00
Chao Wu	2aedd4d12a	doc: add document for vCPU, api and device Create the document for vCPU and api. Add some detail in the device document. Fixes: #4257 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-27 18:12:50 +08:00
wllenyj	bec22ad01f	dragonball: add api module It is used to define the vmm communication interface. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-06-27 18:12:50 +08:00
wllenyj	07f44c3e0a	dragonball: add vcpu manager Manage vcpu related operations. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-06-27 18:12:48 +08:00
wllenyj	78c9718752	dragonball: add upcall support Upcall is a direct communication tool between VMM and guest developed upon vsock. It is used to implement device hotplug. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com>	2022-06-27 17:04:47 +08:00
wllenyj	7d1953b52e	dragonball: add vcpu Virtual CPU manager for virtual machines. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-06-27 17:04:42 +08:00
wllenyj	468c73b3cb	dragonball: add kvm context KVM operation context for virtual machines. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-06-27 16:02:06 +08:00
Bin Liu	27b1bb5ed9	Merge pull request #4467 from egernst/device-pkg device package cleanup/refactor	2022-06-27 14:40:53 +08:00
Eric Ernst	e32bf53318	device: deduplicate state structures Before, we maintained almost identical structures between our persist API and what we keep for our devices, with the persist API being a slight subset of device structures. Let's deduplicate this, now that persist is importing device package. Json unmarshal of prior persist structure will work fine, since it was an exact subset of fields. Fixes: #4468 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-06-26 21:31:29 -07:00
Eric Ernst	f97d9b45c8	runtime: device/persist: drop persist dependency from device pkgs Rather than have device package depend on persist, let's define the (almost duplicate) structures within device itself, and have the Kata Container's persist pkg import these. This'll help avoid unecessary dependencies within our core packages. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-06-26 21:31:29 -07:00
Eric Ernst	f9e96c6506	runtime: device: move to top level package Let's move device package to runtime/pkg instead of being buried under virtcontainers. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-06-26 21:31:29 -07:00
Bin Liu	3880e0c077	agent: refactor reading file timing for debugging In the original code, reads mountstats file and return the content in the error, but at this time the file maybe changed, we should return the file content that parsed line by line to check why there is not a fstype option. Fixes: #4246 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-06-26 21:27:43 -07:00
Fabiano Fidêncio	083ca5f217	Merge pull request #4505 from yoheiueda/agent-debug-build agent: Allow BUILD_TYPE=debug	2022-06-24 14:04:23 +02:00
Fabiano Fidêncio	c70d3a2c35	agent: Update the dependencies Let's run a `cargo update` and ensure the deps are up-to-date before we cut the "-rc0" release. Fixes: #4525 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-24 11:37:25 +02:00
Fabiano Fidêncio	612fd79bae	random: Fix "nonminimal-bool" clippy warning The error shown below was caught during a dependency bump in the CCv0 branch, but we better fix it here first. ``` error: this boolean expression can be simplified --> src/random.rs:85:21 \| 85 \| assert!(!ret.is_ok()); \| ^^^^^^^^^^^^ help: try: `ret.is_err()` \| = note: `-D clippy::nonminimal-bool` implied by `-D warnings` = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#nonminimal_bool error: this boolean expression can be simplified --> src/random.rs:93:17 \| 93 \| assert!(!ret.is_ok()); \| ^^^^^^^^^^^^ help: try: `ret.is_err()` \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#nonminimal_bool ``` Fixes: #4523 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-24 11:37:05 +02:00
Fabiano Fidêncio	d4417f210e	netlink: Fix "or-fun-call" clippy warnings The error shown below was caught during a dependency bump in the CCv0 branch, but we better fix it here first. ``` error: use of `ok_or` followed by a function call --> src/netlink.rs:526:14 \| 526 \| .ok_or(anyhow!(nix::Error::EINVAL))?; \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ help: try this: `ok_or_else(\|\| anyhow!(nix::Error::EINVAL))` \| = note: `-D clippy::or-fun-call` implied by `-D warnings` = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#or_fun_call error: use of `ok_or` followed by a function call --> src/netlink.rs:615:49 \| 615 \| let v = u8::from_str_radix(split.next().ok_or(anyhow!(nix::Error::EINVAL))?, 16)?; \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ help: try this: `ok_or_else(\|\| anyhow!(nix::Error::EINVAL))` \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#or_fun_call ``` Fixes: #4523 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-24 11:37:01 +02:00
Fabiano Fidêncio	133528dd14	Merge pull request #4503 from amshinde/multi-queue-block block: Leverage multiqueue for virtio-block	2022-06-23 12:17:11 +02:00
Yohei Ueda	1b7d36fdb0	agent: Allow BUILD_TYPE=debug The cargo command creates debug build binaries, when the --release option is not specified. Specifying --debug option causes an error. This patch specifies --release option when BUILD_TYPE=release, and does not specify any build type option when BUILD_TYPE=debug. Fixes #4504 Signed-off-by: Yohei Ueda <yohei@jp.ibm.com>	2022-06-23 13:54:32 +09:00
Fabiano Fidêncio	78e27de6c3	Merge pull request #4358 from zvonkok/memreserve runtime: Add heuristic to get the right value(s) for mem-reserve	2022-06-22 13:41:23 +02:00
Archana Shinde	e227b4c404	block: Leverage multiqueue for virtio-block Similar to network, we can use multiple queues for virtio-block devices. This can help improve storage performance. This commit changes the number of queues for block devices to the number of cpus for cloud-hypervisor and qemu. Today the default number of cpus a VM starts with is 1. Hence the queues used will be 1. This change will help improve performance when the default cold-plugged cpus is greater than one by changing this in the config file. This may also help when we use the sandboxing feature with k8s that passes down the sum of the resources required down to Kata. Fixes #4502 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2022-06-21 12:38:53 -07:00
Eric Ernst	72049350ae	Merge pull request #4288 from fengwang666/enable-qemu-sandbox runtime: enable sandbox feature on qemu	2022-06-21 09:22:26 -07:00
Zvonko Kaiser	e7e7dc9dfe	runtime: Add heuristic to get the right value(s) for mem-reserve Fixes: #2938 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2022-06-21 03:44:28 -07:00
Chao Wu	86123f49f2	Merge branch 'main' into runtime-rs In order to keep update with the main, we will update runtime-rs every week. Fixes: #4485 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-20 10:01:58 +08:00
Liang Zhou	ef925d40ce	runtime: enable sandbox feature on qemu Enable "-sandbox on" in qemu can introduce another protect layer on the host, to make the secure container more secure. The default option is disable because this feature may introduce some performance cost, even though user can enable /proc/sys/net/core/bpf_jit_enable to reduce the impact. Fixes: #2266 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-06-17 15:30:46 -07:00
Chelsea Mafrica	28995301b3	tracing: Remove whitespace from root span Remove space from root span name to follow camel casing of other tracing span names in the runtime and to make parsing easier in testing. Fixes #4483 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2022-06-17 12:07:37 -07:00
wllenyj	e89e6507a4	dragonball: add signal handler Used to register dragonball's signal handler. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-06-16 17:31:58 +08:00
Fabiano Fidêncio	f30fe86dc1	Merge pull request #4456 from Bevisy/fixIssue4454 docs: Update outdated URLs and keep them available	2022-06-16 10:26:24 +02:00
Bin Liu	553ec46115	Merge pull request #4436 from alex-matei/fix/sandbox-mem-overflow runtime: fix error when trying to parse sandbox sizing annotations	2022-06-16 11:18:24 +08:00
James O. D. Hunt	9766a285a4	Merge pull request #4422 from snir911/dependabot_bumps deps: Resolve dependabot bumps of containerd, crossbeam-utils, regex	2022-06-15 15:57:53 +01:00
James O. D. Hunt	d06dd8fcdc	Merge pull request #4312 from fidencio/topic/pass-the-tuntap-fd-to-clh Allow Cloud Hypervisor to run under the `container_kvm_t`	2022-06-15 09:37:49 +01:00
Binbin Zhang	a305bafeef	docs: Update outdated URLs and keep them available By comparing the content of the old url and the new url, ensure that their content is consistent and does not contain ambiguities Fixes: #4454 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2022-06-15 16:34:28 +08:00
Chelsea Mafrica	db2a4d6cdf	Merge pull request #4441 from liubin/fix/refactor-reading-mountstat-log agent: refactor reading file timing for debugging	2022-06-14 14:18:14 -07:00
Fabiano Fidêncio	ac5dbd8598	clh: Improve logging related to the net dev addition Let's improve the log so we make it clear that we're only actually adding the net device to the Cloud Hypervisor configuration when calling our own version of VmAddNetPut(). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-14 10:53:09 +00:00
Fabiano Fidêncio	0b75522e1f	network: Set queues to 1 to ensure we get the network fds We want to have the file descriptors of the opened tuntap device to pass them down to the VMMs, so the VMMs don't have to explicitly open a new tuntap device themselves, as the `container_kvm_t` label does not allow such a thing. With this change we ensure that what's currently done when using QEMU as the hypervisor, can be easily replicated with other VMMs, even if they don't support multiqueue. As a side effect of this, we need to close the received file descriptors in the code of the VMMs which are not going to use them. Fixes: #3533 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-14 10:53:09 +00:00
Fabiano Fidêncio	93b61e0f07	network: Add FFI_NO_PI to the netlink flags Adding FFI_NO_PI to the netlink flags causes no harm to the supported and tested hypervisors as when opening the device by its name Cloud Hypervisor[0], Firecracker[1], and QEMU[2] do set the flag already. However, when receiving the file descriptor of an opened tutap device Cloud Hypervisor is not able to set the flag, leaving the guest without connectivity. To avoid such an issue, let's simply add the FFI_NO_PI flag to the netlink flags and ensure, from our side, that the VMMs don't have to set it on their side when dealing with an already opened tuntap device. Note that there's a PR opened[3] just for testing that this change doesn't cause any breakage. [0]: `e52175c2ab/net_util/src/tap.rs (L129)` [1]: `b6d6f71213/src/devices/src/virtio/net/tap.rs (L126)` [2]: `3757b0d08b/net/tap-linux.c (L54)` [3]: https://github.com/kata-containers/kata-containers/pull/4292 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-14 10:53:09 +00:00
Fabiano Fidêncio	bf3ddc125d	clh: Pass the tuntap fds down to Cloud Hypervisor This is basically a no-op right now, as: * netPair.TapInterface.VMFds is nil * the tap name is still passed to Cloud Hypervisor, which is the Cloud Hypervisor's first choice when opening a tap device. In the very near future we'll stop passing the tap name to Cloud Hypervisor, and start passing the file descriptors of the opened tap instead. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-14 10:53:09 +00:00
Fabiano Fidêncio	55ed32e924	clh: Take care of the VmAdNetdPut request ourselves Knowing that VmAddNetPut works as expected, let's switch to manually building the request and writing it to the appropriate socket. By doing this it gives us more flexibility to, later on, pass the file descriptor of the tuntap device to Cloud Hypervisor, as openAPI doesn't support such operation (it has no notion of SCM Rights). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-14 10:53:09 +00:00
Fabiano Fidêncio	01fe09a4ee	clh: Hotplug the network devices Instead of creating the VM with the network device already plugged in, let's actually add the network device after the VM is created, but before the Vm is actually booted. Although it looks like it doesn't make any functional difference between what's done in the past and what this commit introduces, this will be used to workaround a limitation on OpenAPI when it comes to passing down the network device's file descriptor to Cloud Hypervisor, so Cloud Hypervisor can use it instead of opening the device by its name on the VMM side. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-14 10:51:02 +00:00
Fabiano Fidêncio	2e07538334	clh: Expose VmAddNetPut VmAddNetPut is the API provided by the Cloud Hypervisor client (auto generated) code to hotplug a new network device to the VM. Let's expose it now as it'll be used as part this series, mostly to guide the reviewer through the process of what we have to do, as later on, spoiler alert, it'll end up being removed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-14 10:27:30 +00:00
Bin Liu	c84a425250	Merge pull request #4442 from openanolis/anolis/fix_safepath_clippy safe-path: fix clippy warning	2022-06-14 14:02:42 +08:00
Fabiano Fidêncio	a80eb33cd6	Merge pull request #4308 from fidencio/topic/virtiofsd-switch-to-using-the-rust-version-on-all-arches runtime: Switch to using the rust version of virtiofsd (all arches but powerpc)	2022-06-13 13:45:51 +02:00
Bin Liu	81acfc1286	Merge pull request #4425 from liubin/fix/4376-change-log-level-of-getoomevent shim: change the log level for GetOOMEvent call failures	2022-06-13 17:53:11 +08:00
James O. D. Hunt	9b93db0220	Merge pull request #4417 from jodh-intel/docs-monitor-considerations docs: Add more kata monitor details	2022-06-13 10:51:52 +01:00
Fabiano Fidêncio	1ef0b7ded0	runtime: Switch to using the rust version of virtiofsd (all but power) So far this has been done for x86_64. Now that the support for building and testing has been added for all arches, let's do the second part of the switch. We're still not done yet for powerpc, as some a virtifosd crash on the rust version has been found by the maintainer. Fixes: #4258, #4260 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-06-13 10:41:26 +02:00
wllenyj	b6cb2c4ae3	dragonball: add metrics system metrics system is added for collecting Dragonball metrics to analyze the system. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-06-13 13:51:51 +08:00
wllenyj	e80e0c4645	dragonball: add io manager wrapper Wrapper over IoManager to support device hotplug. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: jingshan <jingshan@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-06-13 13:51:46 +08:00
Chao Wu	bb26bd73b1	safe-path: fix clippy warning fix clippy warnings in safe-path lib to make clippy happy. fixes: #4443 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-13 13:38:37 +08:00
Bin Liu	1a5ba31cb0	agent: refactor reading file timing for debugging In the original code, reads mountstats file and return the content in the error, but at this time the file maybe changed, we should return the file content that parsed line by line to check why there is not a fstype option. Fixes: #4246 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-06-13 10:56:51 +08:00
Chao Wu	d5ee3fc856	safe-path: fix clippy warning fix clippy warnings in safe-path lib to make clippy happy. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-12 10:24:05 +08:00
Alexandru Matei	721ca72a64	runtime: fix error when trying to parse sandbox sizing annotations Changed bitsize for parsing functions to 64-bit in order to avoid parsing errors. Fixes #4435 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2022-06-11 18:51:10 +03:00
Chao Wu	93c10dfd86	runtime-rs: add crosvm license in Dragonball add THIRD-PARTY file to add license for crosvm. Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-11 17:24:58 +08:00
wllenyj	39ff85d610	dragonball: green ci Revert this patch, after dragonball-sandbox is ready. And all subsequent implementations are submitted. Fixes: #4257 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-06-11 17:24:17 +08:00
wllenyj	71f24d8271	dragonball: add Makefile. Currently supported: build, clippy, check, format, test, clean Fixes: #4257 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com>	2022-06-11 17:24:17 +08:00
Chao Wu	a1df6d0969	Doc: Update Dragonball Readme and add document for device Update Dragonball Readme to fix style problem and add github issue for TODOs. Add document for devices in dragonball. This is the document for the current dragonball device status and we'll keep updating it when we introduce more devices in later pull requets. Fixes: #4257 Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-11 17:24:17 +08:00
wllenyj	8619f2b3d6	dragonball: add virtio vsock device manager. Added VsockDeviceMgr struct to manage all vsock devices. Fixes: #4257 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-11 17:23:56 +08:00
wllenyj	52d42af636	dragonball: add device manager. Device manager to manage IO devices for a virtual machine. And added DeviceManagerTx to provide operation transaction for device management, added DeviceManagerContext to operation context for device management. Fixes: #4257 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-11 17:23:56 +08:00
wllenyj	c1c1e5152a	dragonball: add kernel config. It is used for holding guest kernel configuration information. Fixes: #4257 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-11 17:23:46 +08:00
wllenyj	6850ef99ae	dragonball: add configuration manager. It is used for managing a group of configuration information. Fixes: #4257 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-11 17:23:39 +08:00
wllenyj	0bcb422fcb	dragonball: add legacy devices manager The legacy devices manager is used for managing legacy devices. Fixes: #4257 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-11 17:23:33 +08:00
wllenyj	3c45c0715f	dragonball: add console manager. Console manager to manage frontend and backend console devcies. A virtual console are composed up of two parts: frontend in virtual machine and backend in host OS. A frontend may be serial port, virtio-console etc, a backend may be stdio or Unix domain socket. The manager connects the frontend with the backend. Fixes: #4257 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-11 17:23:27 +08:00
wllenyj	3d38bb3005	dragonball: add address space manager. Address space abstraction to manage virtual machine's physical address space. The AddressSpaceMgr Struct to manage address space. Fixes: #4257 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-11 17:21:41 +08:00
wllenyj	aff6040555	dragonball: add resource manager support. Resource manager manages all resources of a virtual machine instance. Fixes: #4257 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-11 17:21:41 +08:00
wllenyj	8835db6b0f	dragonball: initial commit The dragonball crate initial commit that includes dragonball README and basic code structure. Fixes: #4257 Signed-off-by: wllenyj <wllenyj@linux.alibaba.com> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com>	2022-06-11 17:21:41 +08:00
Fupan Li	9cb15ab4c5	agent: add the FSGroup support Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2022-06-11 11:30:51 +08:00
Fupan Li	ff7874bc23	protobuf: upgrade the protobuf version to 2.27.0 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2022-06-11 10:05:52 +08:00
Archana Shinde	aefe11b9ba	Merge pull request #4331 from dgibson/config-enable-iommu-annotation Allow io.katacontainers.config.hypervisor.enable_iommu annotation by …	2022-06-10 17:43:27 -07:00
Zhongtao Hu	06f398a34f	runtime-rs: use withContext to evaluate lazily Fixes: #4129 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 22:03:13 +08:00
Quanwei Zhou	fd4c26f9c1	runtime-rs: support network resource Fixes: #3785 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-06-10 22:02:58 +08:00
Tim Zhang	4be7185aa4	runtime-rs: runtime part implement Fixes: #3785 Signed-off-by: Tim Zhang <tim@hyper.sh> Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-06-10 22:01:12 +08:00
Zhongtao Hu	10343b1f3d	runtime-rs: enhance runtimes 1. support oom event 2. use ContainerProcess to store container_id and exec_id 3. support stats Fixes: #3785 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 22:01:05 +08:00
Quanwei Zhou	9887272db9	libs: enhance kata-sys-util and kata-types Fixes: #3785 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-06-10 21:59:47 +08:00
Quanwei Zhou	3ff0db05a7	runtime-rs: support rootfs volume for resource Fixes: #3785 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-06-10 19:58:01 +08:00
Tim Zhang	234d7bca04	runtime-rs: support cgroup resource Fixes: #3785 Signed-off-by: Tim Zhang <tim@hyper.sh>	2022-06-10 19:57:53 +08:00
Quanwei Zhou	75e282b4c1	runtime-rs: hypervisor base define Responsible for VM manager, such as Qemu, Dragonball Fixes: #3785 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-06-10 19:57:45 +08:00
Quanwei Zhou	bdfee005fa	runtime-rs: service and runtime framework 1. service: Responsible for processing services, such as task service, image service 2. Responsible for implementing different runtimes, such as Virt-container, Linux-container, Wasm-container Fixes: #3785 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-06-10 19:57:36 +08:00
Quanwei Zhou	4296e3069f	runtime-rs: agent implements Responsible for communicating with the agent, such as kata-agent in the VM Fixes: #3785 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-06-10 19:57:29 +08:00
Jakob Naucke	d3da156eea	runtime-rs: uint FsType for s390x statfs type on s390x should be c_uint, not __fsword_t Fixes: #3888 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2022-06-10 19:57:23 +08:00
quanwei.zqw	e705ee07c5	runtime-rs: update containerd-shim-protos to 0.2.0 Fixes: #3866 Signed-off-by: quanwei.zqw <quanwei.zqw@alibaba-inc.com>	2022-06-10 19:57:14 +08:00
quanwei.zqw	8c0a60e191	runtime-rs: modify the review suggestion Fixes: #3876 Signed-off-by: quanwei.zqw <quanwei.zqw@alibaba-inc.com>	2022-06-10 19:57:07 +08:00
Zack	278f843f92	runtime-rs: shim implements for runtime-rs Responsible for processing shim related commands: start, delete. This patch is extracted from Alibaba Cloud's internal repository runD Thanks to all contributors! Fixes: #3785 Signed-off-by: acetang <aceapril@126.com> Signed-off-by: Bin Liu <bin@hyper.sh> Signed-off-by: Chao Wu <chaowu@linux.alibaba.com> Signed-off-by: Eryu Guan <eguan@linux.alibaba.com> Signed-off-by: Fupan Li <lifupan@gmail.com> Signed-off-by: gexuyang <gexuyang@linux.alibaba.com> Signed-off-by: Helin Guo <helinguo@linux.alibaba.com> Signed-off-by: He Rongguang <herongguang@linux.alibaba.com> Signed-off-by: Hui Zhu <teawater@gmail.com> Signed-off-by: Issac Hai <hjwissac@linux.alibaba.com> Signed-off-by: Jiahuan Chao <jhchao@linux.alibaba.com> Signed-off-by: lichenglong9 <lichenglong9@163.com> Signed-off-by: mengze <mengze@linux.alibaba.com> Signed-off-by: Qingyuan Hou <qingyuan.hou@linux.alibaba.com> Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com> Signed-off-by: shiqiangzhang <shiyu.zsq@linux.alibaba.com> Signed-off-by: Simon Guo <wei.guo.simon@linux.alibaba.com> Signed-off-by: Tim Zhang <tim@hyper.sh> Signed-off-by: wanglei01 <wllenyj@linux.alibaba.com> Signed-off-by: Wei Yang <wei.yang1@linux.alibaba.com> Signed-off-by: yanlei <yl.on.the.way@gmail.com> Signed-off-by: Yiqun Leng <yqleng@linux.alibaba.com> Signed-off-by: yuchang.xu <yuchang.xu@linux.alibaba.com> Signed-off-by: Yves Chan <lingfu@linux.alibaba.com> Signed-off-by: Zack <zmlcc@linux.alibaba.com> Signed-off-by: Zhiheng Tao <zhihengtao@linux.alibaba.com> Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: Zizheng Bian <zizheng.bian@linux.alibaba.com>	2022-06-10 19:56:59 +08:00
Quanwei Zhou	641b736106	libs: enhance kata-sys-util 1. move verify_cid from agent to libs/kata-sys-util 2. enhance kata-sys-util/k8s Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-06-10 19:55:39 +08:00
Fupan Li	69ba1ae9e4	trans: fix the issue of wrong swapness type Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2022-06-10 19:46:25 +08:00
Quanwei Zhou	d2a9bc6674	agent: agent-protocol support async 1. support async. 2. update ttrpc and protobuf update ttrpc to 0.6.0 update protobuf to 2.23.0 3. support trans from oci Fixes: #3746 Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-06-10 19:36:55 +08:00
Liu Jiang	aee9633ced	libs/sys-util: provide functions to execute hooks Provide functions to execute OCI hooks. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Bin Liu <bin@hyper.sh> Signed-off-by: Huamin Tang <huamin.thm@alibaba-inc.com> Signed-off-by: Lei Wang <wllenyj@linux.alibaba.com> Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-06-10 19:24:30 +08:00
Liu Jiang	8509de0aea	libs/sys-util: add function to detect and update K8s emptyDir volume Add function to detect and update K8s emptyDir volume. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Qingyuan Hou <qingyuan.hou@linux.alibaba.com>	2022-06-10 19:15:59 +08:00
Liu Jiang	6d59e8e197	libs/sys-util: introduce function to get device id Introduce get_devid() to get major/minor number of a block device. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Eryu Guan <eguan@linux.alibaba.com>	2022-06-10 19:15:28 +08:00
Liu Jiang	5300ea23ad	libs/sys-util: implement reflink_copy() Implement reflink_copy() to copy file by reflink, and fallback to normal file copy. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Eryu Guan <eguan@linux.alibaba.com>	2022-06-10 19:15:20 +08:00
Liu Jiang	1d5c898d7f	libs/sys-util: add utilities to parse NUMA information Add utilities to parse NUMA information. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Qingyuan Hou <qingyuan.hou@linux.alibaba.com> Signed-off-by: Simon Guo <wei.guo.simon@linux.alibaba.com>	2022-06-10 19:15:12 +08:00
Liu Jiang	87887026f6	libs/sys-util: add utilities to manipulate cgroup Add utilities to manipulate cgroup, currently only v1 is supported. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: He Rongguang <herongguang@linux.alibaba.com> Signed-off-by: Jiahuan Chao <jhchao@linux.alibaba.com> Signed-off-by: Qingyuan Hou <qingyuan.hou@linux.alibaba.com> Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com> Signed-off-by: Tim Zhang <tim@hyper.sh>	2022-06-10 19:14:59 +08:00
Liu Jiang	ccd03e2cae	libs/sys-util: add wrappers for mount and fs Add some wrappers for mount and fs syscall. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Bin Liu <bin@hyper.sh> Signed-off-by: Fupan Li <lifupan@gmail.com> Signed-off-by: Huamin Tang <huamin.thm@alibaba-inc.com> Signed-off-by: Lei Wang <wllenyj@linux.alibaba.com> Signed-off-by: Quanwei Zhou <quanweiZhou@linux.alibaba.com>	2022-06-10 19:14:06 +08:00
Liu Jiang	45a00b4f02	libs/sys-util: add kata-sys-util crate under src/libs The kata-sys-util crate is a collection of modules that provides helpers and utilities used by multiple Kata Containers components. Fixes: #3305 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com>	2022-06-10 19:10:40 +08:00
Zhongtao Hu	48c201a1ac	libs/types: make the variable name easier to understand 1. modify default values for hypervisor 2. change the variable name 3. check the min memory limit Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 19:01:31 +08:00
Zhongtao Hu	b9b6d70aae	libs/types: modify implementation details 1. fix nit problems 2. use generic type when parsing different type Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 19:01:24 +08:00
Zhongtao Hu	05ad026fc0	libs/types: fix implementation details use ok_or_else to handle get_mut(hypervisor) to substitue unwrap Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 19:01:17 +08:00
Zhongtao Hu	d96716b4d2	libs/types:fix styles and implementation details 1. Some Nit problems are fixed 2. Make the code more readable 3. Modify some implementation details Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 19:01:09 +08:00
Zhongtao Hu	6cffd943be	libs/types:return Result to handle parse error If there is a parse error when we are trying to get the annotations, we will return Result<Option<type>> to handle that. Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 19:00:58 +08:00
Zhongtao Hu	6ae87d9d66	libs/types: use contains to make code more readable use contains to when validate hypervisor block_device_driver Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 19:00:50 +08:00
Zhongtao Hu	45e5780e7c	libs/types: fixed spelling and grammer error fixed spelling and grammer error in some files Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 19:00:43 +08:00
Zhongtao Hu	2599a06a56	libs/types:use include_str! in test file use include_str! to load toml file to string fmt Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 18:28:14 +08:00
Zhongtao Hu	8ffff40af4	libs/types:Option type to handle empty tomlconfig loading from empty string is only used to identity that the config is not initialized yet, so Option<TomlConfig> is a better option Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 18:28:05 +08:00
Zhongtao Hu	626828696d	libs/types: add license for test-config.rs add SPDX license identifier: Apache-2.0 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 18:27:57 +08:00
Liu Jiang	8cdd70f6c2	libs/types: change method to update config by annotation Some annotations are used to override hypervisor configurations, and you know it's dangerous. We must be careful when overriding hypervisor configuration by annotations, to avoid security flaws. There are two existing mechanisms to prevent attacks by annotations: 1) config.hypervisor.enable_annotations defines the allowed annotation keys for config.hypervisor. 2) config.hyperisor.xxxx_paths defines allowd values for specific keys. The access methods for config.hypervisor.xxx enforces the permisstion checks for above rules. To update conifg, traverse the annotation hashmap,check if the key is enabled in hypervisor or not. If it is enabled. For path related annotation, check whether it is valid or not before updating conifg. For cpu and memory related annotation, check whether it is more than or less than the limitation for DB and qemu beforing updating config. If it is not enabled, there will be three possibilities, agent related annotation, runtime related annotation and hypervisor related annotation but not enabled. The function will handle agent and runtime annotation first, then the option left will be the invlaid hypervisor, err message will be returned. add more edge cases tests for updating config clean up unused functions, delete unused files and fix warnings Fixes: #3523 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com> Signed-off-by: Liu Jiang <gerry@linux.alibaba.com>	2022-06-10 18:27:36 +08:00
Liu Jiang	e19d04719f	libs/types: implement KataConfig to wrap TomlConfig The TomlConfig structure is a parsed form of Kata configuration file, but it's a little inconveneient to access those configuration information directly. So introduce a wrapper KataConfig to easily access those configuration information. Two singletons of KataConfig is provided: - KATA_DEFAULT_CONFIG: the original version directly loaded from Kata configuration file. - KATA_ACTIVE_CONFIG: the active version is the KATA_DEFAULT_CONFIG patched by annotations. So the recommended to way to use these two singletons: - Load TomlConfig from configuration file and set it as the default one. - Clone the default one and patch it with values from annotations. - Use the default one for permission checks, such as to check for allowed annotation keys/values. - The patched version may be set as the active one or passed to clients. - The clients directly accesses information from the active/passed one, and do not need to check annotation for override. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com>	2022-06-10 18:26:48 +08:00
Liu Jiang	387ffa914e	libs/types: support load Kata agent configuration from file Add structures to load Kata agent configuration from configuration files. Also define a mechanism for vendor to extend the Kata configuration structure. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com>	2022-06-10 18:26:37 +08:00
Liu Jiang	69f10afb71	libs/types: support load Kata hypervisor configuration from file Add structures to load Kata hypevisor configuration from configuration files. Also define a mechanisms to: 1) for hypervisors to handle the configuration info. 2) for vendor to extend the Kata configuration structure. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 18:25:24 +08:00
Liu Jiang	21cc02d724	libs/types: support load Kata runtime configuration from file Add structures to load Kata runtime configuration from configuration files. Also define a mechanism for vendor to extend the Kata configuration structure. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2022-06-10 18:25:24 +08:00
Liu Jiang	5b89c1df2f	libs/types: add kata-types crate under src/libs Add kata-types crate to host constants and data types shared by multiple Kata Containers components. Fixes: #3305 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Fupan Li <lifupan@gmail.com> Signed-off-by: Huamin Tang <huamin.thm@alibaba-inc.com> Signed-off-by: Lei Wang <wllenyj@linux.alibaba.com> Signed-off-by: yanlei <yl.on.the.way@gmail.com>	2022-06-10 18:25:24 +08:00
Liu Jiang	4f62a7618c	libs/logging: fix clippy warnings Fix clippy warnings of libs/logging. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com>	2022-06-10 18:25:24 +08:00
Liu Jiang	6f8acb94c2	libs: refine Makefile rules Refine Makefile rules to better support the KATA ci env. Fixes: #3536 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com>	2022-06-10 18:25:24 +08:00
Liu Jiang	7cdee4980c	libs/logging: introduce a wrapper writer for logging Introduce a wrapper writer `LogWriter` which converts every line written to it into a log record. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Wei Yang <wei.yang1@linux.alibaba.com> Signed-off-by: yanlei <yl.on.the.way@gmail.com>	2022-06-10 18:25:24 +08:00
Liu Jiang	426f38de94	libs/logging: implement rotator for log files Add FileRotator to rotate log files. The FileRotator structure may be used as writer for create_logger() and limits the storage space occupied by log files. Fixes: #3304 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com> Signed-off-by: Wei Yang <wei.yang1@linux.alibaba.com> Signed-off-by: yanlei <yl.on.the.way@gmail.com>	2022-06-10 18:25:24 +08:00
Liu Jiang	392f1ecdf5	libs: convert to a cargo workspace Convert libs into a Cargo workspace, so all libraries could share the build infrastructure. Fixes #3282 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com>	2022-06-10 18:25:24 +08:00
James O. D. Hunt	412441308b	docs: Add more kata monitor details Add more detail to the `kata-monitor` doc to allow an admin to make a more informed decision about where and how to run the daemon. Fixes: #4416. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-06-09 09:20:11 +01:00
Bin Liu	ae911d0cd3	Merge pull request #4378 from cmaf/update-containerd-docs-critools docs: Update source for cri-tools	2022-06-09 15:12:37 +08:00
Bin Liu	05022975c8	Merge pull request #4413 from jodh-intel/tools-full-err-output tools: Enable extra detail on error	2022-06-09 13:52:08 +08:00
Chelsea Mafrica	aaa74e8a2b	Merge pull request #4415 from jodh-intel/agent-ctl-doc-examples docs: Add agent-ctl examples section	2022-06-08 09:51:30 -07:00
Eric Ernst	4ebf9d38b9	Merge pull request #4310 from egernst/core-sched shim: add support for core scheduling	2022-06-08 17:42:45 +02:00
Bin Liu	eff4e1017d	shim: change the log level for GetOOMEvent call failures GetOOMEvent is a blocking call that will fail if the container exit, in this case, it's not an error or warning. Changing the log level for logs in case of GetOOMEvent call fails will reduce log noise in a large cluster that has pods creating/deleting frequently. Fixes: #4376 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-06-08 22:17:24 +08:00
dependabot[bot]	5d7fb7b7b0	build(deps): bump github.com/containerd/containerd in /src/runtime Bumps [github.com/containerd/containerd](https://github.com/containerd/containerd) from 1.6.1 to 1.6.6. - [Release notes](https://github.com/containerd/containerd/releases) - [Changelog](https://github.com/containerd/containerd/blob/main/RELEASES.md) - [Commits](https://github.com/containerd/containerd/compare/v1.6.1...v1.6.6) --- updated-dependencies: - dependency-name: github.com/containerd/containerd dependency-type: direct:production ... Fixes: #4421 Signed-off-by: dependabot[bot] <support@github.com>	2022-06-08 10:54:46 +03:00
dependabot[bot]	d0ca2fcbbc	build(deps): bump crossbeam-utils in /src/tools/trace-forwarder Bumps [crossbeam-utils](https://github.com/crossbeam-rs/crossbeam) from 0.8.5 to 0.8.8. - [Release notes](https://github.com/crossbeam-rs/crossbeam/releases) - [Changelog](https://github.com/crossbeam-rs/crossbeam/blob/master/CHANGELOG.md) - [Commits](https://github.com/crossbeam-rs/crossbeam/compare/crossbeam-utils-0.8.5...crossbeam-utils-0.8.8) --- updated-dependencies: - dependency-name: crossbeam-utils dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2022-06-08 10:47:58 +03:00
dependabot[bot]	a60dcff4d8	build(deps): bump regex from 1.5.4 to 1.5.6 in /src/tools/agent-ctl Bumps [regex](https://github.com/rust-lang/regex) from 1.5.4 to 1.5.6. - [Release notes](https://github.com/rust-lang/regex/releases) - [Changelog](https://github.com/rust-lang/regex/blob/master/CHANGELOG.md) - [Commits](https://github.com/rust-lang/regex/compare/1.5.4...1.5.6) --- updated-dependencies: - dependency-name: regex dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2022-06-08 10:47:58 +03:00
dependabot[bot]	dbf50672e1	build(deps): bump crossbeam-utils in /src/tools/agent-ctl Bumps [crossbeam-utils](https://github.com/crossbeam-rs/crossbeam) from 0.8.5 to 0.8.8. - [Release notes](https://github.com/crossbeam-rs/crossbeam/releases) - [Changelog](https://github.com/crossbeam-rs/crossbeam/blob/master/CHANGELOG.md) - [Commits](https://github.com/crossbeam-rs/crossbeam/compare/crossbeam-utils-0.8.5...crossbeam-utils-0.8.8) --- updated-dependencies: - dependency-name: crossbeam-utils dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2022-06-08 10:47:58 +03:00
dependabot[bot]	8e2847bd52	build(deps): bump crossbeam-utils from 0.8.6 to 0.8.8 in /src/libs Bumps [crossbeam-utils](https://github.com/crossbeam-rs/crossbeam) from 0.8.6 to 0.8.8. - [Release notes](https://github.com/crossbeam-rs/crossbeam/releases) - [Changelog](https://github.com/crossbeam-rs/crossbeam/blob/master/CHANGELOG.md) - [Commits](https://github.com/crossbeam-rs/crossbeam/compare/crossbeam-utils-0.8.6...crossbeam-utils-0.8.8) --- updated-dependencies: - dependency-name: crossbeam-utils dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2022-06-08 10:47:58 +03:00
dependabot[bot]	e9ada165ff	build(deps): bump regex from 1.5.4 to 1.5.5 in /src/agent Bumps [regex](https://github.com/rust-lang/regex) from 1.5.4 to 1.5.5. - [Release notes](https://github.com/rust-lang/regex/releases) - [Changelog](https://github.com/rust-lang/regex/blob/master/CHANGELOG.md) - [Commits](https://github.com/rust-lang/regex/compare/1.5.4...1.5.5) --- updated-dependencies: - dependency-name: regex dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2022-06-08 10:47:58 +03:00
dependabot[bot]	adad9cef18	build(deps): bump crossbeam-utils from 0.8.5 to 0.8.8 in /src/agent Bumps [crossbeam-utils](https://github.com/crossbeam-rs/crossbeam) from 0.8.5 to 0.8.8. - [Release notes](https://github.com/crossbeam-rs/crossbeam/releases) - [Changelog](https://github.com/crossbeam-rs/crossbeam/blob/master/CHANGELOG.md) - [Commits](https://github.com/crossbeam-rs/crossbeam/compare/crossbeam-utils-0.8.5...crossbeam-utils-0.8.8) --- updated-dependencies: - dependency-name: crossbeam-utils dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2022-06-08 10:47:58 +03:00
James O. D. Hunt	34bcef8846	docs: Add agent-ctl examples section Add a new `Examples` section to the `agent-ctl` docs giving some examples of how to use the tool with QEMU and stand-alone. Fixes: #4414. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-06-08 08:39:38 +01:00
James O. D. Hunt	815157bf02	docs: Remove erroneous whitespace Deleted an extra blank line. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-06-08 08:39:38 +01:00
James O. D. Hunt	f5099620f1	tools: Enable extra detail on error The `agent-ctl` and `trace-forwarder` tools make use of `anyhow::Context` to provide additional call site information on error. However, previously neither tool was using the "alternate debug" format to display the error, meaning full error output was not displayed. Fixes: #4411. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-06-07 14:00:29 +01:00
Bin Liu	a238d8c6bd	Merge pull request #4300 from justxuewei/fix/rustjail/home-env rustjail: get home dir using nix crate	2022-06-06 11:03:46 +08:00
Bin Liu	f981190621	Merge pull request #4383 from cyyzero/runk-list runk: Support `list` sub-command	2022-06-06 10:25:33 +08:00
David Gibson	8f10e13e07	config: Allow enable_iommu pod annotation by default Since #902 the `io.katacontainers.config.hypervisor` pod annotations have only been permitted if explicitly allowed in the global configuration. The default global configuration allows no such annotations. That's important because several of those annotations would cause Kata to execute arbitrary binaries, and so were wildly unsafe. However, this is inconvenient for the `io.katacontainers.config.hypervisor.enable_iommu` annotation specifically, which controls whether the sandbox VM includes a vIOMMU. A guest side vIOMMU is necessary to implement VFIO passthrough devices with `vfio_mode = vfio`, so enabling that mode of operation currently requires a global configuration change, and can't just be enabled per-pod. Unlike some of the other hypervisor annotations, the `enable_iommu` annotation is quite safe. By default the vIOMMU is not present, so allowing a user to override it for a pod only improves their facilities for isolation. Even if the global default were changed to enable the vIOMMU, that doesn't compel the guest kernel to use it, so allowing a user to disable the vIOMMU doesn't materially affect isolation either. Therefore, allow the io.katacontainers.config.hypervisor.enable_iommu annotation to work in the default configurations. fixes #4330 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-06-04 13:02:05 +10:00
Eric Ernst	430da47215	Merge pull request #4360 from fengwang666/shim-leak runtime: ignore ESRCH error from stop container	2022-06-02 12:42:19 -07:00
Feng Wang	9d27c1fced	agent: ignore ESRCH error when destroying containers destroy() method should ignore the ESRCH error from signal::kill and continue the operation as ESRCH is often considered harmless. Fixes: #4359 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-06-02 08:19:48 -07:00
Feng Wang	9726f56fdc	runtime: force stop container after the container process exits Set thestop container force flag to true so that the container state is always set to “StateStopped” after the container wait goroutine is finished. This is necessary for the following delete container step to succeed. Fixes: #4359 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-06-02 08:17:08 -07:00
Chen Yiyang	38a3188206	runk: Support `list` sub-command Support list sub-command. It will traverse the root directory, parse status file and print basic information of containers. Behavior and print format consistent with runc. To handle race with runk delete or system user modify, the loop will continue to traverse when errors are encountered. Fixes: #4362 Signed-off-by: Chen Yiyang <cyyzero@qq.com>	2022-06-02 18:24:51 +08:00
Peng Tao	295a01f9b1	Merge pull request #4159 from egernst/topic/iptables feature: add ability to interact with IPTables within the guest	2022-06-02 11:19:41 +08:00
Tim Zhang	b8e98b175c	Merge pull request #4355 from liubin/fix/add-debug-info-for-parse-mount-error agent: return mount file content if parse mountinfo failed	2022-06-02 10:31:46 +08:00
Chelsea Mafrica	7ae11cad67	docs: Update source for cri-tools Kubernetes-incubator was previously deprecated in favor of kubernetes-sigs. Fixes #4377 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2022-06-01 12:48:48 -07:00
Bin Liu	3e2817f7b5	Merge pull request #4325 from ManaSugi/runk/error-terminal runk: Return error when tty is used without console socket	2022-06-01 13:58:38 +08:00
Bin Liu	a9a3074828	Merge pull request #4339 from ManaSugi/runk/add-podman-instruction runk: Add Podman guide in README	2022-06-01 11:05:42 +08:00
Manabu Sugimoto	5903815746	agent: Pass standard I/O to container launched by runk The `kata-agent` passes its standard I/O file descriptors through to the container process that will be launched by `runk` without manipulation or modification in order to allow the container process can handle its I/O operations. Fixes: #4327 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-06-01 10:19:57 +09:00
Eric Ernst	d2df1209a5	docs: describe kata handling for core-scheduling Add initial documentation for core-scheduling. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 16:17:00 -07:00
Michael Crosby	22b6a94a84	shim: add support for core scheduling In linux 5.14 and hopefully some backports, core scheduling allows processes to be co scheduled within the same domain on SMT enabled systems. Containerd impl sets the core sched domain when launching a shim. This allows a clean way for each shim(container/pod) to be in its own domain and any additional containers, (v2 pods) be be launched with the same domain as well as any exec'd process added to the container. kernel docs: https://www.kernel.org/doc/html/latest/admin-guide/hw-vuln/core-scheduling.html For Kata specifically, we will look for SCHED_CORE environment variable to be set to indicate we shuold create a new schedule core domain. This is equivalent to the containerd shim's PR: `e48bbe8394` Fixes: #4309 Signed-off-by: Eric Ernst <eric_ernst@apple.com> Signed-off-by: Michael Crosby <michael@thepasture.io>	2022-05-31 10:10:40 -07:00
Eric Ernst	af2ef3f7a5	agent-ctl: introduce handle for iptables get/set Add support for the updated agent API for iptables Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 09:27:58 -07:00
Eric Ernst	65f0cef16c	kata-runtime: add iptables CLI to test http endpoint While end users can connect directly to the shim, let's provide a way to easily get/set iptables from kata-runtime itself. Fixes: #4080 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 09:27:58 -07:00
Eric Ernst	3201ad0830	shim-client: ensure we check resp status for Put/Post Without this, potential errors are silently dropped. Let's ensure we return the error code as well as potenial data from the response. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 09:27:58 -07:00
Eric Ernst	0706fb28ac	kata-runtime: shmgmt: make url usage consistent Before, we had a mix of slash, etc. Unfortunately, when cleaning URL paths, serve mux seems to mangle the request method, resulting in each request being a GET (instead of PUT or POST). Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 09:27:58 -07:00
Eric Ernst	2a09378dd9	shim-client: add support for DoPut While at it, make sure we check for nil in DoPost Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 09:27:58 -07:00
Eric Ernst	640173cfc2	shim-mgmt: Add endpoint handler for interacting with iptables Add two endpoints: ip6tables, iptables. Each url handler supports GET and PUT operations. PUT expects the requests' data to be []bytes, and to contain iptable information in format to be consumed by iptables-restore. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 09:27:58 -07:00
Eric Ernst	0136be22ca	virtcontainers: plumb iptable set/get from sandbox to agent Introduce get/set iptable handling. We add a sandbox API for getting and setting the IPTables within the guest. This routes it from sandbox interface, through kata-agent, ultimately making requests to the guest agent. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 09:27:58 -07:00
Eric Ernst	bd50d463b2	agent: iptables: get/set handling for iptables Initial support for getting and setting iptables in the guest. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 09:27:58 -07:00
Eric Ernst	03176a9e09	proto: update generated code based on proto update Update the generated agent.pb.go code based on proto update. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 08:45:59 -07:00
Eric Ernst	38ebbc705b	proto: update to add set/get iptables Update the agent protocol definition to introduce support for setting and getting iptables from the guest. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 08:45:59 -07:00
Bin Liu	78d45b434f	agent: return mount file content if parse mountinfo failed Include mount file content in error message when parsing mountinfo failed for debug. Fixes: #4246, #4103 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-05-31 23:36:14 +08:00
Manabu Sugimoto	c7b3941c96	runk: Enable test for the agent built with standard-oci-runtime feature This enables tests for the kata-agent for runk that is built with standard-oci-runtime feature in CI. Fixes: #4351 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-05-31 21:54:28 +09:00
Manabu Sugimoto	6dbce7c3de	agent: Remove unused import in console test Remove some unused imports in console test module used by runk's test. Fixes: #4351 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-05-31 21:54:02 +09:00
Xuewei Niu	6ecea84bc5	rustjail: get home dir using nix crate Get user's home dir using `nix::unistd` crate instead of `utils` crate, and remove useless code from agent. Fixes: #4209 Signed-off-by: Xuewei Niu <justxuewei@apache.org>	2022-05-31 15:04:33 +08:00
Manabu Sugimoto	648b8d0aec	runk: Return error when tty is used without console socket runk always launches containers with detached mode, so users have to use a console socket with run or create operation when a terminal is used. If users set `terminal` to `true` in `config.json` and try to launch a container without specifying a console socket, runk returns an error with a message early. Fixes: #4324 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-05-31 09:55:39 +09:00
James O. D. Hunt	96c8df40b5	Merge pull request #4335 from ManaSugi/runk/fix-invalid-rootfs runk: Handle rootfs path in config.json properly	2022-05-30 14:03:58 +01:00
Manabu Sugimoto	5205efd9b4	runk: Add Podman guide in README runk can launch containers using Podman, so add the guide in README. Fixes: #4338 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-05-30 19:06:46 +09:00
Manabu Sugimoto	d862ca0590	runk: Handle rootfs path in config.json properly This commit enables runk to handle `root.path` in `config.json` properly even if the path is specified by a relative path that includes the single (`.`) or the double (`..`) dots. For example, with a bundle at `/to/bundle` and a rootfs directly under `/to/bundle` such as `/to/bundle/{bin,dev,etc,home,...}`, the `root.path` value can be either `/to/bundle` or just `.`. This behavior conforms to OCI runtime spec. Accordingly, a bundle path managed by runk's status file (`status.json`) always is statically stored as a canonical path. Previously, a bundle path has been　got by `oci_state()` of rustjail's API that returns the path as the parent directory path of a rootfs (`root.path`). In case of the kata-agent, this works properly because the kata containers assume that the rootfs path is always `/to/bundle/rootfs`. However in case of standard OCI runtimes, a rootfs can be placed anywhere under a bundle, so the rootfs path doesn't always have to be at a `/to/bundle/rootfs`. Fixes: #4334 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-05-30 14:41:26 +09:00
Fabiano Fidêncio	fff832874e	clh: Update to v24.0 This release has been tracked through the v24.0 project. virtio-iommu specification describes how a device can be attached by default to a bypass domain. This feature is particularly helpful for booting a VM with guest software which doesn't support virtio-iommu but still need to access the device. Now that Cloud Hypervisor supports this feature, it can boot a VM with Rust Hypervisor Firmware or OVMF even if the virtio-block device exposing the disk image is placed behind a virtual IOMMU. Multiple checks have been added to the code to prevent devices with identical identifiers from being created, and therefore avoid unexpected behaviors at boot or whenever a device was hot plugged into the VM. Sparse mmap support has been added to both VFIO and vfio-user devices. This allows the device regions that are not fully mappable to be partially mapped. And the more a device region can be mapped into the guest address space, the fewer VM exits will be generated when this device is accessed. This directly impacts the performance related to this device. A new serial_number option has been added to --platform, allowing a user to set a specific serial number for the platform. This number is exposed to the guest through the SMBIOS. * Fix loading RAW firmware (#4072) * Reject compressed QCOW images (#4055) * Reject virtio-mem resize if device is not activated (#4003) * Fix potential mmap leaks from VFIO/vfio-user MMIO regions (#4069) * Fix algorithm finding HOB memory resources (#3983) * Refactor interrupt handling (#4083) * Load kernel asynchronously (#4022) * Only create ACPI memory manager DSDT when resizable (#4013) Deprecated features will be removed in a subsequent release and users should plan to use alternatives * The mergeable option from the virtio-pmem support has been deprecated (#3968) * The dax option from the virtio-fs support has been deprecated (#3889) Fixes: #4317 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-05-26 08:51:18 +00:00
Fupan Li	62d1ed0651	Merge pull request #4290 from Tim-Zhang/remove-oci-kata-agent runk: merge oci-kata-agent into runk	2022-05-25 11:31:25 +08:00
Eric Ernst	6d00701ec9	Merge pull request #4298 from yibozhuang/fix-direct-volume Fix issues with direct-volume stats feature	2022-05-23 15:23:51 -07:00
Tim Zhang	122a85e222	agent: remove bin oci-kata-agent Fixes: #4291 Signed-off-by: Tim Zhang <tim@hyper.sh>	2022-05-23 16:55:16 +08:00
Tim Zhang	35619b45aa	runk: merge oci-kata-agent into runk Merge two bins into one. Fixes: #4291 Signed-off-by: Tim Zhang <tim@hyper.sh>	2022-05-23 16:54:09 +08:00
Yibo Zhuang	8e7c5975c6	agent: fix direct-assigned volume stats The current implementation of walking the disks to match with the requested volume path in agent doesn't work because the volume path provided by the shim to the agent is the mount path within the guest and not the device name. The current logic is trying to match the device name to the volume path which will never match. This change will simplify the get_volume_capacity_stats and get_volume_inode_stats to just call statfs and get the bytes and inodes usage of the volume path directly. Fixes: #4297 Signed-off-by: Yibo Zhuang <yibzhuang@gmail.com>	2022-05-20 18:43:27 -07:00
Yibo Zhuang	4428ceae16	runtime: direct-volume stats use correct name Today the shim does a translation when doing direct-volume stats where it takes the source and returns the mount path within the guest. The source for a direct-assigned volume is actually the device path on the host and not the publish volume path. This change will perform a lookup of the mount info during direct-volume stats to ensure that the device path is provided to the shim for querying the volume stats. Fixes: #4297 Signed-off-by: Yibo Zhuang <yibzhuang@gmail.com>	2022-05-20 18:42:47 -07:00
Yibo Zhuang	ffdc065b4c	runtime: direct-volume stats update to use GET parameter The go default http mux AFAIK doesn’t support pattern routing so right now client is padding the url for direct-volume stats with a subpath of the volume path and this will always result in 404 not found returned by the shim. This change will update the shim to take the volume path as a GET query parameter instead of a subpath. If the parameter is missing or empty, then return 400 BadRequest to the client. Fixes: #4297 Signed-off-by: Yibo Zhuang <yibzhuang@gmail.com>	2022-05-20 18:41:51 -07:00
Yibo Zhuang	f295953183	runtime: fix incorrect Action function for direct-volume stats The action function expects a function that returns error but the current direct-volume stats Action returns (string, error) which is invalid. This change fixes the format and print out the stats from the command instead. Fixes: #4293 Signed-off-by: Yibo Zhuang <yibzhuang@gmail.com>	2022-05-20 14:55:00 -07:00
Peng Tao	2c238c8504	Merge pull request #4213 from zvonkok/vfio runtime: Adding the correct detection of mediated PCIe devices	2022-05-20 15:00:23 +08:00
Fabiano Fidêncio	811ac6a8ce	Merge pull request #4282 from r4f4/runtime-dedup-types-import runtime: remove duplicate 'types' import	2022-05-19 22:15:36 +02:00
Chelsea Mafrica	d8be0f8e9f	Merge pull request #4281 from r4f4/runtime-qemu-comments runtime: sync docstrings with function names	2022-05-19 09:17:38 -07:00
Rafael Fonseca	7a5ccd1264	runtime: sync docstrings with function names The functions were renamed but their docstrings were not. Fixes #4006 Signed-off-by: Rafael Fonseca <r4f4rfs@gmail.com>	2022-05-19 14:31:47 +02:00
Greg Kurz	fa61bd43ee	Merge pull request #4238 from snir911/wip/legacy_console qemu: allow using legacy serial device for the console	2022-05-19 14:30:59 +02:00
Rafael Fonseca	ce2e521a0f	runtime: remove duplicate 'types' import Fallout of `09f7962ff` Fixes #4285 Signed-off-by: Rafael Fonseca <r4f4rfs@gmail.com>	2022-05-19 13:49:47 +02:00
Snir Sheriber	f4994e486b	runtime: allow annotation configuration to use_legacy_serial and update the docs and test Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2022-05-18 18:58:21 +03:00
Fabiano Fidêncio	c88a48be21	Merge pull request #4271 from r4f4/runtime-err-check-fix runtime: do not check for EOF error in console watcher	2022-05-18 09:49:48 +02:00
Chelsea Mafrica	04bd8f16f0	Merge pull request #4252 from Champ-Goblem/patch/fix-is-signal-handled agent: Fix is_signal_handled failing parsing str to u64	2022-05-17 08:31:48 -07:00
GabyCT	12f0ab120a	Merge pull request #4191 from dgibson/go-test-script Improve Go unit test script	2022-05-17 10:27:04 -05:00
Rafael Fonseca	8052fe62fa	runtime: do not check for EOF error in console watcher The documentation of the bufio package explicitly says "Err returns the first non-EOF error that was encountered by the Scanner." When io.EOF happens, `Err()` will return `nil` and `Scan()` will return `false`. Fixes #4079 Signed-off-by: Rafael Fonseca <r4f4rfs@gmail.com>	2022-05-17 15:14:33 +02:00
Snir Sheriber	c67b9d2975	qemu: allow using legacy serial device for the console This allows to get guest early boot logs which are usually missed when virtconsole is used. - It utilizes previous work on the govmm side: https://github.com/kata-containers/govmm/pull/203 - unit test added Fixes: #4237 Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2022-05-17 12:06:11 +03:00
Snir Sheriber	44814dce19	qemu: treat console kernel params within appendConsole as it is tightly coupled with the appended console device additionally have it tested Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2022-05-17 12:05:31 +03:00
Champ-Goblem	4b437d91f0	agent: Fix is_signal_handled failing parsing str to u64 In the is_signal_handled function, when parsing the hex string returned from `/proc/<pid>/status` the space/tab character after the colon is not removed. This patch trims the result of SigCgt so that all whitespace characters are removed. It also extends the existing test cases to check for this scenario. Fixes: #4250 Signed-off-by: Champ-Goblem <cameron@northflank.com>	2022-05-16 20:34:26 +02:00
Fabiano Fidêncio	c39852e83f	runtime: Use ${LIBEXEC}/virtiofsd as the default virtiofsd path As now we build and ship the rust version of virtiofsd, which is not tied to QEMU, we need to update its default location to match with where we're installing this binary. Fixes: #4249 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-05-16 09:30:24 +02:00
David Gibson	e73b70baff	runtime: Don't run unit tests verbose by default go-test.sh by default adds the -v option to 'go test' meaning that output will be printed from all the passing tests as well as any failing ones. This results in a lot of output in which it's often difficult to locate the failing tests you're interested in. So, remove -v from the default flags. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-05-13 13:22:31 +10:00
David Gibson	f24a6e761f	runtime: Consolidate flags setting in unit tests script One of the responsibilities of the go-test.sh script is setting up the default flags for 'go test'. This is constructed across several different places in the script using several unneeded intermediate variables though. Consolidate all the flag construction into one place. fixes #4190 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-05-13 13:22:29 +10:00
David Gibson	cf465feb02	runtime: Don't change test behaviour based on $CI or $KATA_DEV_MODE go-test.sh changes behaviour based on both the $CI and $KATA_DEV_MODE variables, but not in a way that makes a lot of sense. If either one is set it uses the test_coverage path, instead of the test_local path. That collects coverage information, as the name suggests, but it also means it runs the tests twice as root and non-root, which is very non-obvious. It's not clear what use case the test_local path is for at all. Developer local builds will typically have $KATA_DEV_MODE set and CI builds will have $CI set. There's essentially no downside to running coverage all the time - it has little impact on the test runtime. In addition, if both $CI and $KATA_DEV_MODE are set, the script refuses to run things as root, considering it "unsafe". While having both set might be unwise in a general sense, there's not really any way running sudo can be any more unsafe than it is with either one set. So, simplify everything by just always running the test_coverage path. This leaves the test_local path unused, so we can remove it entirely. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-05-13 13:14:37 +10:00
David Gibson	34c4ac599c	runtime: Remove redundant subcommands from go-test.sh go-test.sh accepts subcommands, however invoking it in the usual way via the Makefile doesn't use them. In fact the only remaining subcommand is "help" and we already have another way of getting the usage information (-h or --help). We don't need a second way, so just drop subcommand handling. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-05-13 13:14:37 +10:00
David Gibson	0aff5aaa39	runtime: Simplify package listing in go-test.sh go-test.sh defaults to testing all the packages listed by go list, except for a number filtered out. It turns out that none of those filters are necessary any more: * We've long required a Go newer than 1.9 which means the vendor filter isn't needed * The agent filter doesn't do anything now that we've moved to the Kata 2.x unified repo * The tests filters don't hit anything on the list of modules in src/runtime (which is the only user of the script) But since we don't need to filter anything out any more, we don't even need to iterate through a list ourselves. We can simply pass "./..." directly to go test and it will iterate through all the sub-packages itself. Interestingly this more than doubles the speed of "make test" for me - I suspect because go test's internal paralellism works better over a larger pool of tests. This also lets us remove handling of non-existent coverage files from test_go_package(), since with default options we will no longer test packages without tests by default. If the user explicitly requests testing of a package with no tests, then failing makes sense. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-05-13 13:14:37 +10:00
David Gibson	557c4cfd00	runtime: Don't chmod coverage files in Go tests The go-test.sh script has an explicit chmod command, run as root, to set the mode of the temporary coverage files to 0644. AFAICT the point of this is specifically the 004 bit allowing world read access, so that we can then merge the temporary coverage file into the main coverage file. That's a convoluted way of doing things. Instead we can just run the tail command which reads the temporary file as the same user that generated it. In addition, go-test.sh became root to remove that temporary coverage file. This is not necessary, since deleting a regular file just requires write access to the directory, not the file itself. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-05-13 13:14:37 +10:00
David Gibson	04c8b52e04	runtime: Remove HTML coverage option from go-test.sh The html-coverage option to this script doesn't really alter behaviour it just does the same thing as normal coverage, then converts the report to HTML. That conversion is a single command, plus a chmod to make the final output mode 0644. That overrides any umask the user has set, which doesn't seem like a policy decision this script should be making. Nothing in the kata-containers or tests repository uses this, so it doesn't really make sense to keep this logic inside this script. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-05-13 13:14:37 +10:00
David Gibson	7f76914422	runtime: Add coverage.txt.tmp to gitignore In addition to coverage.txt, the go-test.sh script creates coverage.txt.tmp files while running. These are temporary and certainly shouldn't be committed, so add them to the gitignore file. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-05-13 13:14:37 +10:00
David Gibson	13c2577004	runtime: Move go testing script locally The go unit tests for the runtime are invoked by the helper script ci/go-test.sh. Which calls the run_go_test() function in ci/lib.sh. Which calls into .ci/go-test.sh from the tests repository. But.. the runtime is the only user of this script, and generally stuff for unit tests (rather than functional or integration tests) lives in the main repository, not the tests repository. So, just move the actual script into src/runtime. A change to remove it from the tests repo will follow. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-05-13 13:14:37 +10:00
Snir Sheriber	271933fec0	log-parser: fix some of the documentation minor fixes of links and text Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2022-05-10 13:23:25 +03:00
Snir Sheriber	c7dacb1211	log-parser: move the kata-log-parser from the tests repo to the kata-containers repo under the src/tools/log-parser folder and vendor the modules Fixes: #4100 Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2022-05-10 13:23:25 +03:00
GabyCT	61a167139c	Merge pull request #4186 from liubin/fix/4185-skip-loop-by-user agent: Add a macro to skip a loop easier	2022-05-09 16:58:29 -05:00
Fupan Li	8aad2c59c5	Merge pull request #4184 from liubin/fix/4182-runk-kill-all runk: use custom Kill command to support --all option	2022-05-09 17:56:10 +08:00
Zvonko Kaiser	2a1d394147	runtime: Adding the correct detection of mediated PCIe devices Fixes #4212 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2022-05-09 00:57:06 -07:00
James O. D. Hunt	79d93f1fe7	Merge pull request #4137 from Shensd/sandbox-tests-online_resources agent: add test coverage for functions find_process and online_resources	2022-05-06 09:20:57 +01:00
Chelsea Mafrica	e2f68c6093	Merge pull request #4187 from fidencio/test-hook-grpc-to-oci rustjail: Add tests for hook_grpc_to_oci	2022-05-04 09:25:45 -07:00
Fabiano Fidêncio	bd5da4a7d9	Merge pull request #4189 from yibozhuang/watchable-mount-permission agent watchers: ensure uid/gid is preserved on copy/mkdir	2022-05-04 12:29:24 +02:00
Fabiano Fidêncio	33a8b70558	clh: Rely on Cloud Hypervisor for generating the device ID We're currently hitting a race condition on the Cloud Hypervisor's driver code when quickly removing and adding a block device. This happens because the device removal is an asynchronous operation, and we currently do not monitor events coming from Cloud Hypervisor to know when the device was actually removed. Together with this, the sandbox code doesn't know about that and when a new device is attached it'll quickly assign what may be the very same ID to the new device, leading to the Cloud Hypervisor's driver trying to hotplug a device with the very same ID of the device that was not yet removed. This is, in a nutshell, why the tests with Cloud Hypervisor and devmapper have been failing every now and then. The workaround taken to solve the issue is basically not passing down the device ID to Cloud Hypervisor and simply letting Cloud Hypervisor itself generate those, as Cloud Hypervisor does it in a manner that avoids such conflicts. With this addition we have then to keep a map of the device ID and the Cloud Hypervisor's generated ID, so we can properly remove the device. This workaround will probably stay for a while, at least till someone has enough cycles to implement a way to watch the device removal event and then properly act on that. Spoiler alert, this will be a complex change that may not even be worth it considering the race can be avoided with this commit. Fixes: #4176 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-05-04 09:04:03 +02:00
Jack Hance	475e3bf38f	agent: add test coverage for functions find_process and online_resources Add test coverage for the functions find_process and online_resources in src/sandbox.rs. Fixes #4085 Fixes #4136 Signed-off-by: Jack Hance <jack.hance@ndsu.edu>	2022-05-03 16:00:24 -05:00
Yibo Zhuang	70eda2fa6c	agent: watchers: ensure uid/gid is preserved on copy/mkdir Today in agent watchers, when we copy files/symlinks or create directories, the ownership of the source path is not preserved which can lead to permission issues. In copy, ensure that we do a chown of the source path uid/gid to the destination file/symlink after copy to ensure that ownership matches the source ownership. fs::copy() takes care of setting the permissions. For directory creation, ensure that we set the permissions of the created directory to the source directory permissions and also perform a chown of the source path uid/gid to ensure directory ownership and permissions matches to the source. Fixes: #4188 Signed-off-by: Yibo Zhuang <yibzhuang@gmail.com>	2022-05-03 09:57:31 -07:00
Garrett Mahin	4a1e13bd8f	rustjail: Add tests for hook_grpc_to_oci Add test coverage for hook_grpc_to_oci in rustjail/src/lib.rs Fixes: #4125 Signed-off-by: Garrett Mahin <garrett.mahin@gmail.com>	2022-05-02 23:59:33 +02:00
Bin Liu	383be2203a	agent: Add a macro to skip a loop easier Add a macro to skip a loop easier without using a if {} else {} condition check. Fixes: #4185 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-04-30 20:45:41 +08:00
Bin Liu	c633780ba7	Merge pull request #4119 from bradenrayhorn/test-create-logger-task agent: add tests for create_logger_task function	2022-04-30 19:48:07 +08:00
Bin Liu	97d7b1845b	runk: use custom Kill command to support --all option runk uses liboci-cli crate to parse command line options, but liboci-cli does not support --all option for kill command, though this is the runtime spec behavior. But crictl will issue kill --all command when stopping containers, as a workaround, we use a custom kill command instead of the one provided by liboci-cli. Fixes: #4182 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-04-30 19:34:18 +08:00
Bin Liu	7772f7dd99	runk: set BinaryName for runk for containerd The default runtime for io.containerd.runc.v2 is runc, to use runk, the containerd configuration should set the default runtime to runk or add BinaryName options for the runtime. Fixes: #4177 Signed-off-by: Bin Liu <bin@hyper.sh>	2022-04-29 22:26:32 +08:00
James O. D. Hunt	cc839772d3	Merge pull request #2785 from ManaSugi/standard-container-runtime tools: Add a Rust-based standard OCI container runtime based on Kata agent	2022-04-29 13:20:59 +01:00
James O. D. Hunt	2d5f11501c	Merge pull request #4083 from bradenrayhorn/test-parse-mount-table rustjail: add tests for parse_mount_table	2022-04-29 11:34:22 +01:00
Jianyong Wu	982c32358a	Merge pull request #4031 from Jaylyn-Ren/kata-spdk Virtcontainers: Enable hot plugging vhost-user-blk device on ARM	2022-04-29 12:16:38 +08:00
Chelsea Mafrica	3f069c7acb	Merge pull request #4166 from jodh-intel/agent-ctl-fix-abstract agent-ctl: Fix abstract socket connections	2022-04-28 10:17:28 -07:00
James O. D. Hunt	666aee54d2	docs: Add VSOCK localhost example for agent-ctl Update the `agent-ctl` docs to show how to use a VSOCK local address when running the agent and the tool in the same environment. This is an alternative to using a Unix socket. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-04-28 13:33:23 +01:00
James O. D. Hunt	86d348e065	docs: Use VM term in agent-ctl doc Use the standard "VM" acronym to mean Virtual Machine. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-04-28 13:33:19 +01:00
James O. D. Hunt	4b9b62bb3e	agent-ctl: Fix abstract socket connections Unbreak the `agent-ctl` tool connecting to the agent with a Unix domain socket. It appears that [1] changed the behaviour of connecting to the agent using a local Unix socket (which is not used by Kata under normal operation). The change can be seen by reverting to commit `72b8144b56` (the one before [1]) and running the agent manually as: ```bash $ sudo KATA_AGENT_SERVER_ADDR=unix:///tmp/foo.socket target/x86_64-unknown-linux-musl/release/kata-agent ``` Before [1], in another terminal we see this: ```bash $ sudo lsof -U 2>/dev/null \|grep foo\|awk '{print $9}' @/tmp/foo.socket@ ``` But now, we see the following: ```bash $ sudo lsof -U 2>/dev/null \|grep foo\|awk '{print $9}' @/tmp/foo.socket ``` Note the last byte which represents a nul (`\0`) value. The `agent-ctl` tool used to add that trailing nul but now it seems to not be needed, so this change removes it, restoring functionality. No external changes are necessary so the `agent-ctl` tool can connect to the agent as below like this: ```bash $ cargo run -- -l debug connect --server-address "unix://@/tmp/foo.socket" --bundle-dir "$bundle_dir" -c Check -c GetGuestDetails ``` [1] - https://github.com/kata-containers/kata-containers/issues/3124 Fixes: #4164. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-04-28 13:33:09 +01:00
Fabiano Fidêncio	b6467ddd73	clh: Expose disk rate limiter config With everything implemented, let's now expose the disk rate limiter configuration options in the Cloud Hypervisor configuration file. Fixes: #4139 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 10:28:29 +02:00
Fabiano Fidêncio	7580bb5a78	clh: Expose net rate limiter config With everything implemented, let's now expose the net rate limiter configuration options in the Cloud Hypervisor configuration file. Fixes: #4017 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 10:28:13 +02:00
Fabiano Fidêncio	a88adabaae	clh: Cloud Hypervisor has a built-in Rate Limiter The notion of "built-in rate limiter" was added as part of `bd8658e362`, and that commit considered that only Firecracker had a built-in rate limiter, which I think was the case when that was introduced (mid 2020). Nowadays, however, Cloud Hypervisor takes advantage of the very same crate used by Firecraker to do I/O throttling. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 10:27:56 +02:00
Fabiano Fidêncio	63c4da03a9	clh: Implement the Disk RateLimiter logic Let's take advantage of the newly added DiskRateLimiter* options and apply those to the network device configuration. The logic here is identical to the one already present in the Network part of Cloud Hypervisor's driver. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 10:27:53 +02:00
Fabiano Fidêncio	511f7f822d	config: Add DiskRateLimiter* to Cloud Hypervisor Let's add the newly added disk rate limiter configurations to the Cloud Hypervisor's hypervisor configuration. Right now those are not used anywhere, and there's absolutely no way the users can set those up. That's coming later in this very same series. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 10:27:15 +02:00
Fabiano Fidêncio	5b18575dfe	hypervisor: Add disk bandwidth and operations rate limiters This is the disk counterpart of the what was introduced for the network as part of the previous commits in this series. The newly added fields are: * DiskRateLimiterBwMaxRate, defined in bits per second, which is used to control the network I/O bandwidth at the VM level. * DiskRateLimiterBwOneTimeBurst, also defined in bits per second, which is used to define an initial max rate, which doesn't replenish. * DiskRateLimiterOpsMaxRate, the operations per second equivalent of the DiskRateLimiterBwMaxRate. * DiskRateLimiterOpsOneTimeBurst, the operations per second equivalent of the DiskRateLimiterBwOneTimeBurst. For now those extra fields have only been added to the hypervisor's configuration and they'll be used in the coming patches of this very same series. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 10:27:11 +02:00
Fabiano Fidêncio	1cf9469297	clh: Implement the Network RateLimiter logic Let's take advantage of the newly added NetRateLimiter* options and apply those to the network device configuration. The logic here is quite similar to the one already present in the Firecracker's driver, with the main difference being the single Inbound / Outbound MaxRate and the presence of both Bandwidth and Operations rate limiter. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 10:26:38 +02:00
Fabiano Fidêncio	00a5b1bda9	utils: Define DefaultRateLimiterRefillTimeMilliSecs Firecracker's driver doesn't expose the RefillTime option of the rate limiter to the user. Instead, it uses a contant value of 1000 miliseconds (1 second). As we're following Firecracker's driver implementation, let's expose create a new constant, use it as part of the Firecracker's driver, and later on re-use it as part of the Cloud Hypervisor's driver. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 10:22:42 +02:00
Fabiano Fidêncio	be1bb7e39f	utils: Move FC's function to revert bytes to utils Firecracker's revertBytes function, now called "RevertBytes", can be exposed as part of the virtcontainers' utils file, as this function will be reused by Cloud Hypervisor, when adding the rate limiter logic there. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 10:22:42 +02:00
Fabiano Fidêncio	c9f6496d6d	config: Add NetRateLimiter* to Cloud Hypervisor Let's add the newly added network rate limiter configurations to the Cloud Hypervisor's hypervisor configuration. Right now those are not used anywhere, and there's absolutely no way the users can set those up. That's coming later in this very same series. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 10:22:42 +02:00
Fabiano Fidêncio	2d35e6066d	hypervisor: Add network bandwidth and operations rate limiters In a similar way to what's already exposed as RxRateLimiterMaxRate and TxRateLimiterMaxRate, let's add four new fields to the Hypervisor's configuration. The values added are related to bandwidth and operations rate limiters, which have to be added so we can expose I/O throttling configurations to users using Cloud Hypervisor as their preferred VMM. The reason we cannot simply re-use {Rx,Tx}RateLimiterMaxRate is because Cloud Hypervisor exposes a single MaxRate to be used for both inbound and outbound queues. The newly added fields are: * NetRateLimiterBwMaxRate, defined in bits per second, which is used to control the network I/O bandwidth at the VM level. * NetRateLimiterBwOneTimeBurst, also defined in bits per second, which is used to define an initial max rate, which doesn't replenish. * NetRateLimiterOpsMaxRate, the operations per second equivalent of the NetRateLimiterBwMaxRate. * NetRateLimiterOpsOneTimeBurst, the operations per second equivalent of the NetRateLimiterBwOneTimeBurst. For now those extra fields have only been added to the hypervisor's configuration and they'll be used in the coming patches of this very same series. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 10:22:42 +02:00
Braden Rayhorn	b0e439cb66	rustjail: add tests for parse_mount_table Add tests for parse_mount_table function in rustjail/src/mount.rs. Includes some minor refactoring improve the testability of the function and improve its error values. Fixes: #4082 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-04-27 20:06:01 -05:00
Manabu Sugimoto	b221a2590f	tools: Add runk Add a Rust-based standard OCI container runtime based on Kata agent. You can build and install runk as follows: ```sh $ cd src/tools/runk $ make $ sudo make install $ runk --help ``` Fixes: #2784 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-04-28 00:48:57 +09:00
Manabu Sugimoto	2c218a07b9	agent: Modify Kata agent for runk Generate an oci-kata-agent which is a customized agent to be called from runk which is a Rust-based standard OCI container runtime based on Kata agent. Fixes: #2784 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-04-28 00:48:57 +09:00
James O. D. Hunt	0a6e7d443e	Merge pull request #3910 from etrunko/agent_random Agent: Unit tests for random.rs	2022-04-27 09:41:02 +01:00
James O. D. Hunt	7b20707197	Merge pull request #4107 from garrettmahin/test-mount-grpc-to-oci rustjail: Add tests for mount_grpc_to_oci	2022-04-27 08:50:24 +01:00
Peng Tao	5b6e45ed6c	Merge pull request #4141 from dgibson/cleanup-tmp Fix Go unit tests to clean up /tmp after themselves	2022-04-26 15:43:34 +08:00
Garrett Mahin	4b9e78b837	rustjail: Add tests for mount_grpc_to_oci Add test coverage for mount_grpc_to_oci in rustjail/src/lib.rs Fixes: #4106 Signed-off-by: Garrett Mahin <garrett.mahin@gmail.com>	2022-04-25 08:37:17 -05:00
James O. D. Hunt	bc919cc54c	Merge pull request #4122 from bradenrayhorn/test-mount-from rustjail: add tests for mount_from function	2022-04-25 11:55:21 +01:00
James O. D. Hunt	cb8dd0f4fc	Merge pull request #4143 from garrettmahin/test-hooks-grpc-to-oci rustjail: Add tests for hooks_grpc_to_oci	2022-04-25 10:50:52 +01:00
Braden Rayhorn	81f6b48626	agent: add tests for create_logger_task function Add tests for create_logger_task function in src/main.rs. Fixes: #4113 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-04-24 21:38:32 -05:00
Garrett Mahin	96bc3ec2e9	rustjail: Add tests for hooks_grpc_to_oci Add test coverage for hooks_grpc_to_oci in rustjail/src/lib.rs Fixes: #4142 Signed-off-by: Garrett Mahin <garrett.mahin@gmail.com>	2022-04-22 19:20:04 -05:00
holyfei	0239502781	agent: modify the type of swappiness to u64 The type of MemorySwappiness in runtime is uint64, and the type of swappiness in agent is int64, if we set max uint64 in runtime and pass it to agent, the value will be equal to -1. We should modify the type of swappiness to u64 Fixes: #4123 Signed-off-by: holyfei <yangfeiyu20092010@163.com>	2022-04-22 16:55:37 +08:00
David Gibson	1b931f4203	runtime: Allock mockfs storage to be placed in any directory Currently EnableMockTesting() takes no arguments and will always place the mock storage in the fixed location /tmp/vc/mockfs. This means that one test run can interfere with the next one if anything isn't cleaned up (and there are other bugs which means that happens). If if those were fixed this would allow developers testing on the same machine to interfere with each other. So, allow the mockfs to be placed at an arbitrary place given as a parameter to EnableMockTesting(). In TestMain() we place it under our existing temporary directory, so we don't need any additional cleanup just for the mockfs. fixes #4140 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-04-22 14:47:59 +10:00
David Gibson	ef6d54a781	runtime: Let MockFSInit create a mock fs driver at any path Currently MockFSInit always creates the mockfs at the fixed path /tmp/vc/mockfs. This change allows it to be initialized at any path given as a parameter. This allows the tests in fs_test.go to be simplified, because the by using a temporary directory from t.TempDir(), which is automatically cleaned up, we don't need to manually trigger initTestDir() (which is misnamed, it's actually a cleanup function). For now we still use the fixed path when auto-creating the mockfs in MockAutoInit(), but we'll change that later. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-04-22 14:23:36 +10:00
David Gibson	5d8438e939	runtime: Move mockfs control global into mockfs.go virtcontainers/persist/fs/mockfs.go defines a mock filesystem type for testing. A global variable in virtcontainers/persist/manager.go is used to force use of the mock fs rather than a normal one. This patch moves the global, and the EnableMockTesting() function which sets it into mockfs.go. This is slightly cleaner to begin with, and will allow some further enhancements. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-04-22 14:23:36 +10:00
David Gibson	963d03ea8a	runtime: Export StoragePathSuffix storagePathSuffix defines the file path suffix - "vc" - used for Kata's persistent storage information, as a private constant. We duplicate this information in fc.go which also needs it. Export it from fs.go instead, so it can be used in fc.go. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-04-22 14:23:36 +10:00
David Gibson	1719a8b491	runtime: Don't abuse MockStorageRootPath() for factory tests A number of unit tests under virtcontainers/factory use MockStorageRootPath() as a general purpose temporary directory. This doesn't make sense: the mockfs driver isn't even in use here since we only call EnableMockTesting for the pase virtcontainers package, not the subpackages. Instead use t.TempDir() which is for exactly this purpose. As a bonus it also handles the cleanup, so we don't need MockStorageDestroy any more. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-04-22 14:23:36 +10:00
David Gibson	bec59f9e39	runtime: Make bind mount tests better clean up after themselves There are several tests in mount_test.go which perform a sample bind mount. These need a corresponding unmount to clean up afterwards or attempting to delete the temporary files will fail due to the existing mountpoint. Most of them had such an unmount, but TestBindMountInvalidPgtypes was missing one. In addition, the existing unmounts where done inconsistently - one was simply inline (so wouldn't be executed if the test fails too early) and one is a defer. Change them all to use the t.Cleanup mechanism. For the dummy mountpoint files, rather than cleaning them up after the test, the tests were removing them at the beginning of the test. That stops the test being messed up by a previous run, but messily. Since these are created in a private temporary directory anyway, if there's something already there, that indicates a problem we shouldn't ignore. In fact we don't need to explicitly remove these at all - they'll be removed along with the rest of the private temporary directory. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-04-22 14:20:35 +10:00
David Gibson	f7ba21c86f	runtime: Clean up mock hook logs in tests The tests in hook_test.go run a mock hook binary, which does some debug logging to /tmp/mock_hook.log. Currently we don't clean up those logs when the tests are done. Use a test cleanup function to do this. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-04-22 14:14:52 +10:00
David Gibson	90b2f5b776	runtime: Make SetupOCIConfigFile clean up after itself SetupOCIConfigFile creates a temporary directory with os.MkDirTemp(). This means the callers need to register a deferred function to remove it again. At least one of them was commented out meaning that a /temp/katatest- directory was leftover after the unit tests ran. Change to using t.TempDir() which as well as better matching other parts of the tests means the testing framework will handle cleaning it up. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-04-22 14:14:52 +10:00
David Gibson	2eeb5dc223	runtime: Don't use fixed /tmp/mountPoint path Several tests in kata_agent_test.go create /tmp/mountPoint as a dummy directory to mount. This is not cleaned up after the test. Although it is in /tmp, that's still a little messy and can be confusing to a user. In addition, because it uses the same name every time, it allows for one run of the test to interfere with the next. Use the built in t.TempDir() to use an automatically named and deleted temporary directory instead. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-04-22 14:14:52 +10:00
Liu Jiang	0ad89ebd7c	safe-path: add more unit test cases Add more unit test cases to improve code coverage. Signed-off-by: Liu Jiang <gerry@linux.alibaba.com>	2022-04-21 10:01:23 +08:00
Liu Jiang	b63774ec61	libs/safe-path: add crate to safely resolve fs paths There are always path(symlink) based attacks, so the `safe-path` crate tries to provde some mechanisms to harden path resolution related code. Fixes: #3451 Signed-off-by: Liu Jiang <gerry@linux.alibaba.com>	2022-04-21 10:01:21 +08:00
Braden Rayhorn	f385b21b05	rustjail: add tests for mount_from function Add tests for the mount_from function in rustjail mount.rs file. Fixes: #4121 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-04-20 20:04:57 -05:00
Braden Rayhorn	0e7f1a5e3a	agent: move assert_result macro to test_utils file Move the assert_result macro to the shared test_utils file so that it is not duplicated in individual files. Fixes: #4093 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-04-19 18:57:16 -05:00
Fabiano Fidêncio	604a795073	Merge pull request #4096 from garrettmahin/test-root-grpc-to-oci rustjail: Add tests for root_grpc_to_oci	2022-04-19 21:38:58 +02:00
Fabiano Fidêncio	f619c65b6a	Merge pull request #4074 from bradenrayhorn/test-mount-to-rootfs agent: add tests for mount_to_rootfs function	2022-04-19 21:36:11 +02:00
Fabiano Fidêncio	7ec42951f2	Merge pull request #4035 from bradenrayhorn/test-update-container-namespaces agent: add tests for update_container_namespaces	2022-04-19 21:36:02 +02:00
Fabiano Fidêncio	e6bc912439	Merge pull request #3940 from bradenrayhorn/test-is-signal-handled agent: add tests for is_signal_handled function	2022-04-19 21:35:48 +02:00
Archana Shinde	33e244f284	Merge pull request #4102 from likebreath/0414/clh_v23.0 Upgrade to Cloud Hypervisor v23.0	2022-04-19 06:01:04 -07:00
Fabiano Fidêncio	dbb0c67523	Merge pull request #4072 from fengwang666/dv-bug agent: best-effort removing mount point	2022-04-19 10:08:40 +02:00
Chelsea Mafrica	0af13b469d	Merge pull request #4086 from BbolroC/s390x-fix test: Fix golangci-lint error for s390x	2022-04-15 21:07:09 -07:00
Bin Liu	b19bfac7cd	Merge pull request #4042 from yibozhuang/direct-assign-fsgroup fsGroup support for direct-assigned volume	2022-04-16 10:23:15 +08:00
Bin Liu	4ec1967542	Merge pull request #4094 from fgiudici/kata-monitor_readme kata-monitor: add the README file	2022-04-16 08:27:22 +08:00
Bin Liu	362201605e	Merge pull request #4055 from fgiudici/kata-monitor_pprof kata-monitor: update the hrefs in the debug/pprof index page	2022-04-16 08:12:18 +08:00
Garrett Mahin	2256bcb6ab	rustjail: Add tests for root_grpc_to_oci Add test coverage for root_grpc_to_oci in rustjail/src/lib.rs Fixes: #4095 Signed-off-by: Garrett Mahin <garrett.mahin@gmail.com>	2022-04-15 11:09:18 -05:00
Francesco Giudici	7b2ff02647	kata-monitor: add a README file Fixes: #3704 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-04-15 18:03:23 +02:00
Bo Chen	29e569aa92	virtcontainers: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v23.0. Note: The client code of cloud-hypervisor's (CLH) OpenAPI is automatically generated by openapi-generator [1-2]. [1] https://github.com/OpenAPITools/openapi-generator [2] https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/pkg/cloud-hypervisor/README.md Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-04-14 12:56:01 -07:00
Feng Wang	aabcebbf58	agent: best-effort removing mount point During container exit, the agent tries to remove all the mount point directories, which can fail if it's a readonly filesytem (e.g. device mapper). This commit ignores the removal failure and logs a warning message. Fixes: #4043 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-04-13 22:40:23 -07:00
Chelsea Mafrica	32f92e75cc	Merge pull request #4021 from fengwang666/direct-volume-bug runtime: Base64 encode the direct volume mountInfo path	2022-04-13 13:15:38 -07:00
Greg Kurz	4443bb68a4	Merge pull request #4064 from tiezhuoyu/4063/no-need-to-write-error-of-virtiofsd-to-kata-log runtime: no need to write virtiofsd error to log	2022-04-13 11:59:19 +02:00
Hyounggyu Choi	d136c9c240	test: Fix golangci-lint error for s390x This is to fix a test failure for the kata-containers-2.0-ubuntu-20.04-s390x-main-baseline jenkins job Fixes: #4088 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2022-04-13 09:20:51 +02:00
Fupan Li	66aa07649b	Merge pull request #4062 from liubin/fix/4061-add-links-for-kata-monitor kata-monitor: add some links when generating pages for browsers	2022-04-13 11:30:21 +08:00
Francesco Giudici	86977ff780	kata-monitor: update the hrefs in the debug/pprof index page kata-monitor allows to get data profiles from the kata shim instances running on the same node by acting as a proxy (e.g., http://$NODE_ADDRESS:8090/debug/pprof/?sandbox=$MYSANDBOXID). In order to proxy the requests and the responses to the right shim, kata-monitor requires to pass the sandbox id via a query string in the url. The profiling index page proxied by kata-monitor contains the link to all the data profiles available. All the links anyway do not contain the sandbox id included in the request: the links result then broken when accessed through kata-monitor. This happens because the profiling index page comes from the kata shim, which will not include the query string provided in the http request. Let's add on-the-fly the sandbox id in each href tag returned by the kata shim index page before providing the proxied page. Fixes: #4054 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-04-12 15:53:59 +02:00
Fabiano Fidêncio	78f30c33c6	agent: Avoid agent panic when reading empty stats This was seen in an issue report, where we'd try to unwrap a None value, leading to a panic. Fixes: #4077 Related: #4043 Full backtrace: ``` "thread 'tokio-runtime-worker' panicked at 'called `Option::unwrap()` on a `None` value', rustjail/src/cgroups/fs/mod.rs:593:31" "stack backtrace:" " 0: 0x7f0390edcc3a - std::backtrace_rs::backtrace::libunwind::trace::hd5eff4de16dbdd15" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/../../backtrace/src/backtrace/libunwind.rs:93:5" " 1: 0x7f0390edcc3a - std::backtrace_rs::backtrace::trace_unsynchronized::h04a775b4c6ab90d6" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/../../backtrace/src/backtrace/mod.rs:66:5" " 2: 0x7f0390edcc3a - std::sys_common::backtrace::_print_fmt::h3253c3db9f17d826" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/sys_common/backtrace.rs:67:5" " 3: 0x7f0390edcc3a - <std::sys_common::backtrace::_print::DisplayBacktrace as core::fmt::Display>::fmt::h02bfc712fc868664" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/sys_common/backtrace.rs:46:22" " 4: 0x7f0390a91fbc - core::fmt::write::hfd5090d1132106d8" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/core/src/fmt/mod.rs:1149:17" " 5: 0x7f0390edb804 - std::io::Write::write_fmt::h34acb699c6d6f5a9" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/io/mod.rs:1697:15" " 6: 0x7f0390edbee0 - std::sys_common::backtrace::_print::hfca761479e3d91ed" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/sys_common/backtrace.rs:49:5" " 7: 0x7f0390edbee0 - std::sys_common::backtrace::print::hf666af0b87d2b5ba" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/sys_common/backtrace.rs:36:9" " 8: 0x7f0390edbee0 - std::panicking::default_hook::{{closure}}::hb4617bd1d4a09097" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/panicking.rs:211:50" " 9: 0x7f0390edb2da - std::panicking::default_hook::h84f684d9eff1eede" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/panicking.rs:228:9" " 10: 0x7f0390edb2da - std::panicking::rust_panic_with_hook::h8e784f5c39f46346" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/panicking.rs:606:17" " 11: 0x7f0390f0c416 - std::panicking::begin_panic_handler::{{closure}}::hef496869aa926670" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/panicking.rs:500:13" " 12: 0x7f0390f0c3b6 - std::sys_common::backtrace::__rust_end_short_backtrace::h8e9b039b8ed3e70f" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/sys_common/backtrace.rs:139:18" " 13: 0x7f0390f0c372 - rust_begin_unwind" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/panicking.rs:498:5" " 14: 0x7f03909062c0 - core::panicking::panic_fmt::h568976b83a33ae59" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/core/src/panicking.rs:107:14" " 15: 0x7f039090641c - core::panicking::panic::he2e71cfa6548cc2c" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/core/src/panicking.rs:48:5" " 16: 0x7f0390eb443f - <rustjail::cgroups::fs::Manager as rustjail::cgroups::Manager>::get_stats::h85031fc1c59c53d9" " 17: 0x7f03909c0138 - <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll::hfa6e6cd7516f8d11" " 18: 0x7f0390d697e5 - <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll::hffbaa534cfa97d44" " 19: 0x7f039099c0b3 - <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll::hae3ab083a06d0b4b" " 20: 0x7f0390af9e1e - std::panic::catch_unwind::h1fdd25c8ebba32e1" " 21: 0x7f0390b7c4e6 - tokio::runtime::task::raw::poll::hd3ebbd0717dac808" " 22: 0x7f0390f49f3f - tokio::runtime::thread_pool::worker::Context::run_task::hfdd63cd1e0b17abf" " 23: 0x7f0390f3a599 - tokio::runtime::task::raw::poll::h62954f6369b1d210" " 24: 0x7f0390f37863 - std::sys_common::backtrace::__rust_begin_short_backtrace::h1c58f232c078bfe9" " 25: 0x7f0390f4f3dd - core::ops::function::FnOnce::call_once{{vtable.shim}}::h2d329a84c0feed57" " 26: 0x7f0390f0e535 - <alloc::boxed::Box<F,A> as core::ops::function::FnOnce<Args>>::call_once::h137e5243c6233a3b" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/alloc/src/boxed.rs:1694:9" " 27: 0x7f0390f0e535 - <alloc::boxed::Box<F,A> as core::ops::function::FnOnce<Args>>::call_once::h7331c46863d912b7" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/alloc/src/boxed.rs:1694:9" " 28: 0x7f0390f0e535 - std::sys::unix:🧵:Thread:🆕:thread_start::h1fb20b966cb927ab" " at /rustc/db9d1b20bba1968c1ec1fc49616d4742c1725b4b/library/std/src/sys/unix/thread.rs:106:17" ``` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-12 11:19:08 +02:00
Zhuoyu Tie	6e79042aa0	runtime: no need to write virtiofsd error to log The scanner reads nothing from viriofsd stderr pipe, because param '--syslog' rediercts stderr to syslog. So there is no need to write scanner.Text() to kata log Fixes: #4063 Signed-off-by: Zhuoyu Tie <tiezhuoyu@outlook.com>	2022-04-12 15:59:57 +08:00
Braden Rayhorn	9b6f24b2ee	agent: add tests for mount_to_rootfs function Add test coverage for mount_to_rootfs function in src/mount.rs. Includes minor refactoring to make function more easily testable. Fixes #4073 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-04-11 21:42:38 -05:00
Braden Rayhorn	c3776b1792	agent: add tests for is_signal_handled function Add test coverage for is_signal_handled function in rpc.rs. Includes refactors to make the function testable and handle additional cases. Fixes #3939 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-04-11 21:23:55 -05:00
Braden Rayhorn	9c22d9554e	agent: add tests for update_container_namespaces Add test coverage for update_container_namespaces function in src/rpc.rs. Includes minor refactor to make function easier to test. Fixes #4034 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-04-11 18:27:30 -05:00
Yibo Zhuang	92c00c7e84	agent: fsGroup support for direct-assigned volume Adding two functions set_ownership and recursive_ownership_change to support changing group id ownership for a mounted volume. The set_ownership will be called in common_storage_handler after mount_storage performs the mount for the volume. set_ownership will be a noop if the FSGroup field in the Storage struct is not set which indicates no chown will be performed. If FSGroup field is specified, then it will perform the recursive walk of the mounted volume path to change ownership of all files and directories to the desired group id. It will also configure the SetGid bit so that files created the directory will have group following parent directory group. If the fsGroupChangePolicy is on root mismatch, then the group ownership will be skipped if the root directory group id alreasy matches the desired group id and if the SetGid bit is also set on the root directory. This is the same behavior as what Kubelet does today when performing the recursive walk to change ownership. Fixes #4018 Signed-off-by: Yibo Zhuang <yibzhuang@gmail.com>	2022-04-11 08:57:13 -07:00
Yibo Zhuang	532d53977e	runtime: fsGroup support for direct-assigned volume The fsGroup will be specified by the fsGroup key in the direct-assign mountinfo metadate field. This will be set when invoking the kata-runtime binary and providing the key, value pair in the metadata field. Similarly, the fsGroupChangePolicy will also be provided in the mountinfo metadate field. Adding an extra fields FsGroup and FSGroupChangePolicy in the Mount construct for container mount which will be populated when creating block devices by parsing out the mountInfo.json. And in handleDeviceBlockVolume of the kata-agent client, it checks if the mount FSGroup is not nil, which indicates that fsGroup change is required in the guest, and will provide the FSGroup field in the protobuf to pass the value to the agent. Fixes #4018 Signed-off-by: Yibo Zhuang <yibzhuang@gmail.com>	2022-04-11 08:41:13 -07:00
Yibo Zhuang	6a47b82c81	proto: fsGroup support for direct-assigned volume This change adds two fields to the Storage pb FSGroup which is a group id that the runtime specifies to indicate to the agent to perform a chown of the mounted volume to the specified group id after mounting is complete in the guest. FSGroupChangePolicy which is a policy to indicate whether to always perform the group id ownership change or only if the root directory group id does not match with the desired group id. These two fields will allow CSI plugins to indicate to Kata that after the block device is mounted in the guest, group id ownership change should be performed on that volume. Fixes #4018 Signed-off-by: Yibo Zhuang <yibzhuang@gmail.com>	2022-04-11 08:41:13 -07:00
Braden Rayhorn	9d5e7ee0d4	agent: add tests for mount_storage Add test coverage for mount_storage function in src/mount.rs. Fixes: #4068 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-04-10 21:42:20 -05:00
bin	f8cc5d1ad8	kata-monitor: add some links when generating pages for browsers Add some links to rendered webpages for better user experience, let users can jump to pages only by clicking links in browsers. Fixes: #4061 Signed-off-by: bin <bin@hyper.sh>	2022-04-11 09:29:56 +08:00
Fabiano Fidêncio	698e45f403	Merge pull request #4057 from bradenrayhorn/test-parse-mount-flags-and-options agent: add test coverage for parse_mount_flags_and_options function	2022-04-08 14:42:18 +02:00
Fabiano Fidêncio	761e8313de	Merge pull request #3985 from bradenrayhorn/test-do-write-stream agent: add tests for do_write_stream function	2022-04-08 14:34:57 +02:00
Peng Tao	4f551e3428	Merge pull request #4048 from liubin/fix/3303-delete-virtiofsd-debug-option runtime: delete debug option in virtiofsd	2022-04-08 15:42:38 +08:00
Peng Tao	a83a16e32c	Merge pull request #4059 from garrettmahin/test-process-grpc-to-oci rustjail: add test coverage for process_grpc_to_oci function	2022-04-08 15:39:28 +08:00
Peng Tao	95e45fab38	Merge pull request #4053 from ManaSugi/fix-makefile-for-features agent: Allow the agent to be rebuilt with the change of Cargo features	2022-04-08 15:38:25 +08:00
garrettmahin	c31cd0e81a	rustjail: add test coverage for process_grpc_to_oci function Add test coverage for the process_grpc_to_oci function in src/rustjail/lib.rs Fixes #4058 Signed-off-by: Garrett Mahin <garrett.mahin@gmail.com>	2022-04-07 20:50:48 -05:00
Bin Liu	9c1c219a3f	Merge pull request #4007 from liubin/fix/3959-add-csi-rs-to-gitignore protocols: add src/csi.rs to .gitignore	2022-04-08 09:33:04 +08:00
Braden Rayhorn	1118a3d2da	agent: add test coverage for parse_mount_flags_and_options function Add test coverage for the parse_mount_flags_and_options function in src/mount.rs. Fixes #4056 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-04-07 17:46:35 -05:00
bin	9d5b03a1b7	runtime: delete debug option in virtiofsd virtiofsd's debug will be enabled if hypervisor's debug has been enabled, this will generate too many noisy logs from virtiofsd. Unbind the relationship of log level between virtiofsd and hypervisor, if users want to see debug log of virtiofsd, can set it by: virtio_fs_extra_args = ["-o", "log_level=debug"] Fixes: #3303 Signed-off-by: bin <bin@hyper.sh>	2022-04-07 19:55:22 +08:00
Manabu Sugimoto	eff7c7e0ff	agent: Allow the agent to be rebuilt with the change of Cargo features This allows the kata-agent to be rebuilt when Cargo "features" is changed. The Makefile for the agent do not need to specify the sources for prerequisites by having Cargo check for the sources changes. Fixes: #4052 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2022-04-07 20:09:20 +09:00
Greg Kurz	d0d3787233	Merge pull request #3696 from shippomx/main kata-runtime enable hugepage support	2022-04-06 16:47:04 +02:00
Jaylyn Ren	b975f2e8d2	Virtcontainers: Enable hot plugging vhost-user-blk device on ARM The vhost-user-blk can be hotplugged on the PCI bridge successfully on X86, but failed on Arm. However, hotplugging it on Root Port as a PCIe device can work well on ARM. Open the "pcie_root_port" in configuration.toml is needed. Fixes: #4019 Signed-off-by: Jaylyn Ren <jaylyn.ren@arm.com>	2022-04-06 17:37:51 +08:00
bin	962d05ec86	protocols: add src/csi.rs to .gitignore After running make in src/agent, the git working area will be changed: Untracked files: (use "git add <file>..." to include in what will be committed) src/libs/protocols/src/csi.rs The generated file by `build.rs` should be ignored in git. Fixes: #3959 Signed-off-by: bin <bin@hyper.sh>	2022-04-06 09:55:38 +08:00
Fabiano Fidêncio	b39caf43f1	Merge pull request #3923 from Jakob-Naucke/no-initrd-se runtime: Allow and require no initrd for SE	2022-04-05 09:26:07 +02:00
Feng Wang	354cd3b9b6	runtime: Base64 encode the direct volume mountInfo path This is to avoid accidentally deleting multiple volumes. Fixes #4020 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-04-04 19:56:46 -07:00
Braden Rayhorn	485aeabb6b	agent: add tests for do_write_stream function Add test coverage for do_write_stream function of AgentService in src/rpc.rs. Includes minor refactoring to make function more easily testable. Fixes #3984 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-04-04 08:21:01 -05:00
Archana Shinde	e62bc8e7f3	Merge pull request #3915 from Juneezee/test/t.TempDir test: use `T.TempDir` to create temporary test directory	2022-04-04 01:34:46 -07:00
Fabiano Fidêncio	8980d04e25	Merge pull request #4023 from fidencio/wip/expose-service-offload-option-to-clh clh: Expose service offload configuration	2022-04-01 14:10:33 +02:00
Fabiano Fidêncio	98750d792b	clh: Expose service offload configuration This configuration option is valid for all the hypervisor that are going to be used with the confidential containers effort, thus exposing the configuration option for Cloud Hypervisor as well. Fixes: #4022 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-01 11:15:55 +02:00
Bin Liu	416cc90b7a	Merge pull request #3972 from wfly1998/main agent: use ms as unit of cputime instead of ticks	2022-04-01 15:34:06 +08:00
Bin Liu	5d0adb2164	Merge pull request #3995 from wxx213/main agent: fix container stop error with signal SIGRTMIN+3	2022-04-01 11:29:14 +08:00
Wang Xingxing	0d765bd082	agent: fix container stop error with signal SIGRTMIN+3 The nix::sys::signal::Signal package api cannot deal with SIGRTMIN+3, directly use libc function to send the signal. Fixes: #3990 Signed-off-by: Wang Xingxing <stellarwxx@163.com>	2022-03-31 10:49:45 +08:00
Eng Zer Jun	59c7165ee1	test: use `T.TempDir` to create temporary test directory The directory created by `T.TempDir` is automatically removed when the test and all its subtests complete. This commit also updates the unit test advice to use `T.TempDir` to create temporary directory in tests. Fixes: #3924 Reference: https://pkg.go.dev/testing#T.TempDir Signed-off-by: Eng Zer Jun <engzerjun@gmail.com>	2022-03-31 09:31:36 +08:00
snir911	18dc578134	Merge pull request #3999 from fgiudici/kata-monitor_fix_help kata-monitor: fix duplicated output when printing usage	2022-03-30 18:56:59 +03:00
Francesco Giudici	a63bbf9793	kata-monitor: fix duplicated output when printing usage (default: "/run/containerd/containerd.sock") is duplicated when printing kata-monitor usage: [root@kubernetes ~]# kata-monitor --help Usage of kata-monitor: -listen-address string The address to listen on for HTTP requests. (default ":8090") -log-level string Log level of logrus(trace/debug/info/warn/error/fatal/panic). (default "info") -runtime-endpoint string Endpoint of CRI container runtime service. (default: "/run/containerd/containerd.sock") (default "/run/containerd/containerd.sock") the golang flag package takes care of adding the defaults when printing usage. Remove the explicit print of the value so that it would not be printed on screen twice. Fixes: #3998 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-03-30 11:58:53 +02:00
bin	5e1c30d484	runtime: add logs around sandbox monitor For debugging purposes, add some logs. Fixes: #3815 Signed-off-by: bin <bin@hyper.sh>	2022-03-29 16:59:12 +08:00
bin	fb8be96194	runtime: stop getting OOM events when ttrpc: closed error getOOMEvents is a long-waiting call, it will retry when failed. For cases of agent shutdown, the retry should stop. When the agent hasn't detected agent has died, we can also check whether the error is "ttrpc: closed". Fixes: #3815 Signed-off-by: bin <bin@hyper.sh>	2022-03-29 16:39:01 +08:00
Bin Liu	9495316145	Merge pull request #3962 from yaoyinnan/fix/3750-VirtioMem runtime: Remove the explicit VirtioMem set and fix the comment	2022-03-29 10:20:05 +08:00
yaoyinnan	66f05c5bcb	runtime: Remove the explicit VirtioMem set and fix the comment Modify the 2Mib in the comment to 4Mib. VirtioMem is set by configuration file or annotation. And setupVirtioMem is called only when VirtioMem is true. Fixes: #3750 Signed-off-by: yaoyinnan <yaoyinnan@foxmail.com>	2022-03-28 21:21:38 +08:00
Yu Li	800e4a9cfb	agent: use ms as unit of cputime instead of ticks For the library `procfs`, the unit of values in `CpuTime` is ticks, and we do not know how many ticks per second from metrics because the `tps` in `CpuTime` is private. But there are some implements in `CpuTime` for getting these values, e.g., `user_ms()` for `user`, and `nice_ms()` for `nice`. With these values, accurate time can be obtained. Fixes: #3979 Acked-by: zhaojizhuang <571130360@qq.com> Signed-off-by: Yu Li <liyu.yukiteru@bytedance.com>	2022-03-28 19:30:09 +08:00
Feng Wang	0928eb9f4e	agent: Kill the all the container processes of the same cgroup Otherwise the container process might leak and cause an unclean exit Fixes: #3913 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-27 10:06:58 -07:00
Jakob Naucke	ff17c756d2	runtime: Allow and require no initrd for SE Previously, it was not permitted to have neither an initrd nor an image. However, this is the exact config to use for Secure Execution, where the initrd is part of the image to be specified as `-kernel`. Require the configuration of no initrd for Secure Execution. Also - remove redundant code for image/initrd checking -- no need to check in `newQemuHypervisorConfig` (calling) when it is also checked in `getInitrdAndImage` (called) - use `QemuCCWVirtio` constant when possible Fixes: #3922 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2022-03-25 18:36:12 +01:00
Feng Wang	19f372b5f5	runtime: Add more debug logs for container io stream copy This can help debugging container lifecycle issues Fixes: #3913 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-24 21:35:16 -07:00
Peng Tao	098374b179	Merge pull request #3934 from dcmiddle/fix-agent-check Agent: fix unneeded late initialization lint	2022-03-24 16:02:11 +08:00
David Gibson	c77e34de33	runtime: Move mock hook source src/runtime/virtcontainers/hook/mock contains a simple example hook in Go. The only thing this is used for is for some tests in src/runtime/pkg/katautils/hook_test.go. It doesn't really have anything to do with the rest of the virtcontainers package. So, move it next to the test code that uses it. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-03-23 19:37:35 +11:00
David Gibson	86723b51ae	virtcontainers: Remove unused install/uninstall targets We've now removed the need to install the mock hook binary for unit tests. However, it turns out that managing that was the only thing that the install and uninstall targets in the virtcontainers Makefile handled. So, remove them. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-03-23 19:37:18 +11:00
David Gibson	0e83c95fac	virtcontainers: Run mock hook from build tree rather than system bin dir Running unit tests should generally have minimal dependencies on things outside the build tree. It definitely shouldn't modify system wide things outside the build tree. Currently the runtime "make test" target does so, though. Several of the tests in src/runtime/pkg/katautils/hook_test.go require a sample hook binary. They expect this hook in /usr/bin/virtcontainers/bin/test/hook, so the makefile, as root, installs the test binary to that location. Go tests automatically run within the package's directory though, so there's no need to use a system wide path. We can use a relative path to the binary build within the tree just as easily. fixes #3941 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-03-23 19:34:50 +11:00
Dan Middleton	32131cb8ba	Agent: fix unneeded late initialization lint Clippy v1.58 added needless_late_init Fixes #3933 Signed-off-by: Dan Middleton <dan.middleton@intel.com>	2022-03-22 10:17:24 -05:00
David Gibson	e65db838ff	virtcontainers: Remove VC_BIN_DIR The VC_BIN_DIR variable in the virtcontainers Makefile is almost unused. It's used to generate TEST_BIN_DIR, and it's created in the install target. However, we also create TEST_BIN_DIR, which is a subdirectory of VC_BIN_DIR with mkdir -p, so it will necessarily create VC_BIN_DIR along the way. So we can drop the unnecessary mkdir and expand the definition of VC_BIN_DIR in the definition of TEST_BIN_DIR. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-03-22 16:53:59 +11:00
David Gibson	c20ad2836c	virtcontainers: Remove unused Makefile defines The INSTALL_EXEC and UNINSTALL_EXEC definitions from the virtcontainers Makefile (unlike those from the runtime Makefile in the parent directory) are entirely unused. Remove them. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-03-22 16:40:57 +11:00
David Gibson	c776bdf4a8	virtcontainers: Remove unused parameter from go-test.sh The check-go-test target passes the path to the mock hook test binary to go-test.sh when it invokes it. But go-test.sh just calls run_go_test from ci/lib.sh, which invokes a script from the tests repo without any parameters. That is, this parameter is ignored anyway, so remove it. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-03-22 16:39:22 +11:00
James O. D. Hunt	f8fb0d3bb6	Merge pull request #3322 from Kvasscn/kata_dev_block_driver_option device: using const strings for block-driver option instead of hard coding	2022-03-21 10:56:25 +00:00
Eduardo Lima (Etrunko)	1cad3a4696	agent/random: Ensure data.len > 0 Also adds a test to cover this scenario Signed-off-by: Eduardo Lima (Etrunko) <etrunko@redhat.com>	2022-03-18 15:13:51 -03:00
Eduardo Lima (Etrunko)	33c953ace4	agent: Add test_ressed_rng_not_root Same as previous test, but does not skip if it is not running as root. Signed-off-by: Eduardo Lima (Etrunko) <etrunko@redhat.com>	2022-03-18 15:13:51 -03:00
Wainer dos Santos Moschetta	39a35b693a	agent: Add test to random::reseed_rng() Introduced an unit test for the random::reseed_rng() function. Fixes #291 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2022-03-18 10:23:22 -03:00
Eduardo Lima (Etrunko)	d8f39fb269	agent/random: Rename RNDRESEEDRNG to RNDRESEEDCRNG Make this definition match the one in kernel: `5bfc75d92e/include/uapi/linux/random.h (L38-L39)` Signed-off-by: Eduardo Lima (Etrunko) <etrunko@redhat.com>	2022-03-18 10:23:22 -03:00
Miao Xia	a2f5c1768e	runtime/virtcontainers: Pass the hugepages resources to agent The hugepages resources claimed by containers should be limited by cgroup in the guest OS. Fixes: #3695 Signed-off-by: Miao Xia <xia.miao1@zte.com.cn>	2022-03-15 18:46:08 +08:00
Feng Wang	84aebac327	Merge pull request #3875 from fengwang666/fix-shim-leak runtime: properly handle ESRCH error when signaling container	2022-03-14 12:47:35 -07:00
Feng Wang	aa5ae6b17c	runtime: Properly handle ESRCH error when signaling container Currently kata shim v2 doesn't translate ESRCH signal, causing container fail to stop and shim leak. Fixes: #3874 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-14 11:03:05 -07:00
James O. D. Hunt	afa090ad7b	Merge pull request #3867 from Shensd/main rustjail: optimization, merged several writelns into one	2022-03-14 10:05:48 +00:00
zhanghj	efa19c41eb	device: use const strings for block-driver option instead of hard coding Currently, the block driver option is specifed by hard coding, maybe it is better to use const string variables instead of hard coded strings. Another modification is to remove duplicate consts for virtio driver in manager.go. Fixes: #3321 Signed-off-by: Jason Zhang <zhanghj.lc@inspur.com>	2022-03-14 09:20:43 +08:00
Jack Hance	92ce5e2dc4	rustjail: optimization, merged several writelns into one Optimized several writelns by merging them into one in src/utils.rs Fixes: #3772 Signed-off-by: Jack Hance <jack.hance@ndsu.edu>	2022-03-11 13:18:58 -06:00
James O. D. Hunt	5d6d39be48	scripts: Change here document delimiters Fix the outstanding scripts using non standard shell here document delimiters. This should have been caught by https://github.com/kata-containers/tests/pull/3937, but there is a bug in the checker which is fixed on https://github.com/kata-containers/tests/pull/4569. Fixes: #3864. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-03-10 09:23:37 +00:00
Fabiano Fidêncio	5a7fd943c1	Merge pull request #3838 from bradenrayhorn/get-memory-info-tests agent: add tests for get_memory_info function	2022-03-09 23:21:20 +01:00
Braden Rayhorn	c088a3f3ad	agent: add tests for get_memory_info function Add test coverage for get_memory_info function in src/rpc.rs. Includes some minor refactoring of the function. Fixes #3837 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-03-09 11:34:35 -06:00
Gabriela Cervantes	ffdf961ae9	docs: Update contact link in runtime README This PR updates the contact link in the runtime README document. Fixes #3854 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2022-03-08 16:27:34 +00:00
Julio Montes	293e61dc6e	Merge pull request #3766 from dgibson/hugepages Improve error checking of hugepage allocation	2022-03-08 10:21:57 -06:00
Bin Liu	deb8ce97a8	Merge pull request #3836 from liubin/fix/minor-fix Enhancement: fix comments/logs and delete not used function	2022-03-07 17:26:30 +08:00
bin	b257e0e5ab	rustjail: delete function signal in BaseContainer Function signal in BaseContainer is not used anymore. Fixes: #3835 Signed-off-by: bin <bin@hyper.sh>	2022-03-05 10:33:15 +08:00
bin	d647b28bb8	agent: delete meaningless FIXME comment The test has passed, the FIX comment should be deleted. Fixes: #3835 Signed-off-by: bin <bin@hyper.sh>	2022-03-05 10:33:15 +08:00
bin	1b34494b2f	runtime: fix invalid comments for pkg/resourcecontrol Some comments are copied and not adjusted to the pkg/resourcecontrol package. Fixes: #3835 Signed-off-by: bin <bin@hyper.sh>	2022-03-05 10:32:31 +08:00
Evan Foster	afc567a9ae	storage: make k8s emptyDir creation configurable This change introduces the `disable_guest_empty_dir` config option, which allows the user to change whether a Kubernetes emptyDir volume is created on the guest (the default, for performance reasons), or the host (necessary if you want to pass data from the host to a guest via an emptyDir). Fixes #2053 Signed-off-by: Evan Foster <efoster@adobe.com>	2022-03-04 12:02:42 -08:00
Eric Ernst	1e301482e7	Merge pull request #3406 from fengwang666/direct-blk-assignment Implement direct-assigned volume	2022-03-04 11:58:37 -08:00
Feng Wang	e76519af83	runtime: small refactor to improve readability Remove some confusing/duplicate code so it's more readable Fixes: #3454 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-04 10:00:52 -08:00
Fabiano Fidêncio	7e5f11a52b	vendor: Update containerd to 1.6.1 Let's bring in the latest release of Containerd, 1.6.1, released on March 2nd, 2022. With this, we take the opportunity to remove containerd/api reference as we shouldn't need a separate module only for the API. Here's the list of changes needed in the code due to the bump: * stop using `grpc.WithInsecure()` as it's been deprecated - use `grpc.WithTransportCredentials(insecure.NewCredentials())` instead Fixes: #3820 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-04 10:28:40 +01:00
Fabiano Fidêncio	2af91b23e1	Merge pull request #3281 from jongwu/vcpu_hotplug_arm64 experimentally enable vcpu hotplug and virtio-mem on arm64 in kernel part	2022-03-04 09:14:31 +01:00
Jianyong Wu	42771fa726	runtime: don't set socket and thread for arm/virt As this is just a initial vcpu hotplug support, thread and socket has not been supported. So, don't set socket and thread when hotadd cpu for arm/virt. Fixes: #3280 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2022-03-04 11:22:18 +08:00
Feng Wang	f905161bbb	runtime: mount direct-assigned block device fs only once Mount the direct-assigned block device fs only once and keep a refcount in the guest. Also use the ro flag inside the options field to determine whether the block device and filesystem should be mounted as ro Fixes: #3454 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-03 18:57:02 -08:00
shuochen0311	27fb490228	agent: add get volume stats handler in agent retrieve the stats of direct-assigned volumes from the guest Fixes: #3454 Signed-off-by: shuochen0311 <shuo.chen@databricks.com>	2022-03-03 18:57:02 -08:00
Feng Wang	ea51ef1c40	runtime: forward the stat and resize requests from shimv2 to kata agent Translate the volume path from host-known path to guest-known path and forward the request to kata agent. Fixes: #3454 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-03 18:57:02 -08:00
Feng Wang	c39281ad65	runtime: update container creation to work with direct assigned volumes During the container creation, it will parse the mount info file of the direct assigned volumes and update the in memory mount object. Fixes: #3454 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-03 18:57:02 -08:00
Feng Wang	4e00c2377c	agent: add grpc interface for stat and resize operations Add GetVolumeStats and ResizeVolume APIs for the runtime to query stat and resize fs in the guest. Fixes: #3454 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-03 18:57:02 -08:00
Feng Wang	e9b5a25502	runtime: add stat and resize APIs to containerd-shim-v2 To query fs stats and resize fs, the requests need to be passed to kata agent through containerd-shim-v2. So we're adding to rest APIs on the shim management endpoint. Also refactor shim management client to its own go file. Fixes: #3454 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-03 18:56:53 -08:00
Feng Wang	6e0090abb5	runtime: persist direct volume mount info In the direct assigned volume scenario, Kata Containers persists the information required for managing the volume inside the guest on host filesystem. Fixes: #3454 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-03 15:32:12 -08:00
Feng Wang	fa326b4e0f	runtime: augment kata-runtime CLI to support direct-assigned volume Add commands to add, remove, resize and get stats of a direct-assigned volume. These commands are expected to be consumed by CSI. Fixes: #3454 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-03 15:32:03 -08:00
Fabiano Fidêncio	a2422cf2a1	Merge pull request #3389 from zhsj/rm-distro-test katatestutils: remove distro constraints	2022-03-03 23:26:58 +01:00
Fabiano Fidêncio	12af632952	Merge pull request #3814 from fidencio/wip/disable-block-device-use-minor-fixes Minor fixes for the `disable_block_device_use` comments	2022-03-03 23:26:05 +01:00
Fabiano Fidêncio	af80473496	clh: stop virtofsd if clh fails to boot up the vm If, for some reason, we're able to launch cloud hypervisor but not able to boot the VM up, the virtiofsd process would be left behind. Let's ensure, via defer, that we stop virtiofsd in case of errors. Fixes: #3819 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-03 19:10:37 +01:00
Fabiano Fidêncio	c54bc8e657	Merge pull request #3811 from fidencio/wip/clh-tdx-round-2 clh: tdx: Don't use sharedFS with Confidential Guests	2022-03-03 19:03:28 +01:00
Fabiano Fidêncio	97951a2d12	clh: Don't use SharedFS with Confidential Guests kata-containers/pulls#3771 added TDX support for Cloud Hypervisor, but two big things got overlooked while doing that. 1. virtio-fs, as of now, cannot be part of the trust boundary, so the Confidential Guest will not be using it. 2. virtio-block hotplug should be enabled in order to use virtio-block for the rootfs (used with the devmapper plugin). When trying to use cloud-hypervisor with TDX using virtio-fs, we're facing the following error on the guest kernel: ``` virtiofs virtio2: device must provide VIRTIO_F_ACCESS_PLATFORM ``` After checking and double-checking with virtiofs and cloud-hypervisor developers, it happens as confidential containers might put some limitations on the device, so it can't access all of the guests' memory and that's where this restriction seems to be coming from. Vivek mentioned that virtiofsd do not support VIRTIO_F_ACCESS_PLATFORM (aka VIRTIO_F_IOMMU_PLATFORM) yet, and that for ecrypted guests virtiofs may not be the best solution at the moment. @sboeuf put this in a very nice way: "if the virtio-fs driver doesn't support VIRTIO_F_ACCESS_PLATFORM, then the pages corresponding to the virtqueues and the buffers won't be marked as SHARED, meaning the VMM won't have access to it". Interestingly enough, it works with QEMU, and it may be due to some change done on the patched QEMU that @devimc is packaging, but we won't take the path to figure out what was the change and patch cloud-hypervisor on the same way, because of 1. Fixes: #3810 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-03 12:49:40 +01:00
Fabiano Fidêncio	c30b3a9ff1	clh: Adding a volume is not supported without SharedFS As mounting volumes into the guest requires SharedFS setup, let's ensure we error out if trying to do so in a situation where SharedFS is not supported. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-03 12:49:30 +01:00
Fabiano Fidêncio	f889f1f957	clh: introduce supportsSharedFS() supportsSharedFS() is a new method to be used to ensure that no SharedFS specifics are called when, for a reason or another, Cloud Hypervisor is in a mode where SharedFSs are not supported. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-03 12:49:28 +01:00
Fabiano Fidêncio	54d27ed721	clh: introduce loadVirtiofsDaemon() Similarly to the `createVirtiofsDaemon` and `stopVirtiofsDaemon` methos, let's introduce and use loadVirtiofsDaemon, at it'll also be handy later in this series. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-03 12:48:38 +01:00
Fabiano Fidêncio	ae2221ea68	clh: introduce stopVirtiofsDaemon() Similary to the `createVirtiofsDaemon` method, let's introduce and use its counterpart, as it'll also be handy later in this series. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-03 12:48:26 +01:00
Fabiano Fidêncio	e8bc26f90d	clh: introduce setupVirtiofsDaemon() Similarly to what's been done with the `createVirtiofsDaemon`, let's create a `setupVirtiofsDaemon` one. It will also become handy later in this series. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-03 12:48:14 +01:00
Fabiano Fidêncio	413b3b477a	clh: introduce createVirtiofsDaemon() Let's introduce and use a new `createVirtiofsDaemon` method. Its name says it all, and it'll be handy later in this series when, spoiler alert, SharedFS cannot be used (in such cases as in Confidential Guests). Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-03 12:48:02 +01:00
James O. D. Hunt	55cd0c89d8	runtime: Build golang components with extra security options Enable stack protector and fortify source for golang builds. Fixes: #3817. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-03-03 10:41:26 +00:00
Fabiano Fidêncio	76e4f6a2a3	Revert "hypervisors: Confidential Guests do not support Device hotplug" This reverts commit `df8ffecde0`, as device hotplug is supported and, more than that, is very much needed when using virtio-blk instead of virtio-fs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-03 09:59:55 +01:00
David Gibson	42e35505b0	agent: Verify that we allocated as many hugepages as we need allocate_hugepages() writes to the kernel sysfs file to allocate hugepages in the Kata VM. However, even if the write succeeds, it's not certain that the kernel will actually be able to allocate as many hugepages as we requested. This patch reads back the file after writing it to check if we were able to allocate all the required hugepages. fixes #3816 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-03-03 15:59:45 +11:00
David Gibson	608e003abc	agent: Don't attempt to create directories for hugepage configuration allocate_hugepages() constructs the path for the sysfs directory containing hugepage configuration, then attempts to create this directory if it does not exist. This doesn't make sense: sysfs is a view into kernel configuration, if the kernel has support for the hugepage size, the directory will already be there, if it doesn't, trying to create it won't help. For the same reason, attempting to create the "nr_hugepages" file itself is pointless, so there's no reason to call OpenOptions::create(true). Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-03-03 11:24:11 +11:00
Fabiano Fidêncio	fa8b93927c	config: qemu: Fix disable_block_device_use comments virtio-fs, instead of virtio-9p, is the default shared file system type in case virtio-blk is not used. Fixes: #3813 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-02 20:43:36 +01:00
Fabiano Fidêncio	9615c8bc9c	config: fc: Don't expose disable_block_device_use Relying on virtio-block is the only way to use Firecracker with Kata Containers, as shared FS (virtio-{fs,fs-nydus,9p}) is not supported by Firecracker. As configuration doesn't make sense to be exposed, we hardcode the `false` value in the Firecracker configuration structure. Fixes: #3813 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-02 20:43:28 +01:00
Bin Liu	2ae8bd696a	Merge pull request #3367 from wfly1998/main build: always reset ARCH after getting it	2022-03-02 14:42:45 +08:00
Bin Liu	75877f8793	Merge pull request #3187 from Kvasscn/kata_dev_remove_temp_vsock_dir virtcontainers: remove temp dir created for vsock in test code	2022-03-02 11:05:47 +08:00
Francesco Giudici	7f638dd049	Merge pull request #3764 from Jakob-Naucke/hugepages-test-s390x virtcontainers: Use available s390x hugepages	2022-03-01 14:33:59 +01:00
Fabiano Fidêncio	4ab35b0899	Merge pull request #3796 from jodh-intel/fix-monitor-listen-address Fix monitor listen address	2022-03-01 13:51:01 +01:00
Fabiano Fidêncio	97c17085b0	Merge pull request #3770 from Jakob-Naucke/gofmt-vmm-s390x runtime: Gofmt fixes	2022-03-01 11:34:15 +01:00
James O. D. Hunt	e64c54a2ad	monitor: Listen to localhost only by default Change `kata-monitor` to listen to port `8090` on the local interface only by default. > Note: > > This is a breaking change as previously it listened on all interfaces. Fixes: #3795. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-03-01 10:00:43 +00:00
James O. D. Hunt	e6350d3d45	monitor: Fix build options Removed redundant and duplicated build options to build `kata-monitor` the same way as the other components: - `CGO_ENABLED=0` is not necessary. - `-buildmode=exe` is not necessary since `BUILDFLAGS` already sets the build mode. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-03-01 10:00:43 +00:00
GabyCT	ccb063b848	Merge pull request #3788 from fidencio/wip/update-clh-confidential-guest-comments Update `confidential_guest` comments	2022-02-28 15:11:01 -06:00
GabyCT	bc1733bb0e	Merge pull request #3774 from egernst/delinux-runtime cleanup runtime pkgs for Darwin build, add basic Darwin build/unit test	2022-02-28 15:08:09 -06:00
Jakob Naucke	eda8ea154a	runtime: Gofmt fixes - Mostly blank lines after `+build` -- see https://pkg.go.dev/go/build@go1.14.15 -- this is, to date, enforced by `gofmt`. - 1.17-style go:build directives are also added. - Spaces in govmm/vmm_s390x.go Fixes: #3769 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2022-02-28 17:24:47 +01:00
Eric Ernst	e355a71860	container: file is not linux specific This should not be linux specific -- drop restriction. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-02-28 08:01:53 -08:00
Eric Ernst	b31876eefb	device-manager: move linux-only test to a linux-only file We can't Mkdev on Darwin - let's make sure the vfio test is in a linux-only file. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-02-28 08:01:53 -08:00
Eric Ernst	6a5c634490	resourcecontrol: SystemdCgroup check is not necessarily linux specific This utility function is also used to check the spec that will run in the guest - no need for this to be linux specific. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-02-28 08:01:53 -08:00
Eric Ernst	cc58cf6993	resourcecontrol: convert stats dev_t to unit64types Their types may differ on various host OSes, but unix.Major\|Minor always takes a uint64 Depends-on: github.com/kata-containers/tests#4516 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-02-28 08:01:53 -08:00
Eric Ernst	5be188cc29	utils: Add darwin stub Add a stub for utils_darwin to facilitate building this package on Darwin. We can probably drop this empty stub if we have better abstraction for the various parts of virtcontainers that call it today... Fixes:# 3777 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-02-28 08:01:53 -08:00
Samuel Ortiz	ad0449195d	virtcontainers: Convert stats dev_t to uint64 We need to convert them to uint64 as their types may differ on various host OSes, but unix.Major\|Minor takes a uint64 regardless. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-28 08:01:53 -08:00
Samuel Ortiz	56751089c0	katautils: Use a syscall wrapper for the hook JSON state There is no real equivalent of a thread ID on Darwin. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-28 08:01:53 -08:00
Samuel Ortiz	7d64ae7a41	runtime: Add a syscall wrapper package It allows to support syscall variations between host OSes. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-28 08:01:53 -08:00
Samuel Ortiz	abc681ca5f	katautils: Add Darwin stub for the netNS API And move the current implementation into a Linux only file. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-28 08:01:53 -08:00
Fabiano Fidêncio	de57466212	config: Expand confidential_guest comments Let's clarify that an error will be reported in case confidential_guest is enabled, but the hardware where Kata Containers is running doesn't provide the required feature set. Fixes: #3787 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-28 11:57:42 +01:00
Fabiano Fidêncio	641d475fa6	config: clh: Use "Intel TDX" instead of just "TDX" Let's use "Intel TDX" rather than just "TDX", as it can ease the understanding of the terminology. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-28 10:27:21 +01:00
Fabiano Fidêncio	0bafa2def9	config: clh: Mention supported TEEs Let's mention the supported TEEs to be used with confidential guests. Right now, Cloud Hyperisor supports only Intel TDX, used together with TD Shim. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-28 10:24:33 +01:00
bin	81ed269ed2	runtime: use Cmd.StdoutPipe instead of self-created pipe Nydusd uses a bufio.Scanner to check if nydusd process has existed, but stderr/stdout passed to Cmd is self-created pipe, this pipe will not be closed if the process start failing. Use standard Cmd.StdoutPipe can close the stdout and kata shim will detect the existence of the nydusd process, then call cmd.Wait to reap the process' resources. Fixes: #3783 Signed-off-by: bin <bin@hyper.sh>	2022-02-28 16:52:49 +08:00
Bin Liu	441fdbaf9f	Merge pull request #3753 from sailorvii/main kata-agent: Fix mismatching error of cgroup and mountinfo.	2022-02-28 16:07:26 +08:00
sailorvii	8edca8bbd1	kata-agent: Fix mismatching error of cgroup and mountinfo. The content about systemd in "/proc/self/cgroup" is as: 1:name=systemd:/kubepods/pod1815643d-3789-4e4e-aaf4-00de024912e1/0e15a65bd5f7b30a0b818d90706212354d8b3f0998a1495473c3be9a24706ccf and in "/prol/self/mountinfo" is as: 30 29 0:26 / /sys/fs/cgroup/systemd rw,nosuid,nodev,noexec,relatime shared:6 - cgroup cgroup rw,xattr,release_agent=/usr/lib/systemd/systemd-cgroups-agent,name=systemd The keys extracted from the two files are the same as "name=systemd". So no need to rename the key to "systemd". Fixes: #3385 Signed-off-by: sailorvii <challengingway@hotmail.com>	2022-02-28 10:03:09 +08:00
Eric Ernst	3997c962c2	Merge pull request #3767 from tanweernoor/02242022-kata-containers-issue-3631 runtime, config: make selinux configurable	2022-02-26 08:44:29 -08:00
Fabiano Fidêncio	a9ba7c132b	clh: Fix typo on HotplugRemoveDevice A copy and paste mistake was made and the error on HotplugRemoveDevice() should be about removal and not about addition. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-25 22:35:32 +01:00
Tanweer Noor	082d538cb4	runtime: make selinux configurable removes --tags selinux handling in the makefile (part of it introduced here: `d78ffd6`) and makes selinux configurable via configuration.toml Fixes: #3631 Signed-off-by: Tanweer Noor <tnoor@apple.com>	2022-02-25 10:33:46 -08:00
Fabiano Fidêncio	ea1876f057	Merge pull request #3771 from fidencio/wip/clh-tdx clh: Add TDX support	2022-02-25 18:45:31 +01:00
Samuel Ortiz	1103f5a4d4	virtcontainers: Use FilesystemSharer for sharing the containers files Switching to the generic FilesystemSharer brings 2 majors improvements: 1. Remove container and sandbox specific code from kata_agent.go 2. Allow for non Linux implementations to provide ways to share container files and root filesystems with the Kata Linux guest. Fixes #3622 Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-25 17:22:27 +01:00
Samuel Ortiz	533c1c0e86	virtcontainers: Keep all filesystem sharing prep code to sandbox.go With the Linux implementation of the FilesystemSharer interface, we can now remove all host filesystem sharing code from kata_agent and keep it where it belongs: sandbox.go. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-25 17:22:27 +01:00
Samuel Ortiz	61590bbddc	virtcontainers: Add a Linux implementation for the FilesystemSharer This gathers the current kata agent and container filesystem sharing code into a FilesystemSharer implementation. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-25 17:22:27 +01:00
Samuel Ortiz	03fc1cbd7e	virtcontainers: Add a filesystem sharing interface Filesystem sharing here means the ability to share some parts of the host filesystem with the guest. It's mostly about sharing files and container bundle root filesystems. In order to allow for different file and rootfs sharing implementations, we define a FilesystemSharer interface. This interface provides a preparation step, where concrete implementations will be able to e.g. prepare the host filesysstem. Then it provides 2 methods, one for sharing any file (regular file or a directory) and another one for sharing a container root filesystem Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-25 17:22:27 +01:00
Fabiano Fidêncio	72434333aa	clh: Add TDX support Let's enable TDX support for Cloud Hypervisor, using td-shim as its desired firmware. Fixes: #3632 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-25 16:49:21 +01:00
Fabiano Fidêncio	a13b4d5ad8	clh: Add firmware to the config file "firmware" option was already present for a while, but it's never been exposed to the configuration file before. Let's do it now as it can be used, in combination with the newly added confidential_guest option, to boot a guest VM using the so called `td-shim`[0] with Cloud Hypervisor. [0]: https://github.com/confidential-containers/td-shim Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-25 16:49:21 +01:00
Fabiano Fidêncio	a8827e0c78	hypervisors: Confidential Guests do not support NVDIMM NVDIMM is also not supported with Confidential Guests and Virtio Block devices should be used instead. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-25 16:49:21 +01:00
Fabiano Fidêncio	f50ff9f798	hypervisors: Confidential Guests do not support Memory hotplug Similarly to VCPUs and Device hotplug, Confidential Guests also do not support Memory hotplug. Let's make it clear in the documentation and guard the code on both QEMU and Cloud Hypervisor side to ensure we don't advertise Memory hotplug as being supported when running Confidential Guests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-25 16:49:21 +01:00
Fabiano Fidêncio	df8ffecde0	hypervisors: Confidential Guests do not support Device hotplug Similarly to VCPUs hotplug, Confidential Guests also do not support Device hotplug. Let's make it clear in the documentation and guard the code on both QEMU and Cloud Hypervisor side to ensure we don't advertise Device hotplug as being supported when running Confidential Guests. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-25 16:49:21 +01:00
Fabiano Fidêncio	28c4c044e6	hypervisors: Confidential Guests do not support VCPUs hotplug As confidential guests do not support VCPUs hotplug, let's set the "DefaultMaxVCPUs" value to "NumVCPUs". The reason to do this is to ensure that guests will be started with the correct amount of VCPUs, without giving to the guest with all the possible VCPUs the host could provide. One clear side effect of this limitation is that workloads that would require more VCPUs on their yaml definition will not run on this scenario. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-25 16:49:21 +01:00
Fabiano Fidêncio	29ee870d20	clh: Add confidential_guest to the config file ConfidentialGuest is an option already present and exposed for QEMU, which is used for using Kata Containers together with different sorts of Guest Protections, such as TDX and SEV for x86_64, PEF for ppc64le, and SE for s390x. Right now we error out in case confidential_guest is enabled, as we will be implementing the needed blocks for this as part of this series. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-25 16:49:21 +01:00
Fabiano Fidêncio	9621c59691	clh: refactor image / initrd configuration set This is a small code refactor removing a deadcode based the checks already done in the generic hypervisor abstraction. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-25 16:49:21 +01:00
Fabiano Fidêncio	dcdc412e25	clh: use common kernel params from the hypervisor code The hypervisor code already defines 3 common kernel root params for the following cases: * NVDIMM * NVDIMM without DAX support * Virtio Block As parameters used for cloud-hypervisor have an overlap with the ones provided by the NVDIMM case, let's take advantage of that. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-25 16:49:21 +01:00
Fabiano Fidêncio	4c164afbac	versions: Update Cloud Hypervisor to 5343e09e7b8db Let's bump the Cloud Hypervisor version to 5343e09e7b8db, as that brings a few fixes we're interested in, such as: * hypervisor, vmm: Handle TDX hypercalls with INVALID_OPERAND - https://github.com/cloud-hypervisor/cloud-hypervisor/pull/3723 - This is needed for the TDX support on the cloud hypervisor driver, which is part of this very same series. * openapi: Update the PciBdf types - https://github.com/cloud-hypervisor/cloud-hypervisor/pull/3748 - This is needed due to a change in a DeviceNode field, which would cause a marshalling / demarshalling error when running with a version of cloud-hypervisor that includes the TDX fixes mentioned above. * scripts: dev_cli: Don't quote $features_build * scripts: dev_cli: Add --features option - https://github.com/cloud-hypervisor/cloud-hypervisor/pull/3773 - This is needed due to changes in the scripts used to build Cloud Hypervisor, which are used as part of Kata Containers CIs and github actions. Due to this change, we're also adapting the build scripts as part of this very same commit. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-25 16:49:16 +01:00
Jakob Naucke	bbfe7d6591	Merge pull request #3599 from Jakob-Naucke/no-virtio-rng-ccw virtcontainers: Do not add a virtio-rng-ccw device	2022-02-25 15:27:02 +01:00
Francesco Giudici	3da6006de4	Merge pull request #3751 from fgiudici/kata-monitor_issue3705 kata-monitor: fix collecting metrics for sandboxes not started through CRI	2022-02-25 14:53:12 +01:00
Jakob Naucke	b2a65f9031	virtcontainers: Use available s390x hugepages in TestHandleHugepages. On s390x, hugepage sizes must be set at boot, so test with any that are present (default is 1M). Depends-on: github.com/kata-containers/kata-containers#3770 Fixes: #3763 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2022-02-25 13:11:00 +01:00
Amulyam24	cb4230e60e	runtime: fix package declaration for ppc64le Incorrect package name causes build to fail. Fix it in vm_ppc64le.go Fixes: #3761 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2022-02-24 15:31:48 +05:30
Eric Ernst	c6cc038364	Merge pull request #3615 from sameo/topic/hypervisor Make the hypervisor framework not Linux specific	2022-02-23 16:02:00 -08:00
Francesco Giudici	fec26f8e51	kata-monitor: trivial: rename symbols & labels We introduced collection of sandboxes metadata from the CRI that will be attached to the sandbox metrics: this will allow to immediately match sandboxes metrics with CRI workloads. Rename the symbols from Kube to CRI as the metadata will be there every time pods are created through CRI, also if kubernetes is not installed (e.g., 'crictl runp'). Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-02-23 18:34:32 +01:00
Samuel Ortiz	9fd4e5514f	runtime: Move the resourcecontrol package one layer up And try to reduce the number of virtcontainers packages, step by step. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-23 15:48:40 +01:00
Samuel Ortiz	823faee83a	virtcontainers: Rename the cgroups package To resourcecontrol, and make it consistent with the fact that cgroups are a Linux implementation of the ResourceController interface. Fixes: #3601 Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-23 15:48:40 +01:00
Samuel Ortiz	0d1a7da682	virtcontainers: Rename and clean the cgroup interface We call it a ResourceController, and we make it not so Linux specific. Now the Linux implementations is the cgroups one. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-23 15:48:40 +01:00
Samuel Ortiz	ad10e201e1	virtcontainers: cgroups: Move non Linux routine to utils.go Have an OS agnostic file for sharing routines. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-23 15:48:40 +01:00
Samuel Ortiz	d49d0b6f39	virtcontainers: cgroups: Define a cgroup interface And move the current, Linux-specific implementation into cgroups_linux.go Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-23 15:48:40 +01:00
Francesco Giudici	3ac52e8193	kata-monitor: fix updating sandbox cache at startup We now rely on fs events only to update the sandbox cache. This is not true anyway for sandboxes already present at kata-monitor startup: we just retrieve the list and add them in the cache only when we get their CRI metadata. If CRI metadata is not available we will never add them to the sandbox cache. Fix this by immediately adding the sandboxes we find at startup time to the sandbox cache. Fixes: #3705 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-02-23 11:21:06 +01:00
Francesco Giudici	160bb62138	kata-monitor: bump version to 0.3.0 Since kata-monitor now: - relies on fs events only to update the sandbox cache - adds CRI meta-data as labels (CRI pod name, namespace and uid) it deserves a version bump. Note that while we could let kata-monitor match the runtime version, kata-monitor will usually work flawlessy with different kata shim releases: so it makes sense to keep kata-monitor version separated. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-02-23 11:17:02 +01:00
Fabiano Fidêncio	6a9e5f90f7	Merge pull request #3670 from sameo/topic/nerdctl Support nerdctl OCI hooks	2022-02-22 23:03:33 +01:00
Fabiano Fidêncio	4729fd0fc2	Merge pull request #3736 from liubin/fix/3733-log-events-for-crio shim: log events for CRI-O	2022-02-22 09:19:37 +01:00
bin	f6fc1621f7	shim: log events for CRI-O CRI-O start shim process without setting TTRPC_ADDRESS, that the forwarding events goroutine will get errors. For CRI-O runtime, we can log the events to log file. Fixes: #3733 Signed-off-by: bin <bin@hyper.sh>	2022-02-22 11:02:50 +08:00
Fabiano Fidêncio	1e9f3c856d	Merge pull request #3553 from fgiudici/kata-monitor_cachefix kata-monitor: simplify sandbox cache management and attach kubernetes POD metadata to metrics	2022-02-21 13:17:22 +01:00
Peng Tao	031da99914	Merge pull request #3687 from luodw/nydus-clh nydus: add lazyload support for kata with clh	2022-02-21 19:31:45 +08:00
luodaowen.backend	3175aad5ba	virtiofs-nydus: add lazyload support for kata with clh As kata with qemu has supported lazyload, so this pr aims to bring lazyload ability to kata with clh. Fixes #3654 Signed-off-by: luodaowen.backend <luodaowen.backend@bytedance.com>	2022-02-19 21:55:31 +08:00
zhanghj	94b831ebf8	virtcontainers: remove temp dir created for vsock in test code remove temp dir generated by mock.GenerateKataMockHybridVSock(). Fixes: #3186 Signed-off-by: zhanghj <zhanghj.lc@inspur.com>	2022-02-19 16:59:15 +08:00
Archana Shinde	7db9bef72c	Merge pull request #3718 from Kvasscn/kata_dev_fix_utils_assert_msg virtcontainers: Remove duplicated assert messages in utils test code	2022-02-18 06:07:16 -08:00
Samuel Ortiz	27de212fe1	runtime: Always add network endpoints from the pod netns As the container runtime, we're never inspecting, adding or configuring host networking endpoints. Make sure we're always do that by wrapping addSingleEndpoint calls into the pod network namespace. Fixes #3661 Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-18 10:37:07 +01:00
zhanghj	1cee0a9452	virtcontainers: Remove duplicated assert messages in utils test code Remove duplicated strings in assert.Errorf() and assert.NoErrorf(). Fixes: #3714 Signed-off-by: zhanghj <zhanghj.lc@inspur.com>	2022-02-18 16:45:05 +08:00
Samuel Ortiz	77c29bfd3b	container: Remove VFIO lazy attach handling With the recently added VFIO fixes and support, we should not need that anymore. Fixes #3108 Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-17 08:39:44 +01:00
Fupan Li	8694af6d92	Merge pull request #3657 from liubin/fix/3656-add-make-check-for-tools trace-forwarder/agent-ctl: run cargo fmt/clippy in make check	2022-02-17 10:05:16 +08:00
GabyCT	ced5e910d5	Merge pull request #3558 from jodh-intel/docs-rework-readme docs: Improve top-level README	2022-02-16 16:28:14 -06:00
Fabiano Fidêncio	6f9685fbf5	Merge pull request #3624 from mdlayher/mdl-vsock runtime: use github.com/mdlayher/vsock@v1.1.0	2022-02-16 23:11:47 +01:00
Samuel Ortiz	26b3f0017c	virtcontainers: Split hypervisor into Linux and OS agnostic bits Keep all the OS agnostic bits in the hypervisor.go and hypervisor_ARCH.go files. Fixes #3614 Signed-off-by: Eric Ernst <eric_ernst@apple.com> Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-16 19:15:31 +01:00
Samuel Ortiz	fa0e9dc6b1	virtcontainers: Make all Linux VMMs only build on Linux Some of them (e.g. QEMU) can run on other OSes (e.g. Darwin) but the current virtcontainers implementation is Linux specific. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-16 19:07:34 +01:00
Samuel Ortiz	c91035d0e1	virtcontainers: Move non QEMU specific constants to hypervisor.go Hotplugging errors and 9pfs size are not particularily QEMU specific. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-16 19:07:34 +01:00
Samuel Ortiz	10ae05914c	virtcontainers: Move guest protection definitions to hypervisor.go They're not QEMU specific, other VMMs may implement support for it. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-16 19:07:31 +01:00
Samuel Ortiz	b28d0274ff	virtcontainers: Make max vCPU config less QEMU specific Even though it's still actually defined as the QEMU upper bound, it's now abstracted away through govmm. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-16 19:06:32 +01:00
Samuel Ortiz	a5f6df6a49	govmm: Define the number of supported vCPUs per architecture Based on qhe QEMU supports on those architectures. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-16 19:06:32 +01:00
Fabiano Fidêncio	be2e90469a	Merge pull request #3669 from fidencio/wip/virtiofsd-use-announce-submounts virtiofsd: Use "-o announce_submounts"	2022-02-16 16:43:18 +01:00
James O. D. Hunt	9818cf7196	docs: Improve top-level and runtime README Various improvements to the top-level README file: - Moved the following sections from the runtime's README to the top-level README: - License - Platform support / Hardware requirements - Added the following sections to the top-level README: - Configuration - Hypervisors - Improved formatting of the Documentation section in the top-level README. - Removed some unused named links from the top-level README. Also improvements to the runtime README: - Removed confusing mention of the old 1.x runtime name. - Clarify the binary name for the 2.x runtime and the utility program. > Note: > > We cannot currently link to the AMD website as that site's > configuration causes the CI static checks to fail. See > https://github.com/kata-containers/tests/issues/4401 Fixes: #3557. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-02-16 09:52:48 +00:00
bin	36c3fc12ce	agent: support hugepages for containers Mount hugepage directories and configure the requested number of hugepages dynamically by writing to sysfs files Port from: `78b307b5bd` Fixes: #3342 Signed-off-by: Pradipta Banerjee <pradipta.banerjee@gmail.com> Signed-off-by: bin <bin@hyper.sh>	2022-02-16 15:14:53 +08:00
bin	81a8baa5e5	runtime: add hugepages support Add hugepages support, port from: `b486387cba` Signed-off-by: Pradipta Banerjee <pradipta.banerjee@gmail.com> Signed-off-by: bin <bin@hyper.sh>	2022-02-16 15:14:53 +08:00
bin	7df677c01e	runtime: Update calculateSandboxMemory to include Hugepages Limit Support hugepages and port from: `96dbb2e8f0` Fixes: #3342 Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com> Signed-off-by: Pradipta Banerjee <pradipta.banerjee@gmail.com> Signed-off-by: bin <bin@hyper.sh>	2022-02-16 15:14:37 +08:00
bin	72bf5496fd	agent: handle hook process result Current hook process is handled by just calling unwrap() on it, sometime it will cause panic. By handling all Result type and check the error can avoid panic. Fixes: #3649 Signed-off-by: bin <bin@hyper.sh>	2022-02-15 19:01:54 +01:00
bin	80e8dbf1f5	agent: valid envs for hooks Envs contain null-byte will cause running hooks to panic, this commit will filter envs and only pass valid envs to hooks. Fixes: #3667 Signed-off-by: bin <bin@hyper.sh>	2022-02-15 19:01:54 +01:00
Samuel Ortiz	4f96e3eae3	katautils: Pass the nerdctl netns annotation to the OCI hooks We need to let nerdctl know which namespace to use when calling the selected CNI plugin. See https://github.com/containerd/nerdctl/issues/787 Fixes: #1935 Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-15 18:11:23 +01:00
Samuel Ortiz	a871a33b65	katautils: Run the createRuntime hooks The preStart hooks are being deprecated over the createRuntime ones. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-15 17:31:56 +01:00
Samuel Ortiz	d9dfce1453	katautils: Run the preStart hook in the host namespace The OCI spec is very specific about it: "The prestart hooks MUST be executed in the runtime namespace." Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-15 17:31:56 +01:00
Samuel Ortiz	6be6d0a3b3	katautils: Pass the OCI annotations back to the called OCI hooks That allows us to amend those annotations with information that could be used when running those hooks. For example nerdctl will use those annotations to resolve the networking namespace path in where to run the CNI plugin, i.e. the created pod networking namespace. Fixes #3629 Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-15 17:31:56 +01:00
Fabiano Fidêncio	4bd945b67b	virtiofsd: Use "-o announce_submounts" German Maglione, one of the current virtio-fs developers, has brought to our attention that using "announce-submounts" could help us to prevent inode number collisions. This feature was introduced a year ago or so by Hanna Reitz as part of the 08dce386e77eb9ab044cb118e5391dc9ae11c5a8, and as we already mandate QEMU >= 6.1.0, let's take advantage of that. Fixes: #3507 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-15 08:52:03 +01:00
Yu Li	37df1678ae	build: always reset ARCH after getting it When building with `ARCH=x86_64`, the previous `Makefile` will use it without checking and cause: Makefile:319: *** "ERROR: No hypervisors known for architecture x86_64 (looked for: acrn firecracker qemu cloud-hypervisor)". Stop. This commit fix the above issue by checking `ARCH` no matter where it is assigned. Fixes: #3444 Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com> Signed-off-by: Yu Li <liyu.yukiteru@bytedance.com>	2022-02-15 14:26:34 +08:00
Shengjing Zhu	3a641b56f6	katatestutils: remove distro constraints The distro constraint parses os release files, which may not contain distro version(VERSION_ID field), for example rolling release distributions like Debian testing, archlinux. These distro constraints are not used anyway, so removing them instead of fixing the complex version detection. Fixes: #1864 Signed-off-by: Shengjing Zhu <zhsj@debian.org>	2022-02-15 02:11:52 +08:00
Fabiano Fidêncio	90fd625d0c	versions: Udpate Cloud Hypervisor to 55479a64d237 Let's update cloud-hypervisor to a version that exposes the TDx support via the OpenAPI's auto-generated code. Fixes: #3663 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-02-14 17:32:30 +01:00
James O. D. Hunt	8f80dffead	Merge pull request #3648 from yaoyinnan/index-in-for runtime: The index variable is initialized multiple times in for	2022-02-14 12:36:46 +00:00
bin	734b618c16	agent-ctl: run cargo fmt/clippy in make check Run cargo fmt/clippy in make check and clear clippy warnings. Fixes: #3656 Signed-off-by: bin <bin@hyper.sh>	2022-02-14 20:12:57 +08:00
bin	12c37fafc5	trace-forwarder: add make check for Rust Add make check to run cargo fmt/clippy for Rust projects. Fixes: #3656 Signed-off-by: bin <bin@hyper.sh>	2022-02-14 20:12:48 +08:00
Bin Liu	cf53ec2c71	Merge pull request #2977 from luodw/support_nydus feature(nydusd): add nydusd support to introduce lazyload ability	2022-02-14 13:08:50 +08:00
Eric Ernst	172fac5cc8	Merge pull request #3613 from hxtmdev/markdown-relative docs: Fix relative links in Markdown	2022-02-13 21:01:41 -08:00
Matt Layher	c1ce67d905	runtime: use github.com/mdlayher/vsock@v1.1.0 Fixes #3625 Signed-off-by: Matt Layher <mdlayher@gmail.com>	2022-02-12 19:57:15 -05:00
yaoyinnan	42a878e6c1	runtime: The index variable is initialized multiple times in for Change the variables `mountTypeFieldIdx := 8`, `mntDestIdx := 4` and `netNsMountType := "nsfs"` to const. And unify the variable naming style, modify `mntDestIdx` to `mountDestIdx`. Fixes: #3646 Signed-off-by: yaoyinnan <yaoyinnan@foxmail.com>	2022-02-12 11:10:10 +08:00
luodaowen.backend	2d9f89aec7	feature(nydusd): add nydusd support to introduse lazyload ability Pulling image is the most time-consuming step in the container lifecycle. This PR introduse nydus to kata container, it can lazily pull image when container start. So it can speed up kata container create and start. Fixes #2724 Signed-off-by: luodaowen.backend <luodaowen.backend@bytedance.com>	2022-02-11 21:41:17 +08:00
Daniel Höxtermann	b19b6938a8	docs: Fix relative links in Markdown Relative links within this repository allow for easier navigation to the corresponding file / directory in the current commit / for the selected version. Link text was slightly changed / fixed in - docs/Unit-Test-Advice.md - docs/how-to/how-to-run-docker-with-kata.md Fixes #3045 Signed-off-by: Daniel Höxtermann <daniel@hxtm.dev>	2022-02-11 13:49:42 +01:00
David Gibson	9590874d9c	device: Update PCIDEVICE_ environment variables for the guest In commit 78dff468bf1 we introduced logic to rewrite PCIDEVICE_ environment variables for the container so that they contain correct addresses for the Kata VM rather than for the host. Unfortunately, we never actually invoked the function to do this. It turns out we need to do this not only at container creation time, but also for environment variables supplied to processes exec-ed into the container after creation (e.g. with crictl exec). Add calls to make both those updates. fixes #3634 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-02-11 13:46:36 +11:00
David Gibson	7b7f426a3f	device: Keep host to VM PCI mapping persistently add_devices() generates a mapping of host to guest PCI addresses which is used to update some environment variables for the workload. Currently it just does this locally, but it turns out we're going to need the same map again in order to correct environment variables for processes exec-ed into the existing container. Move the map to the sandbox structure so we can keep it around for those later uses. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-02-11 13:46:17 +11:00
David Gibson	0b2bd64124	device: Rework update_spec_pci() to update_env_pci() This function updates PCIDEVICE_ environment variables (such as those supplied by the Kubernetes SR-IOV plugin) in the OCI spec to be correct for the Kata VM, rather than for the host. We neglected to actually call this function, however, and it turns out that when we do, we need to do things slightly different. We actually need to adjust envionment variables both in the OCI spec when creating a container and also in the variables supplied for exec-ing a new process within an existing container. Adjust the function so that it can be used for both these cases. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2022-02-11 13:46:05 +11:00
Julio Montes	982f14fa66	runtime: support QEMU SGX Enable SGX in QEMU when `sgx.intel.com/epc` annotation is defined fixes #3436 Signed-off-by: Julio Montes <julio.montes@intel.com>	2022-02-10 09:45:48 -06:00
Samuel Ortiz	07b9d93f5f	virtcontainer: Simplify the sandbox network creation flow We don't need to call NewNetwork() twice, and we can have the VM factory case return immediatly. That makes the code more readable. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	2c7087ff42	virtcontainers: Make all endpoints Linux only All of the networking endpoints are Linux specific. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	49d2cde1e2	virtcontainers: Split network tests into generic and OS specific parts Some unit tests are generic while others, mostly because they depend on netlink, are Linux specific. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	0269077ebf	virtcontainers: Remove the netlink package dependency from network.go Move the netlink dependent code into network_linux.go. Other OSes will have to provide the same functions. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	7fca5792f7	virtcontainers: Unify Network endpoints management interface And only have AddEndpoints/RemoveEndpoints for all cases (single endpoint vs all of them, hotplug or not). Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	c67109a251	virtcontainers: Remove the Network PostAdd method It's used once by the sandbox code and can be implemented directly there. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	e0b264430d	virtcontainers: Define a Network interface And move the Linux implementation into a GOOS specific file. Fixes #3005 Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	5e119e90e8	virtcontainers: Rename the Network structure fields and methods We are converting the Network structure into an interface, so that different host OSes can have different networking implementations for Kata. One step into that direction is to rename all the Network structure fields and methods to something that is less Linux networking namespace specific. This will make the Network interface naming consistent. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	b858d0dedf	virtcontainers: Make all Network fields private Prepare for making it a real interface. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	49eee79f5f	virtcontainers: Remove the NetworkNamespace structure It is now replaced with a single Network structure Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	844eb61992	virtcontainers: Have CreateVM use a Network reference We are replacing the NetworkingNamespace structure with the Network one, so we should have the hypervisor interface switching to it as well. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	d7b67a7d1a	virtcontainers: Network API cleanups and simplifications Remove unused parameters. Reduce the number of parameters by deriving some of them (e.g. a networking config) from their outer structure (e.g. a Sandbox reference). Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	2edea88369	virtcontainers: Make the Network structure manage endpoints Endpoints creations, attachement and hotplug are bound to the networking namespace described through the Network structure. Making them Network methods is natural and simplifies the code. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Samuel Ortiz	8f48e28325	virtcontainers: Expand the Network structure For simplicity sake, there should only be one networking structure per sandbox, as opposed to two (Network and NetworkingNamespace) currently. This commit start expanding the Network structure in order to eventually make it the single representation of a virtcontainers sandbox networking. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2022-02-08 22:27:53 +01:00
Pierre Kohler	5ef522f7c3	runtime: check kvm module `sev` correctly Runtime now accepts both `1` and `Y` as valid values for kvm_amd module parameter kvm_amd.sev. Fixes #3273 Signed-off-by: Pierre Kohler <pierre.kohler@cysec.systems>	2022-02-07 23:48:47 +01:00
Eric Ernst	e8eb5e8295	Merge pull request #3609 from egernst/rootless-linux virtcontainers: Split the rootless package into OS specific parts	2022-02-03 12:19:31 -08:00
GabyCT	3603105669	Merge pull request #3584 from devimc/2022-01-31/splitTDVF runtime: suppport split firmware	2022-02-03 10:24:20 -06:00
Jakob Naucke	7ffe9e5198	virtcontainers: Do not add a virtio-rng-ccw device On s390x, skip adding a virtio-rng device. The on-chip CPACF provides entropy instead. For Confidential Containers, when using Secure Execution, entropy attacks on virtio-rng are mitigated. Fixes: #3598 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2022-02-02 17:06:20 +01:00
Fabiano Fidêncio	6d6748afd7	Merge pull request #3351 from Bevisy/main-2610-fix-args agent: Fix execute_hook() args error	2022-02-02 09:45:25 +01:00
Julio Montes	1f29478b09	runtime: suppport split firmware firmware can be split into FIRMWARE_VARS.fd (UEFI variables as configuration) and FIRMWARE_CODE.fd (UEFI program image). UEFI variables can be customized per each user while UEFI code is kept same. fixes #3583 Signed-off-by: Julio Montes <julio.montes@intel.com>	2022-02-01 13:40:19 -06:00
Peng Tao	732c45de94	Merge pull request #3567 from jodh-intel/ch-enable-initrd virtcontainers: Enable initrd for Cloud Hypervisor	2022-01-29 14:23:32 +08:00
bin	bcce1a1911	versions: update Rust to 1.58.1 Update Rust to 1.58.1 to fix CVE-2022-21658. Fixes: #3570 Signed-off-by: bin <bin@hyper.sh>	2022-01-29 11:35:56 +08:00
Samuel Ortiz	14e7f52a91	virtcontainers: Split the rootless package into OS specific parts Move the netns specific bits into a Linux specific file. Fixes: #3607 Signed-off-by: Samuel Ortiz <s.ortiz@apple.com> Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-01-28 16:20:28 -08:00
James O. D. Hunt	7c956e0d27	virtcontainers: Enable initrd for Cloud Hypervisor Since CH has supported booting with an initramfs since version 0.7.0 [1], allow an `initrd=` to be specified. Fixes: #3566. [1] - https://github.com/cloud-hypervisor/cloud-hypervisor/releases/tag/v0.7.0 Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2022-01-28 10:49:10 +00:00
Eric Ernst	a5ebeb96c1	Merge pull request #2941 from egernst/sandbox-sizing-feature Sandbox sizing feature	2022-01-27 09:37:57 -08:00
Eric Ernst	8cde54131a	runtime: introduce static sandbox resource management There are software and hardware architectures which do not support dynamically adjusting the CPU and memory resources associated with a sandbox. For these, today, they rely on "default CPU" and "default memory" configuration options for the runtime, either set by annotation or by the configuration toml on disk. In the case of a single container (launched by ctr, or something like "docker run"), we could allow for sizing the VM correctly, since all of the information is already available to us at creation time. In the sandbox / pod container case, it is possible for the upper layer container runtime (ie, containerd or crio) could send a specific annotation indicating the total workload resource requirements associated with the sandbox creation request. In the case of sizing information not being provided, we will follow same behavior as today: start the VM with (just) the default CPU/memory. If this information is provided, we'll track this as Workload specific resources, and track default sizing information as Base resources. We will update the hypervisor configuration to utilize Base+Workload resources, thus starting the VM with the appropriate amount of CPU and memory. In this scenario (we start the VM with the "right" amount of CPU/Memory), we do not want to update the VM resources when containers are added, or adjusted in size. This functionality is introduced behind a configuration flag, `static_sandbox_resource_mgmt`. This is defaulted to false for all configurations except Firecracker, which is set to true. This'll greatly improve UX for folks who are utilizing Kata with a VMM or hardware architecture that doesn't support hotplug. Note, users will still be unable to do in place vertical pod autoscaling or other dynamic container/pod sizing with this enabled. Fixes: #3264 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-01-26 09:04:38 -08:00
Eric Ernst	c3e97a0a22	config: updates to configuration clh, fc toml template There's some cruft -- let's update to reflect reality, and ensure that we match what is expected. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-01-26 09:45:50 -08:00
Francesco Giudici	ab447285ba	kata-monitor: add kubernetes pod metadata labels to metrics Add the POD metadata we get from the container manager to the metrics by adding more labels. Fixes: #3551 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-01-26 13:48:45 +01:00
Francesco Giudici	834e199eee	kata-monitor: drop unused functions Drop the functions we are not using anymore. Update the tests too. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-01-26 13:48:45 +01:00
Francesco Giudici	7516a8c51b	kata-monitor: rework the sandbox cache sync with the container manager Kata-monitor detects started and terminated kata pods by monitoring the vc/sbs fs (this makes sense since we will have to access that path to access the sockets there to get the metrics from the shim). While kata-monitor updates its sandbox cache based on the sbs fs events, it will schedule also a sync with the container manager via the CRI in order to sync the list of sandboxes there. The container manager will be the ultimate source of truth, so we will stick with the response from the container manager, removing the sandboxes not reported from the container manager. May happen anyway that when we check the container manager, the new kata pod is not reported yet, and we will remove it from the kata-monitor pod cache. If we don't get any new kata pod added or removed, we will not check with the container manager again, missing reporting metrics about that kata pod. Let's stick with the sbs fs as the source of truth: we will update the cache just following what happens on the sbs fs. At this point we may have also decided to drop the container manager connection... better instead to keep it in order to get the kube pod metadata from it, i.e., the kube UID, Name and Namespace associated with the sandbox. Every time we get a new sandbox from the sbs fs we will try to retrieve the pod metadata associated with it. Right now we just attach the container manager sandbox id as a label to the exposed metrics, making hard to link the metrics to the running pod in the kubernetes cluster. With kubernetes pod metadata we will be able to add them as labels to map explicitly the metrics to the kubernetes workloads. Fixes: #3550 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-01-26 13:48:45 +01:00
Francesco Giudici	e78d80ea0d	kata-monitor: silently ignore CHMOD events on the sandboxes fs We currently WARN about unexpected fs events, which includes CHMOD operations (which should be actually expected...). Just ignore all the fs events we don't care about without any warn. We dump all the events with debug log in any case. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-01-26 13:48:45 +01:00
Francesco Giudici	e9eb34cea8	kata-monitor: improve debug logging Improve debug log formatting of the sandbox cache update process. Move raw and tracing logs from the DEBUG to the TRACE log level. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-01-26 13:48:45 +01:00
Fabiano Fidêncio	f7c7dc8d33	Merge pull request #3504 from Jakob-Naucke/s390x-govmm-tests Fix and re-enable s390x GoVMM tests	2022-01-26 12:57:38 +01:00
Archana Shinde	081a235efe	Merge pull request #3540 from bradenrayhorn/fix-negative-memory-limit runtime: fix handling container spec's memory limit	2022-01-25 05:17:05 -08:00
Braden Rayhorn	fc0e095180	runtime: fix handling container spec's memory limit The OCI container spec specifies a limit of -1 signifies unlimited memory. Update the sandbox memory calculator to reflect this part of the spec. Fixes: #3512 Signed-off-by: Braden Rayhorn <bradenrayhorn@fastmail.com>	2022-01-24 13:30:32 -06:00
Jakob Naucke	016569fd8e	Merge pull request #3476 from bergwolf/runtime-dep runtime: update runc and image-spec dependencies	2022-01-24 15:53:43 +01:00
Binbin Zhang	4fc4c76b87	agent: Fix execute_hook() args error 1. The hook.args[0] is the hook binary name which shouldn't be included in the Command.args. 2. Add new unit tests Fixes: #2610 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2022-01-24 14:13:24 +08:00
Peng Tao	5643c6dcae	runtime: update runc and image-spec dependencies To address two depbot security warnings. Fixes: #3475 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2022-01-24 11:49:05 +08:00
Fabiano Fidêncio	8a8ae8aae7	Merge pull request #3531 from egernst/test-lint agent: resolve unused variables in tests	2022-01-21 21:57:13 +01:00
Bo Chen	94b343492d	Merge pull request #3520 from likebreath/0120/clh_v21.0 Upgrade to Cloud Hypervisor v21.0	2022-01-21 08:08:13 -08:00
Jakob Naucke	2f37165f46	govmm: Unite VirtioNet tests no explicit PCI test, just switch path depending on architecture (CCW for s390x, PCI for others). Also fixes an unknown variable error. Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2022-01-21 13:00:05 +01:00
Jakob Naucke	4a428fd1c5	govmm: readonly=on in s390x blkdev test Forgotten in `b17f07395c`, also fixes a test. Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2022-01-21 13:00:05 +01:00
Jakob Naucke	79ecebb280	govmm: TestAppendPCIBridgeDevice et al. on !s390x s390x uses CCW, also fixes a lint failure about undeclared variables on s390x. Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2022-01-21 13:00:05 +01:00
Jakob Naucke	dc285ab1d7	govmm: Remove unnecessary comma in iommu_platform in FSDevice.QemuParams for VirtioCCW. Forgotten in `ff34d283db`, also fixes a test. Fixes: #3500 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2022-01-21 13:00:05 +01:00
Jakob Naucke	d23f2eb0f0	govmm: Revert "govmm: s390x: Skip broken tests" This reverts commit `5ce9011a36`. Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2022-01-21 13:00:05 +01:00
Amulya Meka	f52ce302bc	runtime: rectify passing empty options to -ldflags When no options are passed to -ldflags, it passes incorrect values(in this case, $BUILDFLAGS) to it. Fix passing empty values by passing $KATA_LDFLAGS in quotes. Fixes: #3521 Signed-off-by: Amulya Meka <amulmek1@in.ibm.com>	2022-01-21 06:57:52 +00:00
Tim Zhang	eac003462d	Merge pull request #3370 from lifupan/fix_namespace agent: fix the issue of creating new namespaces for agent	2022-01-21 10:25:43 +08:00
Bo Chen	2d799cbfa3	virtcontainers: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v21.0. Note: The client code of cloud-hypervisor's (CLH) OpenAPI is automatically generated by openapi-generator [1-2]. [1] https://github.com/OpenAPITools/openapi-generator [2] https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/pkg/cloud-hypervisor/README.md Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-01-20 17:48:10 -08:00
Fabiano Fidêncio	5ce9011a36	govmm: s390x: Skip broken tests For now a bunch of tests are simply not working. Let's skip them all, and re-enable them once kata-containers/kata-containers/issues/3500 gets fixed. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-01-20 01:04:35 +01:00
Fabiano Fidêncio	8bcaed0b4f	govmm: Adapt license headers to kata-containers Both projects follow the same license, Apache-2.0, but the header saying that comes from govmm is different from the one expected for the tests present on the kata-containers repo. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-01-19 18:02:46 +01:00
Fabiano Fidêncio	6dd6577986	govmm: Ignore govet checks, at least for now govet checks have been ignored on govmm repo, but those are enabled on kata-containers one. So, in order to avoid failing our CIs let's just keep ignoring the checks for the govmm structs and have an issue opened for fixing it whenever someone has cycles to do it. The important bit here is, we're not making anything worse that it already is. :-) Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-01-19 18:02:46 +01:00
Fabiano Fidêncio	de678a3aaa	govmm: Remove non-relevant top files govmm, from now on, should follow the same guidelines from contributing, copying, and etc as kata-containers does. The go.mod is not needed anymore as the project lives inside the runtime. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-01-19 18:02:46 +01:00
Fabiano Fidêncio	ec6655af87	govmm: Use govmm from our own pkg Let's stop using govmm from kata-containers/govmm and let's start using it from our own repo. Fixes: #3495 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-01-19 18:02:46 +01:00
Fabiano Fidêncio	fb7f98bd2e	Merge govmm into kata-containers	2022-01-19 09:40:15 +01:00
Julio Montes	c0e28b54a1	Merge pull request #3460 from devimc/2021-01-17/vendorGovmm vendor: update govmm	2022-01-18 15:54:11 -06:00
Julio Montes	49223e67af	runtime: remove enable_swap option `enable_swap` option was added long time ago to add `-realtime mlock=off` to the QEMU's command line. Kata now supports QEMU 6, `-realtime` option has been deprecated and `mlock=on` is causing unexpected behaviors in kata. This patch removes support for `enable_swap`, `-realtime` and `mlock=` since they are causing bugs in kata. Signed-off-by: Julio Montes <julio.montes@intel.com>	2022-01-18 11:12:29 -06:00
Jakob Naucke	5285ac2b57	runtime: -Wl,--s390-pgste for s390x for linking. Required for basic KVM checks on some kernels (e.g. the one RHEL is currently shipping), cf. `6621441db5/target/s390x/kvm/meson.build (L15-L16)`. Fixes: #3469 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2022-01-18 11:32:03 +01:00
Julio Montes	41e0c414a4	vendor: update govmm bring SGX support and other fixes shortlog: `8939b0f` qemu: add support for SGX `b17f073` qemu: update readonly flag for block devices `f971801` qemu: only set wait parameter for server mode socket based char device `82cc01d` qemu: Fix 32 bit int overflow in test file `1d1a231` qemu: Add support for legacy serial device `9a2bbed` qemu: Remove -realtime in favor of -overcommit `fe83c20` qemu: Add support for --no-shutdown Knob `1ed5271` qmp: wait for POWERDOWN event in ExecuteSystemPowerdown() fixes #3080 Signed-off-by: Julio Montes <julio.montes@intel.com>	2022-01-17 09:20:47 -06:00
Eric Ernst	9277317098	agent: resolve unused variables in tests A few tests have unused or unread variables. Let's clean these up... Fixes: #3530 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-01-16 14:09:03 -08:00
Sebastian Hasler	adffd3f8b6	scripts: Use shebang /usr/bin/env bash Not all distros have `/bin/bash`, e.g. NixOS. Fixes: #3450 Signed-off-by: Sebastian Hasler <sebastian.hasler@stuvus.uni-stuttgart.de>	2022-01-13 22:53:28 +01:00
liangxianlong	878ab93c15	runtime: Provide protection for shared data The k.reqHandlers should be protected by locks when used Fixes #3440 Signed-off-by: liangxianlong <liang.xianlong@zte.com.cn>	2022-01-13 14:48:10 +08:00
James O. D. Hunt	ef835b5948	Merge pull request #3418 from yangfeiyu20102011/main runtime: it should rollback when failed in Sandbox AddInterface	2022-01-12 10:22:36 +00:00
Bin Liu	a561159f7b	Merge pull request #3423 from liubin/fix/3422-ignore-some-generated-files libs: add some generated files to .gitignore	2022-01-12 15:46:21 +08:00
bin	85f5ae190e	runtime: close span before return from function in case of error Return before closing span will cause invalid spans, so span should be closed before function return. Fixes: #3424 Signed-off-by: bin <bin@hyper.sh>	2022-01-11 19:45:41 +08:00
bin	106df33ff8	libs: add some generated files to .gitignore Generated protocols files should not be inclued in Git repo. And also add Cargo.lock in oci/protocols directory to .gitignore. Fixes: #3422 Signed-off-by: bin <bin@hyper.sh>	2022-01-11 19:29:27 +08:00
yangfeiyu	b133a2368a	runtime: it should rollback when failed in Sandbox AddInterface When Sandbox AddInterface() is called, it may fail after endpoint.HotAttach, we'd better rollback and call save() in the end. Fixes: #3419 Signed-off-by: yangfeiyu <yangfeiyu20102011@163.com>	2022-01-11 18:43:43 +08:00
Feng Wang	c486c2ca18	agent: fix the broken protobuf generation code After the protocols are moved to upper libs (PR3355), the runtime protocol generation is broken. This fixes it. Fixes: #3414 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-01-10 15:37:00 -08:00
Gabriela Cervantes	ad16d75c07	runtime: Remove docker comments for kata 2.0 configuration.tomls This PR removes the reference of how to use disable_new_netns configuration with docker as for kata 2.0 we are not supporting docker and this information was used for kata 1.x Fixes #3400 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2022-01-06 16:08:10 +00:00
James O. D. Hunt	66510b977d	Merge pull request #3392 from zhsj/fix-doc docs: fix agent proto file path	2022-01-06 14:31:34 +00:00
Eric Ernst	e073c0936b	Merge pull request #3279 from egernst/containerd-vendor-bump vendor: update to containerd v1.6.0-beta.4	2022-01-05 11:13:05 -08:00
Shengjing Zhu	905e124b77	docs: fix agent proto file path Fixes: #3391 Signed-off-by: Shengjing Zhu <zhsj@debian.org>	2022-01-06 00:22:49 +08:00
Bin Liu	94f14cf6f7	Merge pull request #3363 from zhsj/remove-binary vc: remove swagger binary	2022-01-05 20:40:33 +08:00
Bin Liu	f622d9491f	Merge pull request #3253 from stevenhorsman/agent-config-cmdline agent: Refactor command line parsing to use a framework	2022-01-05 20:25:57 +08:00
Fupan Li	615224e993	agent: move the protocols to upper libs move the protocols to upper libs thus it can be shared between agent and other rust runtime. Depends-on: github.com/kata-containers/tests#4306 Fixes: #3348 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2022-01-05 16:58:06 +08:00
Fupan Li	330e3dcc93	agent: move the oci crate to upper libs Move the oci crate to upper libs thus it can be shared between agent and other rust runtimes. Fixes: #3348 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2022-01-05 16:58:06 +08:00
Bin Liu	b2166560fa	Merge pull request #3375 from zhaojizhuang/debianrootfs osbuilder: Restore Debian as a rootfs	2022-01-05 10:27:47 +08:00
Eric Ernst	7b03d78f15	vendor: update to containerd v1.6.0-beta.4 Update our containerd vendoring. In particular, we're interested in grabbing the updated annotation definitions for defining sandbox sizing. - go get github.com/containerd/containerd@v1.6.0-beta.4 - edit go.mod to remove containerd v1.5.8 replacement directive - go mod vendor - go mod tidy Fixes: #3276 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-01-04 17:15:17 -08:00
GabyCT	caa4e89dfc	Merge pull request #3366 from Kvasscn/kata_dev_fix_kata-collect-data_typo runtime: fix a typo in kata-collect-data.sh	2022-01-04 17:03:34 -06:00
James O. D. Hunt	a838a598ef	Merge pull request #3354 from liubin/fix/3353-return-error-details agent: return detail error message for RPC calls from shim	2022-01-04 14:06:25 +00:00
stevenhorsman	1c4edb9619	agent: Refactor arg parsing to use clap Fixes: #3284 Co-authored-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com> Co-authored-by: stevenhorsman <steven@uk.ibm.com> Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2022-01-04 09:14:08 +00:00
zhaojizhuang	3093f93a6f	osbuilder: Restore Debian as a rootfs Restore Debian as a rootfs. 1. revert of #3154, but some change 2. update debian version to 10.11 3. update `libstdc++-6-dev` to `libstdc++-8-dev` 4. changes discarded in QAT are not restored Fixes: #3372 Signed-off-by: zhaojizhuang <571130360@qq.com>	2022-01-04 11:54:34 +08:00
Fupan Li	ea1a173854	agent: fix the issue of creating new namespaces for agent The tokio's spawn will only create an future async task instead of a new real thread, thus executing unshare to create a new namespace in tokio's async task would make the agent process to join in the new created namespace, which isn't expected. Thus, we'd better to to the unshare in a real thread to prevent moving the agent process into a new namespace. Fixes: #3369 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2021-12-30 13:32:22 +08:00
zhanghj	2254fa8657	runtime: fix a typo in kata-collect-data.sh Fix a typo while to check if mountpoint exist. Fixes: #3365 Signed-off-by: zhanghj <zhanghj.lc@inspur.com>	2021-12-28 10:03:18 +08:00
Shengjing Zhu	2d0f9d2d06	vc: remove swagger binary Fixes: #3362 Signed-off-by: Shengjing Zhu <zhsj@debian.org>	2021-12-25 22:41:29 +08:00
bin	cf91307c66	agent: return detail error message for rpc calls from shim For calls from shim to agent, the return error will be processed like this: match self.do_start_container(req).await { Err(e) => Err(ttrpc_error(ttrpc::Code::INTERNAL, e.to_string())), Ok(_) => Ok(Empty::new()), } The e.to_string() return only a part of the error(for example set by context()), this may lead lack of information. The `format!("{:?}", err)` will return more info. Fixes: #3353 Signed-off-by: bin <bin@hyper.sh>	2021-12-24 17:17:29 +08:00
Fupan Li	0fe20854e7	Merge pull request #2481 from Bevisy/main-1494 Makefile: update `make go-test` call	2021-12-24 09:57:06 +08:00
James O. D. Hunt	ba22a04265	Merge pull request #2958 from ManaSugi/ignore-unknown-systemcall agent: Ignore unknown seccomp system calls	2021-12-23 12:12:47 +00:00
Peng Tao	8b6fbf9108	Merge pull request #3331 from dubek/mount-remove-var agent: mount: Remove unneeded mount_point local variable	2021-12-23 11:53:14 +08:00
Jakob Naucke	137e217b85	docs: Fix outdated k8s link in virtcontainers readme Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-12-22 19:40:25 +01:00
Dov Murik	91abebf92e	agent: mount: Remove unneeded mount_point local variable We already have a `mount_path` local Path variable which holds the mount point. Use it instead of creating a new `mount_point` variable with identical type and content. Fixes: #3332 Signed-off-by: Dov Murik <dovmurik@linux.ibm.com>	2021-12-22 14:11:50 +02:00
James O. D. Hunt	b1f4e945b3	security: Update rust crate versions Update the rust dependencies that have upstream security fixes. Issues fixed by this change: - [`RUSTSEC-2020-0002`](https://rustsec.org/advisories/RUSTSEC-2020-0002) (`prost` crate) - [`RUSTSEC-2020-0036`](https://rustsec.org/advisories/RUSTSEC-2020-0036) (`failure` crate) - [`RUSTSEC-2021-0073`](https://rustsec.org/advisories/RUSTSEC-2021-0073) (`prost-types` crate) - [`RUSTSEC-2021-0119`](https://rustsec.org/advisories/RUSTSEC-2021-0119) (`nix` crate) This change also includes: - Minor code changes for the new version of `prometheus` for the agent. - A downgrade of the version of the `futures` crate to the (new) latest version (`0.3.17`) since version `0.3.18` was removed [1]. Fixes: #3296. [1] - See https://crates.io/crates/futures/versions Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-12-22 07:41:16 +00:00
James O. D. Hunt	c2578cd9a1	docs: Clarify where to run agent API generation commands Make it clear when reading the table in the agent's "Change the agent API" documentation that the commands in the "Generation method" column should be run in the agent repo. Fixes: #3317. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-12-20 15:45:36 +00:00
James O. D. Hunt	2ebae2d279	Merge pull request #3287 from jodh-intel/docs-split-arch-doc Split architecture doc into separate files	2021-12-20 10:11:30 +00:00
Chelsea Mafrica	1653dd4a30	tracing: Add span name to logging error Add span name to logging error to help with debugging when the context is not set before the span is created. Fixes #3289 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-12-16 12:44:42 -08:00
James O. D. Hunt	6f9efb4043	docs: Move arch doc to separate directory Move the architecture document into a new `docs/design/architecture/` directory in preparation for splitting it into more manageable pieces. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-12-16 12:26:17 +00:00
Steve Horsman	39cf2b27c1	Merge pull request #3261 from stevenhorsman/native-agent-config-opt agent: Add config file option to cli	2021-12-16 10:00:56 +00:00
Eric Ernst	3865a1bcf6	Merge pull request #2918 from egernst/update-container-type-handling update container type handling	2021-12-15 10:41:23 -08:00
Jakob Naucke	a40e4877e9	Merge pull request #3266 from liubin/fix/3265-update-golang-to-1.16-and-remove-ioutil runtime: update golang to 1.16 and remove ioutil package	2021-12-15 10:09:23 +01:00
Eric Ernst	7a989a8333	runtime: api-test: fixup not clear why this was commented out before -- ensure that we set approprate annotation on the sandbox container's annotations to indicate this is a sandbox. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-12-14 18:55:18 -08:00
Eric Ernst	52f79aef91	utils: update container type handling Today we assume that if the CRI/upper layer doesn't provide a container type annotation, it should be treated as a sandbox. Up to this point, a sandbox with a pause container in CRI context and a single container (ala ctr run) are treated the same. For VM sizing and container constraining, it'll be useful to know if this is a sandbox or if this is a single container. In updating this, we cleanup the type handling tests and we update the containerd annotations vendoring. Fixes: #2926 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-12-14 17:59:19 -08:00
bin	03546f75a6	runtime: change io/ioutil to io/os packages Change io/ioutil to io/os packages because io/ioutil package is deprecated from 1.16: Discard => io.Discard NopCloser => io.NopCloser ReadAll => io.ReadAll ReadDir => os.ReadDir ReadFile => os.ReadFile TempDir => os.MkdirTemp TempFile => os.CreateTemp WriteFile => os.WriteFile Details: https://go.dev/doc/go1.16#ioutil Fixes: #3265 Signed-off-by: bin <bin@hyper.sh>	2021-12-15 07:31:48 +08:00
Peng Tao	7c4263b3e1	src: reorg source directories To make the code directory structure more clear: └── src ├── agent ├── libs │ └── logging ├── runtime ├── runtime-rs (to be added) └── tools ├── agent-ctl └── trace-forwarder Fixes: #3204 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-12-14 10:30:08 +08:00
stevenhorsman	1a34fbcdbd	agent: Add config file option to cli - Add option to pass in config with -c/--config Fixes: #3252 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2021-12-13 21:57:23 +00:00
Fabiano Fidêncio	602d87295b	Merge pull request #3226 from liubin/fix/3193-fill-hypervisorconfig runtime/template: Handling new attributes for hypervisor config	2021-12-09 13:29:23 +01:00
Chelsea Mafrica	7522109abc	Merge pull request #3218 from liubin/fix/3217-fix-span-name runtime: correct span name for stopSandbox function	2021-12-07 16:36:14 -08:00
bin	b92babf91b	runtime/template: Handling new attributes for hypervisor config Some new attributes are added to hypervisor config: - VMStorePath - RunStorePath - SharedPath These attributes should be handled in two places: - reset when check the new hypervisor's config is suitable to the base config. - copy from new hypervisor's config when create new VM Fixes: #3193 Signed-off-by: bin <bin@hyper.sh>	2021-12-07 19:31:03 +08:00
bin	40bd34caaf	runtime: only call stopVirtiofsd when shared_fs is virtio-fs If shared_fs is set to virtio-9p, the virtiofsd is not started, so there is no need to stop it. Fixes: #3219 Signed-off-by: bin <bin@hyper.sh>	2021-12-07 16:06:26 +08:00
bin	33f343ee08	runtime: correct span name for stopSandbox function Normally the span name should be the same as function name, so chagne `StopVM` to `stopSandbox`. Fixes: #3217 Signed-off-by: bin <bin@hyper.sh>	2021-12-07 15:59:18 +08:00
Bo Chen	995300260e	virtcontainers: clh: Upgrade to openapi-generator v5.3.0 The latest release of openapi-generator v5.3.0 contains the fix for `dropping err` bug [1]. This patch also re-generated the client code of Cloud Hypervisor to have the bug fixed. [1] https://github.com/OpenAPITools/openapi-generator/pull/10275 Fixes: #3201 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-12-03 08:55:38 -08:00
Carlos Venegas	d02a0932d6	Merge pull request #3173 from liubin/fix/3172 agent: user container ID as watchable storage key for hashmap	2021-12-03 09:35:32 -06:00
Fabiano Fidêncio	3fdc97e110	Merge pull request #3183 from fengwang666/nonroot-vhost-bug-fix runtime: enable vhost-net for rootless hypervisor	2021-12-03 10:42:50 +01:00
Feng Wang	b3bcb7b251	runtime: enable vhost-net for rootless hypervisor vhost-net is disabled in the rootless kata runtime feature, which has been abandoned since kata 2.0. I reused the rootless flag for nonroot hypervisor and would like to enable vhost-net. Fixes #3182 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2021-12-02 21:55:31 -08:00
Bin Liu	4b57548838	Merge pull request #3181 from egernst/topic/clean-lint Cleanup some unused variables, definitions	2021-12-03 11:06:42 +08:00
Eric Ernst	7cb7b9d5ba	agent: remove unused field in mount handling In our parsing of mountinfo, majority of the fields are unused. Let's stop saving these. Fixes: #3180 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-12-02 17:03:46 -08:00
Eric Ernst	f6ae15826e	agent: drop unused fields from network We don't utilize routes or inteface vectors. Let's drop them. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-12-02 17:03:41 -08:00
Bo Chen	4756a04b2d	virtcontainers: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v19.0. Note: The client code of cloud-hypervisor's (CLH) OpenAPI is automatically generated by openapi-generator [1-2]. [1] https://github.com/OpenAPITools/openapi-generator [2] https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/pkg/cloud-hypervisor/README.md Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-12-02 12:09:12 -08:00
bin	39b35d0073	agent: user container ID as watchable storage key for hashmap Use sandbox ID as the key will cause the failed containers' storage leak. Fixes: #3172 Signed-off-by: bin <bin@hyper.sh>	2021-12-02 23:28:25 +08:00
Bin Liu	3992d28f00	Merge pull request #3152 from liubin/fix/3140-create-empty-dir agent: copy empty directories for watchable-bind mounts	2021-12-02 14:46:25 +08:00
bin	2af95bc536	agent: create directories for watchable-bind mounts In function `update_target`, if the updated source is a directory, we should create the corresponding directory. Fixes: #3140 Signed-off-by: bin <bin@hyper.sh>	2021-12-02 06:31:03 +08:00
Gabriela Cervantes	591d4af1ea	runtime: Update comments for virtcontainers to use kata 2.0 This PR updates the comments in the configuration.toml to point to the current kata containers repository instead of the kata 1.x. Fixes #3163 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2021-12-01 16:16:46 +00:00
Fupan Li	87f350db53	Merge pull request #3125 from jodh-intel/update-rust-crate-versions Update rust crate versions	2021-12-01 18:00:33 +08:00
Gabriela Cervantes	923e098db6	osbuilder: Remove debian as a rootfs Currently we do not have debian as part of the kata CI as we do not have a mantainer, this PR removes debian as a supported rootfs in order to have only the distros that we are supporting and mantainining. Fixes #3153 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2021-11-30 19:31:33 +00:00
James O. D. Hunt	afb96c0044	agent: Wrap remaining nix errors with anyhow Wrap `nix` `Error`'s in an `anyhow` error for consistency with the way `rustjail` handles errors. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-30 13:26:15 +00:00
James O. D. Hunt	aba572e01d	rustjail: Wrap remaining nix errors with anyhow Replace `Result` values that use a "bare" `nix` `Error` like this: ```rust return Err(nix::Error::EINVAL.into()); ``` ... to the following which wraps the nix` error in an `anyhow` call for consistency with the other errors returned by `rustjail`: ```rust return Err(anyhow!(nix::Error::EINVAL)); ``` Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-30 13:24:04 +00:00
James O. D. Hunt	30d6007893	uevent: Fix clippy issue in test code Remove a bare `return` from a test function. This looks wrong but isn't because the callers are all tests that just wait for a state change caused by this test function. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-30 12:58:15 +00:00
James O. D. Hunt	4a2be13c60	agent: Upgrade nix version for security fix Running `cargo audit` showed that the `nix` package for the agent and the `rustjail` and `vsock-exporter` local crates need to be updated to resolve rust security issue [RUSTSEC-2021-0119](https://rustsec.org/advisories/RUSTSEC-2021-0119). Hence, bumped `nix` to the latest version (which required changes to work with the new, simpler `errno` handling). Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-30 12:58:15 +00:00
James O. D. Hunt	256d5008dc	agent: Update crate versions Run `cargo update` to update to the latest crate dependency versions. The agent is an application so this includes expanding the partially specified semvers to full semver values for the following crates, which makes those crates consistent with the other agent dependencies: - `futures` - `regex` - `scan_fmt` - `tokio` Fixes: #3124. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-30 12:58:15 +00:00
James O. D. Hunt	4ebdd424de	forwarder: Update rust lockfile Ran `cargo update` to bump crate versions. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-30 12:58:15 +00:00
James O. D. Hunt	6007322daa	agent: Fixed invalid error message Remove the format specifier in the `"failed to get VFIO group"` error returned by `vfio_device_handler()`. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-30 12:58:15 +00:00
bin	a32e02a1ee	agent: use temp directory as root of test containers Some tests in sandbox.rs need root user to run, because they need create directories under /run/agent directories, actually this is a limit that shouldn't be there. By using a temp directory for test containers will not need run tests as root user. Fixes: #3122 Signed-off-by: bin <bin@hyper.sh>	2021-11-26 15:18:38 +08:00
Manabu Sugimoto	7b35615191	agent: Log unknown seccomp system calls Kata agent logs unknown system calls given by seccomp profiles in advance before the log file descriptor closes. Fixes: #2957 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-11-26 15:10:04 +09:00
Peng Tao	c3de161168	Merge pull request #3118 from liubin/fix/3117-refactor-find_process agent: refactor find_process function and add test cases	2021-11-26 10:22:48 +08:00
Peng Tao	01b6ffc0a4	Merge pull request #3028 from egernst/hypervisor-hacking Hypervisor cleanup, refactoring	2021-11-26 10:21:49 +08:00
James O. D. Hunt	9412be39ba	Merge pull request #3092 from liubin/fix/3091-fix-test-warnings agent: clear cargo test warnings	2021-11-25 17:22:27 +00:00
Chelsea Mafrica	ed7eb26bff	Merge pull request #3113 from liubin/fix/3112-delete-netmon runtime: delete netmon	2021-11-24 17:58:13 -08:00
bin	6a0b7165ba	agent: refactor find_process function and add test cases Delete redundant parameter init in find_process function and add test case for it. Fixes: #3117 Signed-off-by: bin <bin@hyper.sh>	2021-11-25 09:47:25 +08:00
Fupan Li	2938f60abb	Merge pull request #3012 from jodh-intel/agent-rm-unwraps agent: Remove some unwrap and expect calls	2021-11-25 09:37:39 +08:00
Binbin Zhang	75bb340137	shimv2/service: fix defer funtions never run with os.Exit() os.Exit() will terminate program immediately, the defer functions won't be executed, so we add defer functions again before os.Exit(). Refer to https://pkg.go.dev/os#Exit Fixes: #3059 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2021-11-24 15:59:59 +01:00
James O. D. Hunt	bd3217daeb	agent: Remove redundant returns Remove an unnecessary `return` statement identified by clippy. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-24 11:43:49 +00:00
James O. D. Hunt	adab64349c	agent: Remove some unwrap and expect calls Replace some `unwrap()` and `expect()` calls with code to return the error to the caller. Fixes: #3011. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-24 11:43:49 +00:00
James O. D. Hunt	351cef7b6a	agent: Remove unwrap from verify_cid() Improved the `verify_cid()` function that validates container ID's by removing the need for an `unwrap()`. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-24 11:43:49 +00:00
James O. D. Hunt	a7d1c70c4b	agent: Improve baremount Change `baremount()` to accept `Path` values rather than string values since: - `Path` is more natural given the function deals with paths. - This minimises the caller having to convert between string and `Path` types, which simplifies the surrounding code. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-24 11:43:49 +00:00
bin	ddc68131df	runtime: delete netmon Netmon is not used anymore. Fixes: #3112 Signed-off-by: bin <bin@hyper.sh>	2021-11-24 15:08:18 +08:00
wangyongchao.bj	0c6c0735ec	agent: fixed the `make optimize` bug The unrecognized option: 'deny-warnings' args caused `make optimize` failed. Fixed the Makefile of the agent project, make sure the `make optimize` command execute correctly. This PR modify the rustc args from '--deny-warnings' to '--deny warnings'. Fixes: #3104 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-11-23 09:44:05 +08:00
bin	ce0693d6dc	agent: clear cargo test warnings Function parameters in test config is not used. This commit will add under score before variable name in test config. Fixes: #3091 Signed-off-by: bin <bin@hyper.sh>	2021-11-22 20:45:46 +08:00
Binbin Zhang	7304e52a59	Makefile: update `make go-test` call 1. use ci/go-test.sh to replace the direct call to go test 2. fix data race test 3. install hook whether it is root or not Fixes #1494 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2021-11-22 13:59:22 +08:00
David Gibson	1b28d7180f	Merge pull request #2927 from dgibson/vfio-env-mangling Update k8s SR-IOV plugin environment variables to work properly with Kata	2021-11-22 13:44:19 +11:00
Eric Ernst	a0919b0865	Merge pull request #2998 from egernst/fix-symlinks watchers: don't dereference symlinks when copying files	2021-11-19 12:43:22 -08:00
Eric Ernst	ce92cadc7d	vc: hypervisor: remove setSandbox The hypervisor interface implementation should not know a thing about sandboxes. Fixes: #2882 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-19 12:20:41 -08:00
Eric Ernst	2227c46c25	vc: hypervisor: use our own logger This'll end up moving to hypervisors pkg, but let's stop using virtLog, instead introduce hvLogger. Fixes: #2884 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-19 12:20:41 -08:00
Eric Ernst	4c2883f7e2	vc: hypervisor: remove dependency on persist API Today the hypervisor code in vc relies on persist pkg for two things: 1. To get the VM/run store path on the host filesystem, 2. For type definition of the Load/Save functions of the hypervisor interface. For (1), we can simply remove the store interface from the hypervisor config and replace it with just the path, since this is all we really need. When we create a NewHypervisor structure, outside of the hypervisor, we can populate this path. For (2), rather than have the persist pkg define the structure, let's let the hypervisor code (soon to be pkg) define the structure. persist API already needs to call into hypervisor anyway; let's allow us to define the structure. We'll probably want to look at following similar pattern for other parts of vc that we want to make independent of the persist API. In doing this, we started an initial hypervisors pkg, to hold these types (avoid a circular dependency between virtcontainers and persist pkg). Next step will be to remove all other dependencies and move the hypervisor specific code into this pkg, and out of virtcontaienrs. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-19 12:20:41 -08:00
Eric Ernst	34f23de512	vc: hypervisor: Remove need to get shared address from sandbox Add shared path as part of the hypervisor config Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-19 12:20:41 -08:00
Eric Ernst	c28e5a7807	acrn: remove dependency on sandbox, persistapi datatypes Today, acrn relies on sandbox level information, as well as a store provided by common parts of the hypervisor. As we cleanup the abstractions within our runtime, we need to ensure that there aren't cross dependencies between the sandbox, the persistence logic and the hypervisor. Ensure that ACRN still compiles, but remove the setSandbox usage as well as persist driver setup. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-19 12:20:41 -08:00
Eric Ernst	a0e0e18639	hypervisors: introduce pkg to unbreak vc/persist dependency Initial hypervisors pkg, with just basic state types defined. Fixes: #2883 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-19 12:20:41 -08:00
Eric Ernst	b5dfcf2653	watcher: tests: ensure there is 20ms delay between fs writes We noticed s390x test failures on several of the watcher unit tests. Discovered that on s390 in particular, if we update a file in quick sucecssion, the time stampe on the file would not be unique between the writes. Through testing, we observe that a 20 millisecond delay is very reliable for being able to observe the timestamp update. Let's ensure we have this delay between writes for our tests so our tests are more reliable. In "the real world" we'll be polling for changes every 2 seconds, and frequency of filesystem updates will be on order of minutes and days, rather that microseconds. Fixes: #2946 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-19 11:33:36 -08:00
David Gibson	78dff468bf	agent/device: Adjust PCIDEVICE_* container environment variables for VM The k8s SR-IOV plugin, when it assigns a VFIO device to a container, adds an variable of the form PCIDEVICE_<identifier> to the container's environment, so that the payload knows which device is which. The contents of the variable gives the PCI address of the device to use. Kata allows VFIO devices to be passed in to a Kata container, however it runs within a VM which has a different PCI topology. In order for the payload to find the right device, the environment variables therefore need to be converted to list the guest PCI addresses instead of the host PCI addresses. fixes #2897 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 17:44:05 +11:00
David Gibson	4530e7df29	agent/device: Use simpler structure in update_spec_devices() update_spec_devices() takes a bunch of updates for the device entries in the OCI spec and applies them, adjusting things in both the linux.devices and linux.resources.devices sections of the spec. It's important that each entry in the spec only be updated once. Currently we ensure this by first creating an index of where the entries are, then consulting that as we apply each update, so that earlier updates don't cause us to incorrectly detect an entry as being relevant to a later update. This method works, but it's quite awkward. This inverts the loop structure in update_spec_devices() to make this clearer. Instead of stepping through each update and finding the relevant entries in the spec to change, we step through each entry in the spec and find the relevant update. This makes it structurally clear that we're only updating each entry once. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 17:21:11 +11:00
Tim Zhang	653b461dc2	Merge pull request #3064 from lifupan/main agent: fix the issue of missing create a new session for container	2021-11-19 11:28:54 +08:00
David Gibson	b60622786d	agent/device: Correct misleading comment on test case We have a test case commented as testing the case where linux.devices is empty in the OCI spec. While it's true that linux.devices is empth in this example, the reason it fails isn't specifically because it's empty but because it doesn't contain a device for the update we're trying to apply. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 14:25:04 +11:00
David Gibson	89ff700038	agent/device: Remove unnecessary check for empty container_path update_spec_devices() explicitly checks for being called with an empty container path and fails. We have a unit test to verify this behaviour. But while an empty container_path probably does mean something has gone wrong elsewhere, that's also true of any number of other bad paths. Having an empty string here doesn't prevent what we're doing in this function making sense - we can compare it to the strings in the OCI spec perfectly well (though more likely we simply won't find it there). So, there's no real reason to check this one particular odd case. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 14:25:03 +11:00
David Gibson	c855a312f0	agent/device: Make DevIndex local to update_spec_devices() The DevIndex data structure keeps track of devices in the OCI specification. We used to carry it around to quite a lot of functions, but it's now used only within update_spec_devices(). That means we can simplify things a bit by just open coding the maps we need, rather than declaring a special type. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 14:24:47 +11:00
David Gibson	084538d334	agent/device: Change update_spec_device to handle multiple devices at once update_spec_device() adjusts the OCI spec for device differences between the host and guest. It is called repeatedly for each device we need to alter. These calls are now all in a single loop in add_devices(), so it makes more sense to move the loop into a renamed update_spec_devices() and process all the fixups in one call. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 14:23:58 +11:00
David Gibson	d6a3ebc496	agent/device: Obtain guest major/minor numbers when creating DevNumUpdate Currently the DevNumUpdate structure is created with a path to a device node in the VM, which is then used by update_spec_device(). However the only piece of information that update_spec_device() actually needs is the VM side major and minor numbers for the device. We can determine those when we create the DevNumUpdate structure. This means we detect errors earlier and as a bonus we don't need to make a copy of the vm path string. Since that change requires updating 2 of the log statements, we take the opportunity to update all the log statements to structured style. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 14:23:36 +11:00
David Gibson	f4982130e1	agent/device: Check for conflicting device updates For each device in the OCI spec we need to update it to reflect the guest rather than the host. We do this with additional device information provided by the runtime. There should only be one update for each device though, if there are multiple, something has gone horribly wrong. Detect and report this situation, for safety. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 14:23:34 +11:00
David Gibson	f10e8c8165	agent/device: Batch changes to the OCI specification As we process container devices in the agent, we repeatedly call update_spec_device() to adjust the OCI spec as necessary for differences between the host and the VM. This means that for the whole of a pretty complex call graph, the spec is in a partially-updated state - neither fully as it was on the host, not fully as it will be for the container within the VM. Worse, it's not discernable from the contents itself which parts of the spec have already been updated and which have not. We used to have real bugs because of this, until the DevIndex structure was introduced, but that means a whole, fairly complex, parallel data structure needs to be passed around this call graph just to keep track of the state we're in. Start simplifying this by having the device handler functions not directly update the spec, but instead return an update structure describing the change they need. Once all the devices are added, add_devices() will process all the updates as a batch. Note that collecting the updates in a HashMap, rather than a simple Vec doesn't make a lot of sense in the current code, but will reduce churn in future changes which make use of it. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 14:23:15 +11:00
David Gibson	46a4020e9e	agent/device: Types to represent update for a device in the OCI spec Currently update_spec_device() takes parameters 'vm_path' and 'final_path' to give it the information it needs to update a single device in the OCI spec for the guest. This bundles these parameters into a single structure type describing the updates to a single device. This doesn't accomplish much immediately, but will allow a number of further cleanups. At the same time we change the representation of vm_path from a Unicode string to a std::path::Path, which is a bit more natural since we are performing file operations on it. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 12:27:52 +11:00
David Gibson	e7beed5430	agent/device: Remove unneeded clone() from several device handlers virtio_blk_device_handler(), virtio_blk_ccw_device_handler() and virtio_scsi_device_handler() all take a clone of their 'device' parameter. They appear to do this in order to get a mutable copy in which they can update the vm_path field. However, the copy is dropped at the end of the function, so the only thing that's used in it is the vm_path field passed to update_spec_device() afterwards. We can avoid the clone by just using a local variable for the vm_path. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 12:27:52 +11:00
David Gibson	2029eeebca	agent/device: Improve update_spec_device() final_path handling update_spec_device() takes a 'final_path' parameter which gives the name the device should be given in the "inner" OCI spec. We need this for VFIO devices where the name the payload sees needs to match the VM's IOMMU groups. However, in all other cases (for now, and maybe forever), this is the same as the original 'container_path' given in the input OCI spec. To make this clearer and simplify callers, make this parameter an Option, and only update the device name if it is non-None. Additionally, update_spec_device() needs to call to_string() on update_path to get an owned version. Rust convention[0] is to let the caller decide whether it should copy, or just give an existing owned version to the function. Change from &str to String to allow that; it doesn't buy us anything right now, but will make some things a little nicer in future. [0] https://rust-lang.github.io/api-guidelines/flexibility.html?highlight=clone#caller-decides-where-to-copy-and-place-data-c-caller-control Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 12:27:52 +11:00
David Gibson	57541315db	agent/device: Correct misleading parameter name in update_spec_device() update_spec_device() takes a 'host_path' parameter which it uses to locate the device to correct in the OCI spec. Although this will usually be the path of the device on the host, it doesn't have to be - a traditional runtime like runc would create a device node of that name in the container with the given (host) major and minor numbers. To clarify that, rename it to 'container_path'. We also update the block comment to explain the distinctions more carefully. Finally we update some variable names in tests to match. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 12:27:52 +11:00
David Gibson	0c51da3dd0	agent/device: Correct misleading error message in update_spec_device() This error is returned if we have information for a device from the runtime, but a matching device does not appear in the OCI spec. However, the name for the device we print is the name from the VM, rather than the name from the container which is what we actually expect in the spec. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 12:27:52 +11:00
David Gibson	94b7936f51	agent/device: Use nix::sys::stat::{major,minor} instead of libc::* update_spec_devices() includes an unsafe block, in order to call the libc functions to get the major and minor numbers from a device ID. However, the nix crate already has a safe wrapper for this function, which we use in other places in the file. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-11-19 12:27:52 +11:00
Eric Ernst	296e76f8ee	watchers: handle symlinked directories, dir removal - Even a directory could be a symlink - check for this. This is very common when using configmaps/secrets - Add unit test to better mimic a configmap, configmap update - We would never remove directories before. Let's ensure that these are added to the watched_list, and verify in unit tests - Update unit tests which exercise maximum number of files per entry. There's a change in behavior now that we consider directories/symlinks watchable as well. For these tests, it means we support one less file in a watchable mount. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-18 16:23:45 -08:00
Eric Ernst	2b6dfe414a	watchers: don't dereference symlinks when copying files The current implementation just copies the file, dereferencing any simlinks in the process. This results in symlinks no being preserved, and a change in layout relative to the mount that we are making watchable. What we want is something like "cp -d" This isn't available in a crate, so let's go ahead and introduce a copy function which will create a symlink with same relative path if the source file is a symlink. Regular files are handled with the standard fs::copy. Introduce a unit test to verify symlinks are now handled appropriately. Fixes: #2950 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-18 16:23:45 -08:00
Christophe de Dinechin	0380b9bda7	runtime: Update containerd to 1.5.8 Release 1.5.8 of containerd contains fixes for two low-severity advisories: [GHSA-5j5w-g665-5m35](https://github.com/opencontainers/distribution-spec/security/advisories/GHSA-mc8v-mgrf-8f4m) [GHSA-77vh-xpmg-72qh](https://github.com/opencontainers/image-spec/security/advisories/GHSA-77vh-xpmg-72qh) Fixes: #3074 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2021-11-18 18:38:27 +01:00
Greg Kurz	f80ca66300	Merge pull request #2921 from Amulyam24/template_test virtcontainers: fix failing template test on ppc64le	2021-11-18 17:32:18 +01:00
Amulyam24	d5a18173b9	virtcontainers: fix failing template test on ppc64le If a file/directory doesn't exist, os.Stat() returns an error. Assert the returned value with os.IsNotExist() to prevent it from failing. Fixes: #2920 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2021-11-18 15:37:40 +05:30
James O. D. Hunt	7269352fd4	Merge pull request #3057 from jodh-intel/docs-update-agent-readme agent: Update README	2021-11-18 08:02:10 +00:00
Fupan Li	bbaf57adb0	agent: fix the issue of missing create a new session for container When the container didn't had a tty console, it would be in a same process group with the kata-agent, which wasn't expected. Thus, create a new session for the container process. Fixes: #3063 Signed-off-by: Fupan Li <fupan.lfp@antgroup.com>	2021-11-18 14:12:51 +08:00
Eric Ernst	7e6f2b8d64	vc-utils: don't export unused function Many of these functions are just used on one place throughout the rest of the code base. If we create hypervisor package, newtork package, etc, we may want to parse this out. Fixes: #3049 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-17 14:12:57 -08:00
Eric Ernst	860f30882a	virtcontainers: move oci, uuid packages top level This will be useful at runtime level; no need for oci or uuid to be subpkg of virtcontainers. While at it, ensure we run gofmt on the changed files. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-17 14:12:57 -08:00
Eric Ernst	8acb3a32b6	virtcontainers: remove unused package nsenter Package is not utilized. Remove. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-17 14:12:57 -08:00
Eric Ernst	4788cb8263	vc-network: remove unused functions Unused functions -- let's clean up! Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-17 14:12:57 -08:00
Eric Ernst	b6ebddd7ef	oci: remove unused function GetContainerType This is unused - we utilize ContainerType directly. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-17 14:12:57 -08:00
James O. D. Hunt	599bc0c2a9	agent: Update README Update the agent README by removing the historical details about the conversion from golang to rust which (occurred at the start of Kata 2.x development) and replacing it with information that developers and testers should find more useful. Fixes: #3056. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-17 17:57:45 +00:00
Eric Ernst	1e7cb4bc3a	macvlan: drop bridged part of name The fact that we need to "bridge" the endpoint is a bit irrelevant. To be consistent with the rest of the endpoints, let's just call this "macvlan" Fixes: #3050 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-11-16 16:44:29 -08:00
Carlos Venegas	15b5d22e81	Merge pull request #2778 from jcvenegas/clh-race-condition-check clh: Fix race condition that prevent start pods	2021-11-16 14:15:06 -06:00
Carlos Venegas	55412044df	monitor: Fix monitor race condition doing hypervisor.check() The thread monitor will check if the agent and the VMM are alive every second in a blocking thread. The Cloud hypervisor API server is single-threaded, if the monitor does a `check()`, while a slow request is still in progress, the monitor check() method will timeout. The monitor thread will stop all the shim-v2 execution. This commit modifies the monitor thread to make it check the status of the hypervisor after 5 seconds. Additionally, the `check()` method from cloud-hypervisor will use the method `clh.isClhRunning(timeout)` with a 10 seconds timeout. The monitor function does no timeout, so even if `hypervisor.check()` takes more 10 seconds, the isClhRunning method handles errors doing a VmmPing and retry in case of errors until the timeout is reached. Reduce the time to the next check to 5 should not affect any functionality, but it will reduce the overhead polling the hypervisor. Fixes: #2777 Signed-off-by: Carlos Venegas <jose.carlos.venegas.munoz@intel.com>	2021-11-16 18:28:29 +00:00
snir911	b046c1ef6b	Merge pull request #2959 from snir911/wip/cgroups-systemd-fix cgroups: Fix systemd cgroup support	2021-11-15 10:44:45 +02:00
Eric Ernst	e89c06e68b	Merge pull request #3032 from liubin/fix/3031-merge-two-types-packages runtime: merge virtcontainers/pkg/types into virtcontainers/types	2021-11-12 14:23:21 -08:00
Chelsea Mafrica	d38135c93b	Merge pull request #2570 from YchauWang/wyc-agent-test agent/src: improve unit test coverage for src/namespace.rs	2021-11-12 11:24:13 -08:00
Chelsea Mafrica	c8f2ef9488	Merge pull request #3030 from liubin/fix/3029-delete-codes runtime: delete not used codes	2021-11-12 08:53:20 -08:00
bin	09f7962ff1	runtime: merge virtcontainers/pkg/types into virtcontainers/types There are two types packages under virtcontainers, and the virtcontainers/pkg/types has a few codes, merging them into one can make it easy for outstanding and using types package. Fixes: #3031 Signed-off-by: bin <bin@hyper.sh>	2021-11-12 15:06:39 +08:00
bin	6acedc2531	runtime: delete not used codes Functions EnvVars and GetOCIConfig in runtime/virtcontainers/pkg/oci/utils.go are not used anymore. Fixes: #3029 Signed-off-by: bin <bin@hyper.sh>	2021-11-12 11:35:31 +08:00
Bin Liu	bf24eb6b33	Merge pull request #2979 from jodh-intel/agent-ctl-json-api-spec agent-ctl: Allow API specification in JSON format	2021-11-11 16:45:30 +08:00
Snir Sheriber	bcf181b7ee	cgroups: Fix systemd cgroup support As github.com/containerd/cgroups doesn't support scope units which are essential in some cases lets create the cgroups manually and load it trough the cgroups api This is currently done only when there's single sandbox cgroup (sandbox_cgroup_only=true), otherwise we set it as static cgroup path as it used to be (until a proper soultion for overhead cgroup under systemd will be suggested) Fixes: #2868 Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2021-11-11 08:51:45 +02:00
Bin Liu	04185bd068	Merge pull request #2997 from Jakob-Naucke/lint-protection virtcontainers: Lint protection types	2021-11-11 08:34:48 +08:00
Fabiano Fidêncio	05cf7cdddb	Merge pull request #3007 from liubin/fix/3006-check-env-key-value agent: check environment variables if empty or invalid	2021-11-10 19:19:47 +01:00
bin	57bb7ffae3	agent: check environment variables if empty or invalid Invalid environment variable key/value will cause set_env panic. Refer: https://doc.rust-lang.org/std/env/fn.set_var.html#panics Fixes: #3006 Signed-off-by: bin <bin@hyper.sh>	2021-11-10 20:54:21 +08:00
Fabiano Fidêncio	653976c0fd	Merge pull request #3000 from bergwolf/crioptions runtime: Revert "runtime: use containerd package instead of cri-containerd"	2021-11-10 13:41:24 +01:00
Tim Zhang	fbf3bb55c0	Merge pull request #2995 from Tim-Zhang/fix-container-created-time rustjail: Fix created time of container	2021-11-10 19:44:04 +08:00
James O. D. Hunt	8ab90e1068	agent-ctl: Allow API specification in JSON format Update the `agent-ctl` tool to allow API fields to be specified in JSON format, either directly on the command-line, or via a file URI. This feature is made possible by enabling `serde` support in the agent `protocols` crate. Careful use of the `serde` macros allows the `agent-ctl` tool to accept _partially_ specified API objects in JSON format; fields that are not specified are set to the default value for their respective types. `build.rs` changes based on work by Fupan. Fixes: #2978. Contributions-by: Fupan Li <lifupan@gmail.com> Contributions-by: Bin Liu <bin@hyper.sh> Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-10 10:16:04 +00:00
Peng Tao	eacfcdec19	runtime: Revert "runtime: use containerd package instead of cri-containerd" This reverts commit `76f16fd1a7` to bring back cri-containerd crioptions parsing so that kata works with older containerd versions like v1.3.9 and v1.4.6. Fixes: #2999 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-11-10 16:06:42 +08:00
Tim Zhang	e7856ff10c	rustjail: Fix created time of container Got wrong created time of container after an exec this commit will fix this problem. Fixes: #2994 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-11-10 10:43:03 +08:00
Jakob Naucke	b7b89905d4	virtcontainers: Lint protection types Protection types like tdxProtection or seProtection were marked nolint, remove this. As a side effect, ARM needs dummy tests for these. Fixes: #2801 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-11-09 18:36:32 +01:00
James O. D. Hunt	87f676062c	agent: Remove dynamic tracing APIs Remove the `StartTracing` and `StopTracing` agent APIs that toggle dynamic tracing. This is not supported in Kata 2.x, as documented in the [tracing proposals document](https://github.com/kata-containers/kata-containers/pull/2062). Fixes: #2985. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-09 08:39:06 +00:00
James O. D. Hunt	b192d388c1	Merge pull request #2970 from jodh-intel/logging-create-tests-and-checks logging: Always run crate tests	2021-11-08 13:16:48 +00:00
Manabu Sugimoto	c66b56683b	agent: Ignore unknown seccomp system calls If Kata agent cannot resolve the system calls given by seccomp profiles, the agent ignores the system calls and continues to run without an error. Fixes: #2957 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-11-05 21:00:41 +09:00
Chelsea Mafrica	d17100aee6	vendor: update OpenTelemetry to v1.0.0 Upgrade from v0.20.0 to v1.0.0, first stable release. Git log 4bfa0034 Release prep v1.0.0-RC3 (2218) c7ae470a Refactor SDK span creation and implementation (2213) db317fce Verify and update OTLP trace exporter documentation (2053) 04de34a2 Update the website getting started docs (2203) a7b9d021 Rename metric instruments to match feature-freeze API specification (2202) 1f527a52 Update trace API config creation functions (2212) 361a2096 Fix RC2 header in changelog (2215) e209ee75 chore(exporter/zipkin): improves logging on invalid collector. (2191) c0c5ef65 Fix typos in resource.go. (2201) abf6afe0 Update otel example guide (2210) 3b05ba02 Bump actions/setup-go from 2.1.3 to 2.1.4 (2206) bcd7ff7b Bump codecov/codecov-action from 2.0.2 to 2.0.3 (2205) c912b179 Print JSON objects to stdout without a wrapping array (2196) add511c1 Make WithoutTimestamps work (2195) 85c27e01 Bump github.com/golangci/golangci-lint from 1.41.1 to 1.42.0 in /internal/tools (2199) bf6500b3 Bump google.golang.org/grpc from 1.39.1 to 1.40.0 in /exporters/otlp/otlptrace (2184) 9392af96 Bump google.golang.org/grpc in /exporters/otlp/otlptrace/otlptracegrpc (2185) c95694dc Bump google.golang.org/grpc from 1.39.1 to 1.40.0 in /example/otel-collector (2183) 0528fa66 Bump google.golang.org/grpc from 1.39.1 to 1.40.0 in /exporters/otlp/otlpmetric (2186) 3a26ed21 Deprecate the oteltest package (2188) c885435f Website: support GH page links to canonical src (2189) 6da20a27 Add cross-module test coverage (2182) dfc866bd Support capturing stack trace (2163) 41588fea Deprecate the attribute.Any function (2181) 4e8d667f Support a single Resource per MeterProvider in the SDK (2120) a8bb0bf8 Make the tracetest.SpanRecorder concurrent safe (2178) 87d09df3 Deprecate Array attribute in favor of Slice types (2162) df384a9a Move InstrumentKind into the new metric/sdkapi package (2091) 1cb5cdca Unify the OTLP attribute transform (2170) a882ee37 Clarify the attribute package documentation and order/grouping (2168) 5d25c4d2 Add support for int32 in attribute.Any (2169) 2b0e139e Refactor attributes benchmark tests (2167) 4c7470d9 Bump google.golang.org/grpc from 1.39.0 to 1.39.1 in /exporters/otlp/otlptrace (2176) 990c534a Bump google.golang.org/grpc in /example/otel-collector (2172) b45c9d31 Bump google.golang.org/grpc from 1.39.0 to 1.39.1 in /exporters/otlp/otlpmetric (2174) a3d4ff5c Deprecated the bridge/opencensus/utils package (2166) b1d1d529 Move OC bridge integration tests to own mod (2165) 89a9489c Add OC bridge internal unit tests (2164) 56c743ba Allow global ErrorHandler to be set multiple times (2160) d18c135f Add OpenCensus bridge internal package (2146) fcf945a4 Just a little typo fix in code documentation. (2159) 59a82eba Update version.go (2157) 21d4686f Add ErrorHandlerFunc to simplify creating ErrorHandlers (2149) 23cb9396 Remove `internal/semconv-gen` (2155) 39acab32 Fix code sample in otel.GetTraceProvider (2147) 2b1bb29e Update OpenCensus bridge docs with limitations (2145) fd7c327b Fix Jaeger exporter agent port default value and docs (2131) b8561785 fix(2138): add guard to constructOTResources to return an empty resource (2139) 11f62640 Add a SpanRecorder to the sdk/trace/tracetest (2132) fd9de7ec rename assertsocketbuffersize.go to _test (2136) a6b4d90c nit doc fix (2135) 79398418 pre-release v1.0.0-RC2 (2133) 2501e0fd Use semconv.SchemaURL in STDOUT exporter example (2134) ef03dbc9 Bump codecov/codecov-action from 1 to 2.0.2 (2129) bbe6ca40 Deprecate oteltest.Harness for removal (2123) 7a624ac2 Deprecated the oteltest.TraceStateFromKeyValues function (2122) ece1879f Removed dropped link's attributes field from API package (2118) 03902d98 Rename sdk/trace/tracetest test.go -> exporter.go (2128) cb607b0a Unify OTLP exporter retry logic (2095) abe22437 API: create new linked span from current context (2115) db81d4aa Update internal/global/trace testing (2111) 7f10ef72 Remove propagation testing types from oteltest (2116) 25d739b0 Remove resource.WithBuiltinDetectors() which has not been maintained (2097) d57c5a56 Remove several metrics test helpers (2105) 49359495 Simplify trace_context tests (2108) 56d42011 Simplify trace context benchmark test (2109) 63dfe64a Correct status transform in OTLP exporter (2102) 9b1a5f70 Performance improvement: avoid creating multiple same read-only objects (2104) ab78dbd0 Update release URL (2106) 647af3a0 Pre release experimental metrics v0.22.0 (2101) 0a562337 Fixed OS type value for DragonFly BSD (2092) 62c21ffb Bump golang.org/x/tools from 0.1.4 to 0.1.5 in /internal/tools (2096) 4a3da55a Ensure sample code in website_docs getting started page works (2094) d3063a3d Update otel.Meter to global.Meter in Getting Started Document.(2087) (2093) 00a1ec5f Add documentation guidelines and improve Jaeger exporter readme (2082) 12f737c7 oteltest: ensure valid SpanContext created for span started WithNewRoot (2073) 484258eb OS description attribute detector (1840) d8c9a955 Bump google.golang.org/grpc from 1.38.0 to 1.39.0 in /example/otel-collector (2054) 4ffdf034 Add @pellard as an Approver (2047) 1a74b399 Bump google.golang.org/protobuf from 1.26.0 to 1.27.0 in /exporters/otlp/otlpmetric (2040) 57c2e8fb Bump golang.org/x/tools from 0.1.3 to 0.1.4 in /internal/tools (2036) 7cff31a9 Bump google.golang.org/protobuf from 1.26.0 to 1.27.0 in /exporters/otlp/otlptrace (2035) 9e8f523d when using WithNewRoot, don't use the parent context for sampling (2032) 62af6c70 semconv-gen: fix capitalization at word boundaries, add stability/deprecation indicators (2033) 0bceed7e Fix docs on otel-collector example (2034) 6428cd69 Update doc.go (2030) 311a6396 fix documentation for trace.Status (2029) 16f83ce6 export ToZipkinSpanModels for use outside this library (2027) d5d4c87f Add HTTP metrics exporter for OTLP (2022) d6e8f60f Bump github.com/golangci/golangci-lint from 1.40.1 to 1.41.1 in /internal/tools (2023) 51dbe3cb Remove deprecated exporters (2020) 257ef7fc Update project status in README (2017) ced177b7 Pre-release 1.0.0-RC1 (2013) 694c9a41 Interface stability documentation (2012) 39fe8092 Add span.TracerProvider() (2009) d020e1a2 Add more tests for go.opentelemetry.io/otel/trace package. (2004) 6d4a38f1 replace WithSyncer with WithBatcher in opencensus example (2007) c30cd1d0 Split stdout exporter into stdouttrace and stdoutmetric (2005) 80ca2b1e otlp: mark unix endpoints to work without transport security (2001) 65140985 Update codecov ignore (2006) 3be9813d Deprecate the exporters in the "trace" and "metric" sub-directories (1993) 377f7ce4 remove WithTrace* options from otlptrace exporters (1997) b33edaa5 OTLP metrics gRPC exporter (1991) 64b640cc Remove old OTLP exporter (1990) 7728a521 Remove dependency on metrics packages (1988) 135ac4b6 Moved internal/tools duplicated findRepoRoot function to common package (1978) cdf67ddf Update semantic conventions to v1.4.0, move to versioned package (1987) 4883cb11 Refactor exporter creation functions (1985) 87cc1e1f Test BatchSpanProcessor export timeout directly (1982) 7ffe2845 Added inputPath validation to semconv-gen (1986) a113856a Add caveat about installing opencensus bridge (1983) 741cb9a3 Fix generator.go call typo in RELEASING.md (1977) 7a0cee7b Replaces golint by revive and fix newly reported linter issues (1946) 46d9687a Add Schema URL support to Resource (1938) 0827aa62 Use mock server as jaeger agent listener. (1930) 20886012 Bugfix jaeger exporter test panic (1973) 4bf6150f Add baggage implementation based on the W3C and OpenTelemetry specification (1967) bbe2b8a3 Bump github.com/itchyny/gojq from 0.12.3 to 0.12.4 in /internal/tools (1971) 4949bf05 Bump github.com/cenkalti/backoff/v4 from 4.1.0 to 4.1.1 in /exporters/otlp/otlptrace (1972) 015b4c17 Bump github.com/cenkalti/backoff/v4 from 4.1.0 to 4.1.1 in /exporters/otlp (1970) 13eb12ac Bump github.com/prometheus/client_golang from 1.10.0 to 1.11.0 in /exporters/metric/prometheus (1974) 2371bb0a add otlp trace http exporter (1963) a75ade4e sdk/resource: honor OTEL_SERVICE_NAME in fromEnv resource detector (1969) aed45802 Bump go.opentelemetry.io/proto/otlp from 0.8.0 to 0.9.0 in /exporters/otlp/otlptrace (1959) c4ebae6a Bump go.opentelemetry.io/proto/otlp (1960) b1d2be3b Bump google.golang.org/grpc from 1.37.1 to 1.38.0 in /exporters/otlp/otlptrace (1958) f6daea5e Generate semantic conventions according to specification latest tagged version (1933) 435a63b3 Bump github.com/google/go-cmp from 0.5.5 to 0.5.6 (1954) 6c46af66 Bump github.com/google/go-cmp from 0.5.5 to 0.5.6 in /exporters/trace/jaeger (1953) 4d294853 Bump actions/cache from 2.1.5 to 2.1.6 (1952) dfe2b6f1 OTLP trace gRPC exporter (1922) 5a8f7ff7 Bump go.opentelemetry.io/proto/otlp from 0.8.0 to 0.9.0 in /exporters/otlp (1943) bd935866 Add schema URL support to Tracer (1889) c1f460e0 Update API configs. (1921) 270cc603 Small fixes on some Span method's documentation headers (1950) 8603b902 Fix typo in doc (1949) acbb1882 Bump google.golang.org/grpc from 1.37.1 to 1.38.0 in /exporters/otlp (1942) b1621501 Add codecov badge (1940) ea1434c3 Fix some golint issues (1947) 0eeb8f87 Refactor Tracestate (1931) d3b12808 Add Passthrough example (1912) f06cace6 Add @MadVikingGod as a project Approver (1923) ab5facb3 Bump github.com/golangci/golangci-lint in /internal/tools (1925) d23cc61b Refactor configs (1882) 6324adaa Add tracer option argument to global Tracer function (1902) 035fc650 Do not include authentication information in the http.url attribute (1919) d8ac212c Fix sporadic test failure in otlp exporter http driver (1906) a3df00f4 Create .gitattributes (1920) fb88e926 Bump google.golang.org/grpc from 1.37.0 to 1.37.1 in /exporters/otlp (1914) 1982dc46 Bump google.golang.org/grpc in /example/prom-collector (1915) 1759c630 Bump github.com/golangci/golangci-lint in /internal/tools (1916) 7342aa47 Bump google.golang.org/grpc in /example/otel-collector (1913) 21c16418 Add support for scheme in OTEL_EXPORTER_OTLP_ENDPOINT (1886) 5cb62636 Semantic Convention generation tooling (1891) 6219221f Move the unit package to the metric module (1903) 63e0ecfc Implement global default non-recording span (1901) b6d5442f Remove the Tracer method from the Span API (1900) ae85fab3 Document functional options (1899) cabf0c07 Fix default Jaeger collector endpoint (1898) 1e3fa3a3 Bump go.opentelemetry.io/proto/otlp from 0.7.0 to 0.8.0 in /exporters/otlp (1872) 696af787 Bump github.com/benbjohnson/clock from 1.0.3 to 1.1.0 in /sdk/metric (1532) 97eea6c3 Fix some golint issues (1894) 79d9852e fix container port mismatch issue (1895) d20e7228 CI builds validate against last two versions of Go, dropping 1.14 and adding 1.16 (1865) cbcd4b1a Redefine ExportSpans of SpanExporter with ReadOnlySpan (1873) c99d5e99 Split large jaeger span batch to admire the udp packet size limit (1853) 42a84509 Unembed SpanContext (1877) b7d02db1 Add Status type to SDK (1874) f90d0d93 Update README (1876) a1349944 Update resource.go (1871) f40cad5e Add markdown link check configuration and action (1869) 9bc28f6b Fix existing markdown lint issues (1866) 08f4c270 Add documentation for tracer.Start() (1864) 2bd4840c remove Set.Encoded(Encoder) enconding cache (1855) 7674eebf Removed different types of Detectors for Resources. (1810) f92a6d83 Implement retry policy for the OTLP/gRPC exporter (1832) ec75390f Fix BSP context done tests (1863) 8e55f10a Move the Event type from the API to the SDK (1846) e399d355 drop failed to exporter batches and return error when forcing flush a span processor (1860) f6a9279a Honor context deadline or cancellation in SimpleSpanProcessor.Shutdown (1856) aeef8e00 Add markdown lint GitHub action (1849) d4c8ffad Replace spaces to tabs in Go code snippets (1854) cb097250 fixed typo (1857) 392a44fa Refine configuration design docs (1841) 62cd933d Handle Resource env error when non-nil (1851) 24a91628 Document the SSP is not for production use (1844) ec26ac23 Update RELEASING.md (1843) 8eb0bb99 Fix golint issue caused by typo (1847) ca130e54 Markdownlint (1842) 1144a83d Small typo fixes to existing CHANGELOG entries (1839) e6086958 Update website_docs to v0.20.0 (1838) 0f4e454c Change NewSplitDriver paramater and initialization (1798) 92551d39 Prerelease v1.0.0 (2250) 61839133 zipkin: remove no-op WithSDKOptions (2248) 568e7556 Set Schema URL when exporting traces to OTLP (2242) ec26b556 Fix RC tags in docs (2239) 767ce26c Bump github.com/itchyny/gojq from 0.12.4 to 0.12.5 in /internal/tools (2216) fe7058da adding NewNoopMeterProvider to follow trace api (2237) c338a5ef Bump github.com/golangci/golangci-lint from 1.42.0 to 1.42.1 in /internal/tools (2236) ef126f5c Remove deprecated Array from attribute package (2235) 360d1302 Add tests for nil *Resource (2227) 9e7812d1 Remove the deprecated oteltest package (2234) 486afd34 Remove the deprecated bridge/opencensus/utils pkg (2233) eaacfaa8 Fix slice-valued attributes when used as map keys (2223) df2bdbba Fix the import comments of otelpconfig (2224) 7aae2a02 otlptrace: Document supported environment variables (2222) Fixes #2591 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-11-04 12:39:00 -07:00
Chelsea Mafrica	84ccdd8ef2	vendor: update OpenTelemetry to v0.20.0 Update OpenTelemetry from v0.15.0 to v0.20.0. Git log 02d8bdd5 Release v0.20.0 (1837) aa66fe75 OS and Process resource detectors (1788) 7374d679 Fix Links documents (1835) 856f5b84 Add feature request issue template (1831) 0fdc3d78 Remove bundler from Jaeger exporter (1830) 738ef11e Fix flaky global ErrorHandler delegation test (1829) e43d9c00 Update Default Value for Jaeger Exporter Endpoint (1824) 0032bd64 Fix default merging of resource attributes from environment variable (1785) 96c5e4ba Add SpanProcessor example for Span annotation on start (1733) 543c8144 Remove the WithSDKOptions from the Jaeger exporter (1825) 66389ad6 Update function docs in sdk.go (1826) 70bc9eb3 Adds support for timeout on the otlp/gRPC exporter (1821) 081cc61d Update Jaeger exporter convenience functions (1822) 1b9f16d3 Remove the WithDisabled option from Jaeger exporter (1806) 6867faa0 Bump actions/cache from v2.1.4 to v2.1.5 (1818) a2bf04dc Build context pipeline in Jaeger upload process (1809) 2de86f23 Remove locking from Jaeger exporter shutdown/export (1807) 4f9fec29 Add ExportSpans benchmark to Jaeger exporter (1805) d9566abe Fix OTLP testing flake: signal connection from mock collector (1816) a2cecb6e add support for env var configuration to otlp/gRPC (1811) d616df61 Fix flaky OTLP exporter reconnect test (1814) b09df84a Changes stdout to expose the `*sdktrace.TracerProvider` (1800) 04890608 Remove options field from Jaeger exporter (1808) 6db20e00 Remove the abandoned Process struct in Jaeger exporter (1804) 086abf34 docs: use test example to document prometheus.InstallNewPipeline (1796) d0cea04b Bump google.golang.org/api from 0.43.0 to 0.44.0 in /exporters/trace/jaeger (1792) 99c477fe Fixed typo for default service name in Jaeger Exporter (1797) 95fd8f50 Bump google.golang.org/grpc from 1.36.1 to 1.37.0 in /exporters/otlp (1791) 9b251644 Zipkin Exporter: Use default resouce's serviceName as default serivce name (1777) (1786) 4d141e47 Add k8s.node.name and k8s.node.uid to semconv (1789) 5c99a34c Fix golint issue caused by incorrect comment (1795) c5d006c0 Update Jaeger environment variables (1752) 58432808 add NewExportPipeline and InstallNewPipeline for otlp (1373) 7d8e6bd7 Zipkin Exporter: Adjust span transformation to comply with the spec (1688) 2817c091 Merge sdk/export/trace into sdk/trace (1778) c61e654c Refactor prometheus exporter tests to match file headers as well (1470) 23422c56 Remove process config for Jaeger exporter (1776) 0d49b592 Add test to check bsp ignores `OnEnd` and `ForceFlush` post Shutdown` (1772) e9aaa04b Record links/events attribute drops independently (1771) 5bbfc22c Make ExportSpans for Jaeger Exporter honor deadline (1773) 0786fe32 Add Bug report issue templates (1775) 3c7facee Add `ExportTimeout` option to batch span processor (1755) c6b92d5b Make TraceFlags spec-compliant (1770) ee687ca5 Bump github.com/itchyny/gojq from 0.12.2 to 0.12.3 in /internal/tools (1774) 52a24774 add support for configuring tls certs via env var to otlp/HTTP (1769) 35cfbc7e Update precedence of event name in Jaeger exporter (1768) 33699d24 Adds semantic conventions for exceptions (1492) 928e3c38 Modify ForceFlush to abort after timeout/cancellation (1757) 3947cab4 Fix testCollectorEndpoint typo and add tag assertions in jaeger_test (1753) ecc635dc add website docs (1747) 07a8d195 Fix Jaeger span status reporting and unify tag keys (1761) 4fa35c90 add partial support for env var config to otlp/HTTP (1758) bf180d0f improve OTLP/gRPC connection errors (1737) d575865b Fix span IsRecording when not sampling (1750) 20c93b01 Update SamplingParameters (1749) 97501a3f Update SpanSnapshot to use parent SpanContext (1748) 604b05cb Store current Span instead of local and remote SpanContext in context.Context (1731) c61f4b6d Set @lizthegrey to emeritus status (1745) b1342fec Bump github.com/golangci/golangci-lint in /internal/tools (1743) 54e1bd19 Bump google.golang.org/api from 0.41.0 to 0.43.0 in /exporters/trace/jaeger (1741) 4d25b6a2 Bump github.com/prometheus/client_golang from 1.9.0 to 1.10.0 in /exporters/metric/prometheus (1740) 0a47b66f Bump google.golang.org/grpc from 1.36.0 to 1.36.1 in /exporters/otlp (1739) 26f006b8 Reinstate @paivagustavo as an Approver (1734) 382c7ced Remove hasRemoteParent field from SDK span (1728) 862a5a68 Remove setting error status while recording error with Span from oteltest package (1729) 6defcfdf Remove links on NewRoot spans (1726) a9b2f851 upgrade thrift to v0.14.1 in jaeger exporter (1712) 5a6a854d Bump google.golang.org/protobuf from 1.25.0 to 1.26.0 in /exporters/otlp (1724) 23486213 Migrate to using go.opentelemetry.io/proto/otlp (1713) 5d559b40 Remove makeSamplingDecision func (1711) e24702da Update the TraceContext.Extract docs (1720) 9d4eb1f6 Update dates in CHANGELOG.md for 2021 releases (1723) 2b4fa968 Release v0.19.0 (1710) 4beb7041 sdk/trace: removing ApplyConfig and Config (1693) 1d42be16 Rename WithDefaultSampler TracerProvider option to WithSampler and update docs (1702) 860d5d86 Add flag to determine whether SpanContext is remote (1701) 0fe65e6b Comply with OpenTelemetry attributes specification (1703) 88884351 Bump google.golang.org/api from 0.40.0 to 0.41.0 in /exporters/trace/jaeger (1700) 345f264a breaking(zipkin): removes servicName from zipkin exporter. (1697) 62cbf0f2 Populate Jaeger's Span.Process from Resource (1673) 28eaaa9a Add a test to prove the Tracer is safe for concurrent calls (1665) 8b1be11a Rename resource pkg label vars and methods (1692) a1539d44 OpenCensus metric exporter bridge (1444) 77aa218d Fix issue #1490, apply same logic as in the SDK (1687) 9d3416cc Fix synchronization issues in global trace delegate implementation (1686) 58f69f09 Span status from HTTP code: Do not set status message if it can be inferred (1681) 9c305bde Flush metric events prior to shutdown in OTLP example (1678) 66b1135a Fix CHANGELOG (1680) 90bd4ab5 Update employer information for maintainers (1683) 36841913 Remove WithRecord() option from trace.SpanOption when starting a span (1660) 65c7de20 Remove trace prefix from NoOp src files. (1679) e88a091a Make SpanContext Immutable (1573) d75e2680 Avoid overriding configuration of tracer provider (1633) 2b4d5ac3 Bump github.com/golangci/golangci-lint in /internal/tools (1671) 150b868d Bump github.com/google/go-cmp from 0.5.4 to 0.5.5 (1667) 76aa924e Fix the examples target info messaging (1676) a3aa9fda Bump github.com/itchyny/gojq from 0.12.1 to 0.12.2 in /internal/tools (1672) a5edd79e Removed setting error status while recording err as span event (1663) e9814758 chore(zipkin): improves zipkin example to not to depend on timeouts. (1566) 3dc91f2d Add ForceFlush method to TracerProvider (1608) bd0bba43 exporter: swap pusher for exporter (1656) 56904859 Update the SimpleSpanProcessor (1612) a7f7abac SpanStatus description set only when status code is set to Error (1662) 05252f40 Jaeger Exporter: Fix minor mapping discrepancies (1626) 238e7c61 Add non-empty string check for attribute keys (1659) e9b9aca8 Add tests for propagation of Sampler Tracestate changes (1655) 875a2583 Add docs on when reviews should be cleared (1556) 7153ef2d Add HTTP/JSON to the otlp exporter (1586) 62e2a0f7 Unexport the simple and batch SpanProcessors (1638) 992837f1 Add TracerProvider tests to oteltest harness (1607) bb4c297e Pre release v0.18.0 (1635) 712c3dcc Fix makefile ci target and coverage test packages (1634) 841d2a58 Rename local var new to not collide with builtin (1610) 13938ab5 Update SpanProcessor docs (1611) e25503a0 Add compatibility tests to CI (1567) 1519d959 Use reasonable interval in sdktrace.WithBatchTimeout (1621) 7d4496e0 Pass metric labels when transforming to gaugeArray (1570) 6d4a5e0d Bump google.golang.org/grpc from 1.35.0 to 1.36.0 in /exporters/otlp (1619) a93393a0 Bump google.golang.org/grpc in /example/prom-collector (1620) e499ca86 Fix validation for tracestate with vendor and add tests (1581) 43886e52 Make timestamps sequential in lastvalue agg check (1579) 37688ef6 revent end-users from implementing some interfaces (1575) 85e696d2 Updating documentation with an working example for creating NewExporter (1513) 562eb28b Unify the Added sections of the unreleased changes (1580) c4cf1aff Fix Windows build of Jaeger tests (1577) 4a163bea Fix stdout TestStdoutTimestamp failure with sleep (1572) bd4701eb Stagger timestamps in exact aggregator tests (1569) b94cd4b2 add code attributes to semconv package (1558) 78c06cef Update docs from gitter to slack for communication (1554) 1307c911 Remove vendor exclude from license-check (1552) 5d2636e5 Bump github.com/golangci/golangci-lint in /internal/tools (1565) d7aff473 Vendor Thrift dependency (1551) 298c5a14 Update span limits to conform with OpenTelemetry specification (1535) ecf65d79 Rename otel/label -> otel/attribute (1541) 1b5b6621 Remove resampling on span.SetName (1545) 8da52996 fix: grpc reconnection (1521) 3bce9c97 Add Keys() method to propagation.TextMapCarrier (1544) 0b1a1c72 Make oteltest.SpanRecorder into a concrete type (1542) 7d0e3e52 SDK span no modification after ended (1543) 7de3b58c Remove extra labels types (1314) 73194e44 Bump google.golang.org/api from 0.39.0 to 0.40.0 in /exporters/trace/jaeger (1536) 8fae0a64 Create resource.Default() with required attributes/default values (1507) 76f93422 Release v0.17.0 (1534) 9b242bc4 Organize API into Go modules based on stability and dependencies (1528) e50a1c8c Bump actions/cache from v2 to v2.1.4 (1518) a6aa7f00 Bump google.golang.org/api from 0.38.0 to 0.39.0 in /exporters/trace/jaeger (1517) 38efc875 Code Improvement - Error strings should not be capitalized (1488) 6b340501 Update default branch name (1505) b39fd052 nit: Fix comment to be up-to-date (1510) 186c2953 Fix golint error of package comment form (1487) 9308d662 Bump google.golang.org/api from 0.37.0 to 0.38.0 in /exporters/trace/jaeger (1506) 1952d7b6 Reverse order of attribute precedence when merging two Resources (1501) ad7b4715 Remove build flags for runtime/trace support (1498) 4bf4b690 Remove inaccurate and unnecessary import comment (1481) 7e19eb6a Bump google.golang.org/api from 0.36.0 to 0.37.0 in /exporters/trace/jaeger (1504) c6a4406a Bump github.com/golangci/golangci-lint in /internal/tools (1503) 9524ac09 Update workflows to include main branch as trigger (1497) c066f15e Bump github.com/gogo/protobuf from 1.3.1 to 1.3.2 in /internal/tools (1478) 894e0240 Bump github.com/golangci/golangci-lint in /internal/tools (1477) 71ffba39 Bump google.golang.org/grpc from 1.34.0 to 1.35.0 in /exporters/otlp (1471) 515809a8 Bump github.com/itchyny/gojq from 0.12.0 to 0.12.1 in /internal/tools (1472) 3e96ad1e gitignore: remove unused example path (1474) c5622777 Histogram aggregator functional options (1434) 0df8cd62 Rename Makefile.proto to avoid interpretation as proto file (1468) 979ff51f Bump github.com/stretchr/testify from 1.6.1 to 1.7.0 (1453) 1df8b3b8 Bump github.com/gogo/protobuf from 1.3.1 to 1.3.2 in /exporters/otlp (1456) 4c30a90a Bump github.com/stretchr/testify from 1.6.1 to 1.7.0 in /sdk (1455) 5a9f8f6e Bump github.com/stretchr/testify from 1.6.1 to 1.7.0 in /exporters/stdout (1454) 7786f34c Bump github.com/stretchr/testify from 1.6.1 to 1.7.0 in /exporters/trace/zipkin (1457) 4352a7a6 Bump github.com/stretchr/testify from 1.6.1 to 1.7.0 in /exporters/otlp (1460) 6990b3b3 Bump github.com/stretchr/testify from 1.6.1 to 1.7.0 in /exporters/metric/prometheus (1461) 7af40d22 Bump github.com/stretchr/testify from 1.6.1 to 1.7.0 in /exporters/trace/jaeger (1463) f16f1892 Bump google.golang.org/grpc in /example/otel-collector (1465) fe363be3 Move Span Event to API (1452) 43922240 Bump google.golang.org/grpc in /example/prom-collector (1466) 0aadfb27 Prepare release v0.16.0 (1464) 207587b6 Metric histogram aggregator: Swap in SynchronizedMove to avoid allocations (1435) c29c6fd1 Shutdown underlying span exporter while shutting down BatchSpanProcessor (1443) dfece3d2 Combine the Push and Pull metric controllers (1378) 74deeddd Handle tracestate in TraceContext propagator (1447) 49f699d6 Remove Quantile aggregation, DDSketch aggregator; add Exact timestamps (1412) 9c949411 Rename internal/testing to internal/internaltest (1449) 8d809814 Move gRPC driver to a subpackage and add an HTTP driver (1420) 9332af1b Bump github.com/golangci/golangci-lint in /internal/tools (1445) 5ed96e92 Update exporters/otlp Readme.md (1441) bc9cb5e3 Switch CircleCI badge to GitHub Actions (1440) 716ad082 Remove CircleCI config (1439) 0682db1e Adding Security Workflows to GitHub Actions (2/2): gosec workflow (1429) 11f732b8 Adding Security Workflows to GitHub Actions (1/2): codeql workflow (1428) 40f1c003 Add Tracestate into the SamplingResult struct (1432) db06c8d1 Flush metric events before shutdown in collector example (1438) f6f458e1 Fix golint issue caused by typo in trace.go (1436) fe9d1f7e Use uint64 Count consistently in metric aggregation (1430) 3a337d0b Bump github.com/golangci/golangci-lint in /internal/tools (1433) 1e4c8321 cleanup: drop the removed examples in gitignore (1427) 5c9221cf Unify endpoint API that related to OTel exporter (1401) 045c3ffe Build scripts: Replace mapfile with read loop for old bash versions (1425) 2def8c3d Add Versioning Documentation (1388) 6bcd1085 Bump github.com/itchyny/gojq from 0.11.2 to 0.12.0 in /internal/tools (1424) 38e76efe Add a split protocol driver for otlp exporter (1418) 439cd313 Add TraceState to SpanContext in API (1340) 35215264 Split connection management away from exporter (1369) add9d933 Bump github.com/prometheus/client_golang from 1.8.0 to 1.9.0 in /exporters/metric/prometheus (1414) 93d426a1 Add @dashpole as a project Approver (1410) 6fe20ef3 Fix small typo (1409) b22d0d70 Mention the getting started guide (1406) 3fb80fb2 Fix duplicate checkout action in GitHub workflow (1407) 2051927b Correct CI workflow syntax (1403) f11a86f7 Fix typo in comment (1402) bdf87a78 Migrate CircleCI ci.yml workflow to GitHub Actions (1382) 4e59dd1f Bump google.golang.org/grpc from 1.32.0 to 1.34.0 in /example/otel-collector (1400) 83513f70 Bump google.golang.org/api from 0.32.0 to 0.36.0 in /exporters/trace/jaeger (1398) a354fc41 Bump github.com/prometheus/client_golang from 1.7.1 to 1.8.0 in /exporters/metric/prometheus (1397) 3528e42c Bump google.golang.org/grpc from 1.32.0 to 1.34.0 in /exporters/otlp (1396) af114baf Call otel.Handle with non-nil errors (1384) c3c4273e Add RO/RW span interfaces (1360) Fixes #2591 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-11-04 12:30:45 -07:00
Chelsea Mafrica	b5cfb73466	Merge pull request #2931 from YchauWang/wyc-runtime-shim2 runtime# make sure the "Shutdown" trace span have a correct end	2021-11-04 11:33:22 -07:00
James O. D. Hunt	d47484e7c1	logging: Always run crate tests Ensure the tests in the local `logging` crate are run for all consumers of it. Additionally, add a new test which checks that output is generated by a range of different log level `slog` macros. This is designed to ensure debug level output is always available for the consumers of the `logging` crate. Fixes: #2969. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-04 17:26:52 +00:00
Chelsea Mafrica	09d5d8836b	runtime: tracing: Change method for adding tags In later versions of OpenTelemetry label.Any() is deprecated. Create addTag() to handle type assertions of values. Change AddTag() to variadic function that accepts multiple keys and values. Fixes #2547 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-11-04 10:19:05 -07:00
GabyCT	f611785fdc	Merge pull request #2967 from jodh-intel/enable-debug-logs logging: Enable agent debug output for release builds	2021-11-04 10:04:59 -06:00
GabyCT	86b5bb5801	Merge pull request #2940 from ManaSugi/seccomp-aarch64 agent: "Revert agent: Disable seccomp feature on aarch64 temporarily"	2021-11-04 09:38:45 -06:00
James O. D. Hunt	bcf3e82cf0	logging: Enable agent debug output for release builds Raise the `slog` maximum log level feature for release code from `info` to `debug` by changing the `slog` maximum level features in the shared `logging` crate. This allows the consumers of the `logging` crate (the agent, the `trace-forwarder` and the `agent-ctl` tool) to produce debug output when their debug options are enabled. Currently, those options will essentially be a NOP (unless using a debug version of the code). Testing showed that setting the `slog` maximum level features in the rust manifest files for the consumers of the `logging` crate has no impact: those values are ignored, so they have been removed and replaced with a comment stating the levels are set in the `logging` crate. Fixes: #2966. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-11-04 11:42:47 +00:00
Snir Sheriber	b34ed403c5	cgroups: pass vhost-vsock device to cgroup for the sandbox cgroup Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2021-11-04 10:59:10 +02:00
Snir Sheriber	7362e1e8a9	runtime: remove prefix when cgroups are managed by systemd as done previously in `9949daf4dc` Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2021-11-04 10:13:22 +02:00
Bin Liu	a7a47bd7d4	Merge pull request #2943 from liubin/fix/2942-add-golint-for-makefile runtime: Enhancement for Makefile	2021-11-04 11:37:21 +08:00
bin	375ad2b2b6	runtime: Enhancement for Makefile There are some issues with Makefile for runtime: - default target can't be used as a dependent of other targets. - empty target `check` And also add two targets for locally development/tests. - lint: run golangci-lint - pre-commit: run lint and test Fixes: #2942 Signed-off-by: bin <bin@hyper.sh>	2021-11-03 17:36:55 +08:00
Manabu Sugimoto	b468dc500a	agent: Use dup3 system call in unit tests of seccomp Use `dup3` system call instead of `dup2` in unit tests of seccomp because `dup2` is obsolete on aarch64. Fixes: #2939 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-11-03 15:49:23 +09:00
Manabu Sugimoto	1aaa0599d9	agent: "Revert agent: Disable seccomp feature on aarch64 temporarily" Re-enable seccomp feature on aarch64 because CI is ready by https://github.com/kata-containers/tests/pull/4124. This reverts commit `42add7f201`. Fixes: #2939 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-11-02 22:53:38 +09:00
bin	1e331f7542	agent: refactor process IO processing Move closing IO into process.rs and use macro to reduce codes. Fixes: #2944 Signed-off-by: bin <bin@hyper.sh>	2021-11-02 15:49:11 +08:00
wangyongchao.bj	9d3ec58370	runtime: make sure the "Shutdown" trace span have a correct end We only added span.End() in the main process of the shim2 Shutdown method. The "Shutdown" span would keep alive, when the containers number is not 0. This PR make sure the "Shutdown" trace span have a correct end. Fixes: #2930 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-11-02 14:24:31 +08:00
Fupan Li	1c81d7e0b6	Merge pull request #2915 from jodh-intel/agent-ctl-handle-hybrid-vsock agent-ctl: Update for Hybrid VSOCK	2021-11-02 09:55:16 +08:00
Jianyong Wu	e15c8460db	Merge pull request #2265 from rapiz1/simple-ro-mount virtcontainers: simplify read-only mount handling	2021-11-01 10:43:16 +08:00
Bin Liu	51e9038ad5	Merge pull request #1998 from liubin/1997/add-fastfail-test runtime: add fast-test to let test exit on error	2021-10-30 15:38:27 +08:00
bin	3f21af9c5c	runtime: add fast-test to let test exit on error Add -failfast option to let test exit on error, but -failfast option can't cross package, so there is a for loop used to test on all packages in src/runtime, and the parallel number is set to 1, this may lead test to be slow. Fixes: #1997 Signed-off-by: bin <bin@hyper.sh>	2021-10-30 11:09:54 +08:00
GabyCT	c8553ea427	Merge pull request #2046 from littlejawa/issue_2042 test: Fix random failure for TestIoCopy	2021-10-29 17:29:31 -05:00
GabyCT	969b78b01f	Merge pull request #2496 from rapiz1/show-guest-protection cli: Show available guest protection in env output	2021-10-29 17:28:47 -05:00
GabyCT	7b406d5561	Merge pull request #2037 from c3d/issue/2036-is-not-exist agent: Make wording of error message match CRI-O test suite	2021-10-29 17:25:06 -05:00
James O. D. Hunt	2551179e43	Merge pull request #2929 from YchauWang/vc-docs-api virtcontainers: api: update the functions in the api.md docs	2021-10-29 16:01:31 +01:00
James O. D. Hunt	4e2dd41eb6	Merge pull request #1791 from wainersm/virtcontainers-1 virtcontainers: check that both initrd and image are not set	2021-10-29 14:51:07 +01:00
wangyongchao.bj	338ac87516	virtcontainers: api: update the functions in the api.md docs Virtcontainers API document functions weren't sync with the codes Sandbox and VCImpl. And we have two functions named `CreateSandbox` functions, diff by one parameter, very confused. So this pr sync the codes to api documents. Fixes: #2928 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-10-29 15:36:53 +08:00
Bin Liu	71b69c36d5	Merge pull request #2917 from sameo/topic/agent-config-sample agent: Fix the configuration sample file	2021-10-29 11:51:58 +08:00
Bin Liu	eb248b0c66	Merge pull request #2750 from liubin/fix/2749-remove-fixme runtime: set tags for trace span	2021-10-29 11:42:49 +08:00
Gabriela Cervantes	e610fc82ff	runtime: Remove comments about unsupported features in config for clh Cloud hypervisor is only supporting virtio-blk, this PR removes comments that make a wrong reference of other features that are not supported by clh. Fixes #2924 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2021-10-28 15:14:49 +00:00
James O. D. Hunt	d1bcf105ff	forwarder: Remove quotes from socket path in doc Update the trace forwarder README to remove the quotes around the socket path, which makes manipulating that path easier. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-10-28 09:20:38 +01:00
Yujia Qiao	e66d0473be	virtcontainers: simplify read-only mount handling Current handling of read-only mounts is a little tricky. However, a clearer solution can be used here: 1. make a private ro bind mount at privateDest to the mount source 2. make a bind mount at mountDest to the mount created in step 1 3. umount the private bind mount created in step 1 One important aspect is that the mount in step 2 is duplicated from the one we created in step 1. So the MS_RDONLY flag is properly preserved in all mounts created in the propagtion. Fixes: #2205 Depends-on: github.com/kata-containers/tests#4106 Signed-off-by: Yujia Qiao <rapiz3142@gmail.com>	2021-10-28 15:48:41 +08:00
Manabu Sugimoto	42add7f201	agent: Disable seccomp feature on aarch64 temporarily In order to pass CI test of aarch64, it is necessary to run `ci/install_libseccomp.sh` before ruuning unit tests in `jenkins_job_build.sh`. However, `ci/install_libseccomp.sh` is not available until PR #1788 including this commit is merged in the mainline. Therefore, we disable seccomp feature on aarch64 temporarily. After #1788 lands and CI is fixed, this commit will be reverted. Fixes: #1476 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-10-27 19:06:13 +09:00
Manabu Sugimoto	3be50adab9	agent: Add support for Seccomp The kata-agent supports seccomp feature based on the OCI runtime specification. This seccomp capability in the kata-agent is enabled by default. However, it is not enforced by default: users need to enable that by setting `disable_guest_seccomp` to `false` in the main configuration file. Fixes: #1476 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-10-27 19:06:13 +09:00
Samuel Ortiz	4280415149	agent: Fix the configuration sample file All endpoint names share the `Request` suffix. Also, the current list is based on functions, not requests. Fixes #2916 Reported-by: Jakob Naucke <jakob.naucke@ibm.com> Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2021-10-27 06:02:33 +02:00
Bo Chen	bf5f42d411	Merge pull request #2906 from jodh-intel/trace-forwarder-drop-privs forwarder: Drop privileges when using hybrid VSOCK	2021-10-26 13:24:01 -07:00
Wainer dos Santos Moschetta	309dae631a	virtcontainers: check that both initrd and image are not set This changed valid() in hypervisor to check the case where both initrd and image path are set; in this case it returns an error. Fixes #1868 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2021-10-26 10:44:23 -04:00
James O. D. Hunt	a10cfffdff	forwarder: Fix changing log level Fix `-l <log-level>` for the trace forwarder which didn't work previously as it lacked the magic Cargo configuration. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-10-26 11:02:06 +01:00
James O. D. Hunt	6abccb92ce	forwarder: Drop privileges when using hybrid VSOCK Hybrid VSOCK requires `root` privileges to access the sandbox-specific host-side AF_UNIX socket created by the hypervisor (CLH or FC). However, once the socket has been bound, privileges can be dropped, allowing the forwarder to run as user `nobody`. Fixes: #2905. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-10-26 11:01:58 +01:00
Bin Liu	8d8604e10f	Merge pull request #2893 from liubin/fix/2892-print-error-instead-of-return agent: do not return error but print it if task wait failed	2021-10-26 17:48:17 +08:00
James O. D. Hunt	b67fa9e450	forwarder: Make explicit root check Rather than generating a potentially misleading error message if the socket bind fails, perform an explicit check for `root` for Hybrid VSOCK. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-10-26 09:28:26 +01:00
James O. D. Hunt	e377578e08	forwarder: Fix docs socket path Updated the trace forwarder README to ensure the real socket path is created, not the template socket path returned by `kata-runtime env`. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-10-26 09:28:26 +01:00
bin	5f306330f4	virtcontainers: delete duplicated notify in watchHypervisor function When hypervisor check failed, the notify function is called twice. Fixes: #2901 Signed-off-by: bin <bin@hyper.sh>	2021-10-26 11:58:26 +08:00
bin	5f5eca6b8e	agent: do not return error but print it if task wait failed Do not return error but print it if task wait failed and let program continue to run the next code. Fixes: #2892 Signed-off-by: bin <bin@hyper.sh>	2021-10-26 11:43:39 +08:00
Yujia Qiao	6cc8000cae	cli: Show available guest protection in env output Show available guest protections in the `kata-runtime env` output. Also bump the formatVersion. Fixes: #1982 Signed-off-by: Yujia Qiao <rapiz3142@gmail.com>	2021-10-25 21:44:56 +08:00
Yujia Qiao	2063b13805	virtcontainers: Add func AvailableGuestProtections Add functions to return guestProtection as a string slice, which can be then used in `kata-runtime env` output. Signed-off-by: Yujia Qiao <rapiz3142@gmail.com>	2021-10-25 21:44:01 +08:00
Fupan Li	3d0fe433c6	Merge pull request #2889 from lht/handle-uevent-remove-actions agent: Handle uevent remove actions	2021-10-25 19:08:20 +08:00
James O. D. Hunt	ec3aa1694b	Merge pull request #2844 from jongwu/unit_test enable unit test on arm	2021-10-25 10:58:21 +01:00
Bin Liu	01fdeb7641	Merge pull request #2891 from ManaSugi/fix/unify-form rustjail: Consistent coding style of LinuxDevice type	2021-10-25 14:03:03 +08:00
Bin Liu	ded864f862	Merge pull request #2568 from Bevisy/main-2254 cli: Fix outdated kata-runtime bash completion	2021-10-25 14:02:13 +08:00
Haitao Li	a13e2f77b8	agent: Handle uevent remove actions uevents with action=remove was ignored causing the agent to reuse stale data in the device map. This patch adds handling of such uevents. Fixes #2405 Signed-off-by: Haitao Li <lihaitao@gmail.com>	2021-10-25 14:41:32 +11:00
David Gibson	a0825badf6	Merge pull request #2795 from dgibson/vfio-as-vfio Allow VFIO devices to be used as VFIO devices in the container	2021-10-25 14:25:26 +11:00
David Gibson	34273da98f	runtime/device: Allow VFIO devices to be presented to guest as VFIO devices On a conventional (e.g. runc) container, passing in a VFIO group device, /dev/vfio/NN, will result in the same VFIO group device being available within the container. With Kata, however, the VFIO device will be bound to the guest kernel's driver (if it has one), possibly appearing as some other device (or a network interface) within the guest. This add a new `vfio_mode` option to alter this. If set to "vfio" it will instruct the agent to remap VFIO devices to the VFIO driver within the guest as well, meaning they will appear as VFIO devices within the container. Unlike a runc container, the VFIO devices will have different names to the host, since the names correspond to the IOMMU groups of the guest and those can't be remapped with namespaces. For now we keep 'guest-kernel' as the value in the default configuration files, to maintain current Kata behaviour. In future we should change this to 'vfio' as the default. That will make Kata's default behaviour more closely resemble OCI specified behaviour. fixes #693 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:29:31 +11:00
David Gibson	68696e051d	runtime: Add parameter to constrainGRPCSpec to control VFIO handling Currently constrainGRPCSpec always removes VFIO devices from the OCI container spec which will be used for the inner container. For upcoming support for VFIO devices in DPDK usecases we'll need to not do that. As a preliminary to that, add an extra parameter to the function to control whether or not it will remove the VFIO devices from the spec. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:29:31 +11:00
David Gibson	d9e2e9edb2	runtime: Rename constraintGRPCSpec to improve grammar "constraint" is a noun, "constrain" is the associated verb, which makes more sense in this context. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:29:31 +11:00
David Gibson	57ab408576	runtime: Introduce "vfio_mode" config variable and annotation In order to support DPDK workloads, we need to change the way VFIO devices will be handled in Kata containers. However, the current method, although it is not remotely OCI compliant has real uses. Therefore, introduce a new runtime configuration field "vfio_mode" to control how VFIO devices will be presented to the container. We also add a new sandbox annotation - io.katacontainers.config.runtime.vfio_mode - to override this on a per-sandbox basis. For now, the only allowed value is "guest-kernel" which refers to the current behaviour where VFIO devices added to the container will be bound to whatever driver in the VM kernel claims them. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:29:29 +11:00
David Gibson	730b9c433f	agent/device: Create device nodes for VFIO devices Add and adjust the vfio devices in the inner container spec so that rustjail will create device nodes for them. In order to do that, we also need to make sure the VFIO device node is ready within the guest VM first. That may take (slightly) longer than just the underlying PCI device(s) being ready, because vfio-pci needs to initialize. So, add a helper function that will wait for a specific VFIO device node to be ready, using the existing uevent listening mechanism. It also returns the device node name for the device (though in practice it will always /dev/vfio/NN where NN is the group number). Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:28:33 +11:00
David Gibson	175f9b06e9	rustjail: Allow container devices in subdirectories Many device nodes go directly under /dev, however some are conventionally placed in subdirectories under /dev. For example /dev/vfio/vfio or /dev/pts/ptmx. Currently, attempting to pass such a device into a Kata container will fail because mknod() will get an ENOENT because the parent directory is missing (or an equivalent error for bind_dev()). Correct that by making subdirectories as necessary in create_devices(). Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:28:33 +11:00
David Gibson	9891efc61f	rustjail: Correct sanity checks on device path For each user supplied device, create_devices() checks that the given path actually is in /dev, by checking that its path starts with /dev and does not contain "..". However, this has subtle errors because it's interpreting the path as a raw string without considering separators. It will accept the path /devfoo which it should not, while it will not accept the valid (though weird) paths /dev/... and /dev/a..b. Correct this by using std::path::Path methods designed for the purpose. Having done this, it's trivial to also generate the relative path that mknod_dev() or bind_dev() will need, so do that at the same time. We also move this logic into a helper function so that we can add some unit tests for it. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:28:33 +11:00
David Gibson	d6b62c029e	rustjail: Change mknod_dev() and bind_dev() to take relative device path Both these functions take the absolute path from LinuxDevice and drop the leading '/' to make a relative path. They do that with a simple &dev.path[1..]. That can be technically incorrect in some edge cases such as a path with redundant /s like "//dev//sda". To handle cases like that, have the explicit relative path passed into these functions. For now we calculate it in the same buggy way, but we'll fix that shortly. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:28:33 +11:00
David Gibson	2680c0bfee	rustjail: Provide useful context on device node creation errors create_devices() within the rustjail module is responsible for creating device nodes within the (inner) containers. Errors that occur here will be propagated up, but are likely to be low level failures of mknod() - e.g. ENOENT or EACCESS - which won't be very useful without context when reported all the way up to the runtime without the context of what we were trying to do. Add some anyhow context information giving the details of the device we were trying to create when it failed. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:28:33 +11:00
David Gibson	42b92b2b05	agent/device: Allow container devname to differ from the host Currently, update_spec_device() assumes that the proper device path in the (inner) container is the same as the device path specified in the outer OCI spec on the host. Usually that's correct. However for VFIO group devices we actually need the container to see the VM's device path, since it's normal to correlate that with IOMMU group information from sysfs which will be different in the guest and which we can't namespace away. So, add an extra "final_path" parameter to update_spec_device() to allow callers to chose the device path that should be used for the inner container. All current callers pass the same thing as container_path, but that will change in future. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:28:33 +11:00
David Gibson	827a41f973	agent/device: Refactor update_spec_device_list() update_spec_device_list() is used to update the container configuration to change device major/minor numbers configured by the Kata client based on host details to values suitable for the sandbox VM, which may differ. It takes a 'device' object, but the only things it actually uses from there are container_path and vm_path. Refactor this as update_spec_device(), taking the host and guest paths to the device as explicit parameters. This makes the function more self-contained and will enable some future extensions. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:28:33 +11:00
David Gibson	8ceadcc5a9	agent/device: Sanity check guest IOMMU groups Each VFIO device passed into the guest could represent a whole IOMMU group of devices on the host. Since these devices aren't DMA isolated from each other, they must appear as the same IOMMU group in the guest as well. The VMM should enforce that for us, but double check it, since things can't work otherwise. This also means we determine the guest IOMMU group for the VFIO device, which we'll be needing later. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:28:33 +11:00
David Gibson	ff59db7534	agent/device: Add function to get IOMMU group for a PCI device For upcoming VFIO extensions we'll need to work with the IOMMU groups of VFIO devices. This helps us towards that by adding pci_iommu_group() to retrieve the IOMMU group (if any) of a given PCI device. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:28:33 +11:00
David Gibson	13b06a35d5	agent/device: Rebind VFIO devices to VFIO driver inside guest VFIO devices can be added to a Kata container and they will be passed through to the sandbox guest. However, inside the guest those devices will bind to a native guest driver, so they will no longer appear as VFIO devices within the guest. This behaviour differs from runc or other conventional container runtimes. This code allows the agent to match the behaviour of other runtimes, if instructed to by kata-runtime. VFIO devices it's informed about with the "vfio" type instead of the existing "vfio-gk" type will be rebound to the vfio-pci driver within the guest. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:28:33 +11:00
David Gibson	e22bd78249	agent/device: Add helper function for binding a guest device to a driver For better VFIO support, we're going to need to take control of which guest driver controls specific guest devices. To assist with that, add the pci_driver_override() function to force a specific guest device to be bound to a specific guest driver. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-25 12:28:33 +11:00
Manabu Sugimoto	b40eedc9f7	rustjail: Consistent coding style of LinuxDevice type Use `"c".to_string` in the device type of `dev/full` in order to consistent with the coding style of other devices Fixes: #2890 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-10-25 09:15:59 +09:00
Jianyong Wu	57c0f93f54	agent: fix race condition when test watcher create_tmpfs won't pass as the race condition in watcher umount. quote James's words here: 1. Rust runs all tests in parallel. 2. Mounts are a process-wide, not a per-thread resource. The only test that calls watcher.mount() is create_tmpfs(). However, other tests create BindWatcher objects. 3. BindWatcher's drop() implementation calls self.cleanup(), which calls unmount for the mountpoint create_tmpfs() asserts. 4. The other tests are calling unmount whenever a BindWatcher goes out of scope. To avoid that issue, let the tests using BindWatcher in watcher and sandbox.rs run sequentially. Fixes: #2809 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2021-10-24 17:31:53 +08:00
Jianyong Wu	1a96b8ba35	template: disable template unit test on arm Template is broken on arm. here we disable the template unit test temporarily. Fixes: #2809 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2021-10-23 15:07:25 +08:00
Jianyong Wu	43b13a4a6d	runtime: DefaultMaxVCPUs should not greater than defaultMaxQemuVCPUs DefaultMaxVCPUs may be larger than the defaultMaxQemuVCPUs that should be checked and avoided. Fixes: #2809 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2021-10-23 15:07:25 +08:00
Jianyong Wu	c59c36732b	runtime: current vcpu number should be limited The physical current vcpu number should not be used directly as the largest vcpu number is limited to defaultMaxQemuVCPUs. Here, a new helper is introduced in pkg/katautils/config.go to get current vcpu number. Fixes: #2809 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2021-10-23 15:07:25 +08:00
Jianyong Wu	fa922517d9	runtime: kernel version with '+' as suffix panic in parse The current kernel version parse lib can't process suffix '+', as the modified kernel version will add '+' as suffix, thus panic will occur. For example, if the current kernel version is "5.14.0-rc4+", test TestHostNetworkingRequested will panic: --- FAIL: TestHostNetworkingRequested (0.00s) panic: &{DistroName:ubuntu DistroVersion:18.04 KernelVersion:5.11.0-rc3+ Issue: Passed:[] Failed:[] Debug:true ActualEUID:0}: failed to check test constraints: error: Build meta data is empty Here, remove the suffix '+' in kernel version fix helper. Fixes: #2809 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2021-10-23 15:07:25 +08:00
Manohar Castelino	52268d0ece	hypervisor: Expose the hypervisor itself Export the top level hypervisor type s/hypervisor/Hypervisor Fixes: #2880 Signed-off-by: Manohar Castelino <mcastelino@apple.com> Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-10-22 16:46:02 -07:00
Eric Ernst	a72bed5b34	hypervisor: update tests based on createSandbox->CreateVM change Fixup a couple of broken tests. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-10-22 16:45:35 -07:00
Manohar Castelino	f434bcbf6c	hypervisor: createSandbox is CreateVM Last of a series of commits to export the top level hypervisor generic methods. s/createSandbox/CreateVM Fixes #2880 Signed-off-by: Manohar Castelino <mcastelino@apple.com> Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-10-22 16:45:35 -07:00
Manohar Castelino	76f1ce9e30	hypervisor: startSandbox is StartVM s/startSandbox/StartVM Signed-off-by: Manohar Castelino <mcastelino@apple.com> Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-10-22 16:45:35 -07:00
Manohar Castelino	fd24a695bf	hypervisor: waitSandbox is waitVM renaming... Signed-off-by: Manohar Castelino <mcastelino@apple.com>	2021-10-22 16:45:35 -07:00
Manohar Castelino	a6385c8fde	hypervisor: stopSandbox is StopVM Renaming. There is no Sandbox specific logic except tracing. Signed-off-by: Manohar Castelino <mcastelino@apple.com>	2021-10-22 16:45:35 -07:00
Manohar Castelino	f989078cd2	hypervisor: resumeSandbox is ResumeVM renaming... Signed-off-by: Manohar Castelino <mcastelino@apple.com>	2021-10-22 16:45:35 -07:00
Manohar Castelino	73b4f27c46	hypervisor: saveSandbox is SaveVM rename Signed-off-by: Manohar Castelino <mcastelino@apple.com>	2021-10-22 16:45:35 -07:00
Manohar Castelino	7308610c41	hypervisor: pauseSandbox is nothing but PauseVM renaming Signed-off-by: Manohar Castelino <mcastelino@apple.com>	2021-10-22 16:45:35 -07:00
Manohar Castelino	8f78e1cc19	hypervisor: The SandboxConsole is the VM's console update naming Signed-off-by: Manohar Castelino <mcastelino@apple.com>	2021-10-22 16:45:35 -07:00
Manohar Castelino	4d47aeef2e	hypervisor: Export generic interface methods This is in preparation for creating a seperate hypervisor package. Non functional change. Signed-off-by: Manohar Castelino <mcastelino@apple.com>	2021-10-22 16:45:35 -07:00
Manohar Castelino	6baf2586ee	hypervisor: Minimal exports of generic hypervisor internal fields Export commonly used hypervisor fields and utility functions. These need to be exposed to allow the hypervisor to be consumed externally. Note: This does not change the hypervisor interface definition. Those changes will be separate commits. Signed-off-by: Manohar Castelino <mcastelino@apple.com>	2021-10-22 16:45:35 -07:00
GabyCT	03877f3479	Merge pull request #2872 from likebreath/1020/clh_v19.0 Upgrade to Cloud Hypervisor v19.0	2021-10-21 10:26:55 -05:00
James O. D. Hunt	09741272bc	Merge pull request #2783 from likebreath/1001/clh_enable_seccomp virtcontainers: clh: Enable the `seccomp` feature	2021-10-21 09:21:33 +01:00
Bo Chen	8030b6caf0	virtcontainers: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v19.0. Note: The client code of cloud-hypervisor's (CLH) OpenAPI is automatically generated by openapi-generator [1-2]. [1] https://github.com/OpenAPITools/openapi-generator [2] https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/pkg/cloud-hypervisor/README.md Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-10-20 15:48:55 -07:00
James O. D. Hunt	c1adb075ad	Merge pull request #1937 from jodh-intel/add-tracing-docs docs: Write tracing documentation	2021-10-20 10:14:46 +01:00
Binbin Zhang	4f018b5287	runtime: delete useless src/runtime/cli/exit.go simply use os.Exit() replace exit() delete useless ci/go-no-os-exit.sh; Fixes: #2295 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2021-10-20 11:42:37 +08:00
James O. D. Hunt	09a5e03f4a	docs: Write tracing documentation Add documentation explaining how to trace the runtime and agent. Fixes: #1892. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-10-19 17:33:01 +01:00
Chelsea Mafrica	4ce2b14e60	Merge pull request #2817 from jodh-intel/clh+fc-agent-tracing Enable agent tracing for hybrid VSOCK hypervisors	2021-10-18 22:01:52 -07:00
Bin Liu	72d1a04cf1	Merge pull request #2761 from liubin/fix/2752-optimize-test-code runtime: optimize test code	2021-10-19 12:21:04 +08:00
bin	273a1a9ac6	runtime: optimize test code This PR includes these optimize changes: - Remove the dependency on the container engine. The old code uses runc to generate config.json and Docker to export rootfs, that will be heavy and need additional dependency. Using a fixed config for busybox image can avoid the heavy processing above. - Moved duplicate code to pkg/katatestutils package Fixes: #2752 Signed-off-by: bin <bin@hyper.sh>	2021-10-19 09:54:49 +08:00
bin	76f16fd1a7	runtime: use containerd package instead of cri-containerd cri-containerd project has been merged into containerd repo, and we should not reference it any more in code and docs. This commit will use containerd package instead of cri-containerd package. Fixes: #2791 Signed-off-by: bin <bin@hyper.sh>	2021-10-19 09:40:20 +08:00
James O. D. Hunt	41c49a7bf5	Merge pull request #2771 from fengwang666/debug-pid runtime: update sandbox root dir cleanup behavior in rootless hypervisor	2021-10-18 17:47:47 +01:00
Julien Ropé	17a8c5c685	runtime: Fix random failure for TestIoCopy When running the TestIoCopy test, on some occasions, the test runs too quick, and closes the stdin pipe before the ioCopy() routine start to read from it. This causes a SIGSEGV error. To fix this issue, I am adding additional read/write tests before closing the pipes. As the read operation waits for the writer to be done, this actually synchronizes the threads and make sure the final tests (with closed pipes) works as expected. Fixes: #2042 Signed-off-by: Julien Ropé <jrope@redhat.com>	2021-10-18 15:25:57 +02:00
Bin Liu	1cb38ecbe7	Merge pull request #2843 from zhaojizhuang/fixroute agent: Do not fail when trying to adding existing routes	2021-10-18 15:52:29 +08:00
Bin Liu	c2be2dfb61	Merge pull request #2848 from c3d/bug/2847-tag-typo tracing: Fix typo in "package" tag name	2021-10-18 14:50:47 +08:00
Chelsea Mafrica	6ffe9e5afe	Merge pull request #2816 from cmaf/add-var-name-kata runtime: change name in config settings back to "kata"	2021-10-15 14:09:41 -07:00
Christophe de Dinechin	bcffa26305	tracing: Fix typo in "package" tag name The tracing tags for api.go contain `"packages"` as a tag name, whereas all other tags contain `"package"`. Fixes: #2847 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2021-10-15 14:48:00 +02:00
James O. D. Hunt	e61f5e2931	runtime: Show socket path in kata-env output Display a pseudo path to the sandbox socket in the output of `kata-runtime env` for those hypervisors that use Hybrid VSOCK. The path is not a real path since the command does not create a sandbox. The output includes a `{ID}` tag which would be replaced with the real sandbox ID (name) when the sandbox was created. This feature is only useful for agent tracing with the trace forwarder where the configured hypervisor uses Hybrid VSOCK. Note that the features required a new `setConfig()` method to be added to the `hypervisor` interface. This isn't normally needed as the specified hypervisor configuration passed to `setConfig()` is also passed to `createSandbox()`. However the new call is required by `kata-runtime env` to display the correct socket path for Firecracker. The new method isn't wholly redundant for the main code path though as it's now used by each hypervisor's `createSandbox()` call. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-10-15 11:45:29 +01:00
James O. D. Hunt	5b3a349db5	trace-forwarder: Support Hybrid VSOCK Add support for Hybrid VSOCK. Unlike standard vsock (`vsock(7)`), under hybrid VSOCK, the hypervisor creates a "master" UNIX socket on the host. For guest-initiated VSOCK connections (such as the Kata agent uses for agent tracing), the hypervisor will then attempt to open a VSOCK port-specific variant of the socket which it expects a server to be listening on. Running the trace forwarder with the new `--socket-path` option and passing it the Hypervisor specific master UNIX socket path, the trace forwarder will listen on the VSOCK port-specific socket path to handle Kata agent traces. For further details and examples, see the README or run the trace forwarder with `--help`. Fixes: #2786. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-10-15 11:45:29 +01:00
James O. D. Hunt	321be0f794	tracing: Remove trace mode and trace type Remove the `trace_mode` and `trace_type` agent tracing options as decided in the Architecture Committee meeting. See: - https://github.com/kata-containers/kata-containers/pull/2062 Fixes: #2352. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-10-15 10:09:38 +01:00
zhaojizhuang	7d0b616cf3	agent: Do not fail when trying to adding existing routes Adding a route that already exists should not be a reason for the agent to fail booting and thus preventing the sandbox to start. Fixes #2712 Signed-off-by: zhaojizhuang <571130360@qq.com>	2021-10-14 18:38:26 +02:00
Bin Liu	8be85fda4f	Merge pull request #2775 from fgiudici/kata-monitor_issue2292 kata-monitor: add index page	2021-10-14 09:12:57 +08:00
GabyCT	5c7e1b457c	Merge pull request #2821 from likebreath/1011/clh_console clh: Refine the usage of guest console and kernel parameters with Cloud Hypervisor	2021-10-13 13:36:32 -05:00
Peng Tao	176dee6f37	agent: exec should inherit container process capabilities Otherwise rustjail would not set its capabilities and it ends up getting all capabilities. Fixes: #2828 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-10-13 17:24:52 +08:00
Bo Chen	7b2bfd4eca	virtcontainers: clh: Use 'quiet' as the default kernel parameter The 'quiet' kernel parameter can avoid guest kernel logs while booting, which can reduce boot time. Fix: #2820 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-10-11 22:06:27 -07:00
Bo Chen	3e24e46c70	virtcontainers: clh: Turn-off serial and virtio-console by default We will need to have console output from the guest only for debugging purposes. As a result, we can turn-off both the serial and virtio-console devices by default for better boot time. Fixes: #2820 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-10-11 22:06:23 -07:00
Bin Liu	b7cd4ca2b8	Merge pull request #2813 from liubin/fix/2812-flush-root-span agent: flush root span before process finish	2021-10-11 18:46:09 +08:00
bin	2d7b65e8eb	agent: flush root span before process finish Variables in rust will be dropped at the end of the function. In function real_main the trace will be shut down by `tracer::end_tracing()`, but at this time the root span is in an active state, so this root span will not be sent to the trace collector. This can be fixed by dropping the root span manually. Fixes: #2812 Signed-off-by: bin <bin@hyper.sh>	2021-10-11 17:14:37 +08:00
Chelsea Mafrica	3f95469a78	runtime: logging: Add variable for syslog tag The variable for 'name' in config-settings.go.in was previously hardcoded as "kata". In `e7c42fb` it was changed to the runtime name, which is "kata-runtime". Add a variable to specify a syslog identifier for consistency for tests and documentation that use it. Fixes #2806 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-10-11 02:12:13 -07:00
Marcel Apfelbaum	06f4ab10b4	Merge pull request #2764 from dgibson/more-pci Extend PCI submodules to represent non-zero functions and addresses	2021-10-10 15:57:54 +03:00
Feng Wang	adc9e0baaf	runtime: fix two bugs in rootless hypervisor Update the sandbox dir clean up logic to be more appropriate Add different seeds for randInt() method Fixes #2770 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2021-10-08 15:52:42 -07:00
Bo Chen	51cbe14584	runtime: Add option "disable_seccomp" to config hypervisor.clh This patch adds an option "disable_seccomp" to the config hypervisor.clh, from which users can disable the `seccomp` feature from Cloud Hypervisor when needed (for debugging purposes). Fixes: #2782 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-10-08 15:10:30 -07:00
Bo Chen	98b7350a1b	virtcontainers: clh: Enable the `seccomp` feature This patch enables the `seccomp` feature from Cloud Hypervisor which provides fine-grained allowed syscalls for each of its worker threads. It brings important security benefits, while would increase memory footprint. Fixes: #2782 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-10-08 15:07:43 -07:00
bin	5c77cc2c49	runtime: don't start shim management server in tests Shim management server is running in a go routine, in test mode this will cause the directory where the listen socket file(/run/vc/sbs/777-77-77777777/shim-monitor.sock) in leak after the tests finished. Fixes: #2805 Signed-off-by: bin <bin@hyper.sh>	2021-10-08 18:41:53 +08:00
David Gibson	72044180e4	agent/device: Return PCI address from wait_for_pci_device() wait_for_pci_device() waits for the PCI device at the given path to become ready, but it doesn't currently give you any meaningful handle on that device. Change the signature, so that it returns the PCI address of the device. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-08 16:52:49 +11:00
David Gibson	e50b05d93c	agent/pci: Add type to represent PCI addresses Add a new pci::Address type which represents a guest PCI address in DDDD:BB:SS.F form. fixes #2745 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-08 16:52:49 +11:00
David Gibson	8528157b9b	agent/pci: Extend Slot type to represent PCI function as well pci::Slot represents a PCI slot. However, in all cases where we use it, we actually care about addressing a specific PCI function. So, at the moment we can only refer to function 0 in each slot. Replace pci::Slot with pci::SlotFn to represent both the slot and function. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-10-08 16:52:49 +11:00
Fupan Li	988eb95621	Merge pull request #2760 from liubin/fix/2759-optimize-code-for-managing-temp-users runtime: optimize code for managing temp users for rootless mode	2021-10-08 13:49:14 +08:00
bin	bf8f582c1d	runtime: optimize code for managing temp users for rootless mode This commit does two chagnes: - move code for managing temp users to rootless.go. - use common function in qemu.go when shutdown the VM. Fixes: #2759 Signed-off-by: bin <bin@hyper.sh>	2021-10-08 11:04:21 +08:00
Samuel Ortiz	08360c981d	agent: Add an agent configutation file example With all endpoints allowed. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2021-10-07 04:04:52 +02:00
Samuel Ortiz	8a4e69d237	agent: rpc: Return UNIMPLEMENTED for not allowed endpoints From the endpoints string described through the configuration file, we build a hash set of allowed enpoints. If a configuration files does not include an endpoints section, we assume all endpoints are not allowed. If there is no configuration file, then all endpoints are allowed. Then for every ttrpc request, we check if the name of the endpoint is part of the hashset. If it is not, then we return ttrcp::UNIMPLEMENTED. Fixes: #1837 Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-10-07 04:04:32 +02:00
Samuel Ortiz	0ea2e3af07	agent: config: Allow for building the configuration from a file When the kernel command line includes a agent.config_file=<path> entry, then we will try to override the default confiuguration values with the ones we parse from a TOML file at <path>. As the configuration file overrides the default values, we need to go through a simplified builder that convert a set of Option<> fields into the actual AgentConfig structure. Fixes: #1837 Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-10-07 00:37:40 +02:00
Samuel Ortiz	63539dc9fd	agent: config: Add allowed endpoints They will define the list of endpoints that an agent supports. They're empty and non actionable for now. Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-10-07 00:37:40 +02:00
Samuel Ortiz	a953fea324	agent: config: Simplify configuration creation We dont need a constructor and derive directly from the command line parsing. Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-10-07 00:37:40 +02:00
Samuel Ortiz	b888edc2fc	agent: config: Implement Default A single constructor setting default value is a typical pattern for a Default implementation. Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-10-07 00:37:40 +02:00
Bin Liu	10ec4b133c	Merge pull request #2742 from liubin/fix/2741-delete-file-code Delete file virtcontainers-setup.sh	2021-10-07 11:54:47 +08:00
Fabiano Fidêncio	4cde619c68	Merge pull request #2797 from fidencio/wip/upgrade-vendored-containerd vendor: Update containerd to v1.5.7	2021-10-06 21:05:44 +02:00
Chelsea Mafrica	6e3fcce2a2	Merge pull request #2748 from liubin/fix/2747-add-test runtime: Optimize func noNeedForOutput and add test cases	2021-10-06 11:24:57 -07:00
Jianyong Wu	7eac2ec786	protection: add confidential compute frame for arm Even CCA, which is the confidential compute archtecture, has not been ready, add a empty implementation to avoid static check error. Fixes: #2789 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com> Suggested-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-10-06 15:53:36 +02:00
Jianyong Wu	8acfc154de	check: fix typecheck failure in qemu_arm64_test.go fix typecheck failure in qemu_arm64_test.go Fixes: #2789 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2021-10-06 15:53:35 +02:00
Amulya Meka	5b02d54e23	virtcontainers: fix lint failure on ppc64le Add nolint for arch specific code to exclude from lint check. Fixes: #2773 Signed-off-by: Amulya Meka <amulmek1@in.ibm.com>	2021-10-06 15:53:35 +02:00
Jakob Naucke	ff9728f032	virtcontainers: nolint guestProtection Exclude from lint checking for it is ultimately only used in architecture-specific code. Fixes: #2273 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-10-06 15:53:35 +02:00
Jakob Naucke	5c138c8f12	runtime: Fix field alignment on s390x Follow-up of #2237 for s390x -- field alignment isn't always minimal Fixes: #2773 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-10-06 15:53:35 +02:00
Fabiano Fidêncio	191d001610	vendor: Update containerd to v1.5.7 Bump containerd to v1.5.7 in order to bring in a fix for CVE-2021-41103, "insufficiently restricted permissions ons plugins directories (https://github.com/advisories/GHSA-c2h3-6mxw-7mvq)". dependabot found a potential security vulnerability and raised a PR to fix it. However, dependabot does not properly follows nor understands the needed of our CIs (mainly related to formatting the PR and whatnot), thus I'm re-raising it. Fixes: #2796 Supersedes: #2787 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-10-06 10:40:43 +02:00
Eric Ernst	2bc7561561	Merge pull request #2769 from sameo/topic/agent-route Pass the host route IP family to the guest	2021-10-05 07:20:33 -07:00
Bin Liu	f7f6bd0142	kata-monitor: add index page Add an index page to the kata-monitor endpoint. Porting of https://github.com/liubin/kata-containers/commit/a45aa0696d55 Fixes: #2292 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-10-04 18:13:56 +02:00
Samuel Ortiz	a44cde7e8d	agent: netlink: Use the grpc IP family field when updating the route Not all routes have either a gateway or a destination IP. Interface routes, where the source, destination and gateway are undefined, will default to IP v4 with the current is_ipv6() check even when they are v6 routes. We use the provided gRPC Route.Family field instead. This field is built from the host netlink messages, and is a reliable way of finding out a route's IP family. Fixes: #2768 Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2021-10-01 14:39:46 +02:00
Samuel Ortiz	71ce6cfe9e	runtime: Pass the route IP family to the agent When updating the guest routing table, we should forward the IP family information up to the guest. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2021-10-01 14:35:17 +02:00
Samuel Ortiz	99450bd1f7	agent: protos: Add a Family field to the Route payload Our check for the IP family is working as long as we have either a gateway or a destination IP. Some routes are missing both. The RT netlink messages provide the IP family information for each route, so we can carry that piece of information up to the guest. That will allow for a more reliable route IP family determination. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2021-10-01 14:35:17 +02:00
Samuel Ortiz	f85fe70231	runtime: vendor: Bump the netlink package dependency We need to be able to get the IP family from the netlink route meesages, and the Route.Family field only got recently added to the netlink package. The update generates static check warnings about the call for nethandler.Delete() being deprecated in favor of a Close() call instead. So we include the s/Delete()/Close()/ change as part of this PR. Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2021-10-01 14:35:01 +02:00
Amulya Meka	e439cec7c5	cmd: fix field alignment on ppc64le Optimising structure field alignment. Fixes: #2779 Signed-off-by: Amulya Meka <amulmek1@in.ibm.com>	2021-10-01 11:45:27 +00:00
Amulya Meka	e5159ea755	cmd: get return value for setCPUtype Accept and assert the return value in testSetCPUTypeGeneric. Fixes: #2779 Signed-off-by: Amulya Meka <amulmek1@in.ibm.com>	2021-10-01 11:44:14 +00:00
James O. D. Hunt	2ce8d4263c	clh: Suppress hypervisor output to make guest output visible Reduce the cloud-hypervisor log level from `Debug` to `Info` when hypervisor debug is enabled. This is required since `Debug` level: - Is overkill for debugging hypervisor failures. - Effectively hides the output from the guest kernel and userland: CLH generates so much output that the output from the guest gets "lost in the noise" (experiments show that for each full CLH debug message, at most 1 _byte_ of guest output is displayed). Fixes: #2726. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-09-30 14:22:09 +01:00
Jakob Naucke	8739a73dd3	Merge pull request #2736 from Amulyam24/kata-check-test cmd: Fix mismatched types in testModuleData	2021-09-30 10:20:19 +02:00
bin	762922a521	runtime: delete func ConstraintsToVCPUs ConstraintsToVCPUs is not used any more. Fixes: #2741 Signed-off-by: bin <bin@hyper.sh>	2021-09-30 14:44:41 +08:00
bin	4f4854308a	runtime: delete virtcontainers-setup.sh This file is not used anymore. Fixes: #2741 Signed-off-by: bin <bin@hyper.sh>	2021-09-30 14:44:30 +08:00
Chelsea Mafrica	96c033ba6c	Merge pull request #2763 from liubin/fix/2762-update-gitignore runtime: update .gitignore to ignore monitor_address file	2021-09-29 09:45:57 -07:00
Carlos Venegas	7183de47df	Merge pull request #2766 from YchauWang/wyc-runtime-cmd runtime: fix the make check-go-static command error	2021-09-29 10:53:02 -05:00
Bin Liu	4ac7199282	Merge pull request #2494 from rapiz1/clean-up-code virtcontainers: clean up useless code	2021-09-29 22:56:13 +08:00
wangyongchao.bj	bb99bfb45d	runtime: fix the make check-go-static command error modify the make script of the check-go-static, changing the `./cli` path to `./cmd/kata-runtime` Fixes: #2765 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-09-29 15:37:25 +08:00
David Gibson	b57613f53e	Merge pull request #1682 from dgibson/rescan Remove forced PCI rescans from agent	2021-09-29 13:03:55 +10:00
bin	870771d76d	runtime: update .gitignore to ignore monitor_address file Run tests sometimes generate pkg/containerd-shim-v2/monitor_address, and `git status` will treat it as a new file. Package containerd-shim-v2 has moved to pkg/containerd-shim-v2, the monitor_address in .gitignore should be updated too. Fixes: #2762 Signed-off-by: bin <bin@hyper.sh>	2021-09-29 09:24:14 +08:00
Fupan Li	823818cfbc	Merge pull request #2744 from fengwang666/nil-bug runtime: fix nil reference in cleanup rootless user	2021-09-28 22:43:24 +08:00
bin	46720c61c1	runtime: set tags for trace span Set tags for trace span in hook.go and remove FIXME. Fixes: #2749 Signed-off-by: bin <bin@hyper.sh>	2021-09-28 18:05:03 +08:00
bin	18bff58487	runtime: Optimize func noNeedForOutput and add test cases Optimize func noNeedForOutput and add test cases for this func. Fixes: #2747 Signed-off-by: bin <bin@hyper.sh>	2021-09-28 16:58:44 +08:00
Feng Wang	e5fe53f0a9	runtime: fix nil reference in cleanup rootless user It seems the client (crio) can send multiple requests to stop the Kata VM, resulting a nil reference if the uid has already been cleaned up by a different thread. Fixes #2743 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2021-09-27 21:28:47 -07:00
Francesco Giudici	2304a59601	runtime: set the sandbox storage path static Since we now have "unix://" kind of socket returned by the SocketAddress() function, there is no more need to build the sandbox storage path dynamically to keep OS compatibility. Fixes: #2738 Suggested-by: Christophe de Dinechin <dinechin@redhat.com> Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-27 15:57:34 +02:00
Francesco Giudici	315295e0ef	runtime: rename GetSanboxesStoragePath() --> GetSandboxesStoragePath() Add the missing 'd'. Fixes: #2738 Suggested-by: Jakob Naucke <jakob.naucke@ibm.com> Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-27 15:56:14 +02:00
Bin Liu	3217b03b17	Merge pull request #2522 from Bevisy/main-2515 virtcontainers: Fix incorrect scripts path	2021-09-27 21:14:40 +08:00
Bin Liu	39df808f6a	Merge pull request #2695 from YchauWang/wyc-vc-cgroup runtime: clear virtcontainers cgroup duplicated function	2021-09-27 21:12:39 +08:00
Amulya Meka	13e65f2ee8	cmd: Fix mismatched types in testModuleData Rectify the values of testModuleData with the correct types in TestCCCheckCLiFunction in kata-check_(!x86)_test.go Fixes: #2735 Signed-off-by: Amulya Meka <amulmek1@in.ibm.com>	2021-09-27 07:17:07 +00:00
Peng Tao	05995632c3	Merge pull request #2566 from fgiudici/kata-monitor_improvements Kata monitor: cache improvements	2021-09-27 12:29:13 +08:00
David Gibson	907459c1c1	agent/device: Don't force PCI rescans The agent initiates a PCI rescan from two places. One is triggered for each virtio-blk PCI device, and one is triggered unconditionally when we start a new container. The PCI bus rescan code was added long time ago in Clear Containers due to lack of ACPI support in QEMU 2.9 + q35. Since Kata routinely plugs devices under a PCIe-to-PCI bridge, that left SHPC as the only available hotplug mechanism. However, while Kata was using SHPC on the qemu side, it wasn't actually using it on the guest side. Due to a quirk of our guest kernel configuration, the SHPC driver never bound to the bridge, and no hotplug was working at all. To work around that, Kata was forcing the rescan manually, which would discover the new device. That was very fragile (we were arguably relying on a kernel bug). Even if we were using SHPC propertly, it includes a mandatory 5s delay during plug operations (designed for physical cards and human operators), which makes it unsuitable quick start up. Worse, the forced PCI rescans could race with either SHPC or PCIe native hotplug sequences, causing several problems. In some cases this could put the device into an entirely broken state where it wouldn't respond to config space accesses at all. Since pull request #2323 was merged, we have instead used ACPI hotplug which is both fast, and more solid in terms of semantics and races. So, the forced PCI rescans are no longer necessary. Remove them all. fixes #683 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	75f426dd1e	agent: Simplify do_add_swap() do_add_swap() has some mildly complex code to translate the PCI path of a virtio-blk device (where the swap will reside) into a /dev path. However, the device module already has get_virtio_blk_pci_device_name() which does exactly that. The existing code has some further advantages: it uses more precise matching of the sysfs paths, and if necessary it will wait for the device to be added to the guest. While we're there, remove an unnecessary 'as u8' from the PCI path construction: pci::Path::new() already accepts anything which implements TryInfo<u8>, which u32 certainly does. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	aad1a8734f	runtime/device: Give the agent information about VFIO devices We send information about several kinds of devices to the agent so that it can apply specific handling. We don't currently do this with VFIO devices. However we need to do that so that the agent can properly wait for VFIO devices to be ready (previously it did that using a PCI rescan which may not be reliable and has some very bad side effects). This patch collates and sends the relevant information. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	ebd7b61884	runtime: Don't repeat GetDeviceByID between appendDevices() and append() Both appendBlockDevice and appendVhostUserBlkDevice start by using GetDeviceByID to lookup the api.Device object corresponding to their ContainerDevice object. However their common caller, appendDevices() has already done this. This changes it so the looked up api.Device is passed to the individual appendDevice() functions. This slightly reduces duplicated work, but more importantly it makes it clearer that append*Device() don't need to check for a nil result from GetDeviceByID, since the caller has already done that. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	ad45c52fbe	runtime/device: Record guest PCI path for VFIO devices For several device types which correspond to a PCI device in the guest we record the device's PCI path in the guest. We don't currently do that for VFIO devices, but we're going to need to for better handling of SR-IOV devices. To accomplish this, we have to determine the guest PCI path from the information the VMM gives us: For qemu, we query the slot of the device and its bridge from QMP. For cloud-hypervisor, the device add interface gives us a guest PCI address. In fact this represents a design error in the clh API - there's no way it can really know the guest PCI address in general. It works in this case, because clh doesn't use PCI bridges, so the device will always be on the root bus. Based on that, the PCI path is simply the device's slot number. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	5c2af3e308	runtime/device: Refactor hotplugVFIODevice() to have common exit path hotplugVFIODevice() has several different paths depending if we're plugging into a root port or a PCIE<->PCI bridge and if we're using a regular or mediated VFIO device. We're going to want some common code on the successful exit path here, so refactor the function to allow that without duplication. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	8bc71105f4	agent/device: Add device type for VFIO devices Currently, VFIO devices attached to a Kata container aren't described to the agent at all. We essentially just hope they're ready by the time we've entered the container proper, which is usually the case because of the PCI rescan - but that causes other problems. This adds a new device type to the agent representing VFIO devices. The agent will use its existing uevent watching mechanisms to wait for the associated guest PCI device to appear before proceeding. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	f7a2707505	agent: Move driver type constants into device.rs Currently the constants giving the names for each device/driver type in the protocol are in mount.rs, and used in device.rs. Since these constants are inherently related to, well, devices, it makes more sense to put them in device.rs and use them from mount.rs. This will become even more so with planned extensions which will add some device types that will not be used in mount.rs at all. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	5b1eb08bde	agent/uevent: Improve logging of wait_for_uevent() These messages will help when debugging matchers not matching properly. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	cf36fd87ad	runtime: Fix some leftover go fmt errors A few "go fmt" errors appear to have crept it. Clean them up with "go fmt ./..." in the src/runtime directory. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
zhanghj	57e3712dbd	virtiofs: fix error report in TestVirtiofsdStart when go test running Initialize ctx with context.Background() instead of nil value. Fixes: #2718 Signed-off-by: zhanghj <zhanghj.lc@inspur.com>	2021-09-24 16:06:06 +08:00
Fabiano Fidêncio	279f8e9d03	Merge pull request #2590 from c3d/issue/2589-virtiofsd-perms virtiofs: Create shared directory with 0700 mode, not 0750	2021-09-24 09:16:40 +02:00
Eric Ernst	fa44e5c1e5	Merge pull request #2703 from egernst/watcher-fixup watcher: ensure we create target mount point for storage	2021-09-23 21:59:08 -07:00
Julio Montes	1766c93b08	Merge pull request #2662 from cmaf/tracing-stop-rootctx runtime: tracing: Use root context to stop tracing	2021-09-23 11:50:35 -05:00
Eric Ernst	272771dcf9	watcher: ensure we create target mount point for storage We would only create the target when updating files. We need to make sure that we create the target if the source is a directory. Without this, we'll fail to start a container that utilizes an empty configmap, for example. Add unit tests for this. Fixes: #2638 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-09-23 08:29:28 -07:00
Julio Montes	5d2a82fbf9	Merge pull request #2323 from dgibson/acpi-pcihp Replace SHPC with ACPI PCI hotplug for Kata guests	2021-09-23 09:55:31 -05:00
Francesco Giudici	8b0bc1f45e	kata-monitor: bump version to 0.2.0 We now support any container engine CRI compliant. Let's bump the kata-monitor version to 0.2.0. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-23 14:32:09 +02:00
Francesco Giudici	bfb556d56a	kata-monitor: refresh kata sandbox list on fs events This commit stops the container engine polling in favor of the kata sandbox storage path monitoring. The pod cache list is now refreshed based on fs events and synced with the container engine only when needed. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-23 14:32:09 +02:00
Francesco Giudici	0e854f3b80	kata-monitor: improve detection of kata workloads When the container engine is different than containerd or CRI-O we lack proper detection of kata workloads and consider all the pods as kata ones. Instead of querying the container engine for the lower level runtime used in each pod, check if a directory matching the pod exists in the virtualcontainers sandboxes storage path. This provides a container engine independent way to check for kata pods. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-23 14:32:09 +02:00
Fabiano Fidêncio	0ececc630f	Merge pull request #2666 from cmaf/tracing-newContainer-logger runtime: tracing: Fix logger passed in newContainer	2021-09-23 13:07:19 +02:00
Fabiano Fidêncio	e33c26ba18	Merge pull request #2622 from YchauWang/wyc-vc-api virtcontainers: update VC SandboxConfig API add SandboxBindMounts field	2021-09-23 13:05:33 +02:00
Fabiano Fidêncio	47170e302a	Merge pull request #2616 from Bevisy/main-2615 sandbox: Allow the device to be accessed,such as /dev/null and /dev/u…	2021-09-23 13:04:18 +02:00
David Gibson	8bbcb06af5	qemu: Disable SHPC hotplug Under certain circumstances[0] Kata will attempt to use SHPC hotplug for PCI devices on the guest. In fact we explicitly enable SHPC on our PCI to PCI bridges, regardless of the qemu default. SHPC was designed a long, long time ago for physical hotplugging and works very poorly for a virtual environment. In particular it has a mandatory 5s delay to allow a (real, human) operator to back out the operation if they press a button by mistake. This alone makes it unusable for a fast start up application like Kata. Worse, the agent forces a PCI rescan during startup. That will race with the SHPC hotplug operation causing the device to go into a bad state where config space can't be accessed from the guest at all. The only reason we've sort of gotten away with this is that our default guest kernel configuration triggers what's arguably a kernel bug effectively disabling SHPC. That makes the agent rescan the only reason we see the new device. Now that we require a qemu >=6.1, which includes ACPI PCI hotplug on the q35 machine, we can explicitly disable SHPC in all cases. It's nothing but trouble. fixes #2174 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-23 10:27:26 +10:00
David Gibson	cc4983eeac	runtime: Remove unused qemuArchBase.appendBridges definition qemuArchBase.appendBridges is never actually used, because the bare qemuArchBase type is itself never used (outside of unit tests). Instead all the subclasses of qemuArchBase override appendBridges() to call the very similar, but not identical genericAppendBridges. So, we can remove the qemuArchBase.appendBridges implementation. Furthermore, all those subclasses override appendBridges() in exactly the same way, and so we can remove those definitions and replace the base class qemuArchBase appendBridges() with that version, calling genericAppendBridges(). Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-23 10:15:08 +10:00
David Gibson	e248de4616	vendor: Update govmm Update to commit `1b60b536f3`, in particular to get extensions to allow IO and memory window reservations to be set on PCI bridges. https://github.com/kata-containers/govmm/pull/201 Git log: `de039da` govmm/qemu: Let IO/memory reservations be specified for bridge devices Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-23 10:14:29 +10:00
wangyongchao.bj	3b0c4bf9a0	runtime: clear virtcontainers cgroup duplicated function There are `DeviceToDeviceCgroup` and `deviceToDeviceCgroup` two functions, creating a `specs.LinuxDeviceCgroup` object. We clear the new function `deviceToDeviceCgroup`. Fixes: #2694 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-09-22 15:13:34 +08:00
Fabiano Fidêncio	32c3fb71f2	Merge pull request #2546 from fengwang666/rootless-qemu-doc docs: documentation for running non-root VMM	2021-09-21 22:45:33 +02:00
Fabiano Fidêncio	2bee8bc6bd	Merge pull request #2432 from fengwang666/qemu-rootless runtime: run the QEMU VMM process with a non-root user	2021-09-21 21:37:02 +02:00
Feng Wang	305afc8b70	docs: documentation for running non-root VMM Documentation for running non-root QEMU VMM in Kata runtime Fixes: #2545 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2021-09-21 11:20:37 -07:00
Samuel Ortiz	3a4aca4d67	Merge pull request #2671 from YchauWang/wyc-runtime-config runtime: update .gitignore file cleare the vc shim config	2021-09-21 15:15:09 +02:00
Feng Wang	9a6d56f1ab	runtime: fix empty cgroup path validation error An empty cgroup path shouldn't fail cgroup creation Fixes #2674 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2021-09-20 13:48:09 -07:00
Christophe de Dinechin	48fb1d9203	virtiofs: Create shared directory with 0700 mode, not 0750 A discussion on the Linux kernel mailing list [1] exposed that virtiofsd makes a core assumption that the file systems being shared are not accessible by any non-privileged user. We currently create the `shared` directory in the sandbox with the default `0750` permissions, which gives read and directory traversal access to the group. There is no real good reason for a non-root user to access the shared directory, and this is potentially dangerous. Fixes: #2589 [1]: https://lore.kernel.org/linux-fsdevel/YTI+k29AoeGdX13Q@redhat.com/ Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2021-09-20 10:47:18 +02:00
Francesco Giudici	afad910d0e	kata-monitor: add getSandboxFS() Retrieve the absolute sandbox storage path. We will soon need this to monitor the creation/deletion of new kata sandboxes. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-20 10:37:55 +02:00
Francesco Giudici	e38686f74d	runtime: add GetSandboxesStoragePath() The storage path we use to collect the sandbox files is defined in the virtcontainers/persist/fs package. We create the runtime socket in that storage path, by hardcoding the full path in the SocketAddress() function in the runtime package. This commit splits the hardcoded path by the socket address path so that the runtime package will be able to provide the storage path to all the components that may need it. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-20 10:37:55 +02:00
Francesco Giudici	245a12bbb7	kata-monitor: improve sandbox caching In order to retrieve the list of sandboxes, we poll the container engine every 15 seconds via the CRI. Once we have the list we have to inspect each pod to find out the kata ones. This commit extend the sandbox cache to keep track of all the pods, marking the kata ones, so that during the next polling only the new sandboxes should be inspected to figure out which ones are using the kata runtime. Fixes: #2563 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-20 10:37:55 +02:00
Francesco Giudici	fc067d61d4	kata-monitor: warn when unable to retrive the lower level runtime this is an unexpected event (likely a change in how containerd/cri-o record the lower level runtime in the pod) and should be more visible: raise the log level to "warning". Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-20 10:37:54 +02:00
Francesco Giudici	53ec4df953	kata-monitor: minor fixes fix comment and use literals Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-20 10:37:54 +02:00
Chelsea Mafrica	077b77c178	runtime: tracing: Fix logger passed in newContainer Change logger in Trace call in newContainer from sandbox.Logger() to nil. Passing nil will cause an error to be logged by kataTraceLogger instead of the sandbox logger, which will avoid having the log message report it as part of the sandbox subsystem when it is part of the container subsystem. The kataTraceLogger will not log it as related to the container subsystem, but since the container logger has not been created at this point, and we already use the kataTraceLogger in other instances where a subsystem's logger has not been created yet, this PR makes the call consistent with other code. Fixes #2665 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-09-17 11:41:04 -07:00
Chelsea Mafrica	39cd05e0bb	runtime: tracing: Use root context to stop tracing Call StopTracing with s.rootCtx, which is the root context for tracing, instead of s.ctx, which is parent to a subset of trace spans. Fixes #2661 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-09-17 11:39:13 -07:00
Feng Wang	1cfe59304d	runtime: Run QEMU using a non-root user/group A random generated user/group is used to start QEMU VMM process. The /dev/kvm group owner is also added to the QEMU process to grant it access. Fixes #2444 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2021-09-17 11:28:44 -07:00
wangyongchao.bj	fd98373850	runtime: update .gitignore file cleare the vc shim config update .gitignore file, remove the follow configurations: /virtcontainers/shim/mock/cc-shim/cc-shim /virtcontainers/shim/mock/kata-shim/kata-shim /virtcontainers/shim/mock/shim Fixes: #2670 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-09-17 15:25:28 +08:00
wangyongchao.bj	1b1790fdbc	agent/src: improve unit test coverage for src/namespace.rs Improve unit test coverage for src/namespace.rs for Kata 2.0 agent Fixes: #289 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-09-17 14:15:14 +08:00
Hui Zhu	fff82b4ef5	Merge pull request #2628 from bergwolf/runtime-reorg runtime: refactor commandline code directory	2021-09-17 10:37:22 +08:00
Chelsea Mafrica	6159ef3499	Merge pull request #2626 from YchauWang/wyc-vc-api02 virtcontainers: update VC HypervisorConfig API add three lost fields	2021-09-16 16:46:27 -07:00
Peng Tao	067c44d0b6	runtime: fix UT build failure storeContainer has been removed. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-09-16 19:42:02 +08:00
Peng Tao	e7c42fbc76	runtime: unify generated config We don't need to maintain two generated config.go and even have duplicates between them. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-09-16 17:19:18 +08:00
Peng Tao	4f7cc18622	runtime: refactor commandline code directory Move all command line code to `cmd` and move containerd-shim-v2 to pkg. Fixes: #2627 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-09-16 17:19:18 +08:00
Samuel Ortiz	7bf96d2457	Merge pull request #2604 from Amulyam24/container_tests virtcontainers: add unit tests for container.go	2021-09-16 11:02:16 +02:00
Samuel Ortiz	9ed024e0bf	Merge pull request #2649 from likebreath/0916/clh_hugepages runtime: clh: Enable hugepages support	2021-09-16 10:57:34 +02:00
David Gibson	9d3cd9841f	agent/mount: Remove unused ensure_destination_exists() The only remaining callers of ensure_destination_exists() are in its own unit tests. So, just remove it. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-16 12:24:47 +10:00
David Gibson	64aa562355	agent: Correct mount point creation mount_storage() first makes sure the mount point for the storage volume exists. It uses fs::create_dir_all() in the case of 9p or virtiofs volumes otherwise ensure_destination_exists(). But.. ensure_destination_exists() boils down to an fs::create_dir_all() in most cases anyway. The only case it doesn't is for a bind fstype, where it creates a file instead of a directory. But, that's not correct anyway because we need to create either a file or a directory depending on the source of the bind mount, which ensure_destination_exists() doesn't know. The 9p/virtiofs paths also check if the mountpoint exists before calling fs::create_dir_all(), which is unnecessary (fs::create_dir_all already handles that case). mount_storage() does have the information to know what we need to create, so have it explicitly call ensure_destination_file_exists() for the bind mount to a non-directory case, and fs::create_dir_all() in all other cases. fixes #2390 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-16 12:24:47 +10:00
David Gibson	08d7aebc28	agent/mount: Split out regular file case from ensure_destination_exists() ensure_destination_exists() can create either a directory or a regular file depending on the arguments. This patch extracts the regular file specific option into its own helper: ensure_destination_file_exists(). This: - Avoids doing some steps in the directory case (they're already handled by create_dir_all()) - Enables some further future cleanups Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-16 12:24:47 +10:00
David Gibson	9fa3beff4f	agent: Remove unnecessary BareMount structure struct Baremount contains the information necessary to make a new mount. As a datastructure, however, it's pointless, since every user just constructs it, immediately calls the BareMount::mount() method then discards the structure. Simplify the code by making this a direct function call baremount(). Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-16 12:24:47 +10:00
David Gibson	49282854f1	agent: Simplify BareMount::mount by using nix::mount::mount BareMount::mount does some complicated marshalling and uses unsafe code to call into the mount(2) system call. However, we're already using the nix crate which provides a more Rust-like wrapper for mount(2). We're even already using nix::mount::umount and nix::mount::MsFlags from the same module. In the same way, we can replace the direct usage of libc::umount() with nix::mount::umount() in one of the tests. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-16 12:24:47 +10:00
David Gibson	bac849ecba	Merge pull request #2634 from dgibson/newer-rust versions: Allow newer Rust versions	2021-09-16 12:23:37 +10:00
Bo Chen	d00decc97d	runtime: clh: Enable hugepages support This patch adds the configuration option that allows to use hugepages with Cloud Hypervisor guests. Fixes: #2648 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-09-15 10:43:57 -07:00
David Gibson	64bb803fcf	runtime/qemu: Move from query-cpus to query-cpus-fast We recently updated to using qemu-6.1 (from qemu 5.2). Unfortunately one breaking change in qemu 6.0 wasn't caught by the CI. The query-cpus QMP command has been removed, replaced by query-cpus-fast (which has been available since qemu 2.12). govmm already had support for query-cpus-fast, we just weren't using it, so the change is quite easy. fixes #2643 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-15 16:41:26 +10:00
David Gibson	25ac3524c9	versions: Allow newer Rust versions Rust 1.47.0 which is the latest we note as tested in versions.yaml is now getting fairly old - many current distros have newer versions (e.g. Rust 1.54.0 in Fedora 34). Bring this more up to date. Note that this is only updating the 'newest-version', not the minimum required version. The new version changes the name of the 'clippy::unknown_clipp_lints' option to simply 'unknown_lints' so we need to change that as well to avoid warnings. fixes #2633 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-15 08:58:28 +10:00
Samuel Ortiz	4b7e4a4c70	runtime: Vendoring update Due to the libcontainer dependencies removal. Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-09-14 07:09:34 +02:00
Samuel Ortiz	9bed2ade0f	virtcontainers: Convert to the new cgroups package API The new API is based on containerd's cgroups package. With that conversion we can simpligy the virtcontainers sandbox code and also uniformize our cgroups external API dependency. We now only depend on containerd/cgroups for everything cgroups related. Depends-on: github.com/kata-containers/tests#3805 Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com> Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-09-14 07:09:34 +02:00
Samuel Ortiz	b42ed39349	virtcontainers: cgroups: Add a containerd API based cgroups package Eventually, we will convert the virtcontainers and the whole Kata runtime code base to only rely on that package. This will make Kata only depends on the simpler containerd cgroups API. Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-09-14 07:09:34 +02:00
Samuel Ortiz	f17752b0dc	virtcontainers: container: Do not create and manage container host cgroups The only process we are adding there is the container host one, and there is no such thing anymore. Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-09-14 07:09:33 +02:00
Samuel Ortiz	dc7e9bce73	virtcontainers: sandbox: Host cgroups partitioning This is a simplification of the host cgroup handling by partitioning the host cgroups into 2: A sandbox cgroup and an overhead cgroup. The sandbox cgroup is always created and initialized. The overhead cgroup is only available when sandbox_cgroup_only is unset, and is unconstrained on all controllers. The goal of having an overhead cgroup is to be more flexible on how we manage a pod overhead. Having such cgroup will allow for setting a fixed overhead per pod, for a subset of controllers, while at the same time not having the pod being accounted for those resources. When sandbox_cgroup_only is not set, we move all non vCPU threads to the overhead cgroup and let them run unconstrained. When it is set, all pod related processes and threads will run in the sandbox cgroup. Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-09-14 07:09:29 +02:00
Samuel Ortiz	f811026c77	virtcontainers: Unconditionally create the sandbox cgroup manager Regardless of the sandbox_cgroup_only setting, we create the sandbox cgroup manager and set the sandbox cgroup path at the same time. Without doing this, the hypervisor constraint routine is mostly a NOP as the sandbox state cgroup path is not initialized. Fixes #2184 Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-09-14 07:05:57 +02:00
wangyongchao.bj	a6066404f7	virtcontainers: update VC HypervisorConfig API add three lost fields Sync the virtcontainers api.md document, add `ConfidentialGuest` `EntropySourceList` `GuestSwap` three fields to the HypervisorConfig API. Fixes #2625 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-09-14 10:42:54 +08:00
wangyongchao.bj	bb18cd475c	virtcontainers: update VC SandboxConfig API add SandboxBindMounts field sync the virtcontainers api.md document, add SandboxBindMounts field to the SandboxConfig API. And update the order of the SandboxConfig API fields. Fixes #2621 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-09-14 09:56:47 +08:00
Eric Ernst	967db0cbcc	Merge pull request #2544 from likebreath/0831/upgrade_clh_v18.0 versions: Upgrade to Cloud Hypervisor v18.0	2021-09-13 11:27:45 -07:00
Fabiano Fidêncio	9381f23ccf	Merge pull request #2613 from sameo/topic/runtime-readme runtime: Fix README link	2021-09-13 17:44:56 +02:00
Binbin Zhang	58e77a3c13	sandbox: Allow the device to be accessed,such as /dev/null and /dev/urandom If the device has no permission, such as /dev/null, /dev/urandom, it needs to be added into cgroup. Fixes: #2615 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2021-09-13 20:47:16 +08:00
Samuel Ortiz	75ef8c243a	Merge pull request #2603 from Bevisy/main-2539 sandbox: Add device permissions such as /dev/null to cgroup	2021-09-13 11:04:51 +02:00
Samuel Ortiz	13b8bb0c74	runtime: Fix README link The LICENSE file lives in the project's root. Fixes #2612 Signed-off-by: Samuel Ortiz <s.ortiz@apple.com>	2021-09-11 09:44:40 +02:00
Anastassios Nanos	62baa48ef5	virtcontainers: fc: parse vcpuID correctly In getThreadIDs(), the cpuID variable is derived from a string that already contains a whitespace. As a result, strings.SplitAfter returns the cpuID with a leading space. This makes any go variant of string to int fail (strconv.ParseInt() in our case). This patch makes sure that the leading space character is removed so the string passed to strconv.ParseInt() is "CPUID" and not " CPUID". This has been caused by a change in the naming scheme of vcpu threads for Firecracker after v0.19.1. Fixes: #2592 Signed-off-by: Anastassios Nanos <ananos@nubificus.co.uk>	2021-09-10 09:39:56 +00:00
Bo Chen	f785ff0bf2	virtcontainers: clh: Revert the workaround incorrect default values Given the fix to the bugs of the openapi spec file is included in the Cloud Hypervisor v18.0 [1], this patch reverts the workaround we carried in the CLH driver. This reverts commit `932ee41b3f`. [1] https://github.com/cloud-hypervisor/cloud-hypervisor/pull/3029 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-09-09 14:52:53 -07:00
Bo Chen	0e0e59dc5f	virtcontainers: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v18.0. Note: The client code of cloud-hypervisor's (CLH) OpenAPI is automatically generated by openapi-generator [1-2]. [1] https://github.com/OpenAPITools/openapi-generator [2] https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/pkg/cloud-hypervisor/README.md Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-09-09 14:51:55 -07:00
Amulyam24	d865c80986	virtcontainers: add unit tests for container.go Fixes: #268 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2021-09-09 13:09:38 +05:30
Binbin Zhang	71f915c63f	sandbox: Add device permissions such as /dev/null to cgroup adds the default devices for unix such as /dev/null, /dev/urandom to the container's resource cgroup spec Fixes: #2539 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2021-09-09 15:33:24 +08:00
bin	2abc450a4d	test: enable running tests under root user Add tests that run under root user to test special cases. Fixes: #2446 Signed-off-by: bin <bin@hyper.sh>	2021-09-09 14:21:34 +08:00
Julio Montes	9bbaa66f39	Merge pull request #2480 from Bevisy/main makefile: Fix error exit status code	2021-09-06 07:28:15 -05:00
Binbin Zhang	f5172d1c36	cli: Fix outdated kata-runtime bash completion adapt to the latest kata-runtime version Fixes: #2254 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2021-09-04 22:26:44 +08:00
Bin Liu	103fdd3f6c	Merge pull request #2564 from Bevisy/main-2296 virtcontainers: Remove NewStoreFeature	2021-09-03 10:41:21 +08:00
James O. D. Hunt	f3a1bf3b45	Merge pull request #2552 from bergwolf/license license: drop redundent license files	2021-09-02 14:31:18 +01:00
Binbin Zhang	e2a9e78c9e	virtcontainers: Remove NewStoreFeature remove NewStoreFeature Fixes: #2296 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2021-09-02 21:28:36 +08:00
Peng Tao	256c3b2747	license: drop redundent license files There is no need to keep multiple copies of the license file in different directory. We can just use the top level one for the project. Fixes: #2553 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-09-01 15:10:04 +08:00
Hui Zhu	bcc9fa3b35	hotplugAddBlockDevice: Use ExecuteBlockdevAddWithDriverCache with swap Use ExecuteBlockdevAddWithDriverCache with swap in hotplugAddBlockDevice to handle swap file cannot work OK with ExecuteBlockdevAddWithCache issue. Fixes: #2548 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-09-01 14:13:11 +08:00
Hui Zhu	bd85da0461	vendor: Update vendor/github.com/kata-containers/govmm Update vendor/github.com/kata-containers/govmm for ExecuteBlockdevAddWithDriverCache. Fixes: #2548 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-09-01 13:59:19 +08:00
Peng Tao	c0daa4ebff	Merge pull request #2513 from cmaf/tracing-tracingtags-consistency tracing: Change runtime tracing tags to vars	2021-08-31 10:25:10 +08:00
Fabiano Fidêncio	67d1f4fd14	Merge pull request #2528 from snir911/main_debuggabillity_sq shimv2: add logging to shimv2 api calls	2021-08-30 15:50:55 +02:00
Peng Tao	a9de761d71	runtime: drop qemu-lite support As the project is not maintained and we have not been testing against it for a long time. Fixes: #2529 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-08-30 16:58:12 +08:00
Peng Tao	8ae3edbc18	runtime: fix default hypervisor path Should not be qemu-lite. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-08-30 16:09:02 +08:00
Snir Sheriber	0c7789fad6	runtime: Add container field to logs and unified field naming Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2021-08-30 10:09:05 +03:00
Snir Sheriber	72e3538e36	shimv2: add information to method comment add a comment to explicitly mentioned method is a binary call Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2021-08-30 10:09:05 +03:00
Snir Sheriber	8dadca9cd1	shimv2: add logging to shimv2 api calls and also fetch and log container id from the request Fixes: #2527 Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2021-08-30 10:09:05 +03:00
Bo Chen	b564dd47b6	Merge pull request #2526 from Bevisy/main-2285 runtime: delete types or const that no longer needed	2021-08-29 15:35:03 -07:00
Bin Liu	a89cc0bb5c	Merge pull request #2524 from Bevisy/main-2264 runtime: Optimize the way slice created	2021-08-29 16:00:08 +08:00
Eric Ernst	8771d8c375	Merge pull request #2514 from rapiz1/improve-util-test virtcontainers: simplify tests	2021-08-28 06:41:15 -07:00
Yujia Qiao	a99fcc3af1	virtcontainers: simplify tests Simplify tests in utils_test.go by table-driven tests. Fixes: #2281 Signed-off-by: Yujia Qiao <rapiz3142@gmail.com>	2021-08-28 12:35:25 +08:00
Binbin Zhang	39ffd8ee84	runtime: delete types or const that no longer needed type: ProcessListOptions; ProcessList const: SocketTypeVSOCK Fixes: #2285 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2021-08-28 04:09:25 +00:00
Binbin Zhang	ff37f5c798	runtime: Optimize the way slice created Initialize and assign a value, reducing one append operation Fixes: #2264 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2021-08-28 04:15:59 +08:00
Carlos Venegas	fb583780f6	Merge pull request #2488 from likebreath/0823/clh_openapi_generator virtcontainers: clh: Upgrade to the openapi-generator v5.2.1	2021-08-27 14:28:09 -05:00
Binbin Zhang	4751698829	virtcontainers: Fix incorrect scripts path modify to the correct relative path Fixes: #2515 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2021-08-27 19:16:53 +00:00
Chelsea Mafrica	8f0f949abf	tracing: Move dynamically added attributes to Trace() Where possible, move attributes added with AddTag() to Trace() call to reduce the amount of code used for tracing. Fixes #2512 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-08-27 08:26:40 -07:00
Bo Chen	932ee41b3f	virtcontainers: clh: Workaround incorrect default values Two default values defined in the 'cloud-hypervisor.yaml' have typo, and this patch manually overwrites them with the correct value as a workaround before the corresponding fix is landed to Cloud Hypervisor upstream. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-08-26 22:53:31 -07:00
Bo Chen	bff38e4f4d	virtcontainers: clh: Fix the unit test This patch fixes the unit tests over clh.go with the updated client code. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-08-26 22:53:17 -07:00
Bo Chen	d967d3cb37	virtcontainers: clh: Use constructors to ensure proper default value With the updated openapi-generator, the client code now handles optional attributes correctly, and ensures to assign the right default values. This patch enables to use those constructors to make sure the proper default values being used. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-08-26 22:53:13 -07:00
Chelsea Mafrica	87de26bda3	tracing: Modify Trace() to accept multiple tag maps The general Trace() function accepts one map as a set of tags. Modify it to accept multiple sets of tags so that additional ones can be added at Trace() and not as a subsequent call. Additionally, we should not iterate over the maps unless tracing tracing is enabled. Fixes #2512 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-08-26 15:55:32 -07:00
Chelsea Mafrica	8058e97212	tracing: Change runtime tracing tags to vars Tracing tags are stored inconsistently throughout the runtime. Change all instances of tracing tags to variables. Fixes #2512 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-08-26 15:55:32 -07:00
Bo Chen	a6a2e525de	virtcontainers: clh: Migrate to use the updated client APIs The client code (and APIs) for Cloud Hypervisor has been changed dramatically due to the upgrade to `openapi-generator` v5.2.1. This patch migrate the Cloud Hypervisor driver in the kata-runtime to use those updated APIs. The main change from the client code is that it now uses "pointer" type to represent "optional" attributes from the input openapi specification file. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-08-26 14:04:18 -07:00
Chelsea Mafrica	0be91280f2	Merge pull request #2466 from Bl1tz23/main Fix version parsing for firecracker version 0.25 and over	2021-08-26 08:51:18 -07:00
wangyongchao.bj	2304f935b4	docs: update the GoDoc url from kata 1.x to 2.x the katatestutils GoDoc url stilled using the kata 1.x branch url. This PR fixed the url from kata-containers/runtime/pkg/katatestutils to kata-containers/kata-containers/src/runtime/pkg/katatestutils Fixes: #2500 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-08-25 11:21:36 +08:00
Yujia Qiao	814cea9601	virtcontainers: clean up useless code Fixes: #2275 Signed-off-by: Yujia Qiao <rapiz3142@gmail.com>	2021-08-24 16:04:34 +08:00
Bo Chen	46eb07e14f	virtcontainers: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor with the updated `openapi-generator` v5.2.1. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-08-23 16:00:32 -07:00
Bo Chen	80fba4d637	virtcontainers: clh: Upgrade to the openapi-generator v5.2.1 To improve the quality and correctness of the auto-generated code, this patch upgrade the `openapi-generator` to its latest stable release v5.2.1. Fixes: #2487 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-08-23 15:59:41 -07:00
Bl1tz23	87bbae1bd7	fc: fix version parsing for fc >= 0.25 Allows to use firecracker version >=0.25. Fixes: #2471 Signed-off-by: Bl1tz23 <alex3angle@gmail.com>	2021-08-23 15:09:59 +03:00
Binbin Zhang	d422789fac	makefile: Fix error exit status code Generate `config-generated.go` file under src/runtime/cli/containerd-shim-kata-v2 before excuting test or coverage. Fixes #2479 Signed-off-by: Binbin Zhang <binbin36520@gmail.com>	2021-08-23 11:31:33 +08:00
Fabiano Fidêncio	348795e282	Merge pull request #2233 from fgiudici/kata-monitor_liubin_cri use CRI in kata-monitor	2021-08-20 13:58:12 +02:00
Jack Rieck	7a5ffd4a0f	config: Enable jailer by default when using firecracker Now that we have enabled CI tests for jailed firecracker and we have fixed the issue with removing the block storage device #2387, we should leverage the full power of firecracker and enable jailer by default. Fixes: #2455 Signed-off-by: Jack Rieck <jack.rieck@sendgrid.com>	2021-08-17 19:22:09 -04:00
Chelsea Mafrica	9586d48254	tracing: Return context in runHooks() span creation The call to Trace() in runHooks() should return a context so that subsequent calls to runHook() produce properly ordered trace spans. Fixes #2423 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-08-12 10:09:56 -07:00
Eric Ernst	71f304ce17	agent: watcher: cleanup mount if needed when container is removed If a bind mount was created for watchable storage, make sure we remove when removing a container. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-08-11 08:53:28 -07:00
Samuel Ortiz	f1a505dbfe	agent: Temporarily allow unknown linters Bump thiserror to 1.0.26 for vsock-exporter and work around a bug in Clippy nonstandard_macro_braces lint. (See https://github-redirect.dependabot.com/rust-lang/rust-clippy/issues/7422) Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-08-11 08:53:28 -07:00
Eric Ernst	961aaff004	agent: watcher: fixes to make more robust inotify/watchable-mount changes... - Allow up to 16 files. It isn't that uncommon to have 3 files in a secret. In Kubernetes, this results in 9 files in the mount (the presented files, which are symlinks to the latest files, which are symlinks to actual files which are in a seperate hidden directoy on the mount). Bumping from eight to 16 will help ensure we can support "most" secret/tokens, and is still a pretty small number to scan... - Now we will only replace the watched storage with a bindmount if we observe that there are too many files or if its too large. Since the scanning/updating is racy, we should expect that we'll occassionally run into errors (ie, a file deleted between scan / update). Rather than stopping and making a bind mount, continue updating, as the changes will be updated the next time check is called for that entry (every 2 seconds today). To facilitate the 'oversized' handling, we create specific errors for too large or too many files, and handle these specific errors when scanning the storage entry. - When handling an oversided mount, do not remove the prior files -- we'll just overwrite them with the bindmount. This'll help avoid the files disappearing from the user, avoid racy cleanup and simplifies the flow. Similarly, only mark it as a non-watched storage device after the bindmount is created successfully. - When creating bind mount, make sure destination exists. If we hadn't had a successful scan before, this wouldn't exist and the mount would fail. Update logic and unit test to cover this. - In several spots, we were returning when there was an error (both in scan and update). For update case, let's just log an warning and continue; since the scan/update is racy, we should expect that we'll have transient errors which should resolve the next time the watcher runs. Fixes: #2402 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-08-11 08:52:51 -07:00
Fabiano Fidêncio	2aa686a0f5	Merge pull request #2409 from sameo/topic/agent agent: Fix cargo 1.54 clippy warning	2021-08-10 23:03:00 +02:00
wangyongchao.bj	99ab91df3d	docs: update the docs project url from kata 1.x to 2.x changed the document project url in the using-vpp-and-kata.md and runtime experimental README.md files. Fixes: #2418 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-08-10 13:51:54 +08:00
Fabiano Fidêncio	e1e6827a2c	Merge pull request #2388 from nubificus/fix_jailed_fc virtcontainers: fc: properly remove jailed block device	2021-08-10 00:17:18 +02:00
Samuel Ortiz	233b53c048	agent: Fix cargo 1.54 clippy warning Mostly the needless borrow one, plus a few others that are now enforced. Fixes #2408 Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-08-05 18:41:55 +02:00
Francesco Giudici	2d8386ea52	kata-monitor: add few unit tests Add cri.go unit tests Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-08-05 11:41:54 +02:00
Francesco Giudici	8714a35063	kata-monitor: make code to identify kata pods simpler just search for the "kata" substring in the runtime value and log at info level when the runtime name/type is not found. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-08-05 11:41:54 +02:00
Francesco Giudici	68a6f011b5	kata-monitor: drop the runtime info from the sandbox cache We keep the container engine info in the sandbox cache map, as the value associated to the pod id (the key). Since we used that in getMonitorAddress() only (which is gone) we can avoid storing that information. Let's drop it. Keep the map structure and the [put,delete]IfExists functions as we may want to move to an event based cache update process sooner or later, and we will need those. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-08-05 11:41:54 +02:00
Francesco Giudici	97dcc5f78a	kata-monitor: drop getMonitorAddress() since the shim socket path is statically defined in the containerd-shimv2 code, we don't need to retrieve the socket name from the filesystem: construct the socket name using the containerd-shimv2 code. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-08-05 11:41:54 +02:00
Francesco Giudici	0b03d97d0b	vendor: update vendors for kata-monitor kata-monitor switched from containerd client to CRI. Update the dependencies and vendored code. go mod tidy go mod vendor Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-08-05 11:41:54 +02:00
Francesco Giudici	c2f03e8993	kata-monitor: talk to the container engine via the CRI kata-monitor uses containerd client to retrieve information from the container engine. This makes kata-monitor work with the containerd container engine only. Bin Liu (bin <bin@hyper.sh>) worked on a kata-monitor version able to talk to any container engine leveraging the standard CRI[1]. Here, the original work of Bin Lui has been adapted on the current kata-monitor to make it container engine independent. [1] https://github.com/liubin/kata-containers/tree/fix/1030-use-cri-in-kata-monitor Fixes: #1030 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-08-05 11:41:54 +02:00
Chelsea Mafrica	eac05ad6d6	Merge pull request #2375 from sameo/upstream/topic/process-cwd agent: Create the process CWD when it does not exist	2021-08-04 11:35:11 -07:00
Anastassios Nanos	64dd35ba4f	virtcontainers: fc: properly remove jailed block device When running a firecracker instance jailed, block devices are not removed correctly, as the jailerRoot path is not stripped from the PATCH command sent to the FC API. This patch differentiates the jailed case from the non-jailed one and allows the firecracker instance to be properly terminated. Fixes #2387 Signed-off-by: Anastassios Nanos <ananos@nubificus.co.uk>	2021-08-04 16:31:56 +00:00
Christophe de Dinechin	881b996443	agent: Make wording of error message match CRI-O test suite The CRI-O integration test suite has two tests that fail because they search for "not found" in the error message, but we emit "is not exist". Change the error message to match the expectations of the test suite. Fixes: #2036 Reported-by: Julien Ropé <jrope@redhat.com> Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2021-08-04 09:33:09 +02:00
David Gibson	d5f85698e1	vendor: Update govmm Update to commit `3c64244cbb`, in particular to get these fixes which are needed to work with qemu-6.0 and later: https://github.com/kata-containers/govmm/pull/192 https://github.com/kata-containers/govmm/pull/194 Git log `d27256f` (qmp: Don't use deprecated 'props' field for object-add, 2021-08-03) `d8cdf9a` (qemu: Drop support for versions older than 5.0, 2021-08-03) `1b02192` (Use 'host_device' driver for blockdev backends, 2021-07-29) `9518675` (add support for "sandbox" feature to qemu, 2021-07-20) `335fa81` (qemu: fix golangci-lint errors, 2021-07-21) `61b6378` (.github/workflows: reimplement github actions CI, 2021-07-21) `9d6e797` (go: support go modules, 2021-07-21) `0d21263` (qemu: support read-only nvdimm, 2021-07-21) Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-08-04 15:04:30 +10:00
David Gibson	3165095669	runtime/qemu: Use explicit "on" for kernel_irqchip parameter Kata uses the 'kernel_irqchip' machine option to qemu. By default it uses it in what qemu calls the "short-form boolean" with no parameter. That style was deprecated by qemu between 5.2 and 6.0 (commit ccd3b3b8112b) and effectively removed entirely between 6.0 and 6.1 (commit d8fb7d0969d5). Update ourselves for newer qemus by using an explicit "kernel_irqchip=on". Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-08-04 14:34:11 +10:00
Carlos Venegas	27b9a68189	Merge pull request #2365 from sameo/topic/clh-tracing virtcontainers: clh: Do not use the default HTTP client	2021-08-03 12:54:09 -05:00
Hui Zhu	e6408fe670	Container: Add initConfigResourcesMemory and call it in newContainer The swappiness is not right if just set io.katacontainers.container.resource.swappiness: $ pod_yaml=pod.yaml $ container_yaml=container.yaml $ image="quay.io/prometheus/busybox:latest" $ cat << EOF > "${pod_yaml}" metadata: name: busybox-sandbox1 EOF $ cat << EOF > "${container_yaml}" metadata: name: busybox-killed-vmm annotations: io.katacontainers.container.resource.swappiness: "100" image: image: "$image" command: - top EOF $ sudo crictl pull $image $ podid=$(sudo crictl runp $pod_yaml) $ cid=$(sudo crictl create $podid $container_yaml $pod_yaml) $ sudo crictl start $cid crictl exec $cid cat /sys/fs/cgroup/memory/memory.swappiness 60 The cause of this issue is there are two elements store the resources infomation. They are c.config.Resources for calculateSandboxMemory and c.GetPatchedOCISpec() for agent. This add initConfigResourcesMemory to Container and call it in newContainer to handle the issue. Fixes: #2372 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-08-02 16:02:12 +08:00
Fupan Li	fdc42ca7ff	Merge pull request #2324 from jongwu/ro_nv qemu/arm: remove nvdimm/"ReadOnly" option on arm64	2021-08-02 14:14:06 +08:00
Samuel Ortiz	49083bfa31	agent: Create the process CWD when it does not exist Although the OCI specification does not explictly requires that, we should create the process CWD if it does not exist, before chdir'ing to it. Without that fizx, the kata-agent fails to create a container and returns a grpc error when it's trying to change the containerd working directory to an non existing folder. runc, the OCI runtime reference implementation, also creates the process CWD when it's not part of the container rootfs. Fixes #2374 Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-08-01 04:27:03 +02:00
Hui Zhu	ee90affc18	newContainer: Initialize c.config.Resources.Memory if it is nil container start fail if io.katacontainers.container.resource.swap_in_bytes and memory_limit_in_bytes are not set. $ pod_yaml=pod.yaml $ container_yaml=container.yaml $ image="quay.io/prometheus/busybox:latest" $ cat << EOF > "${pod_yaml}" metadata: name: busybox-sandbox1 EOF $ cat << EOF > "${container_yaml}" metadata: name: busybox-killed-vmm annotations: io.katacontainers.container.resource.swappiness: "60" image: image: "$image" command: - top EOF $ sudo crictl pull $image $ podid=$(sudo crictl runp $pod_yaml) $ cid=$(sudo crictl create $podid $container_yaml $pod_yaml) $ sudo crictl start $cid DEBU[0000] get runtime connection DEBU[0000] connect using endpoint 'unix:///var/run/containerd/containerd.sock' with '10s' timeout DEBU[0000] connected successfully using endpoint: unix:///var/run/containerd/containerd.sock DEBU[0000] StartContainerRequest: &StartContainerRequest{ContainerId:4fea91d16f661931fe33acd247efe831ef9e571588ba18b5a16f04c278fd61b8,} DEBU[0000] StartContainerResponse: nil FATA[0000] starting the container "4fea91d16f661931fe33acd247efe831ef9e571588ba18b5a16f04c278fd61b8": rpc error: code = Unknown desc = failed to create containerd task: failed to create shim: ttrpc: closed: unknown The cause of fail if if c.config.Resources.Memory is nil, values of io.katacontainers.container.resource.swappiness and io.katacontainers.container.resource.swap_in_bytes will be store in newContainer. This commit initialize c.config.Resources.Memory if it is nil in newContainer. Fixes: #2367 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-08-01 10:03:27 +08:00
Hui Zhu	767a41ce56	updateResources: Log result after calculateSandboxMemory Log result after calculateSandboxMemory in updateResources. Fixes: #2367 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-08-01 09:57:44 +08:00
Samuel Ortiz	760ec4e58a	virtcontainers: clh: Do not use the default HTTP client When enabling tracing with Cloud Hypervisor, we end up establishing 2 connections to 2 different HTTP servers: The Cloud Hypervisor API one that runs over a UNIX socket and the Jaeger endpoint running over UDP. Both connections use the default HTTP golang client instance, and thus share the same transport layer. As the Cloud Hypervisor implementation sets it up to be over a Unix socket, the jaeger uploader ends up going through that transport as well, and sending its spans to the Cloud Hypervisor API server. We fix that by giving the Cloud Hypervisor implementation its own HTTP client instance and we avoid sharing it with anything else in the shim. Fixes #2364 Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-07-30 16:51:01 +02:00
James O. D. Hunt	4f0726bc49	docs: Remove table of contents Removed all TOCs now that GitHub auto-generates them. Also updated the documentation requirements doc removing the requirement to add a TOC. Fixes: #2022. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-07-30 10:58:22 +01:00
James O. D. Hunt	f186c5e284	docs: Fix invalid URLs Correct broken / stale URLs as detected by the CI URL checker. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-07-30 10:58:22 +01:00
Peng Tao	9514dda52e	mod: unity containerd dependency The old ones are carrying CVEs, do not use them. PS: In order to update the modules, we're running `make handle_vendor` target from the runtime's Makefile. This is now part of the CI and ensures that the vendored code is up-to-date. It's important to note that older versions of golang may generate different results for those, but those versions are not supported anymore, so we're good to go with what we have in the CI (1.15 and 1.16). Signed-off-by: Peng Tao <bergwolf@hyper.sh> Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-29 20:51:02 +02:00
Peng Tao	6ffe37b949	mod: unify runc dependency Since the old ones are carrying CVEs. Do not use them. PS: In order to update the modules, we're running `make handle_vendor` target from the runtime's Makefile. This is now part of the CI and ensures that the vendored code is up-to-date. It's important to note that older versions of golang may generate different results for those, but those versions are not supported anymore, so we're good to go with what we have in the CI (1.15 and 1.16). Fixes: #2338 Signed-off-by: Peng Tao <bergwolf@hyper.sh> Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-29 20:48:52 +02:00
Bo Chen	cc0bb9aebc	versions: Upgrade to Cloud Hypervisor v17.0 Highlights from the Cloud Hypervisor release v17.0: 1) ARM64 NUMA support using ACPI; 2) `Seccomp` support for MSHV backend; 3) Hotplug of macvtap devices; 4) Improved SGX support; 5) Inflight tracking for `vhost-user` devices; 6) Bug fixes. Details can be found: https://github.com/cloud-hypervisor/cloud-hypervisor/releases/tag/v17.0 Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by `openapi-generator` [1-2]. As the API changes do not impact usages in Kata, no additional changes in kata's runtime are needed to work with the current version of cloud-hypervisor. [1] https://github.com/OpenAPITools/openapi-generator [2] https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/pkg/cloud-hypervisor/README.md Fixes: #2333 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-07-27 11:56:29 -07:00
Fupan Li	838e169b9c	Merge pull request #2248 from lifupan/check_file_exist mount: fix the issue of missing check file exists	2021-07-27 23:29:26 +08:00
Jianyong Wu	77604de80b	qemu/arm: remove nvdimm/"ReadOnly" option on arm64 There is a new "ReadOnly" option added to nvdimm device in qemu and now added to kata. However, qemu used for arm64 is a little old and has no this feature. Here we remove this feature for arm. Fixes: #2320 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2021-07-27 20:32:55 +08:00
Fabiano Fidêncio	9806e88963	Merge pull request #2319 from kata-containers/dependabot/go_modules/src/runtime/github.com/containerd/containerd-1.5.4 build(deps): bump github.com/containerd/containerd from 1.5.2 to 1.5.4 in /src/runtime	2021-07-27 08:49:50 +02:00
Gabriela Cervantes	4fbae549e4	docs: Update experimental documentation This PR updates the experimental documentation with the proper reference to kata 2.x Fixes #2317 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2021-07-26 20:29:21 +00:00
dependabot[bot]	07f7ad9d63	build(deps): bump github.com/containerd/containerd in /src/runtime Bumps [github.com/containerd/containerd](https://github.com/containerd/containerd) from 1.5.2 to 1.5.4. - [Release notes](https://github.com/containerd/containerd/releases) - [Changelog](https://github.com/containerd/containerd/blob/main/RELEASES.md) - [Commits](https://github.com/containerd/containerd/compare/v1.5.2...v1.5.4) --- updated-dependencies: - dependency-name: github.com/containerd/containerd dependency-type: direct:production ... Fixes: #2322 Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-07-26 23:34:09 +08:00
Fabiano Fidêncio	116c29c897	cgroups: manager's Set() now takes Resources as its parameter Pior our bump to runc 1.0.1 the manager's Set() would take a Config as its parameter. Now it takes the Resources directly. Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-26 11:34:27 +02:00
Fabiano Fidêncio	c0f801c0c4	rootless: RunningInUserNS() is now part of userns namespace Previously part of the "system" namespace, the RunningInUserNS() has been moved to the "userns" namespace. Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-26 11:34:23 +02:00
Fabiano Fidêncio	b5293c5214	runtime: update runc dependency to 1.0.1 Dependabot brought to us attention that we were still vendoring the runc code which was affected by CVE-2021-30465. Although the vulnerability doesn't seem to affect kata-containers, we better keep our dependencies up-to-date anyways. With this in mind, let's bump our runc dependency to the latest release. Fixes: #2309 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-26 08:06:43 +02:00
Julio Montes	2859600a6f	runtime: virtcontainers: make rootfs image read-only Improve security by making rootfs image read-only, nobody will be able to modify it from the guest. fixes #1916 Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-23 13:20:42 -05:00
Julio Montes	070590fb53	vendor: update govmm Bring read-only nvdimm support Shortlog: `335fa81` qemu: fix golangci-lint errors `61b6378` .github/workflows: reimplement github actions CI `9d6e797` go: support go modules `0d21263` qemu: support read-only nvdimm `ff34d28` qemu: Consistent parameter building Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-22 08:47:44 -05:00
Chelsea Mafrica	b817340f94	Merge pull request #2282 from lifupan/main monitor: mv the monitor socket into sbs directory	2021-07-20 15:26:31 -07:00
Julio Montes	aec530904b	runtime: virtcontainers/utils: fix govet fieldalignment Fix structures alignment Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 11:59:15 -05:00
Julio Montes	1e4f7faa77	runtime: virtcontainers/types: fix govet fieldalignment Fix structures alignment Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 11:59:15 -05:00
Julio Montes	bb9495c0b7	runtime: virtcontainers/pkg: fix govet fieldalignment Fix structures alignment Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 11:59:15 -05:00
Julio Montes	80ab91ac2f	runtime: virtcontainers/persist: fix govet fieldalignment Fix structures alignment Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 11:59:15 -05:00
Julio Montes	54bdd01811	runtime: virtcontainers/factory: fix govet fieldalignment Fix structures alignment Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 11:59:15 -05:00
Julio Montes	dd58de368d	runtime: virtcontainers/device: fix govet fieldalignment Fix structures alignment Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 11:59:15 -05:00
Julio Montes	47d95dc1c6	runtime: virtcontainers: fix govet fieldalignment Fix structures alignment fixes #2271 Depends-on: github.com/kata-containers/tests#3727 Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 11:59:15 -05:00
Julio Montes	8ca7a7c547	runtime: netmon: fix govet fieldalignment Fix structures alignment Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 10:30:30 -05:00
Julio Montes	31de8eb75b	runtime: pkg: fix govet fieldalignment Fix structures alignment Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 10:30:30 -05:00
Julio Montes	2b80091e14	runtime: containerd-shim-v2: fix govet fieldalignment Fix structures alignment Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 10:30:30 -05:00
Julio Montes	0dc59df68f	runtime: cli: fix govet fieldalignment Fix structures alignment Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 10:30:30 -05:00
Peng Tao	fd2607cc43	Merge pull request #2202 from teawater/swap7 Add swap support	2021-07-20 21:12:30 +08:00
fupan.lfp	add480ed59	monitor: mv the monitor socket into sbs directory Since the monitor socket used the unix socket path file, which needed to be cleaned after the pod terminated, thus put it into the sandbox data directory, and it would be cleaned up once the sandbox termianted. Fixes: #2269 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-07-20 19:10:01 +08:00
Fabiano Fidêncio	75c5edd66a	Merge pull request #2263 from eryugey/eryugey/for-main agent: clear MsFlags if the option has clear flag set	2021-07-20 12:50:45 +02:00
Bin Liu	6b00806bb8	Merge pull request #2243 from egernst/bump-tokio agent/agent-ctl: update tokio to 1.8.1	2021-07-20 13:56:32 +08:00
Hui Zhu	cb6b7667cd	runtime: Add option "enable_guest_swap" to config hypervisor.qemu This commit add option "enable_guest_swap" to config hypervisor.qemu. It will enable swap in the guest. Default false. When enable_guest_swap is enabled, insert a raw file to the guest as the swap device if the swappiness of a container (set by annotation "io.katacontainers.container.resource.swappiness") is bigger than 0. The size of the swap device should be swap_in_bytes (set by annotation "io.katacontainers.container.resource.swap_in_bytes") - memory_limit_in_bytes. If swap_in_bytes is not set, the size should be memory_limit_in_bytes. If swap_in_bytes and memory_limit_in_bytes is not set, the size should be default_memory. Fixes: #2201 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-07-19 23:22:06 +08:00
Hui Zhu	a733f537e5	runtime: newContainer: Handle the annotations of SWAP This commit add code to handle the annotations "io.katacontainers.container.resource.swappiness" and "io.katacontainers.container.resource.swap_in_bytes". It will set the value of "io.katacontainers.resource.swappiness" to c.config.Resources.Memory.Swappiness and set the value of "io.katacontainers.resource.swap_in_bytes" to c.config.Resources.Memory.Swap. Fixes: #2201 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-07-19 23:20:46 +08:00
Hui Zhu	2c835b60ed	ContainerConfig: Set ocispec.Annotations to containerConfig.Annotations ocispec.Annotations is dropped in ContainerConfig. This commit let it to be set to containerConfig.Annotations in ContainerConfig. Fixes: #2201 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-07-19 23:20:43 +08:00
Hui Zhu	243d4b8689	runtime: Sandbox: Add addSwap and removeSwap addSwap will create a swap file, hotplug it to hypervisor as a special block device and let agent to setup it in the guest kernel. removeSwap will remove the swap file. Just QEMU support addSwap. Fixes: #2201 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-07-19 23:20:40 +08:00
Hui Zhu	e1b91986d7	runtime: Update golang proto code for AddSwap Fixes: #2201 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-07-19 23:20:37 +08:00
Hui Zhu	4f066db8da	agent: agent.proto: Add AddSwap Add new fuction AddSwap. When agent get AddSwap, it will get the device name from PCIPath and set the device as the swap device. Fixes: #2201 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-07-19 23:20:34 +08:00
Fabiano Fidêncio	11d84cca46	Merge pull request #2229 from lifupan/fix_virtiofsd virtiofsd: fix the issue of missing stop virtiofsd	2021-07-19 13:34:59 +02:00
Bin Liu	b94ebc30b4	Merge pull request #2235 from Tim-Zhang/vsock-exporter-async vsock-exporter: switch to tokio runtime	2021-07-19 17:06:14 +08:00
Fabiano Fidêncio	462e445d2f	Merge pull request #2261 from ManaSugi/fix/oci-hooks-explanation config: Fix description for OCI hooks	2021-07-19 10:38:16 +02:00
Fabiano Fidêncio	f8d71eb96b	Merge pull request #2253 from lifupan/fix_socket_address shimv2: fix the issue of kata-runtime exec failed	2021-07-19 10:38:06 +02:00
Eryu Guan	35cbc93dee	agent: clear MsFlags if the option has clear flag set 'FLAGS' hash map has bool to indicate if the flag should be cleared or not. But in parse_mount_flags_and_options() we set the flag even 'clear' is true. This results in a 'rw' mount being mounted as 'MS_RDONLY'. Fixes: #2262 Signed-off-by: Eryu Guan <eguan@linux.alibaba.com>	2021-07-19 11:50:10 +08:00
Manabu Sugimoto	ff87da721b	config: Fix description for OCI hooks - Update url for osbuilder - Fix typo about poststart Fixes: #2260 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-07-18 16:47:19 +09:00
Fabiano Fidêncio	fcc93b0074	shim-v2: Be compatible with the old runtime options Seems that at least some versions of container, when using ConifgPath, still rely on the runtime options and its APIs from the not in use anymore github.com/containerd/cri-containerd/pkg/api/runtimeoptions/v1. The fact backward compat breaks when moving from the old to the new runtime options, which happened as part of f60641a6e6d, strongly feels like a containerd bug. Regardless, we can easily work this around on our side without much hassle. Just by importing old runtime options the unmarshalling doesn't break anymore and we can easily check whether getting the options fails or not and fallback to the old way if it does. Fixes: #2258 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-18 00:07:57 +02:00
fupan.lfp	8e0daf6780	shimv2: fix the issue of kata-runtime exec failed Commit `32c9ae1388` upgrade the containerd vendor, which used the socket path to replace the abstract socket address for socket listen and dial, and there's an bug in containerd's abstract socket dialing. Thus we should replace our monitor and exec socket server with the socket path to fix this issue. Fixes: #2238 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-07-16 11:41:09 +08:00
fupan.lfp	5371b9214f	mount: fix the issue of missing check file exists It's better to check whether the destination file exists before creating them, if it had been existed, then return directly. Fixes: #2247 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-07-15 18:09:33 +08:00
Eric Ernst	acf6932863	agent: update tokio to 1.8.1 Update to latest tokio to address RUSTSEC-2021-0072: Task dropped in wrong thread when aborting `LocalSet` task Update the toml to specify just 1.x for the tokio version. Fixes: #2165 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-07-14 17:18:21 -07:00
Fabiano Fidêncio	3a9ecbcca5	Merge pull request #2231 from liubin/fix/2230-register-defer-callback-at-early-stage runtime: Register defer function at early stage	2021-07-14 17:50:48 +02:00
fupan.lfp	34828df9a1	virtiofsd: fix the issue of missing stop virtiofsd The virtiofsd's PID wan't assigned the right pid, which will result skipping kill it. Fixes: #2228 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-07-14 21:07:10 +08:00
Tim Zhang	73d3798cb1	vsock-exporter: switch to tokio runtime Make the vsock-exporter async totally using tokio runtime. And delay the timing of the connection to trace-forwarder so that it is easy to reconnect when the connection was broken. Fixes: #2234 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-07-14 20:16:05 +08:00
Fabiano Fidêncio	f4fbf723e1	runtime: Update vendored code The go vendored code is not up-to-date and the newly added check for that caught this up as part of https://github.com/kata-containers/kata-containers/pull/2223/checks?check_run_id=3056830309 Let's take advantage of the `make vendor` target and update the vendored code. :-) Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-14 13:59:41 +02:00
Fabiano Fidêncio	5e69b498ed	trace-forwarder: Add `make vendor` This has a similar intent as the go code, but not totally equal. For the go code we want to ensure that the vendored code is up-to-date, while here we want to ensure that `cargo vendor` actually works. We happened to release a few tarballs where `cargo vendor` didn't work and it causes some pain for downstream maintainers. Related: #2159 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-14 13:59:41 +02:00
Fabiano Fidêncio	a104f13230	agent: Add `make vendor` This has a similar intent as the go code, but not totally equal. For the go code we want to ensure that the vendored code is up-to-date, while here we want to ensure that `cargo vendor` actually works. We happened to release a few tarballs where `cargo vendor` didn't work and it causes some pain for downstream maintainers. Related: #2159 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-14 13:59:41 +02:00
Fabiano Fidêncio	579b3f34c2	runtime: Add `make vendor` Let's add this target so we can actually enforce, as part of the static checks (which will be added in a follow-up commit), that our vendored go code is up-to-date. Related: #2159 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-14 13:59:40 +02:00
Fabiano Fidêncio	930ca55d02	runtime: Add `make handle_vendor` This will help us to ensure that we always update the vendored code when needed. Right now we've been lacking behind and we tend to realise something change during the next mandatory update, which is not exactly optimal. Related: #2159 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-14 13:59:40 +02:00
bin	39546a1070	runtime: delete not used functions Delete some not used functions in sandbox.go Fixes: #2230 Signed-off-by: bin <bin@hyper.sh>	2021-07-14 19:42:50 +08:00
bin	d0bc148fe0	runtime: Register defer function at early stage Register defer function at early stage ensure that it can be called if the startSandbox fails. Fixes: #2230 Signed-off-by: bin <bin@hyper.sh>	2021-07-14 17:20:53 +08:00
Tim Zhang	7960689ef7	tracing: replace SimpleSpanProcessor with BatchSpanProcessor This change make tokio could be use in vsock-exporter. Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-07-14 15:59:52 +08:00
bin	350acb2d6e	virtcontainers: refactoring code for error handling in sandbox Use a defined error variable replade inplace error, and shortcut for handling errors returned from function calls. Fixes: #2187 Signed-off-by: bin <bin@hyper.sh>	2021-07-14 14:28:58 +08:00
bin	858f39ef75	virtcontainers: update wrong comments for code Some comments/URL are old or wrong, update them to the correct ones. Fixes: #2187 Signed-off-by: bin <bin@hyper.sh>	2021-07-14 14:28:57 +08:00
bin	e0a19f6a16	virtcontainers: update API documentation Some functions add context as its first parameter, the documentation should update. Fixes: #2187 Signed-off-by: bin <bin@hyper.sh>	2021-07-14 14:28:57 +08:00
Fabiano Fidêncio	8c4dd3b421	Merge pull request #2199 from Tim-Zhang/tracing-enhance trace-forwarder: Add option rustflags, target, build-type for the make	2021-07-13 10:16:21 +02:00
Tim Zhang	6999dccaa8	trace-forwarder: Add option rustflags, target, build-type for the make Support rust-flags, target and build-type. Fixes: #2215 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-07-13 11:35:46 +08:00
Eric Ernst	feeb1ef8b1	Merge pull request #2212 from lifupan/fix_virtiofsd qemu: stop the virtiofsd specifically	2021-07-12 13:56:04 -07:00
Chelsea Mafrica	61b1a6732b	Merge pull request #2179 from bporter816/bporter816/refactor-tracing tracing: Consolidate tracing into a new katatrace package	2021-07-12 12:42:01 -04:00
bin	9081bee2fd	runtime: return error if clh's binary has not a normal stat When checking clh's binary path if valid, return error even though the error is not a IsNotExist error. And add errors to log filed when errors occurred. Fixes: #2208 Signed-off-by: bin <bin@hyper.sh>	2021-07-12 11:16:35 +08:00
Benjamin Porter	b10e3e22b5	tracing: Consolidate tracing into a new katatrace package Removes custom trace functions defined across the repo and creates a single trace function in a new katatrace package. Also moves span tag management into this package and provides a function to dynamically add a tag at runtime, such as a container id, etc. Fixes #1162 Signed-off-by: Benjamin Porter <bporter816@gmail.com>	2021-07-11 14:19:51 -05:00
David Gibson	1ab72518b3	agent: Fix to parsing of /proc/self/mountinfo get_mounts() parses /proc/self/mountinfo in order to get the mountpoints for various cgroup filesystems. One of the entries in mountinfo is the "device" for each filesystem, but for virtual filesystems like /proc, /sys and cgroups, the device entry is arbitrary. Depending on the exact rootfs setup, it can end up being "-". This breaks get_mounts() because it uses " - " as a separator. There really is a " - " separator in mountinfo, but in this case the device entry shows up as a second one. Fix this, by changing a split to a splitn, which will effectively only consider the first " - " in the line. While we're there, make the warning message more useful, by having it actually show which line it wasn't able to parse. fixes #2182 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-07-10 19:30:27 +10:00
fupan.lfp	8f76626fd6	qemu: stop the virtiofsd specifically We'd better stop the virtiofsd specifically after stop qemu, instead of depending on the qemu's termination to notify virtiofsd to exit. Fixes: #2211 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-07-10 17:26:19 +08:00
Fabiano Fidêncio	da3de3c2eb	shim-v2: Fix `gosimple` issue on utils_test.go For some reason our static check started to get opinionated about code that's been there for ages. One of the suggestions is to improve: ``` INFO: Running golangci-lint on /home/fidencio/go/src/github.com/kata-containers/kata-containers/src/runtime/containerd-shim-v2 utils_test.go:76:36: S1039: unnecessary use of fmt.Sprintf (gosimple) testDir, err = ioutil.TempDir("", fmt.Sprintf("shimV2-")) ``` And that's what this PR is about. Fixes: #2204 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-09 17:24:51 +02:00
Fabiano Fidêncio	305fb0547d	virtcontainers: Fix `gosimple` issue on client.go For some reason our static check started to get opinionated about code that's been there for ages. One of the suggestions is to improve: ``` INFO: Running golangci-lint on /home/fidencio/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/pkg/agent/protocols/client client.go:431:2: S1017: should replace this `if` statement with an unconditional `strings.TrimPrefix` (gosimple) if strings.HasPrefix(sock, "mock:") { ``` And that's what this PR is about. Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-09 17:18:08 +02:00
Fabiano Fidêncio	89cf168c92	virtcontainers: Ignore a staticcheck error on cpuset.go First of all, cpuset.go just comes from kubernetes and we shouldn't be doing much with this file apart from updating it every now and then (but that's material for another PR). Right now, due to some change on the static checks we use as part of our CI, we started getting issues as: ``` INFO: Running golangci-lint on /home/fidencio/go/src/github.com/kata-containers/kata-containers/src/runtime/virtcontainers/pkg/cpuset cpuset.go:60:2: SA4005: ineffective assignment to field Builder.done (staticcheck) b.done = true ``` For those, let's just ignore the lint and move on. Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-07-09 17:17:12 +02:00
Jakob Naucke	9577e54e2a	Merge pull request #2168 from Jakob-Naucke/fix-cc-suse-s390x runtime: Use CC=gcc on all RPM-based s390x	2021-07-09 11:07:35 +02:00
Jakob Naucke	e8ec18a9d8	Merge pull request #2027 from Jakob-Naucke/virtio-blk-ccw s390x: Enable virtio-blk-ccw	2021-07-08 18:22:44 +02:00
Jakob Naucke	28b2c629e3	runtime: Use CC=gcc on SUSE s390x too This setting is required, as it is on Fedora-likes. Fixes: #2167 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-07-08 15:01:32 +02:00
Jakob Naucke	cfd690b638	virtcontainers: Use virtio-blk-ccw on s390x if virtio-blk-pci were to be used Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-07-08 14:59:47 +02:00
Jakob Naucke	8758ce26b7	agent: Enable virtio-blk-ccw Forward-port of https://github.com/kata-containers/agent/pull/600. Enable virtio-blk-ccw devices in agent (virtio-blk for s390x, already enabled in runtime). Fixes: #2026 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-07-08 14:59:47 +02:00
James O. D. Hunt	a33d6bae63	forwarder: Add dump only option Added a `--dump-only` option which disables forwarding of trace spans. This essentially makes the forwarder a NOP but can be useful for testing purposes. Fixes: #2132. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-07-08 13:12:17 +01:00
Fabiano Fidêncio	432296ae7a	Merge pull request #2197 from lifupan/fix_leak_hypervisor shimv2: fix the issue of leaking the hypervisor processes	2021-07-08 13:49:37 +02:00
fupan.lfp	4c809a53d2	shimv2: fix the issue of leaking the hypervisor processes Since we only send an shutdown qmp command to qemu when do stopSandbox, and didn't wait until qemu process's exit, thus we'd better to make sure it had exited when shimv2 terminated. Thus here to do the last cleanup of the hypervisor. Fixes: #2198 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-07-08 15:43:58 +08:00
Bo Chen	d08603bebb	runtime: Remove the version check for cloud hypervisor It looks like the version check for cloud hypervisor (clh) was added initially when clh was actively evolving its API. We no longer need the version check as clh API has been fairly stable for its recent releases. Fixes: #1991 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-07-06 18:42:59 -07:00
Tim Zhang	3f1aa8ff91	Merge pull request #2084 from liubin/fix/2082-refactor-vc-pkg-oci runtime: refact virtcontainers/pkg/oci	2021-07-06 19:14:10 +08:00
Bin Liu	26985bbfff	Merge pull request #2173 from Tim-Zhang/enhance-test-execute-hook agent: enhance tests of execute_hook	2021-07-05 14:36:45 +08:00
Fabiano Fidêncio	015b3baf06	Merge pull request #2178 from mxpv/config agent: Cleanup config	2021-07-03 09:51:16 +02:00
Fupan Li	2de9c5b41d	Merge pull request #1969 from liubin/feature/1968-pass-span-context-to-agent Pass span context from runtime to agent to get a full trace #1968	2021-07-03 09:31:02 +08:00
Maksym Pavlenko	e6b1766f6b	agent: Cleanup config This commit clean up config parsing and testing code to make it a bit more easy to maintain. - Adds `with_context` from anyhow to include the underlying error. This helps to understand what exactly went wrong. - Uses ensure and bail as a shorter alternative for `if` checks. - TestData in test_parse_cmdline is now implements Default to reduce boilerplate code - Remove `make_err` as it doesn’t make any sense. Fixes: #2177 Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2021-07-02 14:28:43 -07:00
Tim Zhang	55c5c871d2	agent: enhance tests of execute_hook Use which to find the full path of exe before run execute_hook to avoid error: 'No such file or directory' Fixes: #2172 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-07-02 14:30:56 +08:00
bin	bd5951247c	runtime: add spans and attributes for agent/mount Add more spans and attributes for agent setup, add devices, and mount volumes. Fixes: #1968 Signed-off-by: bin <bin@hyper.sh>	2021-07-02 10:07:28 +08:00
bin	65d2fb5d11	agent: remove instrument attribute for some simple functions For some simple functions that only process memory data(list/hashmap), they don't need to be instrumented. And sometime they may generate non-parent spans, if they are called from daemon-style "threads". Fixes: #1968 Signed-off-by: bin <bin@hyper.sh>	2021-07-02 10:07:28 +08:00
bin	cfb8139f36	agent: add more instruments for RPC calls All RPC calls can get parent span context, and create new sub-spans for the full trace. Fixes: #1968 Signed-off-by: bin <bin@hyper.sh>	2021-07-02 10:07:28 +08:00
bin	ae46e7bf97	runtime: pass span context to agent in ttRPC client Pass span context through ttRPC metadata, that agent can get parent from the context to create new sub-spans. Fixes: #1968 Signed-off-by: bin <bin@hyper.sh>	2021-07-02 10:07:14 +08:00
Fabiano Fidêncio	3fe0af6a9b	Merge pull request #2152 from liubin/fix/2111-update-netlink-libs agent: update netlink libraries	2021-07-01 12:01:35 +02:00
bin	66dd8719e3	runtime: refact virtcontainers/pkg/oci Use common functions wrapping logic of getting values from annotations, parsing bool/uint32/uint64 and setting to struct fields. Fixes: #2082 Signed-off-by: bin <bin@hyper.sh>	2021-07-01 10:14:47 +08:00
fupan.lfp	d671f78952	agent: fix the issue of convert OCI spec to RPC spec Since the rpc spec used an interface to represen the ErrnoRet, thus the transform function of OCItoGRPC should take care of this case. Depends-on: github.com/kata-containers/tests#3629 Fixes: #1441 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-06-30 22:56:59 +08:00
fupan.lfp	f607641a6e	shimv2: fix the issue bring by updating containerd vendor Fix the mismatch bring by the upgrading of vendor of containerd, cgroup and runtime spec. Fixes: #1441 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-06-30 22:56:51 +08:00
fupan.lfp	32c9ae1388	shimv2: update containerd vendor Since the latest containerd's shimv2 had changed the socket from abstract unix socket to path unix socket, thus we'd better to update the vendor to match with the latest containerd. containerd from v1.3.9, v1.4.3 and v1.5.0 used the path unix socket instead of abstract socket, thus kata wouldn's support the containerd's version older than them. Fixes: #1441 short logs: 15d9703d6 Remove ARM64 releases from release notes 5d2e8e86d Revert "Release artifacts for Linux ARM64" 7942ae68b Revert "Specify seccomp target arch for CC" 3187b6dc8 tests: Adds consumed memory stats test 969ec8949 Specify seccomp target arch for CC c19b7b64d RELEASES.md: recommend alternatives for deprecated features 8a62aa1c3 Deprecate built-in aufs snapshotter 4e7915f80 CI: allow Go 1.13 for Docker/Moby compatibility 8e589e873 Vagrantfile: update to Fedora 34 5847340a7 tests: Refactors container image usage 9f43eade6 Prepare v1.5.0-rc.3 release notes 4c7b960cb prow needs some additional setup for docker buildx 2e4c1d4b7 Use the multi-arch version of the test images 4e00c4b65 integration tests needs lsof 177273680 Add script to build test images 1b5d59dfe Add multi-arch support for test images 78e529727 add integration tests 2b0e6cdd4 Separate jobs for build and test for openlab/arm64 cdd075853 Release artifacts for Linux ARM64 efcb18742 Add unit tests for PID NamespaceMode_TARGET validation b48f27df6 Support PID NamespaceMode_TARGET 909660ea9 process: use the unbuffered channel as the done signal 0f332dadd Update cgroups for regenerated protos 391b123a5 adds quiet option for ref ab1654d0e Fix PushHandler cannot push image that contains duplicated blobs 00f8d32ef add not found debug out for check cmd; update usage 55734b1c5 Prepare 1.5.0-rc.2 release notes 3ef337ae3 Update containerd vendors to tags fbe1e140f Update Go to 1.16.3 c1d1edbad gha: use sudo -E in some places to prevent dropping env-vars 7966a6652 Cleanup code 5d79d3adb go.mod: update kubernetes to v1.20.6 1c03c377e go.mod: github.com/containerd/fifo v1.0.0 12a2a2108 go.mod: github.com/google/uuid v1.2.0 3292ea586 pkg/seccomp: use sync.Once to speed up IsEnabled 00b5c99b1 pkg/seccomp: simplify IsEnabled, update doc 6dd29c25f go.mod: github.com/containerd/aufs 330a2a809 go.mod: github.com/containerd/zfs 34780d67a runtime/shim: check the namespace flag first c3dde8c4b freebsd: add zfs to the default plugins b431fe4fc freebsd: don't run shim delete in deleted dir 1f4192daf freebsd: exclude v1 runtimes cb1580937 metadata: improve deleting a non-empty namespace's error message 5bf84034d Remove junit test result processor b83d04f91 Add variable names to runtime's interface definitions 993b86399 Add shim start opts 9e576b889 Optimize backoff 5c02688b5 converter: use OpenWriter helper function fcf3b275f Add lock for ListPids fdb76f55d Fix backword-compatibility issue of non-versioned config file d21fe4625 adds log for each failed host and status not found on host 8a4cbabc6 Reimport windows layers when comitting snapshots 2de38a926 fix(windows): create debug npipe failure 41fc516a2 docs/rootless.md: recommend "easy way" over "hard way" 864a3322b go.mod: github.com/containerd/go-cni v1.0.2 ee34caccb go.mod: github.com/Microsoft/go-winio v0.4.17 d478676d3 go.mod: github.com/containerd/imgcrypt v1.1.1 1dd45d51c go.mod: github.com/containerd/typeurl v1.0.2 abd4be07a fix the 404 url 978ebbef6 Prepare 1.5.0-rc.1 release ce116d4c5 go.mod: github.com/containerd/imgcrypt v1.1.1-0.20210412181126-0bed51b9522c 0550c3233 containerd-stress: add snapshotter option for stress test to use 8a04bd052 address recent runtimes config confusion c4778fe1b go.mod: github.com/containernetworking/plugins v0.9.1 5ce35ac39 devmapper: log pool status when mkfs fails 75097b8ca hcsshim seems to have been updated 9ad087947 Switch all our tests to version 2 e96d2a5d9 Revert "remove two very old no longer used runtime options" 14f357b90 CI: update crun to 0.19 294331060 go.mod: github.com/containerd/console v1.0.2 bb6c0c2de Add more bolt utils 0ad8c0a16 Decouple shim start from task creation c7504987e Implement windowsDiff.Compare via hcsshim/pkg/ociwclayer a64a76846 Replace inline applyWindowsLayer using hcsshim 149fa366f Don't tease the logger with a %-less format string b399e2ef6 Don't lose Compare failure if aborting diff upload fails 36bf3f0e8 go.mod: github.com/Microsoft/hcsshim v0.8.16 8e1a8ecd8 Prepare v1.5.0-rc.0 45df696bf Fix return event publishing error 4bc8f692f optimize cri redirect logs 9bc8d63c9 cri/server: use containerd/oci instead of libcontainer/devices dd16b006e merge in the move to the new options type 9144ce967 shows our runc.v2 default options in the containerd default config 3d20fa930 fix TestSetOOMScoreBoundaries 4d4117415 Change CRI config runtime options type 21ebeef74 integration: use busybox:1.32.0 since latest is unavailable f9bcf4a8a add section link d4be6aa8f rm mirror defaults; doc registry deprecations 7bb73da6b runtime/v2/shim: remove unused SetScore() and remove sys.OOMScoreMaxKillable 91e7d21ee sys: add AdjustOOMScore() utility 44240116a sys: add boundary checks to SetOOMScore() ace1912bb sys: use assert for error checks in OOM tests 6e7271522 sys: add missing pre-condition checks in tests badd60d3f sys: un-export runningPrivileged(), remove runningUnprivileged() 21a175860 go.mod github.com/klauspost/compress v1.11.13 58c5fd09e re-enable cri test da998c81e move to gcr.io/k8s-staging-cri-tools test images 8ba8533bd pkg/cri/opts.WithoutRunMount -> oci.WithoutRunMount 92ea98eda cri-cni-release: add imgcrypt binaries (v1.1.0) 4c1fa5719 remotes/docker: Only return "already exists" on push when the upload was successful 0186a329e remove two very old no longer used runtime options 58a07754a Temporarily disable cri-tools critest 7ae0a60fb Add OCI ref.name to unique key in remotes handler 5ada2f74a Keep host order as defined in TOML file d9ff8ebef support multi-arch images for windows via ctr af1e2af72 ci: upload junit formatted test results 6866b36ab Add workaround to keep docker hosts structs private c54d92c79 image: use generic decompressor for calculating DiffID 1faca349e integration/client: rename package to "client" 6fc9e4500 synchronize replace rules in integration/client go.mod with main go.mod 9e19a2984 Fix hosts test on Windows 3f406d4af Cleanup vendor d56b49c13 Rewrite Docker hosts parser e1f51ba73 Use os.File#Seek() to get the size of a block device ddd4298a1 Migrate current TOML code to github.com/pelletier/go-toml 499c2f7d4 Vendor github.com/pelletier/go-toml 61c749036 integration/util: remove dependency on k8s.io/klog/v2 d9765f7bf Extend default timeout for nested VM integration run 5e94745f2 ctr: add --user for task exec f8c2f0475 remotes/ctr: allow to limit max concurrent uploads like downloads 4674ad7be Ignore some tests on darwin 55450e773 Run unit tests on CI for MacOS 311e326a1 Add CI job to cross compile all the things 10a498c7c Update go-winio to fix compile error on armv7 1a9c6f557 Revendor zfs to to fix integer overflow 1fd3d12f9 `go mod tidy` the client integration test module da7d96ba3 Clean up WCOW layers after tests in the correct order 9ad87b9ba adds critools-version 72b7f4bab task: allow checkpoint on pause state e4b9b1038 Make CRI registry docs more clear ec4d7736d Increase timeout for linux integration tests eb7c7c71e Fix oom tests on non Linux 708299ca4 Move RunningInUserNS() to its own package 0886ceaea Fix reference ordering in CRI image store bf9db47e8 add caller info to the testHook 305b42583 use happy-eyeballs for port-forwarding 22ef69d77 Support HTTP debug in ctr 01765d097 night ci fix: add packages for ubuntu 20.04 8cdc1f13b go.mod: github.com/containerd/zfs v0.0.0-20210322090317-0e92c2247fb7 30e1e66e5 runtime/v2: Fix defer cleanup 33776ada0 Use specific image for user namespaces tests 7704fe72d Specifically mention "mkfs.ext4" on the error from the command 1410220d8 Fix error log when copy file fe787efa2 Fix error log when kill shim 8d8c15ca5 contentproxy: ensure grpc stream is closed on commit 6e343f25e Switch test image to a non rate-limited manifest list 9fdc96c09 runtime/v2: add comment for checkCopyShimLogError 24602e7a9 change default runtime for containerd-stress app 8731888ec Re-enable CRIU tests by not using overlayfs snapshotter b520428b5 Fix CRIU 4e76bcf06 gofmt -s -w all the things 569023fd5 go.mod: github.com/containerd/nri v0.0.0-20210316161719-dbaa18c31c14 0e1f59e89 go.mod: github.com/containerd/zfs v0.0.0-20210315114300-dde8f0fda960 ffff68866 upgrade pause image to 3.5 for non-root 88d3881e1 go.mod: github.com/containerd/fifo v0.0.0-20210316144830-115abcc95a1d a22c43fa4 go.mod: github.com/containerd/aufs v0.0.0-20210316121734-20793ff83c97 f6f861736 go.mod: github.com/containerd/btrfs v0.0.0-20210316141732-918d888fb676 460b35236 go.mod: kubernetes v1.20.4 5e484c961 runtime/v2/runc: fix the defer cleanup of the NewContainer e6086d9c0 Prepare release notes for v1.5.0-beta.4 34b7a5f09 Update mailmap ba8f9845e move overlay-checks to an overlayutils package 7776e5ef2 Support adding devices by dir d895118c7 runtime/v2/runc: fix leaking socket path a76cefd12 plugin status should be skip, not error 766e7953a Change dgst to digest in debug 4e8b2f309 rootfs: fix the error handling of the createInitLayer d3ad7f390 cmd/ctr: use e.g. in the command usage 231bbdc37 cmd/ctr: fix export command ecb881e5e add imgcrypt stream processors to the default config ac2726e12 cmd/containerd: deduplicate config*.go 9a7ca39cb defaults: add DefaultConfigDir 8f863afd3 Use net.IP.IsLoopback() to match loopback addresses eabd9b98b runtime: ignore file-already-closed error if dead shim Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-06-30 22:53:24 +08:00
Eric Ernst	d0ad388721	Merge pull request #2065 from ManaSugi/format-golang-proto runtime: Format golang proto code	2021-06-30 11:08:57 -07:00
Fabiano Fidêncio	550029c473	Merge pull request #2060 from liubin/2059/delete-some-lint-attributes agent: delete some lint attributes	2021-06-30 16:51:07 +02:00
bin	aa264f915f	agent: update netlink libraries Update rtnetlink to use crate.io to make cargo vendor work. Add vendor/ to .gitignore. Fixes: #2111 Signed-off-by: bin <bin@hyper.sh>	2021-06-30 22:39:50 +08:00
Fabiano Fidêncio	7d37fbfdfb	Merge pull request #2115 from sameo/topic/rust-nix cargo: Use latest nix crate for all Rust code bases	2021-06-28 08:18:53 +02:00
Fabiano Fidêncio	a8bb8269fe	Merge pull request #2047 from Jakob-Naucke/s390x-skip-hotplug virtcontainers: Don't fail memory hotplug	2021-06-28 08:18:31 +02:00
Samuel Ortiz	f6294226e8	cargo: Use latest nix crate for all Rust code bases Our dependencies already bring several versions of nix, we should avoid adding even more fragementation. Fixes #2114 Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com>	2021-06-25 03:38:37 +02:00
Eric Ernst	064dfb164b	runtime: Add "watchable-mounts" concept for inotify support To workaround virtiofs' lack of inotify support, we'll special case particular mounts which are typically watched, and pass on information to the agent so it can ensure that the mount presented to the container is indeed watchable (see applicable agent commit). This commit will: - identify watchable mounts based on file count and mount source - create a watchable-bind storage object for these mounts to communicate intent to the agent - update the OCI spec to take the updated watchable mount source into account Unit tests added and updated for the newly introduced functionality/functions. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-06-24 10:07:06 -07:00
Maksym Pavlenko	6a93e5d593	agent: Initial watchable-bind implementation Add support for watchable-bind storage driver. When watchable-bind storage is present, the agent will create a watchable path in a tmpfs, and poll the watchable-bind source to keep this new mount-point up to date. This poll will allow the agent to present the mount-point to the container, allowing for inotify usage by the container workload. If a mount becomes too large, either in file count or in overall size, we want to stop treating it as watchable, and instead just treat as a bindmount. This'll help avoid DoS by growing tmpfs too large, as well as limiting time spent scanning files. If a watchable-bind grows beyond 8 files (arbitrary sane number for certs/secrets) or 1MB (limit on ConfigMap size), we treat it as a normal bind. Fixes: #1879 Signed-off-by: Eric Ernst <eric_ernst@apple.com> Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com> Signed-off-by: Samuel Ortiz <samuel.e.ortiz@protonmail.com> agent: watcher: SandboxStorages check loop cleanup	2021-06-24 10:07:06 -07:00
Eric Ernst	57c0cee0a5	runtime: Cleanup mountSharedDirMounts, shareFile parameters There's no reason to pass the paths; they can be determined when they are actually used. Let's make the return values more comparable to the other mount handling functions (we'll add storage object in future commit), and pass the mount maps as function parameters. ...No functional changes here... Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-06-24 10:07:06 -07:00
Chelsea Mafrica	ac0bd57748	Merge pull request #2003 from cmaf/fix-span-runHooks tracing: Make runHooks() span creation return context	2021-06-24 07:50:42 -07:00
Jakob Naucke	8310a3d70a	virtcontainers: Don't fail memory hotplug Architectures that do not support memory hotplugging will fail when memory limits are set because that amount is hotplugged. Issue a warning instead. The long-term solution is virtio-mem. Fixes: #1412 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-06-24 10:58:06 +02:00
Fabiano Fidêncio	f5d9d89b73	Merge pull request #2089 from lifupan/fix_wait shimv2: fix the issue of leaking wait goroutines	2021-06-23 23:06:11 +02:00
Fabiano Fidêncio	c47a597568	Merge pull request #2097 from littlejawa/issue_crio_ctr_6_main runtime: report finish time in containers stats	2021-06-23 22:53:12 +02:00
Julien Ropé	6a1a051c65	runtime: report finish time in containers stats Make sure we report the exit time for the container when we answer a "Status" request. Fixes: #2096 Signed-off-by: Julien Ropé <jrope@redhat.com>	2021-06-23 17:36:47 +02:00
fupan.lfp	b3623a2c40	shimv2: fix the issue of leaking wait goroutines After create an container/exec successfully, containerd would wait it immediately, and if start it failed, there is no chance to send value to exitCh, thus the wait goroutine would blocked for ever and had no chance to exit. Fixes: #2087 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-06-23 19:29:26 +08:00
bin	2322f935c1	runtime: update default machine type to q35 The machine type of pc is deleted, generated configuration should update too. Fixes: #2085 Signed-off-by: bin <bin@hyper.sh>	2021-06-23 17:08:44 +08:00
David Gibson	c0cc6d5978	Merge pull request #1954 from marcel-apf/remove-pc Remove the pc machine	2021-06-23 12:00:05 +10:00
Julio Montes	b9e611e363	Merge pull request #2066 from devimc/2021-06-17/fixTeardownPmem runtime: do not hot-remove PMEM devices	2021-06-22 09:06:59 -05:00
Marcel Apfelbaum	ac6b9c53d2	runtime: Hot-plug virtio-mem device on PCI bridge Currently the virtio-mem device is hotplugged on the root bus. This doesn't work for PCIe machines like q35. Hotplug the virtio-mem device into the pci bridge instead. Fixes #1953 Signed-off-by: Marcel Apfelbaum <marcel@redhat.com>	2021-06-22 12:34:48 +03:00
Marcel Apfelbaum	789a59549e	virtcontainers: Remove the pc machine Keeping around two different x86 machines has no added value and require more tests and maintenance. Prefer the q35 machine since it has more features and drop the pc machine. Fixes #1953 Depends-on: github.com/kata-containers/tests#3586 Signed-off-by: Marcel Apfelbaum <marcel@redhat.com>	2021-06-22 11:54:07 +03:00
Manabu Sugimoto	caf5760c45	runtime: Update golang proto code We should update golang proto files. These changes are updated using libprotoc v3.6.1. Fixes: #2064 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-06-19 18:53:56 +09:00
Gabriela Cervantes	a9aa36cebc	docs: Update url for installation guides This PR updates the correct url for kata installation guides in kata 2.x Fixes #2069 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2021-06-18 16:48:07 +00:00
Julio Montes	ecdd137c6f	runtime: do not hot-remove PMEM devices PMEM devices cannot be hot-removed from a running VM. fixes #2018 Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-06-18 09:02:03 -05:00
bin	000049b69e	agent: delete some lint attributes Thes lint attributes can be deleted to keep clean code. Fixes: #2059 Signed-off-by: bin <bin@hyper.sh>	2021-06-18 16:08:25 +08:00
snir911	1faaf5f35d	Merge pull request #2000 from ManaSugi/update-mount-flags agent: Add some mount options and sort the options alphabetically	2021-06-17 11:53:11 +03:00
Tim Zhang	90029032b4	Merge pull request #2049 from liubin/2048/fix-log-field runtime: using detail propertites instead of function name in log field	2021-06-17 10:53:12 +08:00
bin	2022c64f94	runtime: using detail propertites instead of function name in log field To print the correct value of kernel parameters, the log field value should not be a function name. And for that qemuArchBase doesn't contain debug flag, so the log contains debug/non-debug parameters. Fixes: #2048 Signed-off-by: bin <bin@hyper.sh>	2021-06-17 00:17:16 +08:00
Julio Montes	361bee91f7	runtime/virtcontrainers: fix alignment structures fix alignment of qemuArchBase and HypervisorConfig structures Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-06-16 07:16:49 -05:00
Julio Montes	7834f4127f	virtcontainers: change memory_offset to uint64 `memory_offset` is used to increase the maximum amount of memory supported in a VM, this offset is equal to the NVDIMM/PMEM device that is hot added, in real use case workloads such devices are bigger than 4G, which is the current limit (uint32). fixes #2006 Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-06-16 07:16:49 -05:00
Manabu Sugimoto	bd27f7bab5	agent: Sort PROPAGATION and OPTIONS alphabetically to scan easily It's hard to visually scan over the list currently. Therefore, we should sort the list alphabetically to scan easily. Fixes: #1999 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-06-16 17:23:05 +09:00
snir911	fb318532b9	Merge pull request #2044 from devimc/2021-06-15/skipTestIoCopy containerd-shim-v2: Skip TestIoCopy unit test	2021-06-16 09:59:35 +03:00
Chelsea Mafrica	6abe7caecb	Merge pull request #2039 from Amulyam24/pef-tests ppc64le: Adding test for appendProtectionDevice	2021-06-15 16:19:05 -07:00
Julio Montes	ad06eb90db	containerd-shim-v2: Skip TestIoCopy unit test TestIoCopy unit test is failing randonly, skip it until we have a fix fixes #2043 Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-06-15 13:17:05 -05:00
Amulya Meka	ea9bb8e9ad	ppc64le: Adding test for appendProtectionDevice Fixes: #2038 Signed-off-by: Amulya Meka <amulmek1@in.ibm.com>	2021-06-15 10:23:38 +00:00
Tim Zhang	799cb27234	agent: Upgrade mio to v0.7.13 to fix epoll_fd leak problem Fixes: #2035 Fixes: tokio-rs/tokio/#3809 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-06-15 11:35:49 +08:00
Julio Montes	9d585935b5	Merge pull request #2020 from GabyCT/topic/fixreadruntime docs: Update README for runtime documentation	2021-06-14 10:37:20 -05:00
Fabiano Fidêncio	5a71786986	Merge pull request #1674 from jimcadden/stable-2.0-SEV Support SEV	2021-06-12 16:56:51 +02:00
Fabiano Fidêncio	be31694554	virtcontainers: Fix TestQemuAmd64AppendProtectionDevice() Since SEV support has been added, an implementation mistake was also added to TestQemuAmd64AppendProtectionDevice. appendProtectionDevice() will, as it name says, append the protection device to whatever was there previously. So, when SEV was added, we broke the comparison done for TDX as we didn't append the expected output for TDX with what we already had for SEV. This should be enough to get the tests passing. Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-06-12 08:56:15 -04:00
Fabiano Fidêncio	723c0ac4d5	Merge pull request #1832 from littlejawa/issue_1713 test: Add a unit test for ioCopy()	2021-06-12 00:34:28 +02:00
Gabriela Cervantes	240aae96dd	docs: Update README for runtime documentation This PR removes old links that were used in kata 1.x but not longer valid for kata 2.x Fixes #2019 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2021-06-11 16:01:20 -05:00
GabyCT	66e4c77a54	Merge pull request #1993 from likebreath/0610/clh_v16.0 versions: Upgrade to cloud-hypervisor v16.0	2021-06-11 15:11:11 -05:00
Chelsea Mafrica	cabddcc735	tracing: Make runHooks() span creation return context The call to Trace() in runHooks() does not return a context; fix this so that the subsequent calls to runHook() produces a properly ordered trace span. Fixes #2001 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-06-10 23:50:51 -07:00
Manabu Sugimoto	e544779c61	agent: Add some mount options Add the following mount options to catch up with the runtime spec - silent - loud - (no)acl - (no)iversion - (no)lazytime Fixes: #1999 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-06-11 15:08:46 +09:00
Fabiano Fidêncio	dc4307d3cc	Merge pull request #1974 from Jakob-Naucke/fix-cc-fedora-s390x Update CC=gcc setting for Fedora s390x	2021-06-11 00:31:51 +02:00
Fabiano Fidêncio	24bbcf58d3	Merge pull request #1981 from LiangZhou-CTY/patch-1 runtime: remove the call to storeSandbox at the end of createSandboxFromConfig	2021-06-11 00:30:39 +02:00
Fabiano Fidêncio	8239f6fc17	Merge pull request #1772 from Jakob-Naucke/sec-exec virtcontainers: Add support for Secure Execution	2021-06-11 00:02:01 +02:00
Bo Chen	85c40001da	versions: Upgrade to cloud-hypervisor v16.0 Highlights from the Cloud Hypervisor release v16.0: 1) Improved live migration support; 2) Improved `vhost-user` support; 3) ARM64 ACPI and UEFI support; 4) Bug fixes. Details can be found: https://github.com/cloud-hypervisor/cloud-hypervisor/releases/tag/v16.0 Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by `openapi-generator` [1-2]. As the API changes do not impact usages in Kata, no additional changes in kata's runtime are needed to work with the current version of cloud-hypervisor. [1] https://github.com/OpenAPITools/openapi-generator [2] https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/pkg/cloud-hypervisor/README.md Fixes: #1992 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-06-10 10:16:39 -07:00
Fupan Li	9d84272dd1	Merge pull request #1988 from ManaSugi/conform-to-latest-nix agent: Conform to the latest nix version (0.21.0)	2021-06-10 17:17:03 +08:00
Manabu Sugimoto	a1247bc0bb	agent: Conform to the latest nix version (0.21.0) We need to fix some agent's code to conform to the latest nix crate to be able to use new features of the nix. Fixes: #1987 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-06-10 16:58:51 +09:00
Liang Zhou	3130e66d33	runtime: remove storeSandbox at the end of createSandboxFromConfig Remove storeSandbox() at the end of createSandboxFromConfig(), because this callchain createSandboxFromConfig -> createContainers has already calls storeSandbox(). This can improve the startup speed of the container, even just for a little. Fixes: #1980 Signed-off-by: Liang Zhou <zhoul110@chinatelecom.cn>	2021-06-10 11:56:40 +08:00
Tim Zhang	f26837a0f1	Merge pull request #1967 from liubin/fix/1956-add-more-traces-for-network runtime: add more traces for network	2021-06-10 10:56:42 +08:00
Jakob Naucke	7593ebf947	runtime: Use CC=gcc on Fedora s390x This was fixed for the Go agent back in https://github.com/kata-containers/osbuilder/issues/217, but is also required for the runtime. Fixes: #1973 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-06-08 16:36:24 +02:00
Fabiano Fidêncio	208ab60e1e	Merge pull request #1863 from zhsj/drop-covertool runtime: remove covertool from cli test	2021-06-07 16:21:51 +02:00
Fabiano Fidêncio	51ac042cad	Merge pull request #939 from keloyang/detach factory: Use lazy unmount	2021-06-07 13:26:16 +02:00
Jakob Naucke	c0c05c73e1	virtcontainers: Add support for Secure Execution Secure Execution is a confidential computing technology on s390x (IBM Z & LinuxONE). Enable the correspondent virtualization technology in QEMU (where it is referred to as "Protected Virtualization"). - Introduce enableProtection and appendProtectionDevice functions for QEMU s390x. - Introduce CheckCmdline to check for "prot_virt=1" being present on the kernel command line. - Introduce CPUFacilities and avilableGuestProtection for hypervisor s390x to check for CPU support. Fixes: #1771 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-06-07 10:50:33 +02:00
Jakob Naucke	78f21710e3	virtcontainers/s390x: Put consts into one block Previously, all consts were in single lines in virtcontainers/qemu_s390x.go. Put them into a const block. Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-06-07 10:50:30 +02:00
bin	784025bb08	runtime: add more traces for network Add traces for all the endpoinnt types and the main interface functions. Record errors for some traces. Fixes: #1956 Signed-off-by: bin <bin@hyper.sh>	2021-06-07 11:38:40 +08:00
Chelsea Mafrica	60806ce3c8	Merge pull request #1957 from cmaf/tracing-attributes-sandboxID-1 Add sandbox and container ID to trace spans	2021-06-04 09:10:05 -07:00
Tim Zhang	1255b83427	Merge pull request #1955 from Tim-Zhang/fix-fd-leak-of-netlink agent: Fix fd leak caused by netlink	2021-06-03 20:15:15 +08:00
Tim Zhang	9e3349c18e	agent: Fix fd leak caused by netlink See also: little-dude/netlink#165 Fixes: #1952 Because the author of netlink has no time to maintain the crate (https://github.com/little-dude/netlink/issues/161), so we need to switch the dependency to github temporarily. Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-06-03 17:23:37 +08:00
Chelsea Mafrica	3d0e0b2786	tracing: Add network model to span Trace spans erroneously set the network model to default in all cases. Add function to return network model string and use it to set attribute in spans. Fixes #1878 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-06-02 21:53:54 -07:00
Chelsea Mafrica	8ca0207281	tracing: Add sandbox and container ID to trace spans Add sandbox, container, and hypervisor IDs to trace spans. Note that some spans in sandbox.go are created with a trace() call from api.go. These spans have additional attributes set after span creation to overwrite the api attributes. Fixes #1878 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-06-02 21:53:54 -07:00
Bin Liu	1673110ee9	Merge pull request #1930 from jcvenegas/kata-moinitor-export-virtiofsd metrics: Add virtiofsd exporter	2021-06-03 10:38:55 +08:00
Chelsea Mafrica	33c12b6d08	Merge pull request #1929 from jodh-intel/add-agent-tracing tracing: Add basic VSOCK tracing	2021-06-02 11:45:41 -07:00
Sandeep Gupta	b26d5b1d08	virtcontainers: Support SEV fixes #1869 Signed-off-by: Jim Cadden <jcadden@ibm.com>	2021-06-02 14:32:50 -04:00
James O. D. Hunt	a9a0eccf33	tracing: Add basic VSOCK tracing Implement an openTelemetry custom exporter that sends trace spans to a VSOCK socket. A VSOCK-to-span converter (such as the Kata trace forwarder) needs to be running on the host to allow systems like Jaeger to capture the trace spans. By default, tracing is not enabled (meaning a NOP tracer is used). To activate tracing, set the `agent.kata.enable_tracing=true` in the configuration file. The type of tracing this change introduces is "static isolated" tracing. See [1] for further details. > Note: > > This change only provides the foundational changes for agent > tracing work. The feature is _not_ yet complete since it does > not yet show the correct trace hierarchy. Fixes: #60. [1] - https://github.com/kata-containers/agent/blob/master/TRACING.md Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-06-02 18:00:05 +01:00
Jim Cadden	81c6e4ca9f	runtime/vendor: add github.com/intel-go/cpuid Fixes: #1869 Signed-off-by: Jim Cadden <jcadden@ibm.com>	2021-06-02 12:59:04 -04:00
Carlos Venegas	2234b73090	metrics: Add virtiofsd exporter Export proc stats for virtiofsd. This commit only adds for hypervisors that have support for it. - qemu - cloud-hypervisor Fixes: #1926 Signed-off-by: Carlos Venegas <jos.c.venegas.munoz@intel.com>	2021-06-02 16:06:00 +00:00
Tim Zhang	9bf781d704	agent: Upgrade tokio-vsock to fix fd leak of vsock socket Fixes: #1950 The further information: rust-vsock/vsock-rs#15 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-06-02 16:03:09 +08:00
Tim Zhang	476ec9bd86	Merge pull request #1948 from liubin/fix/1947-fix-comments runtime: fix some comments and logs	2021-06-02 10:52:01 +08:00
Pradipta Banerjee	604e3a6fa1	Merge pull request #1882 from Amulyam24/pef runtime: Add support for PEF	2021-06-01 12:56:53 +05:30
Peng Tao	41e04495f4	Merge pull request #1943 from bergwolf/cleanup2 cleanup TODOs in runtime	2021-06-01 14:16:46 +08:00
Chelsea Mafrica	bcde703b36	Merge pull request #1859 from cmaf/tracing-attributes-1 tracing: Make runtime span attributes more consistent	2021-05-31 21:57:58 -07:00
bin	b68334a1a8	runtime: fix some comments and logs This commit fix some conments/logs. And add some logs for debug. Fixes: #1947 Signed-off-by: bin <bin@hyper.sh>	2021-06-01 09:04:18 +08:00
Bin Liu	d1ac0a1a2c	Merge pull request #1938 from liubin/fix/1933-virtiofsd-refactor virtiofsd: refactor qemu.go to use code in virtiofsd.go	2021-06-01 08:32:56 +08:00
Fabiano Fidêncio	d7b6e3e178	Merge pull request #1942 from bergwolf/cleanup runtime: remove unused doc.go	2021-05-31 22:41:24 +02:00
Peng Tao	1f5b229bef	runtime: remove FIXME in SandboxState about CgroupPath It is in real life usage as we put non constrained sandbox processes (like shim) in a separate cgroup path. Fixes: #1944 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-05-29 13:17:14 +08:00
Peng Tao	fee0004ad4	runtime: remove TODO about hot add memory in qemu.go Already addressed by https://github.com/kata-containers/runtime/pull/786 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-05-29 11:15:50 +08:00
Peng Tao	2e29ef9cab	runtime: remove TODO comment from StatusContainer It is no longer valid as containerd already doesn't treat container pid as host process pid. Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-05-29 11:10:32 +08:00
bin	72cd8f5ef6	virtiofsd: refactor qemu.go to use code in virtiofsd.go CloudHypervisor is using virtiofsd.go to manage virtiofsd process, but qemu has its code in qemu.go. This commit let qemu to re-use code in virtiofsd.go to reduce code and improve maintenanceability. Fixes: #1933 Signed-off-by: bin <bin@hyper.sh>	2021-05-29 11:00:05 +08:00
Peng Tao	0b22c48d2a	runtime: remove unused doc.go It doesn't even contain any actual code there. Fixes: #1941 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-05-29 10:25:29 +08:00
Peng Tao	c455d84571	Merge pull request #1918 from lifupan/main cgroup: fix the issue of set mem.limit and mem.swap	2021-05-29 10:05:44 +08:00
Peng Tao	fd6d32ee42	Merge pull request #1939 from lifupan/fix_epipe agent: re-enable the standard SIGPIPE behavior	2021-05-29 10:05:09 +08:00
Fabiano Fidêncio	bcf78a18ae	Merge pull request #1932 from liubin/fix/1931-virtiofsd-fd-leak-and-return-right-pid virtiofsd: Fix file descriptors leak and return correct PID	2021-05-28 12:29:56 +02:00
fupan.lfp	30f4834c5b	cgroup: fix the issue of set mem.limit and mem.swap When update memory limit, we should adapt the write sequence for memory and swap memory, so it won't fail because the new value and the old value don't fit kernel's validation. Fixes: #1917 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-05-28 15:44:14 +08:00
fupan.lfp	0ae364c8eb	agent: re-enable the standard SIGPIPE behavior The Rust standard library had suppressed the default SIGPIPE behavior, see https://github.com/rust-lang/rust/pull/13158. Since the parent's signal handler would be inherited by it's child process, thus we should re-enable the standard SIGPIPE behavior as a workaround. Fixes: #1887 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-05-28 15:25:05 +08:00
Chelsea Mafrica	05a46fede0	tracing: Make runtime span attributes more consistent Span attributes (tags) are not consistent in runtime tracing, so designate and use core attributes such source, package, subsystem, and type as span metadata for more understandable output. Use WithAttributes() during span creation to reduce calls to SetAttributes(). Modify Trace() in katautils to accept slice of attributes so multiple functions using different attributes can use it. Fixes #1852 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-05-27 10:07:11 -07:00
bin	727bfc4556	runtime: and cgroup and SandboxCgroupOnly check for check sub-command In kata-runtime check sub-command, checks cgroups and SandboxCgroupOnly to show message if the SandboxCgroupOnly is not set to true and cgroup v2 is used. Fixes: #1927 Signed-off-by: bin <bin@hyper.sh>	2021-05-27 21:19:12 +08:00
James O. D. Hunt	b25ad1ab2c	tracing: Make trace-forwarder async The tracing crates are now async, so update the trace forwarder to use the new API. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-05-27 10:48:05 +01:00
James O. D. Hunt	45f02227b2	tracing: Add trace points Use the tracing crate to create automatic trace spans for the _majority_ of top-level modules. Note that not all functions in the top-level modules can be traced: - Some functions cannot be traced due to the requirement that all function parameters implement the `Debug` trait. In some cases (such as `netlink.rs`), objects are being passed that are defined in different crates and which do not implement `Debug`. - Some functions may never return (`signal.rs`). - Some functions are inlined. - Some functions are very simple getter/setter functions. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-05-27 10:42:58 +01:00
bin	773deca2f6	virtiofsd: Fix file descriptors leak and return correct PID This commit will fix two problems: - Virtiofsd process ID returned to the caller will always be 0, the pid var is never being assigned a value. - Socket listen fd may leak in case of failure of starting virtiofsd process. This is a port of `be9ca0d58b` Fixes: #1931 Signed-off-by: bin <bin@hyper.sh>	2021-05-27 16:51:41 +08:00
Amulyam24	37a426b4c6	runtime: Add support for PEF Protected Execution Facility(PEF) is the confidential computing technology on ppc64le. This PR adds the support for it in Kata. Also re-vendor govmm for the latest changes. Fixes: #1881 Signed-off-by: Amulyam24 <amulmek1@in.ibm.com>	2021-05-25 14:29:42 +00:00
Fabiano Fidêncio	c3f6c88668	Merge pull request #1915 from quanweiZhou/fix_start_container_failed_when_drop_all_caps agent: fix start container failed when dropping all capabilities	2021-05-24 14:13:52 +02:00
Tim Zhang	005e5ddedc	Merge pull request #1905 from ManaSugi/del_underscore_var agent: Remove unnecessary underscore(_) variables	2021-05-24 17:39:48 +08:00
quanweiZhou	3e4ebe10ac	agent: fix start container failed when dropping all capabilities When starting a container and dropping all capabilities, the init child process has no permission to read the exec.fifo file because the parent set the file mode 0o622. So change the exec.fifo file mode to 0o644. fixes #1913 Signed-off-by: quanweiZhou <quanweiZhou@linux.alibaba.com>	2021-05-22 17:33:49 +08:00
Eric Ernst	7f1030d303	sandbox-bindmount: persist mount information Without this, if the shim dies, we will not have a reliable way to identify what mounts should be cleaned up if `containerd-shim-kata-v2 cleanup` is called for the sandbox. Before this, if you `ctr run` with a sandbox bindmount defined and SIGKILL the containerd-shim-kata-v2, you'll notice the sandbox bindmount left on host. With this change, the shim is able to get the sandbox bindmount information from disk and do the appropriate cleanup. Fixes #1896 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-05-21 12:54:35 -07:00
Eric Ernst	089a7484e1	sandbox: Cleanup if failure to setup sandbox-bindmount occurs If for any reason there's an error when trying to setup the sandbox bindmounts, make sure we roll back any mounts already created when setting up the sandbox. Without this, we'd leave shared directory mount and potentially sandbox-bindmounts on the host. Fixes: #1895 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-05-21 12:54:35 -07:00
Manabu Sugimoto	20a382c158	agent: Remove unnecessary underscore(_) variables We should remove underscore(_) prefixed variables when ? operator is used. Fixes: #1903 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-05-21 17:45:34 +09:00
Shukui Yang	bd0cde40e7	factory: Use lazy unmount we can have the following case, 1. start kata container with factory feature, this need kata-runtime config to enable factory and use initrd as base image. 2. start a kata container. 3. cd /root; cd /run/vc/vm/template dir, this will make /run/vc/vm/template to be in used. 4. destroy vm template with kata-runtime factory destroy , and check the template mountpoint. we can see the template mountpoints will add everytime we repeat the above steps . [root@centos1 template]# mount \|grep template [root@centos1 template]# docker run -ti --rm --runtime untrusted-runtime --net none busybox echo [root@centos1 template]# cd /root; cd /run/vc/vm/template/ [root@centos1 template]# /kata/bin/kata-runtime factory destroy vm factory destroyed [root@centos1 template]# mount \|grep template tmpfs on /run/vc/vm/template type tmpfs (rw,nosuid,nodev,relatime,seclabel,size=2105344k) [root@centos1 template]# docker run -ti --rm --runtime untrusted-runtime --net none busybox echo [root@centos1 template]# cd /root; cd /run/vc/vm/template/ [root@centos1 template]# /kata/bin/kata-runtime factory destroy vm factory destroyed [root@centos1 template]# mount \|grep template tmpfs on /run/vc/vm/template type tmpfs (rw,nosuid,nodev,relatime,seclabel,size=2105344k) tmpfs on /run/vc/vm/template type tmpfs (rw,nosuid,nodev,relatime,seclabel,size=2105344k) Fixes: #938 Signed-off-by: Shukui Yang <keloyangsk@gmail.com>	2021-05-20 16:18:28 +08:00
Fabiano Fidêncio	f52468bea7	agent/agent-ctl: Replace prctl crate by the capctl one While evaluating the possibility of having kata-agent statically linked to the GNU libc, we've ended up facing some issues with prctl. When debugging the issues, we figured out that the crate hasn't been maintained since 2015 and that the capctl one is a good 1:1 replacement for what we need. Fixes: #1844 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-05-19 20:16:26 +02:00
Fabiano Fidêncio	8aefc79314	agent: Perform a `cargo update` While in the beginning of the development cycle, let's perform a `cargo update`. Fixes: #1883 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-05-19 09:43:17 +02:00
Shengjing Zhu	1b60705646	runtime: remove covertool from cli test covertool has no active since 2018 and is not compatible with go1.16 ../vendor/github.com/dlespiau/covertool/pkg/cover/cover.go:76:29: cannot use f (type dummyTestDeps) as type testing.testDeps in argument to testing.MainStart: dummyTestDeps does not implement testing.testDeps (missing SetPanicOnExit0 method) Fixes: #1862 Signed-off-by: Shengjing Zhu <zhsj@debian.org>	2021-05-16 03:06:06 +08:00
Peng Tao	f6c5f7c0ef	Merge pull request #1844 from lifupan/main rustjail: separated the propagation flags from mount flags	2021-05-14 10:25:35 +08:00
Peng Tao	35151f1786	runtime: sandbox delete should succeed after verifying sandbox state Otherwise we might block delete and create orphan containers. Fixes: #1039 Signed-off-by: Peng Tao <bergwolf@hyper.sh> Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-05-13 14:05:49 -07:00
fupan.lfp	e5fe572f51	rustjail: separated the propagation flags from mount flags Since the propagation flags couldn't be combinted with the standard mount flags, and they should be used with the remount, thus it's better to split them from the standard mount flags. Fixes: #1699 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-05-13 23:53:52 +08:00
Julien Ropé	a918c46fb6	test: Add a unit test for ioCopy() Following the fix for #1713, adding a unit test for ioCopy() that verifies that data is properly copied from source to destination whatever the order in which the pipes are closed. Fixes #1831 Signed-off-by: Julien Ropé <jrope@redhat.com>	2021-05-12 11:30:45 +02:00
Bin Liu	cc4748fa64	Merge pull request #1829 from Tim-Zhang/fix-reap agent: avoid reaping the exit signal of execute_hook in the reaper	2021-05-12 17:24:25 +08:00
Bin Liu	15778a17e5	Merge pull request #1828 from Tim-Zhang/move-dep agent: move the dependency tempfile to the dev-dependencies section	2021-05-12 17:21:50 +08:00
Tim Zhang	a5bb383cf3	agent: avoid reaping the exit signal of execute_hook in the reaper Fixes: #1826 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-05-12 14:40:20 +08:00
Tim Zhang	ce7a5ba22e	agent: move the dependency tempfile to the dev-dependencies section The tempfile is only used by tests. Fixes: #1827 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-05-12 14:39:58 +08:00
Fabiano Fidêncio	ac61e60492	Merge pull request #1790 from snir911/configure_timeout runtime: make dialing timeout configurable	2021-05-11 16:52:05 +02:00
Bin Liu	bffb099d99	Merge pull request #1816 from egernst/get-sandbox-metrics-cli Get sandbox metrics cli	2021-05-11 13:10:30 +08:00
Samuel Ortiz	2c4e4ca1ac	Merge pull request #1590 from devimc/2021-02-02/ConfidentialComputing Support TDx	2021-05-10 22:19:40 +02:00
Eric Ernst	8068a4692f	kata-runtime: add `metrics` command For easier debug, let's add subcommand to kata-runtime for gathering metrics associated with a given sandbox. kata-runtime metrics --sandbox-id foobar Fixes: #1815 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-05-10 10:45:10 -07:00
Eric Ernst	3787306107	kata-monitor: export get stats for sandbox Gathering stats for a given sandbox is pretty useful; let's export a function from katamonitor pkg to do this. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-05-10 08:53:56 -07:00
Snir Sheriber	01b56d6cbf	runtime: make dialing timeout configurable allow to set dialing timeout in configuration.toml default is 30s Fixes: #1789 Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2021-05-10 16:39:37 +03:00
Eric Ernst	3caed6f88d	runtime: shim: dedup client, socket addr code (1) Add an accessor function, SocketAddress, to the shim-v2 code for determining the shim's abstract domain socket address, given the sandbox ID. (2) In kata monitor, create a function, BuildShimClient, for obtaining the appropriate http.Client for communicating with the shim's monitoring endpoint. (3) Update the kata CLI and kata-monitor code to make use of these. (4) Migrate some kata monitor methods to be functions, in order to ease future reuse. (5) drop unused namespace from functions where it is no longer needed. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-05-07 15:20:37 -07:00
Fabiano Fidêncio	4bc006c8a4	runtime: Short the shim-monitor path Instead of having something like "/containerd-shim/$namespace/$sandboxID/shim-monitor.sock", let's change the approach to: * create the file in a more neutral location "/run/vc", instead of "/containerd-shim"; * drop the namespace, as the sandboxID should be unique; * remove ".sock" from the socket name. This will result on a name that looks like: "/run/vc/$sandboxID/shim-monitor" Fixes: #497 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-05-07 14:20:35 -07:00
Tim Zhang	1bfc426a2b	Merge pull request #1784 from liubin/fix/1783-delete-un-used-fn agent: delete code which is no longer used	2021-05-07 14:25:26 +08:00
Fabiano Fidêncio	2436839fa7	Merge pull request #1749 from liubin/fix/1748-delete-tracing-in-cli cli: delete tracing code for kata-runtime binary	2021-05-07 08:17:16 +02:00
Tim Zhang	75648b0770	Merge pull request #1745 from liubin/fix/1744-add-doc-for-enable_pprof docs: add per-Pod Kata configurations for `enable_pprof`	2021-05-07 13:45:34 +08:00
Fupan Li	70e1d44262	Merge pull request #1800 from teawater/fix_vm Fix issue of virtio-mem	2021-05-07 13:08:12 +08:00
Fupan Li	487e165093	Merge pull request #1778 from snir911/patch_nofile Set fixed NOFILE limit value for kata-agent	2021-05-07 13:06:10 +08:00
Chelsea Mafrica	3e8137399c	Merge pull request #1805 from liubin/fix/1804-select-sandbox-ctx runtime: use s.ctx instead ctx for checking cancellation	2021-05-06 09:51:47 -07:00
Chelsea Mafrica	917665ab6d	Merge pull request #1751 from liubin/fix/1750-fix-comments runtime: fix some comments	2021-05-06 08:42:15 -07:00
Julio Montes	4f61f4b490	virtcontainers: Support TDX Add support for Intel TDX confidential guests fixes #1332 Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-05-06 10:09:05 -05:00
Julio Montes	0affe8860d	virtcontainers: define confidential guest framework Define the structure and functions needed to support confidential guests, this commit doesn't add support for any specific technology, support for TDX, SEV, PEF and others will be added in following commits. Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-05-06 10:09:05 -05:00
Julio Montes	539afba03d	runtime: define config options to enable confidential computing Define config options to enable or disable confidential computing and its features, for example: * Image service offloading * Image decryption keys Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-05-06 10:09:05 -05:00
bin	79831fafaf	runtime: use s.ctx instead ctx for checking cancellation s.ctx should be used for checking cancellation, and the local ctx is used for tracing. Fixes: #1804 Signed-off-by: bin <bin@hyper.sh>	2021-05-06 17:22:53 +08:00
bin	f6d5fbf9ba	runtime: fix some comments This commint include two types of fixes for comments in src/runtime/containerd-shim-v2/start.go. - Update comment for calling of watchOOMEvents. - Comments without heading spaces. Fixes: #1750 Signed-off-by: bin <bin@hyper.sh>	2021-05-06 17:12:52 +08:00
Hui Zhu	7f7c3fc8ec	qemu.go: qemu: resizeMemory: Fix virtio-mem resize overflow issue This commit change sizeByte from uint32 to uint64 to fix overflow issue. Fixes: #1796 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-05-06 14:13:50 +08:00
Hui Zhu	c9053ea3fb	qemu.go: qemu: setupVirtioMem: let sizeMB be multiple of 2Mib Got: FATA[0000] run pod sandbox: rpc error: code = Unknown desc = failed to create containerd task: Add 189759MB virtio-mem-pci fail QMP command failed: backend memory size must be multiple of 0x200000: unknown This commit let sizeMB be multiple of 2Mib to fix the issue. Fixes: #1796 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-05-06 14:13:48 +08:00
Snir Sheriber	a188577ebf	agent: Set fixed NOFILE limit value for kata-agent Some applications may fail if NOFILE limit is set to unlimited. Although in some environments this value is explicitly overridden, lets set it to a more sane value in case it doesn't. Fixes #1715 Signed-off-by: Snir Sheriber <ssheribe@redhat.com>	2021-05-04 15:06:11 +03:00
Julio Montes	88cf3db601	runtime: implement CPUFlags function `CPUFlags` returns a map with all the CPU flags, these CPU flags may help us to identiry whether a system support confidential computing or not. Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-05-03 09:33:13 -05:00
Eric Ernst	1c0d3afd55	Merge pull request #1754 from Jakob-Naucke/fix-virtiofs-s390x virtcontainers: Fix virtio-fs on s390x	2021-04-30 09:28:12 -07:00
Fabiano Fidêncio	2e0221125a	Merge pull request #1780 from likebreath/0429/clh_v15.0 versions: Upgrade to cloud-hypervisor v15.0	2021-04-30 18:20:36 +02:00
Fabiano Fidêncio	29fdfcfebc	Merge pull request #1725 from liubin/liubin/1724-not-return-if-get-api-socket-failed clh: return error if apiSocketPath failed	2021-04-30 18:16:45 +02:00
Fabiano Fidêncio	dc23adcd50	Merge pull request #1743 from alrs/fix-runtime-err runtime: fix dropped error	2021-04-30 18:15:22 +02:00
bin	d601ae3446	agent: delete not used comments Delete comments meanless or make people confusion. Fixes: #1783 Signed-off-by: bin <bin@hyper.sh>	2021-04-30 19:37:55 +08:00
bin	6038da1903	agent: delete rustjail/src/configs directory This directory is not used anymore. Fixes: #1783 Signed-off-by: bin <bin@hyper.sh>	2021-04-30 19:18:03 +08:00
bin	84ee8aa8b2	agent: delete not used functions In file src/agent/rustjail/src/validator.rs, these two functions are not used: - get_namespace_path - check_host_ns Fixes: #1783 Signed-off-by: bin <bin@hyper.sh>	2021-04-30 19:17:41 +08:00
Fabiano Fidêncio	bd486f7bf3	Merge pull request #1720 from ManaSugi/update-seccomp-spec agent: Update seccomp configuration for errnoRet and flags	2021-04-30 10:52:42 +02:00
Bo Chen	1ca6bedf3e	versions: Upgrade to cloud-hypervisor v15.0 Quotes from the cloud-hypervisor release v15.0: This release is the first in a new version numbering scheme to represent that we believe Cloud Hypervisor is maturing and entering a period of stability. With this new release we are beginning our new stability guarantees. Other highlights from the latest release include: 1) Network device rate limiting; 2) Support for runtime control of `virtio-net` guest offload; 3) `--api-socket` supports file descriptor parameter; 4) Bug fixes on `virtio-pmem`, PCI BARs alignment, `virtio-net`, etc.; 5) Deprecation of the "LinuxBoot" protocol for ELF and bzImage in the coming release. Details can be found: https://github.com/cloud-hypervisor/cloud-hypervisor/releases/tag/v15.0 Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by `openapi-generator` [1-2]. As the API changes do not impact usages in Kata, no additional changes in kata's runtime are needed to work with the current version of cloud-hypervisor. [1] https://github.com/OpenAPITools/openapi-generator [2] https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/pkg/cloud-hypervisor/README.md Fixes: #1779 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-04-29 10:56:22 -07:00
Jakob Naucke	3ee61776d6	virtcontainers: Enable virtio-fs on s390x Allow and configure vhost-user-fs devices (virtio-fs) on s390x. As a consequence, appendVhostUserDevice now takes a context, which affects its signature for other architectures. Fixes: #1753 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-04-29 09:54:08 +02:00
Jakob Naucke	8385ff9554	runtime: Re-vendor GoVMM for vhost-user-fs-ccw devno support shortlog: `f0e9a35` Merge pull request #171 from Jakob-Naucke/fix-virtiofs-s390x `abd3c7e` qemu: VhostUserDevice CCW device numbers `3eaeda7` qemu: Refactor vhostuserDev.QemuParams `7183b12` Merge pull request #166 from kata-containers/egernst-patch-1 `092293f` Merge pull request #169 from QiuMike/master `511cf58` Fix qemu commandline issue with empty romfile `8ba62b0` Merge pull request #164 from devimc/2021-03-30/tdxSupport `b3eac95` qmp: remove frequent, chatty log `3141894` qemu: add support for tdx-guest object Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-04-29 09:53:54 +02:00
Jakob Naucke	adba4532a4	virtcontainers: Revert "virtcontainers: Allow s390x appendVhostUserDevice" This reverts commit `7f60911333`. Patch allowed other vhost user devices besides FS not supported on s390x and failed to attach a CCW device number, which results in the inavailability to use more devices after vhost-user-fs-ccw. Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-04-29 09:43:33 +02:00
Eric Ernst	b20dff8027	Merge pull request #1759 from kata-containers/fix_update Fix the issue that sandbox size is not right after update	2021-04-28 14:48:24 -07:00
Eric Ernst	23a8179184	Merge pull request #1756 from egernst/leave-no-virtiofs-behind qemu: kill virtiofsd if failure to start VMM	2021-04-27 17:16:33 -07:00
Wainer dos Santos Moschetta	3677640811	runtime/virtcontainers: Fix typo on qmp error msg "negotiate" was misspelled on qemu's qmp error message. Fixes #1764 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2021-04-27 11:52:42 -04:00
Hui Zhu	0787ea8073	cgroupsCreate: not set resources to c.config.Resources cgroupsCreate will just keep the CPU resources infomation but not the others. Set it to c.config.Resources will clean most of resources of the container. This commit remove it to handle the issue. Fixes: #1758 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-04-27 16:44:30 +08:00
Hui Zhu	831224aa22	Sandbox: Fix ContainerConfig ptr in CreateContainer and createContainers The pointer that send to newContainer in CreateContainer and createContainers is not the pointer that point to the address in s.config.Containers. This commit fix this issue. Fixes: #1758 Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-04-27 16:44:22 +08:00
Eric Ernst	a57c8ab1be	qemu: kill virtiofsd if failure to start VMM If the QEMU VMM fails to launch, we currently fail to kill virtiofsd, resulting in leftover processes running on the host. Let's make sure we kill these, and explicitly cleanup the virtiofs socket on the filesystem. Ideally we'll migrate QEMU to utilize the same virtiofsd interface that CLH uses, but let's fix this bug as a first step. Fixes: #1755 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-04-26 21:07:20 -07:00
bin	95e54e3f48	docs: add per-Pod Kata configurations for enable_pprof Now enabling enable_pprof for individual pods is supported, but not documented. This commit will add per-Pod Kata configurations for `enable_pprof` in file `docs/how-to/how-to-set-sandbox-config-kata.md` Fixes: #1744 Signed-off-by: bin <bin@hyper.sh>	2021-04-26 22:20:49 +08:00
Fabiano Fidêncio	fb30c58847	Merge pull request #1747 from liubin/fix/1746-deleted-not-used-files cli: delete not used files	2021-04-26 09:57:19 +02:00
bin	13c23fec11	cli: delete tracing code for kata-runtime binary There are no pod/container operations in kata-runtime binary, tracing in this package is meaningless. Fixes: #1748 Signed-off-by: bin <bin@hyper.sh>	2021-04-26 11:11:22 +08:00
bin	ff2b9e5478	cli: delete not used files Delete two files that not used anymore: - src/runtime/cli/console.go - src/runtime/cli/console_test.go Fixes: #1746 Signed-off-by: bin <bin@hyper.sh>	2021-04-25 17:46:56 +08:00
bin	0d0a520d42	clh: return error if apiSocketPath failed If apiSocketPath failed, should return the error, but not nil Fixes: #1724 Signed-off-by: bin <bin@hyper.sh>	2021-04-25 10:25:42 +08:00
Lars Lehtonen	fc6bb01a7f	runtime: fix dropped error Fixes: #212 Signed-off-by: Lars Lehtonen <lars.lehtonen@gmail.com>	2021-04-24 14:18:50 -07:00
Chelsea Mafrica	8587e3a00b	Merge pull request #1732 from liubin/fix/1731-delete-builtin-parameter runtime: delete not used function parameter builtIn	2021-04-23 18:30:55 -07:00
Fabiano Fidêncio	fe2311cd4c	Merge pull request #1739 from pmores/virtiofsd-extra-args-annotation-handling add io.katacontainers.config.hypervisor.virtio_fs_extra_args handling	2021-04-23 23:22:01 +02:00
Pavel Mores	30ff6ee88b	runtime: handle io.katacontainers.config.hypervisor.virtio_fs_extra_args Users can specify extra arguments for virtiofsd in a pod spec using the io.katacontainers.config.hypervisor.virtio_fs_extra_args annontation. However, this annotation was ignored so far by the runtime. This commit fixes the issue by processing the annotation value (if present) and translating it to the corresponding hypervisor configuration item. Fixes #1523 Signed-off-by: Pavel Mores <pmores@redhat.com>	2021-04-23 21:09:28 +02:00
Fabiano Fidêncio	5eaf7a9982	Merge pull request #1049 from c3d/feature/1043-entropy-source-annotation Entropy source annotation	2021-04-23 20:16:11 +02:00
bin	677f0d9904	runtime: delete not used function parameter builtIn Parametr builtIn is not used in function updateRuntimeConfigAgent, delete it from updateRuntimeConfigAgent and LoadConfiguration function signature. Fixes: #1731 Signed-off-by: bin <bin@hyper.sh>	2021-04-23 17:42:42 +08:00
Fabiano Fidêncio	a4fffa1f22	Merge pull request #1714 from littlejawa/issue_1713 runtime: Fix stdout/stderr output from container being truncated	2021-04-22 23:00:47 +02:00
Fabiano Fidêncio	b41d9a99b4	Merge pull request #1703 from lifupan/main_fix fix the issue of missing set fsGroup for EphemeralStorage	2021-04-22 20:29:36 +02:00
Christophe de Dinechin	dcb9f40394	config: Protect annotation for entropy_source It would be undesirable to be given an annotation like "/dev/null". Filter out bad annotation values. Fixes: #1043 Suggested-by: James O. D. Hunt <james.o.hunt@intel.com> Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2021-04-22 15:26:40 +02:00
fupan.lfp	f4c26aad00	agent: fix the issue of missing set fsGroup for EphemeralStorage For k8s emptyDir volume, a specific fsGroup would be set for it, thus guest should get this fsGroup from runtime and set it properly on the EphemeralStorage volume in guest. Fixes: #1580 Signed-off-by: fupan.lfp <fupan.lfp@antfin.com>	2021-04-22 21:09:02 +08:00
fupan.lfp	628d55bf4c	kata-agent: fix the issue of fsGroup missing For k8s emptyDir volume, a specific fsGroup would be set for it, thus runtime should pass this fsGroup for EphemeralStorage to guest and set it properly on the emptyDir volume in guest. Fixes: #1580 Signed-off-by: fupan.lfp <fupan.lfp@antfin.com>	2021-04-22 21:08:52 +08:00
David Gibson	e91591fff2	Merge pull request #1701 from dgibson/clippy Assorted clippy fixes for Rust agent	2021-04-22 20:36:49 +10:00
Bin Liu	db4fbac1d3	Merge pull request #1722 from Tim-Zhang/use-channle-for-process-exit agent: use channel instead of pipe(2) to send exit signal of process	2021-04-22 15:27:36 +08:00
David Gibson	0405beb2d8	agent: Remove unused Default implementation for NamespaceType Currently we implement the Default trait for NamespaceType. It doesn't really make sense to have a default for this type though - you really need to know what type of namespace you're setting. In fact the Default implementation is never used, so we can just drop it. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:54:02 +10:00
David Gibson	7b83b7ec1f	agent/uevent: Better initialize Uevent in test We had some code that initialized a Uevent to the default value, then set specific fields to various values. This can be accomplished inside the one initialized using the ..Default::default() syntax. Making this change stops clippy from complaining. fixes #1611 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:53:57 +10:00
David Gibson	b0190a407f	agent: Use vec![] macro rather than init-then-push We have one place where we create an empty vector then immediately push something into it. We can do this in one step using the vec![] macro, which stops clippy complaining. fixes #1611 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:53:56 +10:00
David Gibson	1c43245e3e	agent/device: Remove unneeded Result<> wrappers from uev matchers The various type implementing the UeventMatcher trait have new() methods which return a Result<>, however none of them can actually fail. This is a leftover from their development where some versions could fail to initialize. Remove the unneccessary wrappers to silence clippy. fixes #1611 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:53:34 +10:00
David Gibson	e41cdb8b9f	agent: Use str::is_empty() method in config::get_string_value() An explicit check against "" is a bit less clear and makes clippy complain. fixes #1611 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:53:29 +10:00
David Gibson	2377c0975c	agent: Use CamelCase for NamespaceType values Currently these are in all-caps, to match typical capitalization of IPC, UTS and PID in the world at large. However, this violates Rust's capitalization conventions and makes clippy complain. fixes #1611 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:53:24 +10:00
David Gibson	75eca6d56f	agent/rustjail: Clean up error path in execute_hook()s async task Clippy (in Rust 1.51 at least) has some complaints about this closure inside execute_hook() because it uses explicit returns in some places where it doesn't need them, because they're the last expression in the function. That isn't necessarily obvious from a glance, but we can make clippy happy and also make things a little clearer: first we replace a somewhat verbose 'match' using Option::ok_or_else(), then rearrange the remaining code to put all the error path first with an explicit return then the "happy" path as the stright line exit with an implicit return. fixes #1611 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:53:23 +10:00
David Gibson	6ce1e56d20	agent/rustjail: Remove an unnecessary PathBuf PathBuf is an owned, mutable Path. We don't need those properties in get_value_from_cgroup() so we can use a Path instead. This may be slightly safer, and definitely stops clippy (version 1.51 at least) from complaining. fixes #1611 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:53:04 +10:00
David Gibson	3c4485ece3	agent/rustjail: Clean up some static definitions with vec! macro DEFAULT_ALLOWED_DEVICES and DEFAULT_DEVICES are essentially global constant lists. They're implemented as a lazy_static! initialized Vec values. The code to initialize them creates an empty Vec then pushes values onto it. We can simplify this a bit by using the vec! macro. This might be slightly more efficient, and it definitely stops recent clippy versions (e.g. 1.51) from complaining about it. fixes #1611 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:52:59 +10:00
David Gibson	eaec5a6c06	agent/oci: Change name case to make clippy happy Recent versions of clippy (e.g. in Rust 1.51) complain about a number of names in the oci crate, which don't obey Rust's normal CamelCasing conventions. It's pretty clear that these don't obey the usual rules because they are attempting to preserve conventional casing of existing acronyms they incorporate ("VM", "POSIX", etc.). However, it's been my experience that matching the case and name conventions of your environs is more important than matching case with external norms. Therefore, this patch changes all the identifiers in the oci crate to match Rust conventions. Their users in the rustjail crate are updated to match. fixes #1611 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:52:54 +10:00
David Gibson	3f5fdae0d8	agent/rustjail: (trivial) Clean up comment on process_grpc_to_oci() This comment appears to be connected specifically with this function, but has some other items separating it for no particular reason. It also has a typo. Correct both. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:52:45 +10:00
David Gibson	210f39a46f	agent/rustjail: Simplify renaming imports Functions in rustjail deal with both the local oci module's data structure and the protocol::oci module's data structure. Since these both cover the OCI container config they are quite similar and have many identically named types. To avoid conflicts, we import many things from those modules with altered names. However the names we use oci* and grpc* don't fit the normal Rust capitalization convention for types. However by renaming the import of the 'protocols::oci' module itself to 'grpc', we can actually get rid of the many renames by just qualifying at each use site with only a very small increase in verbosity. As a bonus this gets rid of multiple 'use' items scattered through the file. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-22 11:52:42 +10:00
Julien Ropé	d4a5413774	runtime: Fix stdout/stderr output from container being truncated Do not close the tty as part of the stdout redirection routine. The close is already happening a couple lines below, after all routines have finished. Fixes #1713 Signed-off-by: Julien Ropé <jrope@redhat.com>	2021-04-21 17:09:09 +02:00
Tim Zhang	8ecf8e5c1f	agent: use channel instead of pipe to send exit signal of process The situation is not a IPC scene, pipe(2) is too heavy. We have tokio::sync:⌚:channel after tokio has been introduced. The channel has better performance and easy to use. Fixes: #1721 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-04-21 16:47:41 +08:00
Chelsea Mafrica	1c222c75ac	Merge pull request #1697 from jodh-intel/improve-agent-shutdown-handling Improve agent shutdown handling	2021-04-20 21:25:36 -07:00
Manabu Sugimoto	81c5ff1231	agent: Update seccomp configuration for errnoRet and flags Update: - Make the type of errnoRet in oci.proto oneof - Update seccomp_grpc_to_oci that can set errnoRet as EPREM if the value is empty. - Update the oci.pb.go based on the above fixes - Add seccomp errnoRet and flags option to configs in rustjail Fixes: #1719 Signed-off-by: Manabu Sugimoto <Manabu.Sugimoto@sony.com>	2021-04-21 12:16:58 +09:00
Fabiano Fidêncio	4c177b5c40	Merge pull request #1599 from Jakob-Naucke/virtiofs-s390x Enable virtio-fs on s390x	2021-04-20 21:07:15 +02:00
Carlos Venegas	cd27308755	Merge pull request #1432 from dgibson/bug1431 block: Generate PCI path for virtio-blk devices on clh	2021-04-20 12:00:09 -05:00
Fabiano Fidêncio	9df86d28a5	Merge pull request #1678 from cmaf/remove-spans-healthcheck runtime: Disable trace for healthcheck	2021-04-20 18:38:47 +02:00
Jakob Naucke	7f60911333	virtcontainers: Allow s390x appendVhostUserDevice Remove the prohibition of vhost-user devices on s390x, which are by now supported (e.g. vhost-user-fs-ccw). As a consequence, appendVhostUserDevice no longer needs an error in its signature. This enables virtio-fs support on s390x. Fixes: #1469 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-04-20 12:20:32 +02:00
Jakob Naucke	67ac4f4585	runtime: update GoVMM for memory backend support Update GoVMM to get memory backend support for non-DIMM setups. This is necessary for virtio-fs on s390x. Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-04-20 12:19:52 +02:00
David Gibson	6577b01a5c	agent/rustjail: Fix accidental damage from tokio conversion register_memory_event_v2() includes a closure spawned as an async task with tokio. At the end of that closure, there's a test for a closed fd exiting if so. But this is right at the end of the closure when it was about to exit anyway, so this does nothing. This code was originally an explicit thread, converted to a tokio task by `332fa4c` "agent: switch to async runtime". It looks like there was an error during conversion, where this logic was accidentally moved out of the while loop above, where it makes a lot more sense. Put it back into the loop. fixes #1702 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-19 16:54:43 +10:00
James O. D. Hunt	de2631e711	utils: Make WaitLocalProcess safer Rather than relying on the system clock, use a channel timeout to avoid problems if the system time changed. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-04-15 15:46:42 +01:00
James O. D. Hunt	9256e590dc	shutdown: Don't sever console watcher too early Fixed logic used to handle static agent tracing. For a standard (untraced) hypervisor shutdown, the runtime kills the VM process once the workload has finished. But if static agent tracing is enabled, the agent running inside the VM is responsible for the shutdown. The existing code handled this scenario but did not wait for the hypervisor process to end. The outcome of this being that the console watcher thread was killed too early. Although not a problem for an untraced system, if static agent tracing was enabled, the logs from the hypervisor would be truncated, missing the crucial final stages of the agents shutdown sequence. The fix necessitated adding a new parameter to the `stopSandbox()` API, which if true requests the runtime hypervisor logic simply to wait for the hypervisor process to exit rather than killing it. Fixes: #1696. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-04-15 15:22:00 +01:00
James O. D. Hunt	51ab870091	utils: Improve WaitLocalProcess Previously, the hypervisors were sending a signal and then checking to see if the process had died by sending the magic null signal (`0`). However, that doesn't work as it was written: the logic was assuming sending the null signal to a process that was dead would return `ESRCH`, but it doesn't: you first need to you `wait(2)` for the process before sending that signal. This means that previously, all affected hypervisors would appear to take `timeout` seconds to end, even though they had _already_ finished. Now, the hypervisors true end time will be seen as we wait for the processes before sending the null signal to ensure the process has finished. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-04-15 14:51:06 +01:00
James O. D. Hunt	507ef6369e	utils: Add waitLocalProcess function Refactored some of the hypervisors to remove the duplicated code used to trigger a shutdown. Also added some unit tests. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-04-15 14:51:03 +01:00
Chelsea Mafrica	0f2fe4a418	Merge pull request #1565 from Jakob-Naucke/s390x-fix-cli-test cli: Use genericGetExpectedHostDetails on s390x	2021-04-14 10:25:23 -07:00
David Gibson	1d5098de70	agent/block: Generate PCI path for virtio-blk devices on clh Currently runtime and agent special case virtio-blk devices under clh, ostensibly because the PCI address information is not available in that case. In fact, cloud-hypervisor's VmAddDiskPut API does return a PciDeviceInfo, which includes a PCI address. That API is broken, because PCI addressing depends on guest (firmware or OS) actions that the hypervisor won't know about. clh only gets away with this because it only uses a single PCI root and never uses PCI bridges, in which case the guest addresses are accurately predictable: they always have domain and bus zero. Until https://github.com/kata-containers/kata-containers/pull/1190, Kata couldn't handle PCI addressing unless there was exactly one bridge, which might be why this was actually special-cased for clh. With #1190 merged, we can handle more general PCI paths, and we can derive a trivial (one element) PCI path from the information that the clh API gives us. We can use that to remove this special case. fixes #1431 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-13 13:29:24 +10:00
David Gibson	e7c97f0f5d	runtime/tests: Change "moo FAILURE" message Change the "moo FAILURE" message shown in a couple of the unit tests to "moo message". This means that searching for unrelated failures in the test output by looking for "FAIL" won't show these messages as false positives any more. fixes #1683 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-13 13:25:03 +10:00
Fupan Li	17d33868c2	Merge pull request #1670 from liubin/1668-remove-ProcessListContainer-API remove ProcessListContainer API	2021-04-12 10:22:37 +08:00
Chelsea Mafrica	543f9da3ba	runtime: Disable trace for healthcheck With tracing enabled, grpc health check generates a large number of spans which creates too much data for tasks running longer than a few minutes. To solve this, remove span creation from kata agent check() and sendReq() where the majority of the spans come from. Leave contexts in functions for subsequent calls that create spans. Fixes #1395 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-04-09 15:47:00 -07:00
bin	421439c633	API: remove ProcessListContainer/ListProcesses This commit will remove ProcessListContainer API from VCSandbox and ListProcesses from agent.proto. Fixes: #1668 Signed-off-by: bin <bin@hyper.sh>	2021-04-09 17:34:25 +08:00
David Gibson	0e04d6299b	Merge pull request #1642 from dgibson/ueventplus Refine uevent matching conditions	2021-04-09 13:10:52 +10:00
Eric Ernst	2334b858a0	Merge pull request #1661 from liubin/1660-replace-newStore-by-store virtcontainers: replace newStore by store in Sandbox struct	2021-04-08 13:17:44 -07:00
bin	d75fe95685	virtcontainers: replace newStore by store in Sandbox struct The property name make newcomers confused when reading code. Since in Kata Containers 2.0 there will only be one type of store, so it's safe to replace it by `store` simply. Fixes: #1660 Signed-off-by: bin <bin@hyper.sh>	2021-04-08 23:59:16 +08:00
Eric Ernst	324b026a77	Merge pull request #1604 from wainersm/agent_mount-1 agent: log the mount point if it is already mounted	2021-04-08 08:26:12 -07:00
Tim Zhang	24b0703fda	agent: fix test for the debug console Fix test for the debug console. Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-04-08 14:57:40 +08:00
Tim Zhang	790332575b	agent: async the debug console Make the debug console in this commit. Finish the rework of debug console. Fixes: #1647 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-04-08 14:57:36 +08:00
David Gibson	8ea2ce9a31	agent/device: Remove legacy uevent matching DevAddrMatcher existed purely as a transitional step as we refined the uevent matching logic for each of the different device types we care about. We've now done that, so it can be removed along with several related pieces. fixes #1628 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-08 12:30:18 +10:00
David Gibson	5d007743c1	agent/device: Refine uevent matching for pmem devices Use the new uevent matching infrastructure to refine the matching for pmem devices to something more pinned down to that device type. While we're there, fix a few anciliary problems with get_pmem_device_name(): - The name is poor - the input to this function is the expected device name, so the result isn't helpful, except that it needs to wait for the device to be ready in the guest. Change it to wait_for_pmem_device() and explicitly check that the returned device name matches the one expected. - Remove an incorrect comment in nvdimm_storage_handler() (the only caller) which appears to have been copied from the virtio-blk path, but then become stale. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-08 12:02:39 +10:00
James O. D. Hunt	9017e1100b	agent: start to rework the debug console It's the first commit of the rework. Fixes: #1647 Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-04-08 09:57:48 +08:00
David Gibson	a59e07c1f9	agent/define: Refine uevent matching for virtio-scsi devices Current get_scsi_device_name() uses the legacy uevent matching which isn't very precise. This refines it to use a specific matcher implementation. While we're at it: - No longer insist on the SCSI controller being under the PCI root. It generally will be, but there's no particular reason to require it. The matcher still has a problem in that it won't work sensibly if there are multiple SCSI busses in the guest. Fixing that requires changes on the runtime side as well, though, so it's beyond scope for this change. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-08 11:13:00 +10:00
David Gibson	484a364729	agent/device: Rework uevent handling for virtio-blk devices There are some problems with get_pci_device_name(): 1) It's misnamed: in fact it is only used for handling virtio-blk PCI devices. It's also only correct for virtio-blk devices, the event matching doesn't locate the "raw" PCI device, but rather the block device created by virtio-blk as a child of the PCI device itself. 2) The uevent matching is imprecise. As all things using the legacy DevAddrMatcher, it matches on a bunch of conditions used across several different device types, not all of which make sense for virtio-blk pci devices specifically. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-08 11:13:00 +10:00
Eric Ernst	15c2d7ed30	Merge pull request #1400 from ManaSugi/update-oci-seccomp oci: Update seccomp configuration	2021-04-07 15:18:19 -07:00
GabyCT	d922070c50	Merge pull request #1644 from lifupan/fix_env rustjail: fix the issue of missing default home env	2021-04-07 10:16:07 -05:00
GabyCT	81bcded9a3	Merge pull request #1492 from dgibson/uevent Make uevent watching mechanism more flexible	2021-04-07 10:15:33 -05:00
fupan.lfp	a938d90310	rustjail: fix the issue of missing default home env first get the "HOME" env from "/etc/passwd", if there's no corresponding uid entry in /etc/passwd, then set "/" as the home env. Fixes: #1643 Signed-off-by: fupan.lfp <fupan.lfp@antfin.com>	2021-04-07 15:11:28 +08:00
GabyCT	0b87fd436f	Merge pull request #1544 from snir911/timeout runtime: increase dial timeout	2021-04-06 16:10:51 -05:00
Wainer dos Santos Moschetta	49eec92038	agent: log the tag and mount point if it is already mounted On commit `17e9a2cff5` it was introduced a guard for the case the mount point is already mounted. Instead of log only the mount tag ("kataShared") with this change it will print both tag and mount point path. Fixes: #1398 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2021-04-06 14:14:59 -04:00
GabyCT	aac852a0bc	Merge pull request #1561 from Jakob-Naucke/s390x-statfs-constants agent: s390x statfs constants	2021-04-06 11:11:40 -05:00
David Gibson	0828f9ba70	agent/uevent: Introduce wait_for_uevent() helper get_device_name() contains logic to wait for a specific uevent, then extract the /dev node name from it. In future we're going to want similar logic to wait on uevents, but using different match criteria, or getting different information out. To simplify this, add a wait_for_uevent() helper in the uevent module, which takes an explicit UeventMatcher object and returns the whole uevent found. To make testing easier, we also extract the cut down uevent watcher from test_get_device_name() into a new spawn_test_watcher() helper. Its used for both test_get_device_name() and a new test_wait_for_uevent() amd will be useful for more tests in future. fixes #1484 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 21:14:52 +10:00
David Gibson	16ed55e440	agent/device: Use consistent matching for past and future uevents get_device_name() looks at kernel uevents to work out the device name for a given PCI (usually) address. However, when we call it we can't know if the uevent we're interested in has already happened (in which case it will have been recorded in Sandbox::uevent_map) or yet to come, in which case we need to register to watch it. However, we currently match differently against past and future events. For past events we simply look for a sysfs path including the address, but for future events we use a complex bit of logic in the is_match() closure. Change it to use the exact same matching logic in both cases. fixes #1397 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 21:14:33 +10:00
David Gibson	4b16681d87	agent/uevent: Put matcher object rather than "device address" in watch list Currently, Sandbox::uevent_watchers lists uevents to watch for by a "device address" string. This is not very clearly defined, and is matched against events with a rather complex closure created in Uevent::process_add(). That closure makes a bunch of fragile assumptions about what sort of events we could ever be interested in. In some ways it is too restrictive (requires everything to be a block device), but in others is not restrictive enough (allows things matching NVDIMM paths, even if we're looking for a PCI block device). To allow the clients more precise control over uevent matching, we define a new UeventMatcher trait with a method to match uevents. We then have the atchers list include UeventMatcher trait objects which are used directly by Uevent::process_add(), instead of constructing our match directly from dev_addr. For now we don't actually change the matching function, or even use multiple different trait implementations, but we'll refine that in future. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 21:14:18 +10:00
David Gibson	b8b322482c	agent/uevent: Consolidate event matching logic The event matching logic in Uevent::process_add() is split into two parts. The first checks if we care about the event at all, the second checks whether the event is relevant to a particular watcher. However, we're going to be adding more types of watchers in future, which will make the global filter too restrictive. Fold the two bits of logic together into a per-watcher filter function. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:59:43 +10:00
David Gibson	d2caff6c55	agent: Re-organize uevent processing Uevent::process() is a bit oddly organized. It treats the onlining of hotplugged memory as the "default" case, although that's quite specific, while treating the handling of hotplugged block devices more like a special case, although that's pretty close to being very general. Furthermore splitting Uevent::is_block_add_event() from Uevent::handle_block_add_event() doesn't make a lot of sense, since their logic is intimately related to each other. Alter the code to be a bit more sensible: first split on the "action" type since that's the most fundamental difference, then handle the memory onlining special case, then the block device add (which will become a lot more general in future changes). Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:59:20 +10:00
David Gibson	55ed2ddd07	agent: Store uevent watchers in Vec rather than HashMap Sandbox:dev_watcher is a HashMap from a "device address" to a channel used to notify get_device_name() that a suitable uevent has been found. However, "device address" isn't well defined, having somewhat different meanings for different device/event types. We never actually look up this HashMap by key, except to remove entries. Not looking up by key suggests that a map is not the appropriate data structure here. Furthermore, HashMap imposes limitations on the types which will prevent some future extensions we want. So, replace the HashMap with a Vec<Option<>>. We need the Option<> so that we can remove entries by index (removing them from the Vec completely would hange the indices of other entries, possibly breaking concurrent work. This does mean that the vector will keep growing as we watch for different events during startup. However, we don't expect the number of device events we watch for during a run to be very large, so that shouldn't be a problem. We can optimize this later if it becomes a problem. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:59:19 +10:00
David Gibson	91e0ef5c90	agent/uevent: Report whole Uevents to device watchers Currently, when Uevent::handle_block_add_event() receives an event matching a registered watcher, it reports the /dev node name from the event back to the watcher. This changes it to report the entire uevent, not just the /dev node name. This will allow various future extensions. It also makes the client side of the uevent watching - get_device_name() - more consistent between its two paths: finding a past uevent in Sandbox::uevent_map() or waiting for a new uevent via a watcher. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:58:47 +10:00
David Gibson	3642005479	agent: Store whole Uevent in map, rather than just /dev name Sandbox::pci_device_map contains a mapping from sysfs paths to /dev entries which is used by get_device_name() to look up the right /dev node. But, the map only supplies the answer if the uevent for the device has already been received, otherwise get_device_name() has to wait for it. However the matching for already-received and yet-to-come uevents isn't quite the same which makes the whole system fragile. In order to make sure the matching for both cases is identical, we need the already-received side to store the whole uevent to match against, not just the sysfs path and device name. So, rename pci_device_map to uevent_map and store the whole uevent there verbatim. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:58:47 +10:00
David Gibson	0616202580	agent/device: Move GLOBAL_DEVICE_WATCHER into Sandbox In Kata 1.x, both the sysToDevMap and the deviceWatchers are in the sandbox structure. For some reason in Kata 2.x, the device watchers have moved to a separate global variable, GLOBAL_DEVICE_WATCHER. This is a bad idea: apart from introducing an extra global variable unnecessarily, it means that Sandbox::pci_device_map and GLOBAL_DEVICE_WATCHER are protected by separate mutexes. Since the information in these two structures has to be kept in sync with each other, it makes much more sense to keep them both under the same single Sandbox mutex. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:58:45 +10:00
David Gibson	11ae32e3c0	agent/device: Fix path matching for PCI devices For the case of virtio-blk PCI devices, when matching uevents we create a pci_p temporary. However, we build it incorrectly: the dev_addr values we use for PCI devices are a relative sysfs paths from the PCI root to the device in question including an initial /. But when we construct pci_p we add an extra /, meaning the resulting path will not match properly. AFAICT the only reason we got away with this is because in practice the virtio-blk devices where discovered by the kernel before we looked for them meaning the loosed matching in get_device_name() was used, rather than the pci_p logic in handle_block_add_event(). Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:58:06 +10:00
David Gibson	4f60880414	agent/device: Update test_get_device_name() The current test_get_device_name(), ported from Kata 1.x doesn't really reflect how the function is used in practice. The example path appears to be for a virtio-blk device, but it's an s390 specific variant, not a PCI device. The s390 form isn't actually supported by any of the existing users of get_device_name(). Change it to a plausible virtio-blk-pci style path to better test how get_device_name() will actually be used in practice. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:49:48 +10:00
Bin Liu	117c59150d	Merge pull request #1613 from Tim-Zhang/pipestream-shutdown-do-nothing Don't do anything in Pipestream::shutdown	2021-04-06 14:03:00 +08:00
Tim Zhang	ee6a590db1	agent: add test test_pipestream_shutdown Make sure PipeStream::shutdown() do not close the inner fd. Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-04-06 11:44:56 +08:00
Tim Zhang	4a2d437043	agent: don't do anything in Pipestream::shutdown The only right way to shutdown pipe is drop it Otherwise PipeStream will conflict with its twins Because they both have the same fd, and both registered. Fixes: #1614 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-04-06 11:44:38 +08:00
Peng Tao	d5600641dd	Merge pull request #1603 from lifupan/fix_fsgroup Fix fsgroup	2021-04-06 11:35:03 +08:00
David Gibson	e3e670c56f	agent/device: Forward port test for get_device_name() from Kata 1.x Kata 1.x had a testcase for the equivalent getDeviceName function in Go, this adapts it to Rust and adds it to Kata 2.x. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 13:29:37 +10:00
David Gibson	ed08980fc1	agent: Remove many "panic message is not string literal" warnings Rust 1.51 appears to have added a new warning in anticipation of Rust 2021, which requires the format string for panic!()s (including via the various assert!() macros) to be a string literal. This triggers quite a few times in the agent code. This patch fixes them. fixes #1626 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 11:51:34 +10:00
Snir Sheriber	13653e7b55	runtime: increase dial timeout On some setups, starting multiple kata pods (qemu) simultaneously on the same node might cause kata VMs booting time to increase and the pods to fail with: Failed to check if grpc server is working: rpc error: code = DeadlineExceeded desc = timed out connecting to vsock 1358662990:1024: unknown Increasing default dialing timeout to 30s should cover most cases. Signed-off-by: Snir Sheriber <ssheribe@redhat.com> Fixes: #1543	2021-04-04 09:37:38 +03:00
Chelsea Mafrica	17b1452c2a	Merge pull request #1607 from fidencio/wip/only-keep-one-VERSION-file Only keep one VERSION file	2021-04-02 11:14:12 -07:00
Bo Chen	1511d966aa	Merge pull request #1616 from egernst/dechat-deruntime Dechat deruntime	2021-04-01 11:02:27 -07:00
Chelsea Mafrica	4a3282cf1a	Merge pull request #1608 from likebreath/0331/go_fmt_clh_clinet_code runtime: Format auto-generated client code for cloud-hypervisor API	2021-04-01 10:39:02 -07:00
Eric Ernst	a4c125a8b9	trace: move gRPC requests from debug to trace There are many requests to the agent that happen with relatively high frequency when a workload is running (checkRequest, as an example). Let's move from Debug to Trace to avoid bombarding journal. Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2021-04-01 09:03:26 -07:00
Eric Ernst	50fff97753	trace: move trace span chatter to trace rather than info No human should ever read that ouptut. Let's at least move it to trace for now. Fixes: #1615 Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2021-04-01 09:02:56 -07:00
Fupan Li	5524bc806b	Merge pull request #1612 from liubin/1610/use-concrete-kata-agent-config-type runtime: use concrete KataAgentConfig instead of interface type	2021-04-01 21:26:38 +08:00
bin	6fe48329b5	runtime: use concrete KataAgentConfig instead of interface type Kata Containers 2.0 only have one type of agent, so there is no need to use interface as config's type Fixes: #1610 Signed-off-by: bin <bin@hyper.sh>	2021-04-01 13:44:45 +08:00
fupan.lfp	6493942568	mount: fix the issue of missing set fsGroup For k8s emptyDir volume, a specific fsGroup would be set for it, thus guest should get this fsGroup from runtime and set it properly on the emptyDir volume in guest. Fixes: #1580 Signed-off-by: fupan.lfp <fupan.lfp@antfin.com>	2021-04-01 11:33:26 +08:00
fupan.lfp	88e58a4f4b	agent: fix the issue of missing pass fsGroup For k8s emptyDir volume, a specific fsGroup would be set for it, thus runtime should pass this fsGroup to guest and set it properly on the emptyDir volume in guest. Fixes: #1580 Signed-off-by: fupan.lfp <fupan.lfp@antfin.com>	2021-04-01 11:33:18 +08:00
Fabiano Fidêncio	572aff53e8	build: Only keep one VERSION file Instead of having different VERSION files spread accross the project, let's always use the one in the topsrcdir and remove all the others, keeping only a synlink to the topsrcdir one. Fixes: #1579 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-03-31 23:51:20 +02:00
Bo Chen	0c38d9ecc4	runtime: Fix the format of the client code of cloud-hypervisor APIs Regenerate the client code with the added `go-fmt` step. No functional changes. Fixes: #1606 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-03-31 14:41:44 -07:00
Bo Chen	52cacf8838	runtime: Format auto-generated client code for cloud-hypervisor API This patch extends the current process of generating client code for cloud-hypervisor API with an additional step, `go-fmt`, which will remove the generated `client/go.mod` file and format all auto-generated code. Fixes: #1606 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-03-31 14:36:24 -07:00
Eric Ernst	c0c7bef2b8	Merge pull request #1592 from likebreath/0330/versions_clh_v0.14.0 versions: Update cloud-hypervisor to release v0.14.1	2021-03-31 12:39:35 -07:00
Fabiano Fidêncio	a3d8554ab9	Merge pull request #1577 from liubin/feature/1576-import-runc-v2-options-types runtime: import runtime/v2/runc/options to decode request from Docker	2021-03-31 20:35:24 +02:00
Bo Chen	84b62dc3b1	versions: Update cloud-hypervisor to release v0.14.1 Highlights for cloud-hypervisor version 0.14.0 include: 1) Structured event monitoring; 2) MSHV improvements; 3) Improved aarch64 platform; 4) Updated hotplug documentation; 6) PTY control for serial and virtio-console; 7) Block device rate limiting; 8) Plan to deprecate the support of "LinuxBoot" protocol and support PVH protocol only. Highlights for cloud-hypervisor version 0.13.0 include: 1) Wider VFIO device support; 2) Improve huge page support; 3) MACvTAP support; 4) VHD disk image support; 5) Improved Virtio device threading; 6) Clean shutdown support via synthetic power button. Details can be found: https://github.com/cloud-hypervisor/cloud-hypervisor/releases Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by `openapi-generator` [1-2]. As the API changes do not impact usages in Kata, no additional changes in kata's runtime are needed to work with the latest version of cloud-hypervisor. [1] https://github.com/OpenAPITools/openapi-generator [2] https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/pkg/cloud-hypervisor/README.md Fixes: #1591 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-03-31 11:09:47 -07:00
Orestis Lagkas Nikolos	6255cc1959	virtcontainers/fc: Upgrade Firecracker to v0.23.1 This patch upgrades Firecracker version from v0.21.1 to v0.23.1 * Generate swagger models for v0.23.1 (from firecracker.yaml) * Change uint64 types in TokenBucket object according to rate-limiter implementation (introduced in commit #cfeb966) * Update Firecracker Logger/Metrics to support the new API * Update payload in fc.vmRunning to support the new API * Add Metrics type to fcConfig Fixes: #1518 Signed-off-by: Orestis Lagkas Nikolos <olagkasn@nubificus.co.uk>	2021-03-31 04:55:40 -05:00
Chelsea Mafrica	e5aa4e7eb4	Merge pull request #1563 from Jakob-Naucke/s390x-missing-contexts virtcontainers: Fix missing contexts in s390x	2021-03-30 09:38:28 -07:00
Carlos Venegas	c748a9c278	Merge pull request #1549 from jcvenegas/2021-03-24/makefile-enable-dax-env-var runtime: makefile allow override DAX value	2021-03-30 10:06:16 -06:00
bin	09d454ac74	runtime: import runtime/v2/runc/options to decode request from Docker Shimv2 protocol CreateTaskRequest.Options has a type of *google_protobuf.Any. If the call is from Docker, to decode the request, the proto types(github.com/containerd/containerd/runtime/v2/runc/options) should be imported. Fixes: #1576 Signed-off-by: bin <bin@hyper.sh>	2021-03-30 19:44:00 +08:00
Tim Zhang	b58fb25d88	Merge pull request #1555 from liubin/fix/1554-install-hook-before-test test: install mock hook binary before test	2021-03-30 14:01:56 +08:00
Eric Ernst	05680b86c4	Merge pull request #1537 from lifupan/main cgroups: fix the issue of get wrong online cpus	2021-03-29 15:56:03 -07:00
Eric Ernst	460117a1a6	Merge pull request #1510 from littlejawa/issue_1003 build: remove unused variables from Makefile	2021-03-29 14:54:09 -07:00
Carlos Venegas	0b502d15b2	runtime: makefile allow override DAX value Allow enable DAX using env variable Fixes: #1547 Signed-off-by: Carlos Venegas <jos.c.venegas.munoz@intel.com>	2021-03-29 21:28:22 +00:00
Eric Ernst	24214a536a	Merge pull request #1560 from egernst/fix-1559 container: on cleanup, rm container directory for mounts path	2021-03-29 14:14:52 -07:00
GabyCT	17840cb573	Merge pull request #1546 from devimc/2021-03-24/supportQEMU6 runtime: add support for QEMU 6	2021-03-29 14:33:16 -06:00
Eric Ernst	9a4e866654	container: on cleanup, rm container directory for mounts path A wrong path was being used for container directory when virtiofs is utilized. This resulted in a warning message in logs when a container is killed, or completes: level=warning msg="Could not remove container share dir" Without proper removal, they'd later be cleaned up when the shared path is removed as part of stopping the sandbox. Fixes: #1559 Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2021-03-29 11:39:39 -07:00
Jakob Naucke	1366f0fb9c	cli: Use genericGetExpectedHostDetails on s390x getExpectedHostDetails did not offload any work to genericGetExpectedHostDetails on s390x. By using that function, much redundant code can be saved. This also resolves 2 issues with the previous version: - The number of CPUs was not calculated. - vcUtils.SupportsVsocks() still used the Kata v1 signature. Fixes: #1564 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-03-29 17:58:16 +02:00
Jakob Naucke	31ced01eba	virtcontainers: Fix missing contexts in s390x #1389 has added a context for many signatures to improve trace spans. Functions specific to s390x lack this. Add context where required. This affects some common code signatures, since some functions that do not require context on other architectures do require it on s390x. Also remove an unnecessary import in test_qemu_s390x.go. Fixes: #1562 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-03-29 17:49:27 +02:00
Jakob Naucke	52a276fbdb	agent: Fix type for PROC_SUPER_MAGIC on s390x statfs f_types are long on most architectures, but not on s390x, where they are uint. Following the fix in rust-lang/libc at https://github.com/rust-lang/libc/pull/1999, the custom defined PROC_SUPER_MAGIC must be updated in a similar way. Fixes: #1204 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-03-29 17:25:19 +02:00
Jakob Naucke	5b7c8b7d26	agent: Update cgroups-rs to 0.2.5 to pull in the chain of https://github.com/rust-lang/libc/pull/1999, https://github.com/nix-rust/nix/pull/1372, and https://github.com/kata-containers/cgroups-rs/pull/38. This adds statfs constants on s390x. cgroups-rs 0.2.4 also contains this fix, but let's move to the latest 0.2.5 right away. Fixes: #1204 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2021-03-29 17:25:14 +02:00
bin	48e5e4f2f3	test: install mock hook binary before test `make test` depends mock hook in virtcontainers directory, before test, install it first. And also run test as normal user and root in GitHub actions. Fixes: #1554 Signed-off-by: bin <bin@hyper.sh>	2021-03-29 22:40:45 +08:00
James O. D. Hunt	1d448813a1	uevent: Add shutdown channel for task Allow the uevent task to shutdown on request. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-03-29 14:32:12 +01:00
James O. D. Hunt	d8d5b4cd1d	signal: Move to a new module Move the signal handling code into a new module and refactor into the main handler and a new SIGCHLD handling function to make the code simpler and easier to understand. Also added a unit test for shutdown. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-03-29 14:32:12 +01:00
James O. D. Hunt	011f7d785a	logging: Rework for shutdown Make changes to logger thread to allow the logger to be replaced with a NOP logger (required for agent shutdown). Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-03-29 14:32:12 +01:00
James O. D. Hunt	7d5f88c0ad	agent: Enable clean shutdown The agent doesn't normally shutdown: it doesn't need to be as it is killed after the workload has finished. However, a clean and ordered shutdown sequence is required to support agent tracing, since all trace spans need to be completed to ensure a valid trace transaction. Enable a controlled shutdown by allowing the main threads (tasks) to be stopped. To allow this to happen, each thread is now passed a shutdown channel which it must listen to asynchronously, and shut down the thread if activity is detected on that channel. Since some threads are created for I/O and since the standard `io::copy` cannot be stopped, added a new `interruptable_io_copier()` function which shares the same semantics as `io::copy()`, but which is also passed a shutdown channel to allow asynchronous I/O operations to be stopped cleanly. Fixes: #1531. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-03-29 14:32:12 +01:00
James O. D. Hunt	dcb39c61f1	main: Create logger task Encapsulate the logic for handling the task that displays logger output into a new function to simplify the code and remove another anonymous async block. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-03-29 14:32:11 +01:00
James O. D. Hunt	2cf2897d31	main: Use task list for stopping tasks Maintain a list of tasks and wait on them all before main returns. This is preparatory work for the agent shutdown: all tasks that are started need to be added to the list. This aggregation makes it easier to identify what needs to stop before the agent can exit cleanly. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-03-29 14:32:11 +01:00
James O. D. Hunt	039df1d727	main: Refactor main logic into new async function Move most of the main logic into a separate async function. This makes the code clearer and avoids the anonymous async block. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-03-29 14:32:11 +01:00
James O. D. Hunt	2a648fa760	logging: Use guard to make threaded logging safe Return a guard variable from `create_logger()` which the caller can implicitly drop to guarantee that all threads started by the async log drain are stopped. This fixes a long-standing bug [1] whereby the agent could panic with the following error, generated by the `slog` logging crate: ``` slog::Fuse Drain: Custom { kind: Other, error: "serde serialization error: Bad file descriptor (os error 9)" } ``` [1] - See https://github.com/kata-containers/kata-containers/issues/171. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-03-29 14:32:11 +01:00
James O. D. Hunt	38f0d8d3ce	config: Fix assert_error testing macro Fixed the `assert_error!()` test macro so that it correctly handles the scenario where the test expects an error, but the actual result was `Ok` (no error). Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-03-29 14:32:11 +01:00
Bin Liu	594c47ab6c	Merge pull request #1553 from bergwolf/ro-volumes runtime: fix virtiofsd RO volume sharing	2021-03-29 20:43:34 +08:00

... 37 38 39 40 41 ...

4749 Commits