kata-containers

mirror of https://github.com/kata-containers/kata-containers.git synced 2025-08-24 10:41:43 +00:00

Author	SHA1	Message	Date
Alex Lyn	cf74166d75	Merge pull request #9015 from Apokleos/bugfix-exec-uds runtime: display accurate error msg to avoid misleading users.	2024-02-05 13:50:43 +08:00
Alex Lyn	c6830ceb89	runtime: display accurate error msg to avoid misleading users. The original handling method does not reach user expectations. When the ClientSocketAddress method stats the corresponding path of runtime-rs and has not found it yet, we should return an error message here that includes the reason for the failure (which should be an error display indicating that both runtime-go and runtime-rs were not found). Instead of simply displaying the corresponding path of runtime-rs as the final error message to users. It is also necessary to return the error promptly to the caller for further error handling. Fixes: #8999 Signed-off-by: Alex Lyn <alex.lyn@antgroup.com>	2024-02-04 16:45:59 +08:00
Guoqiang Ding	7bf1ebe16d	kata-monitor: fix agentUrl from containerd shim Fix the missing leading slash. Fixes: #9013 Signed-off-by: Guoqiang Ding <dgq8211@gmail.com>	2024-02-04 16:24:13 +08:00
Beraldo Leal	7641c19f74	runtime: bump containerd for gogo deprecation This update includes necessary changes due to the version bump of containerd and its dependencies. It's part of a broader initiative to phase out gogo protobuf, which has been deprecated, and to align with the current supported libraries. Fixes #7420. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-11-06 16:49:59 +00:00
Alexandru Matei	db2cac34d8	runtime: Don't create socket file in /run/kata The socket file for shim management is created in /run/kata and it isn't deleted after the container is stopped. After running and stopping thousands of containers /run folder will run out of space. Fixes #6622 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com> Co-authored-by: Greg Kurz <groug@kaod.org>	2023-04-13 10:21:29 +03:00
Miao Xia	0f73515561	runtime: add filter metrics with specific names The kata monitor metrics API returns a huge size response, if containers or sandboxs are a large number, focus on what we need will be harder. Fixes: #6500 Signed-off-by: Miao Xia <xia.miao1@zte.com.cn>	2023-03-28 14:56:13 +08:00
Eric Ernst	0706fb28ac	kata-runtime: shmgmt: make url usage consistent Before, we had a mix of slash, etc. Unfortunately, when cleaning URL paths, serve mux seems to mangle the request method, resulting in each request being a GET (instead of PUT or POST). Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2022-05-31 09:27:58 -07:00
Bin Liu	362201605e	Merge pull request #4055 from fgiudici/kata-monitor_pprof kata-monitor: update the hrefs in the debug/pprof index page	2022-04-16 08:12:18 +08:00
Francesco Giudici	86977ff780	kata-monitor: update the hrefs in the debug/pprof index page kata-monitor allows to get data profiles from the kata shim instances running on the same node by acting as a proxy (e.g., http://$NODE_ADDRESS:8090/debug/pprof/?sandbox=$MYSANDBOXID). In order to proxy the requests and the responses to the right shim, kata-monitor requires to pass the sandbox id via a query string in the url. The profiling index page proxied by kata-monitor contains the link to all the data profiles available. All the links anyway do not contain the sandbox id included in the request: the links result then broken when accessed through kata-monitor. This happens because the profiling index page comes from the kata shim, which will not include the query string provided in the http request. Let's add on-the-fly the sandbox id in each href tag returned by the kata shim index page before providing the proxied page. Fixes: #4054 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-04-12 15:53:59 +02:00
bin	f8cc5d1ad8	kata-monitor: add some links when generating pages for browsers Add some links to rendered webpages for better user experience, let users can jump to pages only by clicking links in browsers. Fixes: #4061 Signed-off-by: bin <bin@hyper.sh>	2022-04-11 09:29:56 +08:00
Eric Ernst	1e301482e7	Merge pull request #3406 from fengwang666/direct-blk-assignment Implement direct-assigned volume	2022-03-04 11:58:37 -08:00
Fabiano Fidêncio	7e5f11a52b	vendor: Update containerd to 1.6.1 Let's bring in the latest release of Containerd, 1.6.1, released on March 2nd, 2022. With this, we take the opportunity to remove containerd/api reference as we shouldn't need a separate module only for the API. Here's the list of changes needed in the code due to the bump: * stop using `grpc.WithInsecure()` as it's been deprecated - use `grpc.WithTransportCredentials(insecure.NewCredentials())` instead Fixes: #3820 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-04 10:28:40 +01:00
Feng Wang	e9b5a25502	runtime: add stat and resize APIs to containerd-shim-v2 To query fs stats and resize fs, the requests need to be passed to kata agent through containerd-shim-v2. So we're adding to rest APIs on the shim management endpoint. Also refactor shim management client to its own go file. Fixes: #3454 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2022-03-03 18:56:53 -08:00
Francesco Giudici	fec26f8e51	kata-monitor: trivial: rename symbols & labels We introduced collection of sandboxes metadata from the CRI that will be attached to the sandbox metrics: this will allow to immediately match sandboxes metrics with CRI workloads. Rename the symbols from Kube to CRI as the metadata will be there every time pods are created through CRI, also if kubernetes is not installed (e.g., 'crictl runp'). Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-02-23 18:34:32 +01:00
Francesco Giudici	3ac52e8193	kata-monitor: fix updating sandbox cache at startup We now rely on fs events only to update the sandbox cache. This is not true anyway for sandboxes already present at kata-monitor startup: we just retrieve the list and add them in the cache only when we get their CRI metadata. If CRI metadata is not available we will never add them to the sandbox cache. Fix this by immediately adding the sandboxes we find at startup time to the sandbox cache. Fixes: #3705 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-02-23 11:21:06 +01:00
Francesco Giudici	ab447285ba	kata-monitor: add kubernetes pod metadata labels to metrics Add the POD metadata we get from the container manager to the metrics by adding more labels. Fixes: #3551 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-01-26 13:48:45 +01:00
Francesco Giudici	834e199eee	kata-monitor: drop unused functions Drop the functions we are not using anymore. Update the tests too. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-01-26 13:48:45 +01:00
Francesco Giudici	7516a8c51b	kata-monitor: rework the sandbox cache sync with the container manager Kata-monitor detects started and terminated kata pods by monitoring the vc/sbs fs (this makes sense since we will have to access that path to access the sockets there to get the metrics from the shim). While kata-monitor updates its sandbox cache based on the sbs fs events, it will schedule also a sync with the container manager via the CRI in order to sync the list of sandboxes there. The container manager will be the ultimate source of truth, so we will stick with the response from the container manager, removing the sandboxes not reported from the container manager. May happen anyway that when we check the container manager, the new kata pod is not reported yet, and we will remove it from the kata-monitor pod cache. If we don't get any new kata pod added or removed, we will not check with the container manager again, missing reporting metrics about that kata pod. Let's stick with the sbs fs as the source of truth: we will update the cache just following what happens on the sbs fs. At this point we may have also decided to drop the container manager connection... better instead to keep it in order to get the kube pod metadata from it, i.e., the kube UID, Name and Namespace associated with the sandbox. Every time we get a new sandbox from the sbs fs we will try to retrieve the pod metadata associated with it. Right now we just attach the container manager sandbox id as a label to the exposed metrics, making hard to link the metrics to the running pod in the kubernetes cluster. With kubernetes pod metadata we will be able to add them as labels to map explicitly the metrics to the kubernetes workloads. Fixes: #3550 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-01-26 13:48:45 +01:00
Francesco Giudici	e78d80ea0d	kata-monitor: silently ignore CHMOD events on the sandboxes fs We currently WARN about unexpected fs events, which includes CHMOD operations (which should be actually expected...). Just ignore all the fs events we don't care about without any warn. We dump all the events with debug log in any case. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-01-26 13:48:45 +01:00
Francesco Giudici	e9eb34cea8	kata-monitor: improve debug logging Improve debug log formatting of the sandbox cache update process. Move raw and tracing logs from the DEBUG to the TRACE log level. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2022-01-26 13:48:45 +01:00
bin	03546f75a6	runtime: change io/ioutil to io/os packages Change io/ioutil to io/os packages because io/ioutil package is deprecated from 1.16: Discard => io.Discard NopCloser => io.NopCloser ReadAll => io.ReadAll ReadDir => os.ReadDir ReadFile => os.ReadFile TempDir => os.MkdirTemp TempFile => os.CreateTemp WriteFile => os.WriteFile Details: https://go.dev/doc/go1.16#ioutil Fixes: #3265 Signed-off-by: bin <bin@hyper.sh>	2021-12-15 07:31:48 +08:00
Francesco Giudici	315295e0ef	runtime: rename GetSanboxesStoragePath() --> GetSandboxesStoragePath() Add the missing 'd'. Fixes: #2738 Suggested-by: Jakob Naucke <jakob.naucke@ibm.com> Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-27 15:56:14 +02:00
Francesco Giudici	bfb556d56a	kata-monitor: refresh kata sandbox list on fs events This commit stops the container engine polling in favor of the kata sandbox storage path monitoring. The pod cache list is now refreshed based on fs events and synced with the container engine only when needed. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-23 14:32:09 +02:00
Francesco Giudici	0e854f3b80	kata-monitor: improve detection of kata workloads When the container engine is different than containerd or CRI-O we lack proper detection of kata workloads and consider all the pods as kata ones. Instead of querying the container engine for the lower level runtime used in each pod, check if a directory matching the pod exists in the virtualcontainers sandboxes storage path. This provides a container engine independent way to check for kata pods. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-23 14:32:09 +02:00
Francesco Giudici	afad910d0e	kata-monitor: add getSandboxFS() Retrieve the absolute sandbox storage path. We will soon need this to monitor the creation/deletion of new kata sandboxes. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-20 10:37:55 +02:00
Francesco Giudici	245a12bbb7	kata-monitor: improve sandbox caching In order to retrieve the list of sandboxes, we poll the container engine every 15 seconds via the CRI. Once we have the list we have to inspect each pod to find out the kata ones. This commit extend the sandbox cache to keep track of all the pods, marking the kata ones, so that during the next polling only the new sandboxes should be inspected to figure out which ones are using the kata runtime. Fixes: #2563 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-20 10:37:55 +02:00
Francesco Giudici	fc067d61d4	kata-monitor: warn when unable to retrive the lower level runtime this is an unexpected event (likely a change in how containerd/cri-o record the lower level runtime in the pod) and should be more visible: raise the log level to "warning". Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-20 10:37:54 +02:00
Francesco Giudici	53ec4df953	kata-monitor: minor fixes fix comment and use literals Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-20 10:37:54 +02:00
Peng Tao	4f7cc18622	runtime: refactor commandline code directory Move all command line code to `cmd` and move containerd-shim-v2 to pkg. Fixes: #2627 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-09-16 17:19:18 +08:00
Francesco Giudici	2d8386ea52	kata-monitor: add few unit tests Add cri.go unit tests Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-08-05 11:41:54 +02:00
Francesco Giudici	8714a35063	kata-monitor: make code to identify kata pods simpler just search for the "kata" substring in the runtime value and log at info level when the runtime name/type is not found. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-08-05 11:41:54 +02:00
Francesco Giudici	68a6f011b5	kata-monitor: drop the runtime info from the sandbox cache We keep the container engine info in the sandbox cache map, as the value associated to the pod id (the key). Since we used that in getMonitorAddress() only (which is gone) we can avoid storing that information. Let's drop it. Keep the map structure and the [put,delete]IfExists functions as we may want to move to an event based cache update process sooner or later, and we will need those. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-08-05 11:41:54 +02:00
Francesco Giudici	97dcc5f78a	kata-monitor: drop getMonitorAddress() since the shim socket path is statically defined in the containerd-shimv2 code, we don't need to retrieve the socket name from the filesystem: construct the socket name using the containerd-shimv2 code. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-08-05 11:41:54 +02:00
Francesco Giudici	c2f03e8993	kata-monitor: talk to the container engine via the CRI kata-monitor uses containerd client to retrieve information from the container engine. This makes kata-monitor work with the containerd container engine only. Bin Liu (bin <bin@hyper.sh>) worked on a kata-monitor version able to talk to any container engine leveraging the standard CRI[1]. Here, the original work of Bin Lui has been adapted on the current kata-monitor to make it container engine independent. [1] https://github.com/liubin/kata-containers/tree/fix/1030-use-cri-in-kata-monitor Fixes: #1030 Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-08-05 11:41:54 +02:00
Julio Montes	31de8eb75b	runtime: pkg: fix govet fieldalignment Fix structures alignment Signed-off-by: Julio Montes <julio.montes@intel.com>	2021-07-20 10:30:30 -05:00
fupan.lfp	8e0daf6780	shimv2: fix the issue of kata-runtime exec failed Commit `32c9ae1388` upgrade the containerd vendor, which used the socket path to replace the abstract socket address for socket listen and dial, and there's an bug in containerd's abstract socket dialing. Thus we should replace our monitor and exec socket server with the socket path to fix this issue. Fixes: #2238 Signed-off-by: fupan.lfp <fupan.lfp@antgroup.com>	2021-07-16 11:41:09 +08:00
Eric Ernst	3787306107	kata-monitor: export get stats for sandbox Gathering stats for a given sandbox is pretty useful; let's export a function from katamonitor pkg to do this. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-05-10 08:53:56 -07:00
Eric Ernst	3caed6f88d	runtime: shim: dedup client, socket addr code (1) Add an accessor function, SocketAddress, to the shim-v2 code for determining the shim's abstract domain socket address, given the sandbox ID. (2) In kata monitor, create a function, BuildShimClient, for obtaining the appropriate http.Client for communicating with the shim's monitoring endpoint. (3) Update the kata CLI and kata-monitor code to make use of these. (4) Migrate some kata monitor methods to be functions, in order to ease future reuse. (5) drop unused namespace from functions where it is no longer needed. Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-05-07 15:20:37 -07:00
Peng Tao	74192d179d	runtime: fix static check errors It turns out we have managed to break the static checker in many difference places with the absence of static checker in github action. Let's fix them while enabling static checker in github actions... Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2021-03-24 20:10:19 +08:00
bin	44cde6e464	runtime: connect guest debug console bypass kata-monitor Parse agent socket address by conversation to improve usability of using guest debug console. Fixes: #1329 Signed-off-by: bin <bin@hyper.sh>	2021-02-09 19:36:48 +08:00
bin liu	1b7ed32836	kata-monitor: use regexp to check if runtime is kata containers To support a few common configurations for Kata, including: - `io.containerd.kata.v2` - `io.containerd.kata-qemu.v2` - `io.containerd.kata-clh.v2` `kata-monintor` changes to use regexp instead of direct string comparison. Fixes: #957 Signed-off-by: bin liu <bin@hyper.sh>	2020-10-15 18:42:44 +08:00
bin liu	febdf8f68c	runtime: add debug console service Add `kata-runtime exec` to enter guest OS through shell started by agent Fixes: #245 Signed-off-by: bin liu <bin@hyper.sh>	2020-09-27 10:57:17 +08:00
bin liu	bbf8517050	runtime: add pprof interface for shim Add new http interfaces to support pprof: - /sandboxes - /debug/vars - /debug/pprof/ - /debug/pprof/cmdline - /debug/pprof/profile - /debug/pprof/symbol - /debug/pprof/trace Fixes: #397 Signed-off-by: bin liu <bin@hyper.sh>	2020-07-10 13:05:25 +08:00
bin liu	1b75daa00f	runtime: add new command to collect metrics from Kata containers Add a new command to collect metrics and return metrics to Prometheus. Signed-off-by: bin liu <bin@hyper.sh>	2020-07-02 17:54:54 +08:00

44 Commits