mirror of
				https://github.com/k3s-io/kubernetes.git
				synced 2025-10-26 19:15:27 +00:00 
			
		
		
		
	
		
			
				
	
	
		
			186 lines
		
	
	
		
			7.2 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
			
		
		
	
	
			186 lines
		
	
	
		
			7.2 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| <!-- BEGIN MUNGE: UNVERSIONED_WARNING -->
 | |
| 
 | |
| <!-- BEGIN STRIP_FOR_RELEASE -->
 | |
| 
 | |
| <img src="http://kubernetes.io/kubernetes/img/warning.png" alt="WARNING"
 | |
|      width="25" height="25">
 | |
| <img src="http://kubernetes.io/kubernetes/img/warning.png" alt="WARNING"
 | |
|      width="25" height="25">
 | |
| <img src="http://kubernetes.io/kubernetes/img/warning.png" alt="WARNING"
 | |
|      width="25" height="25">
 | |
| <img src="http://kubernetes.io/kubernetes/img/warning.png" alt="WARNING"
 | |
|      width="25" height="25">
 | |
| <img src="http://kubernetes.io/kubernetes/img/warning.png" alt="WARNING"
 | |
|      width="25" height="25">
 | |
| 
 | |
| <h2>PLEASE NOTE: This document applies to the HEAD of the source tree</h2>
 | |
| 
 | |
| If you are using a released version of Kubernetes, you should
 | |
| refer to the docs that go with that version.
 | |
| 
 | |
| <!-- TAG RELEASE_LINK, added by the munger automatically -->
 | |
| <strong>
 | |
| The latest release of this document can be found
 | |
| [here](http://releases.k8s.io/release-1.3/docs/proposals/resource-metrics-api.md).
 | |
| 
 | |
| Documentation for other releases can be found at
 | |
| [releases.k8s.io](http://releases.k8s.io).
 | |
| </strong>
 | |
| --
 | |
| 
 | |
| <!-- END STRIP_FOR_RELEASE -->
 | |
| 
 | |
| <!-- END MUNGE: UNVERSIONED_WARNING -->
 | |
| 
 | |
| # Resource Metrics API
 | |
| 
 | |
| This document describes API part of MVP version of Resource Metrics API effort in Kubernetes.
 | |
| Once the agreement will be made the document will be extended to also cover implementation details.
 | |
| The shape of the effort may be also a subject of changes once we will have more well-defined use cases.
 | |
| 
 | |
| ## Goal
 | |
| 
 | |
| The goal for the effort is to provide resource usage metrics for pods and nodes through the API server.
 | |
| This will be a stable, versioned API which core Kubernetes components can rely on.
 | |
| In the first version only the well-defined use cases will be handled,
 | |
| although the API should be easily extensible for potential future use cases.
 | |
| 
 | |
| ## Main use cases
 | |
| 
 | |
| This section describes well-defined use cases which should be handled in the first version.
 | |
| Use cases which are not listed below are out of the scope of MVP version of Resource Metrics API.
 | |
| 
 | |
| #### Horizontal Pod Autoscaler
 | |
| 
 | |
| HPA uses the latest value of cpu usage as an average aggregated across 1 minute
 | |
| (the window may change in the future). The data for a given set of pods
 | |
| (defined either by pod list or label selector) should be accesible in one request
 | |
| due to performance issues.
 | |
| 
 | |
| #### Scheduler
 | |
| 
 | |
| Scheduler in order to schedule best-effort pods requires node level resource usage metrics
 | |
| as an average aggreated across 1 minute (the window may change in the future).
 | |
| The metrics should be available for all resources supported in the scheduler.
 | |
| Currently the scheduler does not need this information, because it schedules best-effort pods
 | |
| without considering node usage. But having the metrics available in the API server is a blocker
 | |
| for adding the ability to take node usage into account when scheduling best-effort pods.
 | |
| 
 | |
| ## Other considered use cases
 | |
| 
 | |
| This section describes the other considered use cases and explains why they are out
 | |
| of the scope of the MVP version.
 | |
| 
 | |
| #### Custom metrics in HPA
 | |
| 
 | |
| HPA requires the latest value of application level metrics.
 | |
| 
 | |
| The design of the pipeline for collecting application level metrics should
 | |
| be revisited and it's not clear whether application level metrics should be
 | |
| available in API server so the use case initially won't be supported.
 | |
| 
 | |
| #### Cluster Federation
 | |
| 
 | |
| The Cluster Federation control system might want to consider cluster-level usage (in addition to cluster-level request)
 | |
| of running pods when choosing where to schedule new pods. Although
 | |
| Cluster Federation is still in design,
 | |
| we expect the metrics API described here to be sufficient. Cluster-level usage can be
 | |
| obtained by summing over usage of all nodes in the cluster.
 | |
| 
 | |
| #### kubectl top
 | |
| 
 | |
| This feature is not yet specified/implemented although it seems reasonable to provide users information
 | |
| about resource usage on pod/node level.
 | |
| 
 | |
| Since this feature has not been fully specified yet it will be not supported initally in the API although
 | |
| it will be probably possible to provide a reasonable implementation of the feature anyway.
 | |
| 
 | |
| #### Kubernetes dashboard
 | |
| 
 | |
| [Kubernetes dashboard](https://github.com/kubernetes/dashboard) in order to draw graphs requires resource usage
 | |
| in timeseries format from relatively long period of time. The aggreations should be also possible on various levels
 | |
| including replication controllers, deployments, services, etc.
 | |
| 
 | |
| Since the use case is complicated it will not be supported initally in the API and they will query Heapster
 | |
| directly using some custom API there.
 | |
| 
 | |
| ## Proposed API
 | |
| 
 | |
| Initially the metrics API will be in a separate [API group](api-group.md) called ```metrics```.
 | |
| Later if we decided to have Node and Pod in different API groups also
 | |
| NodeMetrics and PodMetrics should be in different API groups.
 | |
| 
 | |
| #### Schema
 | |
| 
 | |
| The proposed schema is as follow. Each top-level object has `TypeMeta` and `ObjectMeta` fields
 | |
| to be compatible with Kubernetes API standards.
 | |
| 
 | |
| ```go
 | |
| type NodeMetrics struct {
 | |
|   unversioned.TypeMeta
 | |
|   ObjectMeta
 | |
| 
 | |
|   // The following fields define time interval from which metrics were
 | |
|   // collected in the following format [Timestamp-Window, Timestamp].
 | |
|   Timestamp unversioned.Time
 | |
|   Window    unversioned.Duration
 | |
| 
 | |
|   // The memory usage is the memory working set.
 | |
|   Usage v1.ResourceList
 | |
| }
 | |
| 
 | |
| type PodMetrics struct {
 | |
|   unversioned.TypeMeta
 | |
|   ObjectMeta
 | |
| 
 | |
|   // The following fields define time interval from which metrics were
 | |
|   // collected in the following format [Timestamp-Window, Timestamp].
 | |
|   Timestamp unversioned.Time
 | |
|   Window    unversioned.Duration
 | |
| 
 | |
|   // Metrics for all containers are collected within the same time window.
 | |
|   Containers []ContainerMetrics
 | |
| }
 | |
| 
 | |
| type ContainerMetrics struct {
 | |
|   // Container name corresponding to the one from v1.Pod.Spec.Containers.
 | |
|   Name string
 | |
|   // The memory usage is the memory working set.
 | |
|   Usage v1.ResourceList
 | |
| }
 | |
| ```
 | |
| 
 | |
| By default `Usage` is the mean from samples collected within the returned time window.
 | |
| The default time window is 1 minute.
 | |
| 
 | |
| #### Endpoints
 | |
| 
 | |
| All endpoints are GET endpoints, rooted at `/apis/metrics/v1alpha1/`.
 | |
| There won't be support for the other REST methods.
 | |
| 
 | |
| The list of supported endpoints:
 | |
| - `/nodes` - all node metrics; type `[]NodeMetrics`
 | |
| - `/nodes/{node}` - metrics for a specified node; type `NodeMetrics`
 | |
| - `/namespaces/{namespace}/pods` - all pod metrics within namespace with support for `all-namespaces`; type `[]PodMetrics`
 | |
| - `/namespaces/{namespace}/pods/{pod}` - metrics for a specified pod; type `PodMetrics`
 | |
| 
 | |
| The following query parameters are supported:
 | |
| - `labelSelector` - restrict the list of returned objects by labels (list endpoints only)
 | |
| 
 | |
| In the future we may want to introduce the following params:
 | |
| `aggreator` (`max`, `min`, `95th`, etc.) and `window` (`1h`, `1d`, `1w`, etc.)
 | |
| which will allow to get the other aggregates over the custom time window.
 | |
| 
 | |
| ## Further improvements
 | |
| 
 | |
| Depending on the further requirements the following features may be added:
 | |
| - support for more metrics
 | |
| - support for application level metrics
 | |
| - watch for metrics
 | |
| - possibility to query for window sizes and aggreation functions (though single window size/aggregation function per request)
 | |
| - cluster level metrics
 | |
| 
 | |
| <!-- BEGIN MUNGE: GENERATED_ANALYTICS -->
 | |
| []()
 | |
| <!-- END MUNGE: GENERATED_ANALYTICS -->
 |