kubernetes

mirror of https://github.com/k3s-io/kubernetes.git synced 2025-12-03 21:04:47 +00:00

Author	SHA1	Message	Date
David Porter	a854ddb358	Implement metrics for Windows Nodes This implements stats for windows nodes in a new package, winstats. WinStats exports methods to get cadvisor like datastructures, however with windows specific metrics. WinStats only gets node level metrics and information, container stats will go via the CRI. This enables the use of the summary api to get metrics for windows nodes.	2017-09-14 06:32:51 +00:00
Yu-Ju Hong	2c415cc506	kubelet: enable CRI container metrics	2017-09-13 15:09:35 -07:00
Lee Verberne	e2e6a8cd85	Fix typo in kubelet kuberuntime container test Changes "Expetected" to "Expected"	2017-09-13 14:32:48 +02:00
Kubernetes Submit Queue	c6a9b1e198	Merge pull request #52125 from yujuhong/fix-file-sync Automatic merge from submit-queue (batch tested with PRs 52339, 52343, 52125, 52360, 52301) dockershim: check if f.Sync() returns an error and surface it ```release-note dockershim: check the error when syncing the checkpoint. ```	2017-09-12 21:45:56 -07:00
Balaji Subramaniam	e2e356964a	Make CPU manager release allocated CPUs when container enters completed phase.	2017-09-12 21:01:01 -07:00
Kubernetes Submit Queue	b04f81d342	Merge pull request #52344 from smarterclayton/no_log_pull Automatic merge from submit-queue (batch tested with PRs 48226, 52046, 52231, 52344, 52352) Log at higher verbosity levels some common SyncPod errors This log message was 90% of all glog.Errorf level statements reported on a production cluster, hiding other more impactful errors. We already log it in start container, but for extra caution we continue to log it at v(3) here (the downside of not logging a start container error is worse than some log spam at higher levels). HandleError() is intended only for unknown and unexpected errors. ```release-note NONE ``` @derekwaynecarr @sjenning	2017-09-12 19:40:03 -07:00
Kubernetes Submit Queue	32f1521cc2	Merge pull request #52046 from dashpole/soft_eviction Automatic merge from submit-queue (batch tested with PRs 48226, 52046, 52231, 52344, 52352) [BugFix] Soft Eviction timer works correctly fixes #51516 thresholdsMet should not exclude previously met thresholds when we do not have new stats for a threshold. /assign @vishh @derekwaynecarr cc @kubernetes/sig-node-bugs	2017-09-12 19:39:55 -07:00
Kubernetes Submit Queue	8e95e39c15	Merge pull request #52297 from derekwaynecarr/code-hygiene Automatic merge from submit-queue (batch tested with PRs 51041, 52297, 52296, 52335, 52338) Use cAdvisor constant for crio imagefs What this PR does / why we need it: code hygiene to use a constant from cAdvisor Release note: ```release-note NONE ```	2017-09-12 11:10:10 -07:00
Clayton Coleman	a5ac80cbce	Log at higher verbosity levels some common SyncPod errors	2017-09-12 10:52:31 -04:00
Kubernetes Submit Queue	d8847a8f1d	Merge pull request #52119 from mtaufen/sync-files Automatic merge from submit-queue fsync config checkpoint files after writing @yujuhong brought up that it's possible for a hard reboot to result in empty checkpoint files, if they haven't been synced to disk yet. This PR ensures that Kubelet configuration checkpoints are synced after writing to avoid this issue. fixes #52222 Release note: ```release-note NONE ```	2017-09-12 05:41:25 -07:00
Kubernetes Submit Queue	01154dd3cf	Merge pull request #51870 from feiskyer/sandbox-creds Automatic merge from submit-queue (batch tested with PRs 52264, 51870) Use credentials from providers for docker sandbox image What this PR does / why we need it: Sandbox image lookup uses creds from docker config only; other credential providers are ignored. This is a regression introduced in dockershim. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes #51293 Special notes for your reviewer: Should also cherry-pick this to release-1.6 and release-1.7. Release note: ```release-note Fix credentials providers for docker sandbox image. ```	2017-09-12 02:10:24 -07:00
yanxuean	799d0e5a6e	correct to handler	2017-09-12 13:47:08 +08:00
Derek Carr	cf2c688385	Use cAdvisor constant for crio imagefs	2017-09-11 14:08:00 -04:00
Derek Carr	da01c6d3a2	Ignore pods for quota that exceed deletion grace period	2017-09-11 13:31:52 -04:00
Yu-Ju Hong	aaf26b2eaa	dockershim: remove support for legacy containers The code was first introduced in 1.6 to help pre-CRI-kubelet upgrade to using the CRI implementation. They can safely be removed now.	2017-09-11 08:44:27 -07:00
xiangpengzhao	0484a1c2c5	Remove backward compatibility of hostportChainName	2017-09-10 00:24:00 +08:00
Kubernetes Submit Queue	d6df4a5127	Merge pull request #52063 from mtaufen/dkcfg-e2enode Automatic merge from submit-queue (batch tested with PRs 52047, 52063, 51528) Improve dynamic kubelet config e2e node test and fix bugs Rather than just changing the config once to see if dynamic kubelet config at-least-sort-of-works, this extends the test to check that the Kubelet reports the expected Node condition and the expected configuration values after several possible state transitions. Additionally, this adds a stress test that changes the configuration 100 times. It is possible for resource leaks across Kubelet restarts to eventually prevent the Kubelet from restarting. For example, this test revealed that cAdvisor's leaking journalctl processes (see: https://github.com/google/cadvisor/issues/1725) could break dynamic kubelet config. This test will help reveal these problems earlier. This commit also makes better use of const strings and fixes a few bugs that the new testing turned up. Related issue: #50217 I had been sitting on this until the cAdvisor fix merged in #51751, as these tests fail without that fix. Release note: ```release-note NONE ```	2017-09-08 16:06:56 -07:00
Pengfei Ni	4d5d97438b	Use credentials from providers for docker sandbox image	2017-09-09 07:02:04 +08:00
Kubernetes Submit Queue	943817f57b	Merge pull request #52047 from balajismaniam/cpuman-large-topo-test Automatic merge from submit-queue Added large topology tests for static policy in CPU Manager. What this PR does / why we need it: This PR adds a very large topology test case for the CPU Manager feature. Related to #51180. CC @ConnorDoyle	2017-09-08 15:57:41 -07:00
Kevin	f50761c9d4	fix prober ticking shift for kubelet restarted cases	2017-09-08 17:31:02 +08:00
Yu-Ju Hong	a850614613	dockershim: check if f.Sync() returns an error and surface it	2017-09-07 16:05:02 -07:00
Michael Taufen	a846ba191c	Improve dynamic kubelet config e2e node test and fix bugs Rather than just changing the config once to see if dynamic kubelet config at-least-sort-of-works, this extends the test to check that the Kubelet reports the expected Node condition and the expected configuration values after several possible state transitions. Additionally, this adds a stress test that changes the configuration 100 times. It is possible for resource leaks across Kubelet restarts to eventually prevent the Kubelet from restarting. For example, this test revealed that cAdvisor's leaking journalctl processes (see: https://github.com/google/cadvisor/issues/1725) could break dynamic kubelet config. This test will help reveal these problems earlier. This commit also makes better use of const strings and fixes a few bugs that the new testing turned up. Related issue: #50217	2017-09-07 15:50:17 -07:00
Michael Taufen	47beb80368	fsync config checkpoint files after writing	2017-09-07 14:42:18 -07:00
Kubernetes Submit Queue	ae6b329368	Merge pull request #51644 from sjenning/init-container-status-fix Automatic merge from submit-queue (batch tested with PRs 51239, 51644, 52076) do not update init containers status if terminated fixes #29972 #41580 This fixes an issue where, if a completed init container is removed while the pod or subsequent init containers are still running, the status for that init container will be reset to `Waiting` with `PodInitializing`. This can manifest in a number of ways. If the init container is removed why the main pod containers are running, the status will be reset with no functional problem but the status will be reported incorrectly in `kubectl get pod` for example If the init container is removed why a subsequent init container is running, the init container will be re-executed leading to all manner of badness. @derekwaynecarr @bparees	2017-09-07 14:31:23 -07:00
Derek Carr	27365eb900	Fix cross-build	2017-09-07 09:53:52 -04:00
Kubernetes Submit Queue	a51eb2ac4e	Merge pull request #49202 from cbonte/node-addresses Automatic merge from submit-queue (batch tested with PRs 51728, 49202) Fix setNodeAddress when a node IP and a cloud provider are set What this PR does / why we need it: When a node IP is set and a cloud provider returns the same address with several types, only the first address was accepted. With the changes made in PR #45201, the vSphere cloud provider returned the ExternalIP first, which led to a node without any InternalIP. The behaviour is modified to return all the address types for the specified node IP. Which issue this PR fixes: fixes #48760 Special notes for your reviewer: * I'm not a golang expert, is it possible to mock `kubelet.validateNodeIP()` to avoid the need of real host interface addresses in the test ? * It would be great to have it backported for a next 1.6.8 release. Release note: ```release-note NONE ```	2017-09-06 20:01:00 -07:00
Kubernetes Submit Queue	b6545a086c	Merge pull request #51728 from derekwaynecarr/cadvisor-stats Automatic merge from submit-queue (batch tested with PRs 51728, 49202) Enable CRI-O stats from cAdvisor What this PR does / why we need it: cAdvisor may support multiple container runtimes (docker, rkt, cri-o, systemd, etc.) As long as the kubelet continues to run cAdvisor, runtimes with native cAdvisor support may not want to run multiple monitoring agents to avoid performance regression in production. Pending kubelet running a more light-weight monitoring solution, this PR allows remote runtimes to have their stats pulled from cAdvisor when cAdvisor is registered stats provider by introspection of the runtime endpoint. See issue https://github.com/kubernetes/kubernetes/issues/51798 Special notes for your reviewer: cAdvisor will be bumped to pick up https://github.com/google/cadvisor/pull/1741 At that time, CRI-O will support fetching stats from cAdvisor. Release note: ```release-note NONE ```	2017-09-06 20:00:57 -07:00
Joel Smith	58ae5a78f9	Clean up kublet secret and configmap unit test * Expected value comes before actual value in assert.Equal() * Use assert.Equal() instead of assert.True() when possible * Add a unit test that verifies no-op pod updates to the secret_manager and the configmap_manager * Add a clarifying comment about why it's good to seemingly delete a secret on updates. * Fix (for now, non-buggy) variable shadowing issue	2017-09-06 16:38:01 -06:00
Balaji Subramaniam	e2cb80db4a	Added large topology tests for static policy in CPU Manager. - Added comments for tests cases.	2017-09-06 13:15:22 -07:00
David Ashpole	d60d4a4420	soft eviction timer works	2017-09-06 13:01:49 -07:00
Yang Guo	dfea03d920	Implement StatsProvider using CRI stats	2017-09-06 09:11:56 -07:00
Kubernetes Submit Queue	dcc1aa0628	Merge pull request #51928 from mindprince/pr-45724-fix-build Automatic merge from submit-queue Make fakeMountInterface in container_manager_unsupported_test.go implement mount.Interface again. This was broken in #45724 Release note*: ```release-note NONE ``` /sig storage /sig node /cc @jsafrane, @vishh	2017-09-05 19:44:54 -07:00
Kubernetes Submit Queue	e8d99f5839	Merge pull request #51645 from jingxu97/Aug/nameserver Automatic merge from submit-queue (batch tested with PRs 51186, 50350, 51751, 51645, 51837) Set up DNS server in containerized mounter path During NFS/GlusterFS mount, it requires to have DNS server to be able to resolve service name. This PR gets the DNS server ip from kubelet and add it to the containerized mounter path. So if containerized mounter is used, service name could be resolved during mount Release note: ```release-note Allow DNS resolution of service name for COS using containerized mounter. It fixed the issue with DNS resolution of NFS and Gluster services. ```	2017-09-05 17:30:09 -07:00
Kubernetes Submit Queue	99aa992ce8	Merge pull request #51751 from dashpole/update_cadvisor_godep Automatic merge from submit-queue (batch tested with PRs 51186, 50350, 51751, 51645, 51837) Update Cadvisor Dependency Fixes: https://github.com/kubernetes/kubernetes/issues/51832 This is the worst dependency update ever... The root of the problem is the [name change of Sirupsen -> sirupsen](https://github.com/sirupsen/logrus/issues/570#issuecomment-313933276). This means that in order to update cadvisor, which venders the lowercase, we need to update all dependencies to use the lower-cased version. With that being said, this PR updates the following packages: `github.com/docker/docker` - `github.com/docker/distribution` - `github.com/opencontainers/go-digest` - `github.com/opencontainers/image-spec` - `github.com/opencontainers/runtime-spec` - `github.com/opencontainers/selinux` - `github.com/opencontainers/runc` - `github.com/mrunalp/fileutils` - `golang.org/x/crypto` - `golang.org/x/sys` - `github.com/docker/go-connections` - `github.com/docker/go-units` - `github.com/docker/libnetwork` - `github.com/docker/libtrust` - `github.com/sirupsen/logrus` - `github.com/vishvananda/netlink` `github.com/google/cadvisor` - `github.com/euank/go-kmsg-parser` `github.com/json-iterator/go` Fixed https://github.com/kubernetes/kubernetes/issues/51832 ```release-note Fix journalctl leak on kubelet restart Fix container memory rss Add hugepages monitoring support Fix incorrect CPU usage metrics with 4.7 kernel Add tmpfs monitoring support ```	2017-09-05 17:30:06 -07:00
Kubernetes Submit Queue	78c820803c	Merge pull request #50350 from dashpole/eviction_container_deletion Automatic merge from submit-queue (batch tested with PRs 51186, 50350, 51751, 51645, 51837) Wait for container cleanup before deletion We should wait to delete pod API objects until the pod's containers have been cleaned up. See issue: #50268 for background. This changes the kubelet container gc, which deletes containers belonging to pods considered "deleted". It adds two conditions under which a pod is considered "deleted", allowing containers to be deleted: Pods where deletionTimestamp is set, and containers are not running Pods that are evicted This PR also changes the function PodResourcesAreReclaimed by making it return false if containers still exist. The eviction manager will wait for containers of previous evicted pod to be deleted before evicting another pod. The status manager will wait for containers to be deleted before removing the pod API object. /assign @vishh	2017-09-05 17:30:03 -07:00
Rohit Agarwal	18d25bf4ba	Add an OWNERS file for deviceplugin package. Update OWNERS file for gpu package.	2017-09-05 13:46:13 -07:00
Kubernetes Submit Queue	8b9e8cf80a	Merge pull request #51744 from jiayingz/deviceplugin-checkpoint Automatic merge from submit-queue (batch tested with PRs 50072, 51744) Deviceplugin checkpoint What this PR does / why we need it: Extends on top of PR 51209 to checkpoint device to pod allocation information on Kubelet to recover from Kubelet restarts. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes # Special notes for your reviewer: Release note: ```release-note ```	2017-09-05 13:33:01 -07:00
David Ashpole	e5a6a79fd7	update cadvisor, docker, and runc godeps	2017-09-05 12:38:57 -07:00
Jing Xu	3d4bc931d3	Set up DNS server in containerized mounter path During NFS/GlusterFS mount, it requires to have DNS server to be able to resolve service name. This PR gets the DNS server ip from kubelet and add it to the containerized mounter path. So if containerized mounter is used, service name could be resolved during mount	2017-09-05 11:40:23 -07:00
Jiaying Zhang	3b2bc58c11	Extends device_plugin_handler to checkpoint device to container allocation information.	2017-09-05 09:52:14 -07:00
Derek Carr	38d5dee677	Node validation restricts pre-allocated hugepages to single page size	2017-09-05 10:34:30 -04:00
Derek Carr	1ec2a69d9a	Kubelet changes to support hugepages	2017-09-05 09:46:08 -04:00
Rohit Agarwal	08ea02b9a5	Make *fakeMountInterface in container_manager_unsupported_test.go implement mount.Interface again. This was broken in #45724	2017-09-04 21:48:55 -07:00
saadali	3b834cf665	Modify VolumeZonePredicate to handle multi-zone PV Modifies the VolumeZonePredicate to handle a PV that belongs to more then one zone or region. This is indicated by the zone or region label value containing a comma separated list.	2017-09-04 20:13:32 -07:00
David Ashpole	9ac30e2c28	wait for container cleanup before deletion	2017-09-04 17:38:09 -07:00
Balaji Subramaniam	5b5958ecec	Add tests for the static cpumanager policy.	2017-09-04 07:24:59 -07:00
Connor Doyle	d0bcbbb437	Added static cpumanager policy.	2017-09-04 07:24:59 -07:00
Connor Doyle	e03a6435bb	Added cpu assignment helpers.	2017-09-04 07:24:59 -07:00
Szymon Scharmach	242439c9d7	Add topology helper and tests to cpumanager.	2017-09-04 07:24:59 -07:00
Connor Doyle	e4d5565228	Fix Start signature in container_manager_windows.	2017-09-04 07:24:59 -07:00

... 10 11 12 13 14 ...

5838 Commits