Commit Graph

93681 Commits

Author SHA1 Message Date
Caleb Woodbine
d26cb7150a Remove created check 2020-07-22 13:55:29 +12:00
RyderXia
d76c2cc94c update build 2020-07-22 09:36:55 +08:00
Han Kang
2477043dcb add instrumentation approvers as per SIG governance
Change-Id: Ib7e8ef3d5932c4f6fb93397702b25160c851e3ad
2020-07-21 16:33:02 -07:00
Caleb Woodbine
84de375921 Fix error message 2020-07-22 11:17:37 +12:00
Ed Bartosh
667461ace8 Makefile: update test-e2e-node help 2020-07-22 01:51:06 +03:00
Caleb Woodbine
d60a57ad73 Update watches to use watchtools.Until, namespace var name 2020-07-22 10:33:55 +12:00
Caleb Woodbine
bd34e1c445 Fix label patch and check 2020-07-22 10:07:18 +12:00
Kubernetes Prow Robot
98814409ba
Merge pull request #93224 from knabben/make-node-e2e
Documenting TEST_ARGS on Node E2E helper
2020-07-21 15:00:37 -07:00
Kubernetes Prow Robot
a9f0fcc68c
Merge pull request #93148 from claudiubelu/windows/fixes-subpath-symlink
Windows: Fixes subpath symlink evaluation
2020-07-21 15:00:20 -07:00
Kubernetes Prow Robot
5df1e53dac
Merge pull request #93107 from DataDog/azure-per-vmss-vmssvm-incremental-cache
Azure: use per-vmss vmssvm incremental cache
2020-07-21 15:00:06 -07:00
Caleb Woodbine
11492f7b91 Add check for Modified after patch; Update checks for scale 2020-07-22 09:57:11 +12:00
Caleb Woodbine
fc8e24c686 Add watchtools.Until for fetching watch events; Remove plain watch 2020-07-22 09:29:09 +12:00
Caleb Woodbine
9c2330db4c Revert ginkgoRecover 2020-07-22 08:48:28 +12:00
Jordan Liggitt
d195fc2ec8 Ensure runtimeCache contains all observed started containers on pod delete 2020-07-21 15:54:29 -04:00
Ed Bartosh
6a40cc90e9 Revert "e2epod: use foreground deletion"
This reverts commit af91e76d "e2epod: use foreground deletion"
and fixes node-kubelet-conformance tests.
2020-07-21 22:40:31 +03:00
Chelsey Chen
5b1bdd8e38 Add comformance test for event Update 2020-07-21 14:53:26 -04:00
Anago GCB
de18bd6c89 CHANGELOG: Update directory for v1.19.0-rc.2 release 2020-07-21 17:59:27 +00:00
Kubernetes Prow Robot
a1a2f8c5f8
Merge pull request #93225 from hasheddan/storage-panic
Return error instead of panic if container index outside bounds
2020-07-21 09:15:15 -07:00
Jared Burns
402eea882d Update fluent-es-image components for better ES 7.x compatibility 2020-07-21 08:27:04 -07:00
Di Xu
58dc201c9a remove serializers from codec factory 2020-07-21 20:35:19 +08:00
Kubernetes Prow Robot
b6174e605f
Merge pull request #93189 from klueska/upstream-fix-bug-topology-manager
Fix a bug whereby reusable CPUs and devices were not being honored
2020-07-21 04:35:17 -07:00
RyderXia
136df8ce53 update 2020-07-21 17:00:49 +08:00
RyderXia
2214117cd1 clean up unused var containerCache 2020-07-21 16:57:36 +08:00
Kubernetes Prow Robot
adabcc31f0
Merge pull request #93212 from liggitt/gcepd-delete
Deflake GCEPD namespace deletion test
2020-07-21 01:55:28 -07:00
Kubernetes Prow Robot
37cb765a85
Merge pull request #93211 from liggitt/crdlist
Make CRDList lifecycle consistent with CRD
2020-07-21 01:55:14 -07:00
Kubernetes Prow Robot
1fdd8fb213
Merge pull request #93263 from liggitt/windows
Fix windows kubelet startup
2020-07-20 19:51:57 -07:00
Kubernetes Prow Robot
275eabdf72
Merge pull request #93259 from jpbetz/revert-88936
Revert nested trace PR#88936
2020-07-20 19:51:47 -07:00
Kubernetes Prow Robot
b467072a55
Merge pull request #93256 from ahg-g/ahg-metric
Rename pod_preemption_metrics to preemption_metrics.
2020-07-20 19:51:37 -07:00
Kubernetes Prow Robot
bb079afdef
Merge pull request #93253 from liggitt/utils-trace
Update k8s.io/utils
2020-07-20 19:51:28 -07:00
Kubernetes Prow Robot
c09ecf13a5
Merge pull request #93248 from giuseppe/cgroup-set-max-shares
kubelet: clamp cpu.shares to max allowed
2020-07-20 19:51:14 -07:00
José Guilherme Vanz
5ebb1c3be2
cleanup: print warning message after timeout
If a pod has a configmap/secret volume an annoying message shows up
in the log approximately every 70 seconds. This happens because the
desiredStateOfWorldPopulator sync loop always call the
MarkRemountRequired. The function finds the volume plugin and check
if the plugin requires mount. Configmap and secret plugins always
returns true for that. Thus, the reconciler code of the volume manager
remounts the volume every time. This commit change the SetVolumeOwnership
to print the warning only if the function does not finish within 30
seconds.

Signed-off-by: José Guilherme Vanz <jguilhermevanz@suse.com>
2020-07-20 22:46:11 -03:00
José Guilherme Vanz
08ccbe6f0e
Revert "cleanup: decrease log level from warn to v3"
This reverts commit ebece49936e635f151fdd8a64fa2b77fd183e817.
2020-07-20 22:46:11 -03:00
José Guilherme Vanz
9b9dcf3d84
cleanup: decrease log level from warn to v3
If a pod has a configmap/secret volume an annoying message shows up
in the log approximately every 70 seconds. This happens because the
desiredStateOfWorldPopulator sync loop always call the
MarkRemountRequired. The function finds the volume plugin and check
if the plugin requires mount. Configmap and secret plugins always
returns true for that. Thus, the reconciler code of the volume manager
remounts the volume every time. This commit decrease the log level of
that message in the mount function from warning to V4.

Signed-off-by: José Guilherme Vanz <jguilhermevanz@suse.com>
2020-07-20 22:46:11 -03:00
Stephen Heywood
86ba88d52f Promote: Discovery PreferredVersion test 2020-07-21 00:30:25 +00:00
Kubernetes Prow Robot
5a529aa3a0
Merge pull request #91399 from danwinship/endpoint-ipfamily
multiple IPv6/dual-stack endpoint fixes
2020-07-20 13:31:14 -07:00
wawa0210
aea228f5dd fix no-new-privileges on windows 2020-07-20 16:14:52 -04:00
Jordan Liggitt
886727a4c0 Revert "Add deviceManager in windows container manager"
This reverts commit 056d73b1a1.
2020-07-20 16:13:53 -04:00
Joe Betz
02cf58102a Revert nested trace PR#88936 2020-07-20 09:55:05 -07:00
Benjamin Pineau
fcb3f1f64c Tests fixes for Azure per-VMSS VMs caches
Signed-off-by: Benjamin Pineau <benjamin.pineau@datadoghq.com>
2020-07-20 18:35:23 +02:00
Benjamin Pineau
85ecd0e17c Azure: per VMSS, incremental VMSS VMs cache
Azure's cloud provider VMSS VMs API accesses are mediated through
a cache holding and refreshing all VMSS together.

Due to that we hit VMSSVM.List API more often than we could: an
instance's cache miss or expiration should only require a single
VMSS re-list, while it's currently O(n) relative to the number of
attached Scale Sets.

Under hard pressure (clusters with many attached VMSS that can't all
be listed in one sequence of successive API calls) the controller
manager might be stuck trying to re-list everything from scratch,
then aborting the whole operation; then re-trying and re-triggering
API rate-limits, affecting the whole Subscription.

This patch replaces the global VMSS VMs cache by per-VMSS VMs caches.
Refreshes (VMSS VMs lists) are scoped to the single relevant VMSS; under
severe throttling the various caches can be incrementally refreshed.

Signed-off-by: Benjamin Pineau <benjamin.pineau@datadoghq.com>
2020-07-20 18:35:23 +02:00
Kubernetes Prow Robot
5feab0aa1e
Merge pull request #93207 from hasheddan/nvidia-gpu-installer
Use local daemonset manifest for installing Nvidia drivers
2020-07-20 09:02:51 -07:00
Abdullah Gharaibeh
6f9794d5e9 Rename pod_preemption_metrics to preemption_metrics. Since this metric's type was changed from Gauge to Histogram, renaming it should make it easier to providers to migrate 2020-07-20 11:44:10 -04:00
Giuseppe Scrivano
ef935bd991
kubelet: clamp cpu shares to max allowed
clamp the max cpu.shares to the maximum value allowed by the kernel.

It is not an issue when using cgroupfs, as the kernel will
anyway make sure the value is not out of range and automatically clamp
it, systemd has an additional check that prevents the cgroup creation.

Closes: https://github.com/kubernetes/kubernetes/issues/92855

Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>
2020-07-20 17:18:03 +02:00
Jordan Liggitt
7aacbeac14 Update k8s.io/utils 2020-07-20 11:12:29 -04:00
Jordan Liggitt
ef7ef215d6 Handle int -> float conversion in FromUnstructured 2020-07-20 10:41:20 -04:00
Kubernetes Prow Robot
c237804533
Merge pull request #92755 from chelseychen/event-e2e-conformance
Promote Event CRUD tests to conformance
2020-07-20 05:50:51 -07:00
Kevin Klues
00df26a985 Fix a bug whereby reusable CPUs and devices were not being honored
Previously, it was possible for reusable CPUs and reusable devices (i.e.
those previously consumed by init containers) to not be reused by
subsequent init containers or app containers if the TopologyManager was
enabled. This would happen because hint generation for the
TopologyManager was not considering the reusable devices when it made
its hint calculation.

As such, it would sometimes:
1) Generate a hint for a differnent NUMA node, causing the CPUs and
devices to be allocated from that node instead of the one where the
reusable devices live; or
2) End up thinking there were not enough CPUs or devices to allocate and
throw a TopologyAffinity admission error

This patch fixes this by ensuring that reusable CPUs and devices are
considered as part of TopologyHint generation. This frunctionality is
difficult to unit test since it spans multiple components, but an e2e
test will be added in a subsequent patch to test this functionality.
2020-07-20 11:41:13 +00:00
Kevin Klues
74fe9364c3 Simplify logic in devicemanager TopologyHint generation 2020-07-20 11:41:13 +00:00
Kevin Klues
9f5f401d60 Add AnySet() to topologymanager bitmask API 2020-07-20 11:41:13 +00:00
zhangsheng02
14d2440b87 add GetAddressAndDialer unit test
Signed-off-by: zhangsheng02 <sheng.zhang@daocloud.io>
2020-07-20 18:16:58 +08:00