Commit Graph

103444 Commits

Author SHA1 Message Date
Daniel Kłobuszewski
9808dd9a03 Wait 15m after instead of before breaking nodes
15m is enough for Cluster Autoscaler to remove empty nodes, so we need
to break them sooner than that. Instead, wait 15m after breaking them to
ensure Cluster Autoscaler will consider them as unready instead of still
starting.
2021-08-31 11:37:25 +02:00
Sascha Grunert
484b027536
Remove unused --allow-gathering-profiles e2e.test flag
The profile gatherer has been removed in
https://github.com/kubernetes/kubernetes/pull/85304, so those options
are unused since then and can therefore be removed.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2021-08-31 11:33:33 +02:00
Antonio Ojea
2a5ad65a9a e2e test apiserver endpoint and endpointslices
The e2e test "should have Endpoints and EndpointSlices pointing to
the API Server Service" was veryfing the current endpoints
reconciler implementation on the apiservers, however, users may
disable the endpoint reconciler and create their own.

This e2e test is also a conformance test, so we should test the
behaviour and not the implementation details. The test verifies
that a kubernetes.default service exist, an endpoint and endpoint
slices object referencing that service exist and are equivalent.
2021-08-31 11:29:06 +02:00
Kubernetes Prow Robot
f610eee161
Merge pull request #104211 from MikeSpreitzer/stackmore
Increase debug logging in waitGroupCounter::Add
2021-08-31 01:45:37 -07:00
Sascha Grunert
46077e6be7
Remove deprecated --seccomp-profile-root/seccompProfileRoot configuration
The configuration is deprecated and targets removal for v1.23. Tests
cases have been changed as well.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2021-08-31 09:55:28 +02:00
Kubernetes Prow Robot
4615578137
Merge pull request #104633 from martinkennelly/update_sitp_feature_state
Update ServiceInternalTrafficPolicy feature state
2021-08-31 00:09:36 -07:00
scott
8976f6f6d9 explain the reason why metaclient special processing metav1.DeleteOptions encoding
metaclient explicitly specifies the Content-Type when executing Delete and DeleteCollection, and add test for that
2021-08-31 10:55:53 +08:00
zc
b33897f36d modify non-uniform aliases 2021-08-31 09:07:51 +08:00
Kubernetes Prow Robot
bb9e89d430
Merge pull request #104601 from thockin/patchAllocatedValues_port_reuse
Fix a small regression in Service updates
2021-08-30 15:31:13 -07:00
Kubernetes Prow Robot
5aea99b0a2
Merge pull request #104634 from Jiawei0227/snapshotorder
storege e2etest: Delete restored PVC/Pod in snapshottable
2021-08-30 14:25:13 -07:00
Kubernetes Prow Robot
bbbeceb6aa
Merge pull request #104577 from smarterclayton/smaller_filter_master
kubelet: Admission must exclude completed pods and avoid races
2021-08-30 13:17:13 -07:00
Tim Hockin
73503a4936 Fix a small regression in Service updates
Prior to 1.22 a user could change NodePort values within a service
during an update, and the apiserver would allocate values for any that
were not specified.

Consider a YAML like:

```
apiVersion: v1
kind: Service
metadata:
  name: foo
spec:
  type: NodePort
  ports:
  - name: p
    port: 80
  - name: q
    port: 81
  selector:
    app: foo
```

When this is created, nodeport values will be allocated for each port.
Something like:

```
apiVersion: v1
kind: Service
metadata:
  name: foo
spec:
  clusterIP: 10.0.149.11
  type: NodePort
  ports:
  - name: p
    nodePort: 30872
    port: 80
    protocol: TCP
    targetPort: 9376
  - name: q
    nodePort: 31310
    port: 81
    protocol: TCP
    targetPort: 81
  selector:
    app: foo
```

If the user PUTs (kubectl replace) the original YAML, we would see that
`.nodePort = 0`, and allocate new ports.  This was ugly at best.

In 1.22 we fixed this to not allocate new values if we still had the old
values, but instead re-assign them.  Net new ports would still be seen
as `.nodePort = 0` and so new allocations would be made.

This broke a corner case as follows:

Prior to 1.22, the user could PUT this YAML:

```
apiVersion: v1
kind: Service
metadata:
  name: foo
spec:
  type: NodePort
  ports:
  - name: p
    nodePort: 31310 # note this is the `q` value
    port: 80
  - name: q
    # note this nodePort is not specified
    port: 81
  selector:
    app: foo
```

The `p` port would take the `q` port's value.  The `q` port would be
seen as `.nodePort = 0` and a new value allocated.  In 1.22 this results
in an error (duplicate value in `p` and `q`).

This is VERY minor but it is an API regression, which we try to avoid,
and the fix is not too horrible.

This commit adds more robust testing of this logic.
2021-08-30 12:42:17 -07:00
Kubernetes Prow Robot
ff617edd32
Merge pull request #104455 from claudiubelu/test-images/windows-server-2022-part-2
test images: Adds Windows Server 2022 to the BASEIMAGEs (part 2)
2021-08-30 12:07:13 -07:00
Kubernetes Prow Robot
d5cf549376
Merge pull request #104160 from pacoxu/AllowLongNodeID-true
set AllowLongNodeID to true by default since 1.23
2021-08-30 08:17:05 -07:00
Aldo Culquicondor
0801ada81f Add alculquicondor to sig-apps-reviewers 2021-08-30 11:08:03 -04:00
Kubernetes Prow Robot
7282c2002e
Merge pull request #99273 from yangjunmyfm192085/run-test20
Structured Logging migration:modify Scheduler part logs.
2021-08-30 05:56:54 -07:00
Patrick Ohly
89cb4d0ee9 scheduler: better reason for delay with generic ephemeral volumes
These events are currently emitted for a pod using a generic ephemeral volume:

  Type     Reason            Age   From               Message
  ----     ------            ----  ----               -------
  Warning  FailedScheduling  3s    default-scheduler  0/1 nodes are available: 1 persistentvolumeclaim "my-csi-app-inline-volume-my-csi-volume" not found.
  Warning  FailedScheduling  2s    default-scheduler  0/1 nodes are available: 1 pod has unbound immediate PersistentVolumeClaims.

The one about "persistentvolumeclaim not found" is potentially confusing. It
occurs because the scheduler typically checks the pod before the ephemeral
volume controller had a chance to create the PVC.

This is a bit easier to understand:

  Type     Reason            Age   From               Message
  ----     ------            ----  ----               -------
  Warning  FailedScheduling  4s    default-scheduler  0/1 nodes are available: 1 waiting for ephemeral volume controller to create the persistentvolumeclaim "my-csi-app-inline-volume-my-csi-volume".
  Warning  FailedScheduling  2s    default-scheduler  0/1 nodes are available: 1 pod has unbound immediate PersistentVolumeClaims.
2021-08-30 10:06:59 +02:00
Claudiu Belu
33aa55da79 tests: Bumps image versions (Adds Windows Server 2022)
The Container Images for Windows Server 2022 have been published, and we can
start adding jobs for them.

The ltsc2022-based images have been built and promoted with these image versions.
2021-08-29 18:57:35 -07:00
JunYang
93047824f7 Structured Logging migration:modify Scheduler part logs.
Signed-off-by: JunYang <yang.jun22@zte.com.cn>
2021-08-29 20:17:08 +08:00
Kubernetes Prow Robot
edb0a72cff
Merge pull request #104630 from tkashem/remove-option
apiserver: remove server option startup-send-retry-after-until-ready
2021-08-28 05:32:52 -07:00
Kubernetes Prow Robot
80feff6f40
Merge pull request #104408 from martinkennelly/fix_fqdn_hostname_mismatch
Fix comparison between FQDN and hostname
2021-08-27 19:58:52 -07:00
Kubernetes Prow Robot
909c3db3ab
Merge pull request #104559 from Huang-Wei/cleanup-json-tag
sched: remove unneeded json tags in unversioned API objs
2021-08-27 18:38:52 -07:00
Kubernetes Prow Robot
cd63952f13
Merge pull request #95885 from jiahuif/refactor/controller-manager
refactor: controller manager: InitFunc and base controller interface.
2021-08-27 15:40:52 -07:00
Jiawei Wang
273ac5a594 storege e2etest: Delete restored PVC/Pod in snapshottable 2021-08-27 13:53:25 -07:00
Abu Kashem
7adc79a4ea
apiserver: remove server option startup-send-retry-after-until-ready 2021-08-27 13:01:56 -04:00
Kubernetes Prow Robot
fca3175df7
Merge pull request #104231 from astraw99/fix_unified_workers
Unify controller worker num param `threadiness` to `workers`
2021-08-27 09:34:05 -07:00
Martin Kennelly
da78eb5d8a Update ServiceInternalTrafficPolicy feature state
Feature state is beta in v1.22.0.

Signed-off-by: Martin Kennelly <mkennell@redhat.com>
2021-08-27 16:55:47 +01:00
Dave Chen
63b4710f38 Don't expose struct from prometheus client library 2021-08-27 22:21:24 +08:00
Kubernetes Prow Robot
5871321310
Merge pull request #104614 from prameshj/disable-pod-rec
Skip testing for Pod DNS records.
2021-08-27 05:32:04 -07:00
Kubernetes Prow Robot
cb520ca573
Merge pull request #104610 from aojea/i_hate_this_shell_things
skip hack/tools/vendor folder
2021-08-27 01:52:04 -07:00
Kubernetes Prow Robot
d043e9cbcc
Merge pull request #104003 from sahilvv/protobuf
update protobuf github url
2021-08-26 21:22:59 -07:00
Kubernetes Prow Robot
0022e00669
Merge pull request #104617 from justaugustus/sig-release
OWNERS(releng): Set reviewers to `release-managers`
2021-08-26 17:08:59 -07:00
jaehnri
bf1b75f165 Fix typo in kubectl describe pods example
Signed-off-by: jaehnri <joao.henri.cr@gmail.com>
2021-08-26 21:00:18 -03:00
Kubernetes Prow Robot
c88a111a9c
Merge pull request #104609 from jsturtevant/hostprocess-init-containers
Add init hostprocess container e2e test
2021-08-26 15:36:20 -07:00
Stephen Augustus
47b1159a0e
OWNERS(releng): Set reviewers to release-managers
Signed-off-by: Stephen Augustus <foo@auggie.dev>
2021-08-26 17:50:55 -04:00
Pavithra Ramesh
fa3f4a69e5 Skip testing for Pod DNS records.
DNS records of the form *.pod.cluster.local are deprecated - https://github.com/kubernetes/dns/pull/335
Testing for their presence is not required.
2021-08-26 14:33:12 -07:00
Jiawei Wang
8de0f11946 Add GA AnnStorageProvisioner annotation to PVC
This PR adds GA AnnStorageProvisioner annotation to
a PVC if the PVC requires dynamic provisioning. This
also deprecates the beta AnnStorageProvisioner annotation
and it will be removed in a later release.
2021-08-26 12:46:47 -07:00
Kubernetes Prow Robot
d0f6983d6c
Merge pull request #104503 from sanposhiho/scheduler/fix/panic-is-not-execute-defer-funcs
cmd/kube-scheduler: return error instead of os.Exit when something goes wrong
2021-08-26 11:26:21 -07:00
Antonio Ojea
2fee0c45e6 skip hack/tools/vendor folder
The makefiles scripts create a variable with all the go files
that are part of the Kubernetes source tree, including staging.

As today, this variable has a size of < 100kb

wc .make/all_go_dirs.mk
2326  2326 98905 .make/all_go_dirs.mk

This variable is passed as argument in the Makefiles, where it
is expanded. In Linux, there is a limit to the max size of
the arguments MAX_ARG_STRLEN.

If the arguments go above 128k, you get a nice:

execvp: /usr/bin/env: Argument list too long

If you, for whatever reason, do some go mod vendor inside the
hack/tools folder, these files will be added to the variable
and most probably you'll go above the limit and get that error.

Then, you'll learn a lot about Makefils, shell expansion, strace,
execpve, ARG_MAX and MAX_ARG_STRLEN,until you realize what is
the real problem :).
2021-08-26 20:02:43 +02:00
Kubernetes Prow Robot
1619705be7
Merge pull request #104586 from justaugustus/sig-release
OWNERS: SIG Release cleanups
2021-08-26 10:19:23 -07:00
Kubernetes Prow Robot
718f1b60fa
Merge pull request #104585 from la3mmchen/fix/typo
kubeadm: add missing word in output error
2021-08-26 08:51:23 -07:00
James Sturtevant
445f9f2aba Add init hostprocess container test 2021-08-26 08:23:54 -07:00
Sascha Grunert
b8400cbc36
Reduce the number of parallel typechecks to 2
The PR https://github.com/kubernetes/kubernetes/pull/104575 introduces
some intermediate types which makes the 32GiB memory machine kill the
typecheck process. To resolve that issue and make the test more robust,
we now reduce the amount of parallel typechecks to run to `2`.

Signed-off-by: Sascha Grunert <sgrunert@redhat.com>
2021-08-26 16:53:13 +02:00
Kubernetes Prow Robot
9e0d3fdd6c
Merge pull request #104582 from iamemilio/volume_negative_test
Remove Error Message Check Dynamic PV Tests
2021-08-26 07:41:23 -07:00
Kubernetes Prow Robot
c262d09bb7
Merge pull request #104604 from wojtek-t/fix_secret_manager_2
Don't prematurely close reflectors in case of slow initialization in watch based manager
2021-08-26 06:11:23 -07:00
atiratree
b79859cb12 add a test for jsonpath template parsing to prevent regressions
This behaviour was broken by commit
39cfe23232 and PR kubernetes#98057
2021-08-26 14:49:58 +02:00
atiratree
5c3de9f1de revert "fix wrong output when using jsonpath"
This partially reverts commit 39cfe232325d66bcdbc935af7aaf7022562e7010and PR #98057

the original problem was caused by not using {end} at the end of the range
2021-08-26 14:49:45 +02:00
Danielle Lancashire
3884dcb909 e2e_node: run gpu pod long enough to become ready 2021-08-26 14:24:23 +02:00
Danielle Lancashire
7d7884c0e6 e2e_node: install gpu pod with PodClient
Prior to this change, the pod was not getting scheduled on the node as
we don't have a running scheduler in e2e_node. PodClient solves this
problem by manually assigning the pod to the node.
2021-08-26 14:22:22 +02:00
Danielle Lancashire
0cc8af82a1 e2e_node: use upstream gpu installer
The current GPU installer was built in 2017, from source that no longer
exists in Kubernetes ([adding commit][1]. The image was built on 2017-06-13.

Unfortunately, this installer no longer appears to work. When debugging
on the same node type as used by test-infra, it failed to build the
driver as the kernel sha was no longer available.

This lead to needing to find a new way to install GPUs. The smallest
logical change was switching to [cos-gpu-installer][2]
. There is a newer version of this available on [googlesource][3] that
I have not yet tested as it's not clear what the state of the project
is, as I couldn't find docs outside of the source itself.

We install things to the same location as previously to avoid needing
extra downstream changes. There are a couple of weird issues here
however, like needing to run the container twice to correctly update the
LD Cache.

[1]: 1e77594958/cluster/gce/gci/nvidia-gpus/Dockerfile
[2]: https://github.com/GoogleCloudPlatform/cos-gpu-installer
[3]: https://cos.googlesource.com/cos/tools/+/refs/heads/master/src/cmd/cos_gpu_installer/
2021-08-26 14:09:45 +02:00