Commit Graph

9054 Commits

Author SHA1 Message Date
Kubernetes Submit Queue
542486186f Merge pull request #52732 from shyamjvs/fix-metrics-perf-tests
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

Increase api latency threshold for cluster-scoped list calls

Recent change from @smarterclayton (https://github.com/kubernetes/kubernetes/pull/52237) added scope to apiserver metrics. As a result, our current threshold for list calls is no longer sufficient for all-namespace calls which are now being measured separately from namespaced lists. For e.g (from our [last 5k run](https://k8s-gubernator.appspot.com/build/kubernetes-jenkins/logs/ci-kubernetes-e2e-gce-scale-performance/37)):

```
WARNING Top latency metric: {Resource:pods Subresource: Verb:LIST Scope:cluster Latency:{Perc50:4.498374s Perc90:7.548079s Perc99:8.169389s Perc100:0s} Count:1400}
```

cc @kubernetes/sig-scalability-misc @kubernetes/sig-api-machinery-misc @wojtek-t
2017-09-21 10:49:54 -07:00
Alex Robinson
284a7c6165 Fix broken statefulset e2e test
This was broken back in #43637 when the logic in
`(*StatefulSetTester).CreateStatefulSet` switched from using
`generated.ReadOrDie` to read the entire service.yaml file and pass it
to kubectl to using `manifest.SvcFromManifest`, which assumes that the
file contains only a single service.

Fixes #52750
2017-09-21 13:20:05 -04:00
Kubernetes Submit Queue
9dc2e508e3 Merge pull request #52844 from aleksandra-malinowska/autoscaling-test-fix-5
Automatic merge from submit-queue (batch tested with PRs 52843, 52710, 52821, 52844). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

improve retrying logic when checking CA status

This should reduce the flake rate in cluster size autoscaling test suite.
2017-09-21 09:56:55 -07:00
Aleksandra Malinowska
14dfeecd47 improve retrying logic when checking CA status 2017-09-21 13:58:42 +02:00
Aleksandra Malinowska
5a0f1c8747 improve PDBs cleanup 2017-09-21 13:57:47 +02:00
Shyam Jeedigunta
f373645865 Increase api latency threshold for cluster-scoped list calls 2017-09-21 13:33:22 +02:00
xiangpengzhao
a8e7dd8852 Don't specify clusterIP in dns e2e test 2017-09-21 17:25:37 +08:00
Vaibhav Kamra
1d6a701e76 Add e2e test to verify PVC metrics
Adds an e2e test for kubernetes/features#363
2017-09-21 01:10:22 -07:00
Kubernetes Submit Queue
939ae0a931 Merge pull request #52819 from verult/provisioner-e2e-fix
Automatic merge from submit-queue (batch tested with PRs 48406, 52819). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

Fixed nil dereference in dynamic provisioning e2e tests

**What this PR does / why we need it**: Fixed nil dereference in dynamic provisioning e2e tests.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #52815

**Release note**:

```release-note-none
NONE
```
/sig storage
/assign @saad-ali 
/cc @wongma7 
/release-note-none
2017-09-20 21:30:35 -07:00
Kubernetes Submit Queue
d84513283b Merge pull request #52807 from gnufied/add-e2e-volume-metrics
Automatic merge from submit-queue (batch tested with PRs 52724, 52807). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

Add e2e test for volume metrics

Adding e2e test for volume metrics. These tests need to be run
in serial to verify accuracy

Fixes https://github.com/kubernetes/kubernetes/issues/52458

cc @saad-ali @jdumars 

/sig storage
2017-09-20 19:04:53 -07:00
Kubernetes Submit Queue
9505c01f54 Merge pull request #52724 from shyamjvs/fix-density-test
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

Retry if possible while creating latency pods in density test

Saw the [last run](https://k8s-gubernator.appspot.com/build/kubernetes-jenkins/logs/ci-kubernetes-e2e-gce-scale-performance/37) of density test on 5k-node fail due to it:

```
Expected error:
    <*errors.StatusError | 0xc44f2fd7a0>: {
        ErrStatus: {
            TypeMeta: {Kind: "", APIVersion: ""},
            ListMeta: {SelfLink: "", ResourceVersion: "", Continue: ""},
            Status: "Failure",
            Message: "timeout",
            Reason: "",
            Details: nil,
            Code: 500,
        },
    }
    timeout
not to have occurred
```

cc @kubernetes/sig-scalability-misc
2017-09-20 18:49:36 -07:00
Kubernetes Submit Queue
f7dd62f149 Merge pull request #51277 from irfanurrehman/hpa-e2e-mod-for-fed
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

[Federation] Update hpa e2e utils to enable reuse in fed hpa tests

This PR is to enable reuse of some hpa e2e utils in federation, facilitating https://github.com/kubernetes/kubernetes/pull/50168
cc @mwielgus @quinton-hoole 
cc @kubernetes/sig-federation-pr-reviews 

**Release note**:

```NONE
```
2017-09-20 18:01:40 -07:00
Cheng Xing
fbeb07aeac Fixed nil dereference in dynamic provisioning e2e tests 2017-09-20 16:08:23 -07:00
Hemant Kumar
6d4578ad57 Add e2e test for volume metrics
Adding e2e test for volume metrics. These tests need to be run
2017-09-20 18:23:05 -04:00
Kubernetes Submit Queue
654c522e4c Merge pull request #52477 from jamiehannaford/kubernetes-anywhere
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

Support kubernetes-anywhere provider

**What this PR does / why we need it**:

Implements a new `kubernetes-anywhere` provider to allow upgrade testing in the e2e binary. This is the final step to allow https://github.com/kubernetes/test-infra/pull/4495 and https://github.com/kubernetes/kubernetes-anywhere/pull/450.

**Which issue this PR fixes**:

https://github.com/kubernetes/kubeadm/issues/311

**Special notes for your reviewer**:

Some questions I had

- Does the `--provider` flag specified [here](dbbf6261e0/jobs/config.json (L8587)) get sent to the flag defined [here](https://github.com/kubernetes/kubernetes/blob/master/test/e2e/framework/test_context.go#L219)? Or should I add another `--provider` flag inside `--upgrade_args` like this: `--upgrade_args=... --provider=kubernetes-anywhere`?
- Is it necessary to add waiting logic after the `make` command, or will it implicitly handle that by itself?

Some other points:

- I chose `sed` to manipulate the current kubernetes-anywhere `.config` rather than duplicating another [`anywhere.go`](https://github.com/kubernetes/test-infra/blob/master/kubetest/anywhere.go). One suggestion was to use `jq` but since the config on disk is not serialized to JSON yet, I'm not sure how that'd work.
- Since I don't have a GCE/GKE account or vCenter, I can't actually verify the e2e binary works. I've managed to build it, but if somebody could quickly run a smoke test, I'd appreciate it. This is my first poke around test-infra and e2e, so there might be some plumbing missing

/cc @jessicaochen @luxas @pipejakob @roberthbailey
2017-09-20 15:20:47 -07:00
Aleksandra Malinowska
fbeb4de996 add pdbs for more kube-system pods in scale down test 2017-09-20 16:45:01 +02:00
Aleksandra Malinowska
daf77aca1d adjust parameter in cluster autoscaling test 2017-09-20 11:41:15 +02:00
Kubernetes Submit Queue
6b7dafe8b5 Merge pull request #52500 from wongma7/mount-e2e
Automatic merge from submit-queue (batch tested with PRs 52500, 52533). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

Add mount options e2e test

**What this PR does / why we need it**: A test for newly added StorageClass.mountOptions and PV.mountOptions: provision a pv using a class with its storageclass.mountoptions set, and the end result should be that the mount options can be seen from the mounter.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: Fixes #52138

**Special notes for your reviewer**: 

**Release note**:

```release-note
NONE
```
2017-09-19 20:38:34 -07:00
Kubernetes Submit Queue
b80af5d4e4 Merge pull request #52707 from aleksandra-malinowska/autoscaling-test-hpa-allocatable
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

In autoscaling integration test, use allocatable instead of capacity for node memory

This makes the remaining cluster autoscaling test (integration test of HPA and CA working together to scale up the cluster) use node allocatable resources when computing how much memory we need to consume in order to trigger scale up/prevent scale down. Follow up to #52650 as that one is already merging.

cc @wasylkowski
2017-09-19 19:13:48 -07:00
Matthew Wong
a836cc35ba Add mount options e2e test 2017-09-19 20:47:05 -04:00
Kubernetes Submit Queue
a8b3d38b0a Merge pull request #52666 from yguo0905/systemspec
Automatic merge from submit-queue (batch tested with PRs 51337, 47080, 52646, 52635, 52666). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

Fix: update system spec to support Docker 17.03

Docker 17.03 is 1.13 with bug fixes so they are of the same minor version release. We've validated them both in https://github.com/kubernetes/kubernetes/issues/42926. This PR changes the system spec to support Docker 17.03.

**This should be in 1.8.**

**Release note**:

```
Kubernetes 1.8 supports docker version 17.03.x.
```

/assign @Random-Liu
2017-09-19 17:31:14 -07:00
Zihong Zheng
5532e24280 Fix GCE LB resource cleanup for service e2e tests. 2017-09-19 15:42:41 -07:00
Kubernetes Submit Queue
ab9838b779 Merge pull request #52650 from aleksandra-malinowska/autoscaling-test-allocatable
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

In cluster size autoscaling tests, use allocatable instead of capacity for node memory

This makes cluster size autoscaling e2e tests use node allocatable resources when computing how much memory we need to consume in order to trigger scale up/prevent scale down. It should fix failing tests in GKE.
2017-09-19 15:09:54 -07:00
Kubernetes Submit Queue
773268d533 Merge pull request #52743 from foxish/revert-upgrade-test-break
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

Make statefulset tests part of separate testsuite

xref: https://github.com/kubernetes/kubernetes/pull/52582
2017-09-19 13:35:53 -07:00
Kubernetes Submit Queue
0ff165f647 Merge pull request #52350 from wongma7/reclaim-e2e
Automatic merge from submit-queue (batch tested with PRs 52350, 52659). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>..

Add e2e test for storageclass.reclaimpolicy

**What this PR does / why we need it**: Adds another dynamic provisioning test where the storageclass.reclaimpolicy == retain. Have to manually delete the PV at the end of the test.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: https://github.com/kubernetes/kubernetes/issues/52138

**Special notes for your reviewer**: I have not tested it but it's ready for review, I will comment and edit this when i've verified it actually works.

**Release note**:

```release-note
NONE
```
2017-09-19 11:32:38 -07:00
Anirudh
0bfae756b5 Make statefulset tests part of separate testsuite 2017-09-19 11:06:47 -07:00
Yang Guo
9fbbec1afc Fix: update system spec to support Docker 17.03 2017-09-19 10:40:25 -07:00
Shyam Jeedigunta
419bbd26fc Retry if possible while creating latency pods in density test 2017-09-19 17:40:57 +02:00
Manjunath A Kumatagi
945d8cd87b Multi-arch allowPrivilegeEscalation tests 2017-09-19 19:17:03 +05:30
Michail Kargakis
a702742d31 Remove kargakis from OWNERS, add tnozicka
Signed-off-by: Michail Kargakis <mkargaki@redhat.com>
2017-09-19 11:05:23 +02:00
Aleksandra Malinowska
69f1f2f296 use allocatable instead of capacity 2017-09-19 10:26:14 +02:00
divyenpatel
544c6c18b3 Fixing kubelet restart
added code to check if Pid is getting changed upon kubelet restart

adding 30 seconds wait time after restarting kubelet
2017-09-18 15:46:41 -07:00
Aleksandra Malinowska
f6d7b96b3a use allocatable instead of capacity for node memory 2017-09-18 14:18:46 +02:00
Anirudh
1762bc428e update tag 2017-09-15 16:47:42 -07:00
Anirudh
83ad6900e5 Add statefulset upgrade tests to be run as part of
all upgrade testsuites
2017-09-15 16:29:06 -07:00
Kubernetes Submit Queue
9aef242a4c Merge pull request #52223 from bsalamat/approver
Automatic merge from submit-queue (batch tested with PRs 51796, 52223)

Add bsalamat to sig-scheduling-maintainers

**What this PR does / why we need it**:
Adds bsalamat to sig-scheduling-maintainers.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes # N/A

**Release note**:

```release-note
NONE
```

@kubernetes/sig-scheduling-pr-reviews @davidopp @timothysc @k82cn @wojtek-t
2017-09-15 05:51:23 -07:00
Kubernetes Submit Queue
87a1b5f6d7 Merge pull request #52476 from clamoriniere1A/bugfix/e2e_job_backoff_flaky
Automatic merge from submit-queue

Bugfix: Fix e2e Flaky Apps/Job BackoffLimit test

This fix is linked to the PR #51153 that introduce the `JobSpec.BackoffLimit`.

Previously the Timeout used in the test was too aggressive and generates flaky test execution. Now it used the default `framework.JobTimeout` used in others tests.



**What this PR does / why we need it**:
This PR should fix flaky "[sig-apps] Job should exceed backoffLimit" test, due to a too short timeout duration.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #
fixes #51153 

**Special notes for your reviewer**:

**Release note**:

```release-note
```
2017-09-15 03:30:27 -07:00
Jamie Hannaford
69f5feb295 Support kubernetes-anywhere provider 2017-09-15 11:13:08 +02:00
Kubernetes Submit Queue
93ddb7be5f Merge pull request #52237 from smarterclayton/watch_metric
Automatic merge from submit-queue (batch tested with PRs 51824, 50476, 52451, 52009, 52237)

Improve apiserver metrics reporting

Normalize "WATCHLIST" to "WATCH", add "scope" to the other metrics (listing 50k pods is != listing pods in a namespace), and add a new scope "resource" to cover individual resource calls.

This roughly aligns metrics with our ACL model (technically resource scope is GET, but POST to a subresource and POST to a namespace are not the same thing).

```release-note
WATCHLIST calls are now reported as WATCH verbs in prometheus for the apiserver_request_* series.  A new "scope" label is added to all apiserver_request_* values that is either 'cluster', 'resource', or 'namespace' depending on which level the query is performed at.
```
2017-09-15 01:08:11 -07:00
Kubernetes Submit Queue
9d8c11924f Merge pull request #51781 from bsalamat/preemption_tests
Automatic merge from submit-queue (batch tested with PRs 52442, 52247, 46542, 52363, 51781)

Add more tests for pod preemption

**What this PR does / why we need it**:
Adds more e2e and integration tests for pod preemption.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:
This PR is based on #50949. Only the last commit is new.

**Release note**:

```release-note
NONE
```

ref/ #47604

@kubernetes/sig-scheduling-pr-reviews @davidopp
2017-09-15 00:11:17 -07:00
Kubernetes Submit Queue
2c81db53ce Merge pull request #52442 from crassirostris/sd-logging-e2e-fix-trimming
Automatic merge from submit-queue

[fluentd-gcp addon] Remove some e2e tests out of blocking suites

Fixes https://github.com/kubernetes/kubernetes/issues/52433

Some Stackdriver Logging e2e tests are broken in release-blocking suites:

- Due to the change in Docker 1.13, on some systems logs are automatically split by 16K chunks. This PR removes an e2e test that assumes otherwise
- In large clusters, it's not possible to ingest system logs from all nodes

Since it's not a Kubernetes problem per se, mitigating this by removing these tests from blocking suites.
2017-09-14 23:38:04 -07:00
Kubernetes Submit Queue
471b0beb2e Merge pull request #52480 from aleksandra-malinowska/test-fix-gke-small
Automatic merge from submit-queue

Fix failing autoscaling test in GKE

This should fix `[sig-autoscaling] Cluster size autoscaling [Slow] should increase cluster size if pending pods are small and there is another node pool that is not autoscaled [Feature:ClusterSizeAutoscalingScaleUp]` by getting a list of nodes from GKE nodepool in a different way (filtering nodes by labels.) Currently, gcloud command used for it is failing, as we only have GKE node pool name in the test and not the actual MIG name.
2017-09-14 18:48:26 -07:00
Kubernetes Submit Queue
5d995e3f7b Merge pull request #52372 from caesarxuchao/remove-config-copy
Automatic merge from submit-queue (batch tested with PRs 52376, 52439, 52382, 52358, 52372)

Remove the conversion of client config

It was needed because the clientset code in client-go was a copy of the clientset code in Kubernetes.. client-go is authoritative now, so we can remove the nasty copy.
2017-09-14 15:27:17 -07:00
Mik Vyatskov
e79ce0a50d [fluentd-gcp addon] Remove trimming e2e tests out of blocking suites 2017-09-14 19:16:20 +02:00
Niels-Ole Kühl
56247c4e83 Removing PrometheusPushGateway --prom-push-gateway flag from e2e tests. 2017-09-14 14:13:31 +02:00
Kubernetes Submit Queue
3c8fb4b90f Merge pull request #52426 from shyamjvs/dont-crash-on-missing-data
Automatic merge from submit-queue

Don't crash density test on missing a single measurement

We failed our last run due to this (https://k8s-gubernator.appspot.com/build/kubernetes-jenkins/logs/ci-kubernetes-e2e-gce-scale-performance/33) and didn't have pod-startup latency recorded at all.
2017-09-14 05:09:46 -07:00
Aleksandra Malinowska
158ffdb1ec Get nodes from GKE node pool by checking labels 2017-09-14 12:06:34 +02:00
cedric lamoriniere
0d7182f8bd Fix e2e Flaky Apps/Job BackoffLimit test
This fix is linked to the PR #51153 that introduce the
JobSpec.BackoffLimit.
Previously the Timeout used in the test was too agressive and generates
flaky test execution. Now it used the default framework.JobTimeout used
in others tests.
2017-09-14 11:08:48 +02:00
David Zhu
7e10741f94 E2E test to make sure controller does not crash because of nil volume spec. 2017-09-13 17:01:24 -07:00
Bobby (Babak) Salamat
f11b0a65d1 Add more tests on pod preemption 2017-09-13 12:12:07 -07:00