Commit Graph

34694 Commits

Author SHA1 Message Date
Dmitry Shulyak
c0981963d9 Verify evicted pods managed by petset controller will be recreated
Spawn pet set with single replica and simple pod. They will have
conflicting hostPort definitions, and spawned on the same node.

As the result pet set pod, it will be created after simple pod, will be
in Failed state. Pet set controller will try to re-create it. After
verifying that pet set pod failed and was recreated atleast once, we will
remove pod with conflicting hostPort and wait until pet set pod will be in
running state.

Change-Id: I5903f5881f8606c696bd390df58b06ece33be88a
2016-09-21 12:03:11 +03:00
Dmitry Shulyak
200f8c5c39 Delete evicted pet
If pet was evicted by kubelet - it will stuck in this state forever.
By analogy to regular pod we need to re-create pet so that it will
be re-scheduled to another node, so in order to re-create pet
and preserve consitent naming we will delete it in petset controller
and create after that.

Change-Id: Ib98bf7f34b3f2ab1582b9de34b5f4c5f84cd5215
2016-08-31 14:41:10 +03:00
Marek Grabowski
ebe733e469 Merge pull request #31770 from kubernetes/revert-31614-bump_pod_nanny_version
Revert "Bump addon-resizer version in heapster"
2016-08-31 08:53:44 +02:00
Wojciech Tyczynski
f75e0ece0a Revert "Bump addon-resizer version in heapster" 2016-08-31 08:30:56 +02:00
Kubernetes Submit Queue
3b404bd213 Merge pull request #31651 from Random-Liu/move-host-info-around-test-result
Automatic merge from submit-queue

Node E2E: Move host info around test result.

Discussed offline with @yujuhong and @dchen1107. Currently, the node e2e result is organized as:
```
================================================================
Success Finished Host tmp-node-e2e-b6c375c7-e2e-node-containervm-v20160321-image Test Suite
{ginkgo-output}
{framework-error}
================================================================
```
This makes it painful to find which image the test is failing on. The `{ginkgo-output}` is usually quite long, so we have to scroll mouse up and down to find the host name.
This PR changes the test result to:
```
================================================================
Start Host tmp-node-e2e-b6c375c7-e2e-node-containervm-v20160321-image Test Suite
{ginkgo-output}
Success Finished Host tmp-node-e2e-b6c375c7-e2e-node-containervm-v20160321-image Test Suite
{framework-error}
================================================================
```
This is not perfect, but much better than before. We can easily find the host name under the ginkgo test result, like this:
```
================================================================
Start Host test-gci-dev-54-8743-3-0 Test Suite
Running Suite: E2eNode Suite
============================
Random Seed: 1472511489 - Will randomize all specs
Will run 0 of 131 specs

Running in parallel across 8 nodes

I0829 22:58:13.727764    1143 e2e_node_suite_test.go:98] Pre-pulling images so that they are cached for the tests.
I0829 22:58:28.562459    1143 e2e_node_suite_test.go:111] Node services started.  Running tests...
I0829 22:58:28.562477    1143 e2e_node_suite_test.go:116] Wait for the node to be ready

SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS
------------------------------
I0829 22:58:29.742596    1143 e2e_node_suite_test.go:136] Stopping node services...
I0829 22:58:29.742650    1143 services.go:673] Killing process 1423 (services) with -TERM
I0829 22:58:29.860893    1143 e2e_node_suite_test.go:141] Tests Finished


Ran 0 of 131 Specs in 16.185 seconds
SUCCESS! -- 0 Passed | 0 Failed | 0 Pending | 131 Skipped 

Ginkgo ran 1 suite in 19.939034297s
Test Suite Passed

Success Finished Host test-gci-dev-54-8743-3-0 Test Suite
================================================================
```

In a following PR, I'll print the test result from different images into different files to make it more clear for debugging. Mark v1.4 because this helps us de-flake test.

/cc @kubernetes/sig-node
2016-08-30 23:08:41 -07:00
Kubernetes Submit Queue
e3acd8df71 Merge pull request #31760 from Random-Liu/remove-stop-services-option
Automatic merge from submit-queue

Node E2E: Make sure the test framework collect log properly.

Fixes #31537.

It turns out that the service [stop function](https://github.com/kubernetes/kubernetes/blob/master/test/e2e_node/e2e_node_suite_test.go#L135-L139) is never called, because `stop-services` is [always `false`](https://github.com/kubernetes/kubernetes/blob/master/test/e2e_node/runner/remote/run_remote.go#L444). And currently we collect log in the `stop` function.

This PR makes sure the `stop-services` is always true in node e2e. This will fix #31537.

@yujuhong @dchen1107 @vishh
2016-08-30 22:12:54 -07:00
Random-Liu
2907d4019d Move host info around test result. 2016-08-30 21:31:04 -07:00
Random-Liu
e7a1b4e16f Do not set stop-services=false for node e2e and add more logs. 2016-08-30 17:52:23 -07:00
Kubernetes Submit Queue
f839c5ffdf Merge pull request #31746 from mwielgus/ingres-events
Automatic merge from submit-queue

Events for federated ingress controller

cc: @quinton-hoole @kubernetes/sig-cluster-federation
2016-08-30 16:52:08 -07:00
Kubernetes Submit Queue
2b755dc480 Merge pull request #31716 from coufon/explicitly_delete_pods_in_node_perf_test
Automatic merge from submit-queue

Explicitly delete pods in node performance tests

This PR explicitly deletes all created pods at the end in node e2e performance related tests.

The large number of pods may cause namespace cleanup times out (in #30878), therefore we explicitly delete all pods for cleaning up.
2016-08-30 16:52:03 -07:00
Kubernetes Submit Queue
09e3fb355b Merge pull request #31629 from rmmh/fix-gubernator-line
Automatic merge from submit-queue

Only print "running gubernator.sh" when actually running it.
2016-08-30 16:10:18 -07:00
Kubernetes Submit Queue
1c607b391a Merge pull request #31721 from mml/disrupt.e2e.versioned
Automatic merge from submit-queue

Rewrite disruption e2e test to use versioned client.

This currently includes the changes from #31638.  I will rebase once that is merged.
2016-08-30 15:35:15 -07:00
Zhou Fang
0167f74c6c explicitly delete pods in node perf tests 2016-08-30 15:18:37 -07:00
Marcin Wielgus
e850d4f0a2 Events for federated ingress controller 2016-08-30 23:52:35 +02:00
Matt Liggett
a5f1dc44f5 // update-all and staging/.../client-go/copy.sh 2016-08-30 13:35:51 -07:00
Kubernetes Submit Queue
3fd14d97fb Merge pull request #31229 from pigmej/typos_englishify_pkg_api
Automatic merge from submit-queue

Fix various typos in pkg/api

Just fixed some typos + "englishify" in pkg/api
2016-08-30 13:35:36 -07:00
Kubernetes Submit Queue
9f4fb68b37 Merge pull request #31658 from spxtr/slow-tests
Automatic merge from submit-queue

Move several tests to the slow suite.

https://k8s-testgrid.appspot.com/google-gce#gce&width=5&graph-metrics=test-duration-minutes

ref https://github.com/kubernetes/contrib/issues/1522
2016-08-30 12:56:18 -07:00
Matt Liggett
44b2c862ef Rewrite disruption e2e test to use versioned client. 2016-08-30 12:34:45 -07:00
Kubernetes Submit Queue
a3b17f5e53 Merge pull request #31614 from wojtek-t/bump_pod_nanny_version
Automatic merge from submit-queue

Bump addon-resizer version in heapster

Ref #31105
2016-08-30 12:14:28 -07:00
Kubernetes Submit Queue
be859b144d Merge pull request #30808 from smarterclayton/no_defaults
Automatic merge from submit-queue

ClientConfig should not default to http://localhost:8080

This changes clientcmd to skip the default cluster, but preserves the
behavior in kubectl. This prevents the possibility of an administrator
misconfiguration in kubelet or other server component from allowing a
third party who can bind to 8080 on that host from potentially
impersonating an API server and gaining root access.

@mikedanese @deads2k this removes the defaulting of http://localhost:8080 for server from everything except kubectl.



```release-note
Kubernetes server components using `kubeconfig` files no longer default to `http://localhost:8080`.  Administrators must specify a server value in their kubeconfig files.
```
2016-08-30 11:33:39 -07:00
Kubernetes Submit Queue
ec061b81d0 Merge pull request #31660 from mtaufen/un-ref-KubeletConfiguration-on-kubelet
Automatic merge from submit-queue

Make kubeletConfiguration field on Kubelet struct a value type
2016-08-30 10:46:48 -07:00
Kubernetes Submit Queue
1dfd6ab0c1 Merge pull request #31696 from thommay/no_maintainer
Automatic merge from submit-queue

I'm not a rackspace or kubernetes user any more
2016-08-30 09:57:24 -07:00
Kubernetes Submit Queue
12429e1690 Merge pull request #31664 from coufon/fix_perf_test_limit
Automatic merge from submit-queue

increase latency and resource limit accroding to test results

This PR increases the latency limit of node e2e density test according to previous test results.

Fixed #30878
2016-08-30 09:57:19 -07:00
Kubernetes Submit Queue
1b05640cf8 Merge pull request #31640 from mwielgus/events-for-ns
Automatic merge from submit-queue

Events in federated namespace controller

@quinton-hoole @nikhiljindal @wojtek-t @kubernetes/sig-cluster-federation
2016-08-30 09:11:27 -07:00
Kubernetes Submit Queue
dccf3e9288 Merge pull request #31639 from derekwaynecarr/fix-e2e-local
Automatic merge from submit-queue

Fix running e2e against local cluster

required env vars were not being set.

/cc @pmorie
2016-08-30 09:11:22 -07:00
Kubernetes Submit Queue
dbb1950922 Merge pull request #31141 from deads2k/add-queue-depth-metrics
Automatic merge from submit-queue

add names for workqueues to gather controller latency/depth metrics

Adding names to the workqueues used by controllers allows the automatic collection of depth, rate, and latency metrics for those controllers.  These are useful for diagnosing various "slow controller" cases.

@kubernetes/rh-cluster-infra
2016-08-30 08:32:00 -07:00
Kubernetes Submit Queue
a6e8283021 Merge pull request #31700 from gmarek/unused
Automatic merge from submit-queue

Remove unused argument to NodeController.Run

@pwittrock - can this go into 1.4? It's zero risk and reduces possible confusion.
2016-08-30 07:49:40 -07:00
Clayton Coleman
06cbb29e9e
ClientConfig should not default to http://localhost:8080
This changes clientcmd to skip the default cluster, but preserves the
behavior in kubectl. This prevents the possibility of an administrator
misconfiguration in kubelet or other server component from allowing a
third party who can bind to 8080 on that host from potentially
impersonating an API server and gaining root access.
2016-08-30 10:43:24 -04:00
Marcin Wielgus
044fab9f58 Events for federated namesapce controller 2016-08-30 16:28:49 +02:00
Kubernetes Submit Queue
3ccb99d87d Merge pull request #31692 from jszczepkowski/master
Automatic merge from submit-queue

Marked HA master feature experimental.

Marked HA master feature experimental.
2016-08-30 07:09:49 -07:00
deads2k
4317173d3f add names for workqueues to gather controller latency/depth metrics 2016-08-30 09:51:50 -04:00
Kubernetes Submit Queue
0f8869d308 Merge pull request #31516 from sttts/sttts-e2e-CheckConnectivityToHost-logs
Automatic merge from submit-queue

e2e: log wget output on CheckConnectivityToHost error

Log output might help to diagnose e2e flakes, whether they are caused by dns issues or connection timeouts.

Might help with flake https://github.com/kubernetes/kubernetes/issues/28188.
2016-08-30 06:27:28 -07:00
Kubernetes Submit Queue
be848f2adc Merge pull request #26863 from luxas/multi_platform
Automatic merge from submit-queue

Add a multi-platform proposal

This is a proposal for the work I've done with multiarch Kubernetes. I've worked quite a lot on it and now I've written it down so others may easily use and learn from it, while at the same time pointing out issues that we should fix in the future.

I would like to get this in before `v1.3`.

Feel free to comment on it if it's something that's unclear.

@david-mcmahon @ixdy @mikedanese @Pensu @davidopp @bgrant0607 @dlorenc @eparis @vishh @thockin @dchen1107 @lavalamp
2016-08-30 05:40:39 -07:00
Kubernetes Submit Queue
1388e664a4 Merge pull request #31637 from freehan/docker0fix
Automatic merge from submit-queue

keep docker0 with private cidr range

fixes: #31465

Keep docker0 when using kubenet on GCI. Assign 169.254.123.1/24 to docker0 to avoid cidr conflict.
2016-08-30 05:40:33 -07:00
gmarek
ea2d19f5d7 Remove unused argument to NodeController.Run 2016-08-30 14:24:56 +02:00
Kubernetes Submit Queue
177a6840f8 Merge pull request #31519 from piosz/influxdb
Automatic merge from submit-queue

Update influxdb to 0.12

fix #27630 

```release-note
Update influxdb to 0.12
```
2016-08-30 05:03:47 -07:00
Lucas Käldström
b033b25f64 Add a multiarch proposal 2016-08-30 14:39:51 +03:00
Kubernetes Submit Queue
228ba405c5 Merge pull request #31641 from mwielgus/updater-on-failure
Automatic merge from submit-queue

OnError callback function for federated updater

The handler can be useful for precise logging and events recording. Update returns just a single, first received error.

cc: @quinton-hoole @wojtek-t @kubernetes/sig-cluster-federation
2016-08-30 04:24:59 -07:00
Kubernetes Submit Queue
5c332e0d02 Merge pull request #31628 from Random-Liu/move-image-logic-to-remote-run
Automatic merge from submit-queue

Node E2E: Move the image and host related logic to be remote run specific.

This PR addresses #31597, and better fixes #31588.

With this PR:
1) All remote logic is moved into remote mode.
2) All gci related logic is only called when `HOSTS` and `IMAGES` are not specified.
3) Run node e2e against other node e2e images will work again - `make test-e2e-node REMOET=true IMAGES=e2e-node-containervm-v20160321-image`.
4) List images will work again - `make test-e2e-node REMOTE=true LIST_IMAGES=true`.

Mark 1.4, because https://github.com/kubernetes/kubernetes/pull/31588 is marked as 1.4. And this makes the node e2e work as is described in the [doc ](https://github.com/kubernetes/kubernetes/blob/master/docs/devel/e2e-node-tests.md#run-tests-using-different-images).

@yujuhong @vishh 
/cc @kubernetes/sig-node
2016-08-30 04:24:55 -07:00
Thom May
3f219065b5
I'm not a rackspace or kubernetes user any more
Signed-off-by: Thom May <thom@may.lt>
2016-08-30 12:04:50 +01:00
Kubernetes Submit Queue
02e0bdc24e Merge pull request #31593 from dims/fix-test-integration-osx
Automatic merge from submit-queue

Fix make test-integration under OSX

Just throw in a doc.go so there's something compilable in the
test/integration/metrics directory.

Fixes #31587
2016-08-30 03:45:07 -07:00
Piotr Szczesniak
2c19b189c7 Fixed monitoring e2e test 2016-08-30 12:23:40 +02:00
Piotr Szczesniak
323f2aa0db Bumped influxdb to 0.12.2 in Godeps 2016-08-30 12:23:40 +02:00
Marcin Wielgus
6531c83b7b OnError callback function for federated updater 2016-08-30 12:01:50 +02:00
Kubernetes Submit Queue
e5a198e3da Merge pull request #31625 from timstclair/aa-gate
Automatic merge from submit-queue

AppArmor was flipped to beta, update feature gate

/cc @dchen1107

---

1.4 Justification:

- Risk: Low. Change is small & contained.
- Rollback: Nothing else should touch this code path or depend on its functionality.
- Cost: AppArmor is beta, but the feature gate thinks it's alpha.
2016-08-30 03:01:17 -07:00
Jerzy Szczepkowski
c5f18d517d Marked HA master feautre experimental.
Marked HA master feautre experimental.
2016-08-30 11:50:22 +02:00
Kubernetes Submit Queue
56434f1144 Merge pull request #31623 from sttts/sttts-mac-ln-T
Automatic merge from submit-queue

Fix build on Mac

Use portable `ln -snf` instead of `-sTf` (introduced in https://github.com/kubernetes/kubernetes/pull/31525).
2016-08-30 02:16:55 -07:00
Kubernetes Submit Queue
8a44cb5c7b Merge pull request #31683 from gmarek/kubemark-5
Automatic merge from submit-queue

Tune kubemark-5 resource constraints

Ref. #31105
2016-08-30 01:35:38 -07:00
Kubernetes Submit Queue
17787eb6f2 Merge pull request #31557 from timstclair/aa-event
Automatic merge from submit-queue

Include security options in the container created event

New container creation events look like:
```
Created container with docker id /k8s_bar2.a4; Security:[seccomp=sub/subtest(md5:07c9bcb4db631f7ca191d6e0bca49f76)]

Created container with docker id /k8s_bar2.a4; Security:[seccomp=unconfined apparmor=foo-profile]
```

The goal is to provide enough information to confirm that the requseted security constraints were honored.

For https://github.com/kubernetes/kubernetes/issues/31284

/cc @dchen1107 @thockin @jfrazelle @pweil- @pmorie

---

Justification for v1.4:

- Risk: low. This appends some additional information to a human readable message. A bug here would probably not break any functionality
- Roll-back: I don't anticipate any more changes to this area of the code. No functionality depends on this change.
- Cost of not including: Users don't get any (positive) confirmation that the AppArmor or Seccomp profile they requested were actually enabled.
2016-08-30 01:35:33 -07:00
Kubernetes Submit Queue
cca6d7ddd9 Merge pull request #31634 from timstclair/gubernator
Automatic merge from submit-queue

Cleanup node failure message

Fix missing newline
2016-08-30 00:53:15 -07:00