Commit Graph

24100 Commits

Author SHA1 Message Date
k8s-merge-robot
33d1f6c085 Merge pull request #18265 from davidopp/affinity
Auto commit by PR queue bot
2016-01-25 05:57:10 -08:00
Nagarjun
39b1874041 Add GetEnvOr() feature to utils Package 2016-01-25 16:08:35 +05:30
Piotr Szczesniak
6e15394171 Merge pull request #20083 from piosz/disable-cluster-autoscaling-e2e
Disabled cluster size autoscaling e2e since they are flaky
2016-01-25 11:38:16 +01:00
Pieter Joost van de Sande
b2215ead0f fix wrong db label
The rethinkdb-admin service was labelled with `influxdb` instead of `rethinkdb`
2016-01-25 11:31:02 +01:00
Piotr Szczesniak
23b9ca221d Disabled cluster size autoscaling e2e since they are flaky 2016-01-25 10:34:10 +01:00
Lucas Käldström
528910f5f4 Fix the apiserver proxy doc 2016-01-25 09:33:08 +02:00
Alex Robinson
c9471a1665 Fix broken link to logging sidecar example. 2016-01-24 17:22:35 -08:00
jet tsang zeon-git
9ea4f6b6d9 fix kube-up without DOCKER_OPTS 2016-01-25 07:15:20 +08:00
Dhawal Yogesh Bhanushali
f7db2c6e99 vsphere support 2016-01-24 13:52:56 -08:00
David Oppenheimer
5abfb43b64 Merge pull request #18263 from davidopp/taints-tolerations
Dedicated nodes, taints, and tolerations design proposal
2016-01-24 12:15:57 -08:00
David Oppenheimer
1948dc5ab0 Merge pull request #18261 from davidopp/nodeselector
Node affinity and NodeSelector design doc
2016-01-24 12:12:45 -08:00
David Oppenheimer
a646b6428d Merge pull request #18880 from davidopp/mesos-style
How to build Mesos/Omega-style frameworks on Kubernetes.
2016-01-24 12:11:58 -08:00
Justin Santa Barbara
a4d6801655 AWS: Change config-test defaults to match GCE
KUBE_ENABLE_DAEMONSETS & KUBE_ENABLE_DEPLOYMENTS default to true in
config-test in GCE, so we should do the same in AWS for e2e sanity.
2016-01-24 10:43:43 -05:00
Justin Santa Barbara
8a7afc459c AWS: Call build-runtime-config where it is used
build-runtime-config was being called in verify-prereqs, which didn't
match how GCE called it, and didn't seem to actually work.

Instead call it just before the master configuration is built.  Also
call it just before the node configuration is built, even though the
nodes don't _currently_ require the runtime_config.
2016-01-24 10:43:15 -05:00
Zach Loafman
364814072a Merge pull request #20024 from mikedanese/nil-ptr
fix nil pointer in addons test
2016-01-24 06:56:46 -08:00
harry
0fa5b6c4f7 Add e2e test for wrapper volumes
Use git server in e2e and refactor wrapper builder

Update e2e test to use a fake git server
2016-01-24 16:57:44 +08:00
David Oppenheimer
14c276306c Dedicated nodes, taints, and tolerations design doc. 2016-01-23 23:05:50 -08:00
Mike Danese
00144659ec fix nil pointer in addons test 2016-01-23 22:04:53 -08:00
David Oppenheimer
87f2590f37 How to build Mesos/Omega-style frameworks on Kubernetes. 2016-01-23 22:00:19 -08:00
David Oppenheimer
32fa44041b Design doc for node affinity, including NodeSelector. 2016-01-23 19:57:41 -08:00
David Oppenheimer
cea5cf42b6 Inter-pod topological affinity/anti-affinity design doc. 2016-01-23 18:39:34 -08:00
Hongchao Deng
5c68e9171d clean up generated content in stop-kubemark.sh 2016-01-23 16:07:00 -08:00
Aaron Davidson
97689c326d Reduce healthy threshold and check interval for Amazon ELBs
According to AWS, the ELB healthy threshold is "Number of consecutive health check successes before declaring an EC2 instance healthy." It has an unusual interaction with Kubernetes, since all nodes will enter either an unhealthy state or a healthy state together depending on the service's healthiness as a whole.

We have observed that if our service goes down for the unhealthy threshold (which is 2 checks at 30 second intervals = 60 seconds), then the ELB will stop serving traffic to all nodes in the cluster, and will wait for the healthy threshold (currently 10 * 30 = 300 seconds) AFTER the service is restored to add back the cluster nodes, meaning it remains unreachable for an extra 300 seconds.

With the new settings, the ELB will continue to timeout dead nodes after 60 seconds, but will restore healthy nodes after 20 seconds. The minimum value for healthyThreshold is 2, and the minimum value for interval is 5 seconds. I went for 10 seconds instead of the minimum sort of arbitrarily because I was not sure how much this value may affect the scalability of clusters in EC2, as it does put some extra load on the kube-proxy.
2016-01-23 11:10:37 -08:00
Zach Loafman
717551b13c Merge pull request #19760 from mattf/master
move spark images to github.com/kubernetes/application-images/spark
2016-01-23 07:55:20 -08:00
Zach Loafman
a71a0f05ef Merge pull request #19933 from spxtr/fix-verifier
Handle multijob runners properly in verify-jenkins-jobs.sh
2016-01-23 07:34:28 -08:00
Zach Loafman
ad6ebc677d Merge pull request #20046 from lavalamp/flake-watch-test
Remove TestWatchEtcdError
2016-01-23 07:34:14 -08:00
Zach Loafman
c3fc36ee95 Merge pull request #20042 from thockin/logs-for-19665
slightly better logs in service e2e
2016-01-23 07:33:11 -08:00
Zach Loafman
71fadffbed Merge pull request #20025 from mikedanese/fix-unbound
cluster/gce: fix unbound variable
2016-01-23 07:32:56 -08:00
Zach Loafman
2cd4d9e2a0 Merge pull request #19557 from ihmccreery/feature-serviceloadbalancer
Make ServiceLoadBalancer a Feature test
2016-01-23 07:32:37 -08:00
David Oppenheimer
2866ae7bca Merge pull request #20037 from davidopp/flake1
Add time logging to runReplicationControllerTest to help debug timeouts in the future
2016-01-22 22:10:24 -08:00
Clayton Coleman
f9c48c864d Don't add 'error: ' to messages if already there 2016-01-23 00:41:36 -05:00
derekwaynecarr
93005eb7c9 Mark namespace e2e [Serial] 2016-01-22 22:55:40 -05:00
Justin Santa Barbara
d8460d5920 AWS: Pass non-masquerade-cidr through kube-up
Set the environment variable NON_MASQUERADE_CIDR.

Docs in cluster/aws/options.md
2016-01-22 21:01:44 -05:00
Ken Robertson
66c99d5e9c kubelet: Update FakeDockerClient used in integration tests
This updates the mock for the docker client used in integration tests to
include the engine version in its VersionInfo response.
2016-01-22 17:14:01 -08:00
Daniel Smith
c0ffbd58db Remove TestWatchEtcdError
We decided to remove this test, as there's no way to get an upper bound
on its running time. Etcd restart behavior should be tested in
integration or e2e tests.
2016-01-22 16:43:44 -08:00
Prashanth Balasubramanian
ffd34311c6 Sync pods for daemon sets. 2016-01-22 16:42:07 -08:00
Prashanth Balasubramanian
1b93ee7b35 Sync all pods with store before starting RC Manager. 2016-01-22 16:42:02 -08:00
Tim Hockin
9a75794fd6 slightly better logs in service e2e 2016-01-22 16:12:02 -08:00
Isaac Hollander McCreery
21e3776de6 Change wording about experimental API 2016-01-22 16:03:08 -08:00
Chao Xu
bc2a9736dc remove duplicated tests 2016-01-22 15:40:42 -08:00
David Oppenheimer
a8e742f462 Add time logging to runReplicationControllerTest to help debug
timeouts in the future.
2016-01-22 15:32:20 -08:00
Isaac Hollander McCreery
9477bc08a5 Use INSTANCE_PREFIX to prefix uploaded binaries in GCE, fixes #8705 2016-01-22 15:27:33 -08:00
Zach Loafman
6397b781ac Merge pull request #20030 from kubernetes/revert-18169-smarter-scaling
Revert "kubectl: Make scaling smarter"
2016-01-22 15:07:26 -08:00
Zach Loafman
0791de15e3 Merge pull request #20007 from smarterclayton/ipallocrace
19848: Retry service IP repair on conflict
2016-01-22 15:02:05 -08:00
Eric Tune
53ee76fe1a Support Work Queue jobs with variable parallelism
When job.spec.completions is nil, only
one task needs to succeed for the job to succeed,
and parallelism can be scaled freely during runtime.

Added tests.

Release Note:

This causes two minor changes to the API.

First, unset parallelism previously was defaulted to be
equal to completions.  Now it always defaults to 1 if unset.

Second, having parallelism=N and completions unset would previously
be defaulted to 1 completion and N parallelism.
(this is not something we expect people to do, though)
Now, no defaulting occurs in that case, and the job's
behavior is different (any completion causes success).
2016-01-22 14:57:51 -08:00
Zach Loafman
62726c4ab8 Revert "kubectl: Make scaling smarter" 2016-01-22 14:52:38 -08:00
Mike Danese
0fdaa41c5e cluster/gce: fix unbound variable 2016-01-22 14:42:32 -08:00
Zach Loafman
3df16731e2 Merge pull request #19994 from zmerlynn/cross-version
build: Use Go version in cross image tag
2016-01-22 14:32:14 -08:00
Justin Santa Barbara
88eeec4f13 GCE: Register the PersistentVolumeLabel admission controller 2016-01-22 17:16:39 -05:00
Justin Santa Barbara
1276675512 Ubernetes-Lite: Error if a PD name is ambiguous
We don't cope well if a PD is in multiple zones, but this is actually
fairly easy to detect.  This is probably justified purely on the basis
that we never want to delete the wrong volume (DeleteDisk), but also
because this means that we now warn on creation if a disk is in multiple
zones (with the labeling admission controller).

This also means that with the scheduling predicate in place, that many
of our volume problems "go away" in practice: you still can't create or
delete a volume when it is ambiguous, but thereafter the volume will be
labeled with the zone, that will match it only to nodes with the same
zone, and then we query for the volume in that zone when we
attach/detach it.
2016-01-22 17:16:38 -05:00