Commit Graph

23932 Commits

Author SHA1 Message Date
Justin Santa Barbara
a4d6801655 AWS: Change config-test defaults to match GCE
KUBE_ENABLE_DAEMONSETS & KUBE_ENABLE_DEPLOYMENTS default to true in
config-test in GCE, so we should do the same in AWS for e2e sanity.
2016-01-24 10:43:43 -05:00
Justin Santa Barbara
8a7afc459c AWS: Call build-runtime-config where it is used
build-runtime-config was being called in verify-prereqs, which didn't
match how GCE called it, and didn't seem to actually work.

Instead call it just before the master configuration is built.  Also
call it just before the node configuration is built, even though the
nodes don't _currently_ require the runtime_config.
2016-01-24 10:43:15 -05:00
Zach Loafman
364814072a Merge pull request #20024 from mikedanese/nil-ptr
fix nil pointer in addons test
2016-01-24 06:56:46 -08:00
harry
0fa5b6c4f7 Add e2e test for wrapper volumes
Use git server in e2e and refactor wrapper builder

Update e2e test to use a fake git server
2016-01-24 16:57:44 +08:00
David Oppenheimer
14c276306c Dedicated nodes, taints, and tolerations design doc. 2016-01-23 23:05:50 -08:00
Mike Danese
00144659ec fix nil pointer in addons test 2016-01-23 22:04:53 -08:00
David Oppenheimer
87f2590f37 How to build Mesos/Omega-style frameworks on Kubernetes. 2016-01-23 22:00:19 -08:00
David Oppenheimer
32fa44041b Design doc for node affinity, including NodeSelector. 2016-01-23 19:57:41 -08:00
David Oppenheimer
cea5cf42b6 Inter-pod topological affinity/anti-affinity design doc. 2016-01-23 18:39:34 -08:00
Hongchao Deng
5c68e9171d clean up generated content in stop-kubemark.sh 2016-01-23 16:07:00 -08:00
Aaron Davidson
97689c326d Reduce healthy threshold and check interval for Amazon ELBs
According to AWS, the ELB healthy threshold is "Number of consecutive health check successes before declaring an EC2 instance healthy." It has an unusual interaction with Kubernetes, since all nodes will enter either an unhealthy state or a healthy state together depending on the service's healthiness as a whole.

We have observed that if our service goes down for the unhealthy threshold (which is 2 checks at 30 second intervals = 60 seconds), then the ELB will stop serving traffic to all nodes in the cluster, and will wait for the healthy threshold (currently 10 * 30 = 300 seconds) AFTER the service is restored to add back the cluster nodes, meaning it remains unreachable for an extra 300 seconds.

With the new settings, the ELB will continue to timeout dead nodes after 60 seconds, but will restore healthy nodes after 20 seconds. The minimum value for healthyThreshold is 2, and the minimum value for interval is 5 seconds. I went for 10 seconds instead of the minimum sort of arbitrarily because I was not sure how much this value may affect the scalability of clusters in EC2, as it does put some extra load on the kube-proxy.
2016-01-23 11:10:37 -08:00
Zach Loafman
717551b13c Merge pull request #19760 from mattf/master
move spark images to github.com/kubernetes/application-images/spark
2016-01-23 07:55:20 -08:00
Zach Loafman
a71a0f05ef Merge pull request #19933 from spxtr/fix-verifier
Handle multijob runners properly in verify-jenkins-jobs.sh
2016-01-23 07:34:28 -08:00
Zach Loafman
ad6ebc677d Merge pull request #20046 from lavalamp/flake-watch-test
Remove TestWatchEtcdError
2016-01-23 07:34:14 -08:00
Zach Loafman
c3fc36ee95 Merge pull request #20042 from thockin/logs-for-19665
slightly better logs in service e2e
2016-01-23 07:33:11 -08:00
Zach Loafman
71fadffbed Merge pull request #20025 from mikedanese/fix-unbound
cluster/gce: fix unbound variable
2016-01-23 07:32:56 -08:00
Zach Loafman
2cd4d9e2a0 Merge pull request #19557 from ihmccreery/feature-serviceloadbalancer
Make ServiceLoadBalancer a Feature test
2016-01-23 07:32:37 -08:00
David Oppenheimer
2866ae7bca Merge pull request #20037 from davidopp/flake1
Add time logging to runReplicationControllerTest to help debug timeouts in the future
2016-01-22 22:10:24 -08:00
Clayton Coleman
f9c48c864d Don't add 'error: ' to messages if already there 2016-01-23 00:41:36 -05:00
derekwaynecarr
93005eb7c9 Mark namespace e2e [Serial] 2016-01-22 22:55:40 -05:00
Justin Santa Barbara
d8460d5920 AWS: Pass non-masquerade-cidr through kube-up
Set the environment variable NON_MASQUERADE_CIDR.

Docs in cluster/aws/options.md
2016-01-22 21:01:44 -05:00
Ken Robertson
66c99d5e9c kubelet: Update FakeDockerClient used in integration tests
This updates the mock for the docker client used in integration tests to
include the engine version in its VersionInfo response.
2016-01-22 17:14:01 -08:00
Daniel Smith
c0ffbd58db Remove TestWatchEtcdError
We decided to remove this test, as there's no way to get an upper bound
on its running time. Etcd restart behavior should be tested in
integration or e2e tests.
2016-01-22 16:43:44 -08:00
Prashanth Balasubramanian
ffd34311c6 Sync pods for daemon sets. 2016-01-22 16:42:07 -08:00
Prashanth Balasubramanian
1b93ee7b35 Sync all pods with store before starting RC Manager. 2016-01-22 16:42:02 -08:00
Tim Hockin
9a75794fd6 slightly better logs in service e2e 2016-01-22 16:12:02 -08:00
Isaac Hollander McCreery
21e3776de6 Change wording about experimental API 2016-01-22 16:03:08 -08:00
Chao Xu
bc2a9736dc remove duplicated tests 2016-01-22 15:40:42 -08:00
David Oppenheimer
a8e742f462 Add time logging to runReplicationControllerTest to help debug
timeouts in the future.
2016-01-22 15:32:20 -08:00
Isaac Hollander McCreery
9477bc08a5 Use INSTANCE_PREFIX to prefix uploaded binaries in GCE, fixes #8705 2016-01-22 15:27:33 -08:00
Zach Loafman
6397b781ac Merge pull request #20030 from kubernetes/revert-18169-smarter-scaling
Revert "kubectl: Make scaling smarter"
2016-01-22 15:07:26 -08:00
Zach Loafman
0791de15e3 Merge pull request #20007 from smarterclayton/ipallocrace
19848: Retry service IP repair on conflict
2016-01-22 15:02:05 -08:00
Eric Tune
53ee76fe1a Support Work Queue jobs with variable parallelism
When job.spec.completions is nil, only
one task needs to succeed for the job to succeed,
and parallelism can be scaled freely during runtime.

Added tests.

Release Note:

This causes two minor changes to the API.

First, unset parallelism previously was defaulted to be
equal to completions.  Now it always defaults to 1 if unset.

Second, having parallelism=N and completions unset would previously
be defaulted to 1 completion and N parallelism.
(this is not something we expect people to do, though)
Now, no defaulting occurs in that case, and the job's
behavior is different (any completion causes success).
2016-01-22 14:57:51 -08:00
Zach Loafman
62726c4ab8 Revert "kubectl: Make scaling smarter" 2016-01-22 14:52:38 -08:00
Mike Danese
0fdaa41c5e cluster/gce: fix unbound variable 2016-01-22 14:42:32 -08:00
Zach Loafman
3df16731e2 Merge pull request #19994 from zmerlynn/cross-version
build: Use Go version in cross image tag
2016-01-22 14:32:14 -08:00
Justin Santa Barbara
88eeec4f13 GCE: Register the PersistentVolumeLabel admission controller 2016-01-22 17:16:39 -05:00
Justin Santa Barbara
1276675512 Ubernetes-Lite: Error if a PD name is ambiguous
We don't cope well if a PD is in multiple zones, but this is actually
fairly easy to detect.  This is probably justified purely on the basis
that we never want to delete the wrong volume (DeleteDisk), but also
because this means that we now warn on creation if a disk is in multiple
zones (with the labeling admission controller).

This also means that with the scheduling predicate in place, that many
of our volume problems "go away" in practice: you still can't create or
delete a volume when it is ambiguous, but thereafter the volume will be
labeled with the zone, that will match it only to nodes with the same
zone, and then we query for the volume in that zone when we
attach/detach it.
2016-01-22 17:16:38 -05:00
Justin Santa Barbara
900567288b Ubernetes Lite: Label volumes with zone information
When volumes are labeled, they will only be scheduled onto nodes in the
same zone.
2016-01-22 17:16:31 -05:00
Eric Tune
3a15a374ba Merge pull request #19447 from Clarifai/ecr
Add ECR credential provider
2016-01-22 13:52:08 -08:00
Nikhil Jindal
471f24c96a Merge pull request #19883 from nikhiljindal/federatedServerExample
Adding an example for an independent apiserver
2016-01-22 13:43:05 -08:00
Ken Robertson
fff8a7c371 kubelet: Update engine version parsing to handle semantic versioning
This updates the dockertools.dockerVersion to use a semantic versioning
library to more gracefully support engine versions which include
additional version fields.

Previously, go-dockerclient's APIVersion struct was use which only
handles plain numeric x.y.z version strings. With #19675, the library
was now used on the Docker engine string, however it is possible for the
engine string to include include additional information for beta, rc, or
distro specific builds.

This PR also enables the TestDockerRuntimeVersion test which was
previously just a FIXME and updates it to pass, and be used to test the
version string that cause #20005.

This negates the need for fsouza/go-dockerclient#451, since even with
that change, if a user was running Docker 1.10.0-rc1, this would cause
the kubelet to report it as simply 1.10.0.
2016-01-22 13:18:52 -08:00
Dawn Chen
138f18cc92 Move test (Pods should not back-off restarting a container on LivenessProbe failure)
out of flaky test.

For last 100+ runs, the test never fail in kubernetes-e2e-gce-flaky build.
The only exception is build 10313, but the failure is caused by previous
flaky test, not this one itself.
2016-01-22 13:04:57 -08:00
Rudi Chiarito
bc0dd97a70 ECR credential provider 2016-01-22 15:03:25 -05:00
Rudi Chiarito
aa5e3ab4ca Add ECR dependencies and refresh Godeps 2016-01-22 15:03:25 -05:00
nikhiljindal
0ab152ddf2 Adding an example apiserver 2016-01-22 11:52:16 -08:00
Zach Loafman
07c304b43d build: Use Go version in cross image tag
The new tag format is cross-<go version>-<cross version>, starting
with cross-1.5.3-1.

Also bump the image we pre-pull / warn-on to 1.5.3

Fixes #19990
2016-01-22 11:45:41 -08:00
Zach Loafman
0b00928c74 Merge pull request #19987 from gmarek/remove-test
Remove monitor_resources test
2016-01-22 11:10:06 -08:00
Zach Loafman
b25d596a2b Merge pull request #19630 from mikedanese/fix-test
test: make ValidateCount throw an error instead of a log
2016-01-22 11:08:27 -08:00
Zach Loafman
7189db3701 Merge pull request #19396 from justinsb/aws_mountdevice
AWS: Use a strongly typed mountDevice
2016-01-22 11:04:23 -08:00