Commit Graph

3946 Commits

Author SHA1 Message Date
Marcin Wielgus
cdf60ee760 Merge pull request #27763 from wojtek-t/write_cluster_name
Write cluster name in prepare-upgrade
2016-06-21 11:34:54 +02:00
Wojciech Tyczynski
89a4e8850a Write cluster name in prepare-upgrade 2016-06-21 10:43:07 +02:00
k8s-merge-robot
3ddff2b240 Merge pull request #27740 from andyzheng0831/gci-fix
Automatic merge from submit-queue

GCI: fix kubectl permission issue #27643
2016-06-20 22:58:57 -07:00
k8s-merge-robot
70ad689df4 Merge pull request #27332 from nikhiljindal/awsSecrets
Automatic merge from submit-queue

federation: Creating kubeconfig files to be used for creating secrets for clusters on aws and gke

Extension of https://github.com/kubernetes/kubernetes/pull/26914 which created the kubeconfig files for gce clusters.
This PR extends it to AWS, vagrant and GKE.

The change for AWS and vagrant is exactly same as GCE.
For GKE, since `gcloud create clusters` creates kubeconfig, we are just copying the generated kubeconfig to the desired location

cc @kubernetes/sig-cluster-federation @colhom 
@roberthbailey for GKE
2016-06-20 21:13:21 -07:00
Andy Zheng
d05b14f718 GCI: fix kubectl permission issue #27643 2016-06-20 17:35:20 -07:00
k8s-merge-robot
fae7285b00 Merge pull request #27707 from yifan-gu/stage1_name
Automatic merge from submit-queue

rkt: Map kubelet's `--stage1-image` flag to rkt's `--stage1-name` flag.

This enables rkt to use cached stage1 image instead of unpacking the stage1 image every time for every pod.

After this change, users need to preload the stage1 images in order to enable rkt to find the stage1 image with the name specified by this flag.

Also, the cloud config is modified to pre-load the stage1 images.

cc @kubernetes/sig-rktnetes @kubernetes/sig-node
2016-06-20 16:40:09 -07:00
nikhiljindal
864b267a8d Simplifying the code to create kubeconfig 2016-06-20 16:10:47 -07:00
k8s-merge-robot
5c841556b8 Merge pull request #27724 from wonderfly/fix_unbound_var
Automatic merge from submit-queue

Make GCI_DOCKER_VERSION optional

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()

fixes #26813

This env var is only set in `config-test.sh` and trying to evaluate it with
`nounset` set will end up with an "unbound variable" error.

@girishkalele @dchen1107 This is a XS fix. Can you review?
2016-06-20 15:26:13 -07:00
nikhiljindal
9026195614 Updating the code to fix the context name before using it as secret and cluster names 2016-06-20 14:50:28 -07:00
nikhiljindal
657a7ef6a4 Create kubeconfig for cluster secrets on other cloud providers as well 2016-06-20 14:36:14 -07:00
k8s-merge-robot
a3f09700e5 Merge pull request #27409 from adityakali/logrotate.1
Automatic merge from submit-queue

add logrotate service and configuration for GCI

This change mirrors the configuration in cluster/saltbase/salt/logrotate for GCI.

On GCI we use systemd timers (https://www.freedesktop.org/software/systemd/man/systemd.timer.html) and install an hourly timer - kube-logrotate.timer. This will invoke kube-logrotate.service (which calls /usr/sbin/logrotate) once every hour to perform log rotation as per the rotation rules installed under /etc/logrotate.d/.

@kubernetes/goog-image @zmerlynn @dchen1107 @andyzheng0831
2016-06-20 13:40:57 -07:00
Daniel Wang
0acb2dc28a Make GCI_DOCKER_VERSION optional
fixes #26813

This env var is only set in `config-test.sh` and trying to evaluate it with
`nounset` set will end up with an "unbound variable" error.
2016-06-20 11:22:14 -07:00
Yifan Gu
fc31c8d6f0 cluster/gce: Preload rkt stage1 images and set the default --rkt-stage1-image. 2016-06-20 10:39:46 -07:00
Yifan Gu
4e9298cd99 rkt: Map kubelet's --stage1-image flag to rkt's --stage1-name flag.
This enables rkt to use cached stage1 image instead of unpacking the
stage1 image every time for every pod.

After this change, users need to preload the stage1 images in order to
enable rkt to find the stage1 image with the name specified by this flag.
2016-06-20 10:39:46 -07:00
k8s-merge-robot
3b57f868d3 Merge pull request #27542 from piosz/heapster-version
Automatic merge from submit-queue

Bumped Heapster to v1.1.0

```release-note
Bumped Heapster to v1.1.0.
More details about the release https://github.com/kubernetes/heapster/releases/tag/v1.1.0
```

Version 1.1.0 doesn't differ from the previous one v1.1.0-beta2 significantly - just fixed few small issues during stabilization period.

cc @a-robinson @jszczepkowski @mwielgus 

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()
2016-06-20 04:40:27 -07:00
k8s-merge-robot
b442ba72e5 Merge pull request #27522 from adityakali/gci53
Automatic merge from submit-queue

make GCI image detection robust

This change makes sure that in case we roll back a released GCI image, the image detection logic picks a correct active image. 

@kubernetes/goog-image @Amey-D @wonderfly @dchen1107
2016-06-20 04:05:45 -07:00
k8s-merge-robot
5debaf7b88 Merge pull request #27253 from luxas/latest_hyperkube_addons
Automatic merge from submit-queue

Update to dnsmasq:1.3 and make hyperkube always use the latest addons

This bumps dnsmasq to a version that works on all architectures: https://github.com/kubernetes/contrib/pull/1192 (which have to be pushed first indeed)

Also I removed the manifests in hyperkube addons in favor for machine-generated ones, which will avoid mistakes.

This one is required for `v1.3`, so it has to be cherrypicked I think...

It makes docker and docker-multinode addons work again...
(Yes, we'll probably get rid of docker in favor for minikube, but we'll have to have it in this release at least)

@girishkalele @thockin @ArtfulCoder @david-mcmahon @bgrant0607 @mikedanese
2016-06-19 22:19:09 -07:00
k8s-merge-robot
530001c1fd Merge pull request #26778 from gmarek/validate-cluster
Automatic merge from submit-queue

Validate-cluster finishes shortly after at most ALLOWED_NOTREADY_NODE…

Fixes #26691

cc @wojtek-t @lavalamp
2016-06-19 21:40:54 -07:00
k8s-merge-robot
2274359ec1 Merge pull request #27518 from justinsb/kubedown_warn_if_no_vpc
Automatic merge from submit-queue

AWS kube-down: Issue warning if VPC not found

To address issue #17219
2016-06-19 19:53:20 -07:00
k8s-merge-robot
a025e20f93 Merge pull request #27302 from zmerlynn/faster-large-down
Automatic merge from submit-queue

gce/kube-down: Parallelize IGM deletion, batch more

Batch up to 200 instance / route deletions, handle IGM deletion in parallel.

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()
2016-06-19 18:04:13 -07:00
k8s-merge-robot
368704dc71 Merge pull request #27185 from gmarek/heapster-cpu
Automatic merge from submit-queue

Enable dynamic allocation of heapster/eventer cpu request/limit

cc @mwielgus @piosz @zmerlynn
2016-06-19 16:48:49 -07:00
Lucas Käldström
6c53c6a997 Make hyperkube work on all platforms, bump dnsmasq to 1.3 and addon-manager to v3, and always use the latest manifests 2016-06-19 17:15:17 +03:00
Justin Santa Barbara
d62fe2a315 AWS kube-up: set net.ipv4.neigh.default.gc_thresh1=0
This works around a linux kernel bug with overly aggressive caching of
ARP entries, which was causing problems when we reused IP addresses in
VPCs, for example with an ASG in a relatively small subnet.

See #23395 for more explanation.

Fixes #23395
2016-06-19 00:20:16 -04:00
k8s-merge-robot
6bdb9b0b8d Merge pull request #27070 from girishkalele/exechealthz-quiet
Automatic merge from submit-queue

DNS healthz container - add quiet option to stop flooding logs
2016-06-18 17:54:57 -07:00
k8s-merge-robot
13cd6475c9 Merge pull request #26917 from fabioy/fix_multi_migs
Automatic merge from submit-queue

Handle multiple MIGs (single-zone) properly in GKE scripts.

Change the scripts for GKE to handle multiple node pools (MIGs) in the same zone. 

Fixes https://github.com/kubernetes/test-infra/issues/100.

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()
2016-06-18 11:36:08 -07:00
k8s-merge-robot
48b255d424 Merge pull request #26813 from wonderfly/choose_docker_version
Automatic merge from submit-queue

Prep for continuous Docker validation test

```release-note
Add a test config variable to specify desired Docker version to run on GCI.
```
We want to continuously validate Docker releases (#25215), on GCI. This change
adds a new test config variable, `KUBE_GCI_DOCKER_VERSION`, through which we can
specify which version of Docker we want to run on the master and nodes. This
change also patches the Jenkins e2e-runner with the ability to fetch the latest
Docker (pre)release, and sets the aforementioned variable accordingly.

Tested on my local Jenkins instance that was able to start a cluster with the latest Docker version (different from installed version) running on both master and nodes.

@dchen1107 Can you review?

cc/ @andyzheng0831 for changes in `cluster/gce/gci/helper.sh`, and @ixdy @spxtr for changes to the Jenkins e2e-runner

cc/ @kubernetes/goog-image
2016-06-18 11:00:55 -07:00
k8s-merge-robot
db904257f9 Merge pull request #27599 from dchen1107/gci
Automatic merge from submit-queue

Fix docker log level on GCI node.

Fix #27584

cc/ @a-robinson
2016-06-17 20:21:05 -07:00
k8s-merge-robot
808f3ecbe6 Merge pull request #27220 from yifan-gu/kube-up-master
Automatic merge from submit-queue

cluster/gce/coreos: Make kube-up works for both rkt and docker on coreos on gce

With this PR, kube-up will be able to spin up a pure rkt cluster given the choice `KUBE_CONTAINER_RUNTIME=rkt`

e.g. 
```
export KUBE_GCE_ZONE=us-east1-b
export KUBE_OS_DISTRIBUTION=coreos

export KUBE_GCE_MASTER_PROJECT=coreos-cloud
export KUBE_GCE_MASTER_IMAGE=coreos-alpha-1032-0-0-v20160428

export KUBE_GCE_NODE_PROJECT=coreos-cloud
export KUBE_GCE_NODE_IMAGE=coreos-alpha-1032-0-0-v20160428


export KUBE_ENABLE_NODE_LOGGING=false
export KUBE_ENABLE_CLUSTER_MONITORING=none

export KUBE_CONTAINER_RUNTIME=rkt
export KUBE_RKT_VERSION=v1.8.0
```

```
$ cluster/kube-up.sh
...
$ kubectl cluster-info
Kubernetes master is running at https://104.196.41.124
GLBCDefaultBackend is running at https://104.196.41.124/api/v1/proxy/namespaces/kube-system/services/default-http-backend
Elasticsearch is running at https://104.196.41.124/api/v1/proxy/namespaces/kube-system/services/elasticsearch-logging
Kibana is running at https://104.196.41.124/api/v1/proxy/namespaces/kube-system/services/kibana-logging
KubeDNS is running at https://104.196.41.124/api/v1/proxy/namespaces/kube-system/services/kube-dns
kubernetes-dashboard is running at https://104.196.41.124/api/v1/proxy/namespaces/kube-system/services/kubernetes-dashboard
```

```
$ kubectl get pods --all-namespaces
NAMESPACE     NAME                                            READY     STATUS    RESTARTS   AGE
kube-system   elasticsearch-logging-v1-5zfrd                  1/1       Running   0          2m
kube-system   elasticsearch-logging-v1-83u6w                  1/1       Running   0          2m
kube-system   etcd-server-events-yifan-test-rkt-master        1/1       Running   0          2m
kube-system   etcd-server-yifan-test-rkt-master               1/1       Running   0          2m
kube-system   kibana-logging-v1-0g7yu                         1/1       Running   2          2m
kube-system   kube-addon-manager-yifan-test-rkt-master        1/1       Running   0          3m
kube-system   kube-apiserver-yifan-test-rkt-master            1/1       Running   0          2m
kube-system   kube-controller-manager-yifan-test-rkt-master   1/1       Running   0          3m
kube-system   kube-dns-v14-1mqco                              3/3       Running   0          2m
kube-system   kube-scheduler-yifan-test-rkt-master            1/1       Running   0          2m
kube-system   kubernetes-dashboard-v1.1.0-beta2-uwutn         1/1       Running   0          2m
kube-system   l7-lb-controller-v0.6.0-8pgbo                   2/2       Running   0          2m
kube-system   node-problem-detector-v0.1-7iwb2                1/1       Running   0          2m
kube-system   node-problem-detector-v0.1-k4m8o                1/1       Running   0          2m
kube-system   node-problem-detector-v0.1-rxtp8                1/1       Running   0          2m
kube-system   node-problem-detector-v0.1-wsoqd                1/1       Running   0          2m

```


Fix #24103 

cc @kubernetes/sig-node @kubernetes/rktnetes-maintainers
2016-06-17 19:39:22 -07:00
k8s-merge-robot
7ab303efbe Merge pull request #27466 from madhusudancs/fed-dns-scope
Automatic merge from submit-queue

Move GCE NODE_SCOPES to config-common.sh and enable DNS scope in GKE.

Follow up to #27357

cc @colhom @kubernetes/sig-cluster-federation 

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()
2016-06-17 17:02:11 -07:00
k8s-merge-robot
2266c4eb1d Merge pull request #27371 from andyzheng0831/network-plugin
Automatic merge from submit-queue

Revert "Revert "GCI: add support for network plugin""

PR #27027 added the network plugin support in GCI config, but later a bug in the network plugin broke e2e tests (see issue #27118). The bug was fixed by #27141 and we have been repeatedly run the serial e2e tests more than 10 times to verify the fix. Now it should be safe to put the GCI network plugin support back.

We will first merge in the master branch and monitor the Jenkins serial tests for a while and then cherry-pick it into release-1.3 branch.
2016-06-17 14:03:18 -07:00
k8s-merge-robot
d4d6184931 Merge pull request #27627 from justinsb/fix_23390
Automatic merge from submit-queue

kube-up multizone: don't print scary warning

The node-count check gets confused when there are more nodes that we
launched, which is normal with KUBE_USE_EXISTING_MASTER.

This fix just suppresses the error message in that case.

Fix #23390
2016-06-17 11:24:29 -07:00
Justin Santa Barbara
bab7a255b7 kube-up multizone: don't print scary warning
The node-count check gets confused when there are more nodes that we
launched, which is normal with KUBE_USE_EXISTING_MASTER.

This fix just suppresses the error message in that case.

Fix #23390
2016-06-17 13:00:50 -04:00
Aditya Kali
4a46c948f2 make GCI image detection robust 2016-06-17 09:56:53 -07:00
Marcin Wielgus
9031e4780c Bump cluster autoscaler to 0.2.0 2016-06-17 16:31:25 +02:00
Dawn Chen
58684a56be Fix docker log level on GCI node. 2016-06-16 20:54:42 -07:00
Yifan Gu
d96c7edc34 cluster/saltbase: Enable rkt to load master components images. 2016-06-16 15:18:52 -07:00
Yifan Gu
16d69f9f37 cluster/gce/coreos: Update addon manifests, add node-problem-detector.
Also bumped CNI version.
2016-06-16 15:18:52 -07:00
Yifan Gu
f3c5d4e62b cluster/gce/coreos: Update the cloud config and scripts to enable rkt runtime. 2016-06-16 15:18:52 -07:00
Piotr Szczesniak
8fff5319db Bumped Heapster to v1.1.0 2016-06-16 20:41:28 +02:00
Girish Kalele
ee7ca66dba Add nethealth prepull container output to e2e run logs 2016-06-16 11:03:09 -07:00
Jerzy Szczepkowski
390a30b130 Updated version of resource consumer in image puller.
Updated version of resource consumer in image puller.
2016-06-16 10:57:21 +02:00
Justin Santa Barbara
e58a16e95a AWS kube-down: Issue warning if VPC not found
Issue #17219
2016-06-15 23:53:57 -04:00
k8s-merge-robot
38cd0d078e Merge pull request #27311 from vmware/photon-controller-fixes
Automatic merge from submit-queue

Fixes and improvements to Photon Controller backend for kube-up

- Improve reliability of network address detection by using MAC
  address. VMware has a MAC OUI that reliably distinguishes the VM's
  NICs from the other NICs (like the CBR). This doesn't rely on the
  unreliable reporting of the portgroup.
- Persist route changes. We configure routes on the master and nodes,
  but previously we didn't persist them so they didn't last across
  reboots. This persists them in /etc/network/interfaces
- Fix regression that didn't configure auth for kube-apiserver with
  Photon Controller.
- Reliably run apt-get update: Not doing this can cause apt to fail.
- Remove unused nginx config in salt
2016-06-15 14:46:54 -07:00
Madhusudan.C.S
3f5fbee19d Move GCE NODE_SCOPES to config-common.sh and enable DNS scope in GKE. 2016-06-15 11:36:07 -07:00
gmarek
8617f70add Adjust heapster cpu request/limit 2016-06-15 13:34:58 +02:00
k8s-merge-robot
3c822c0b3c Merge pull request #27357 from madhusudancs/fed-dns-scope
Automatic merge from submit-queue

Add Google Cloud DNS auth scope to GCE VMs when they are started in a federation.

cc @colhom @kubernetes/sig-cluster-federation 

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()
2016-06-15 00:00:54 -07:00
Aditya Kali
a423b83c06 add logrotate service and configuration for GCI
This change mirrors the configuration in cluster/saltbase/salt/logrotate
for GCI.
2016-06-14 19:01:58 -07:00
Alain Roy
5740ceb7f6 Fixes and improvements to Photon Controller backend for kube-up
- Improve reliability of network address detection by using MAC
  address. VMware has a MAC OUI that reliably distinguishes the VM's
  NICs from the other NICs (like the CBR). This doesn't rely on the
  unreliable reporting of the portgroup.
- Persist route changes. We configure routes on the master and nodes,
  but previously we didn't persist them so they didn't last across
  reboots. This persists them in /etc/network/interfaces
- Fix regression that didn't configure auth for kube-apiserver with
  Photon Controller.
- Reliably run apt-get update: Not doing this can cause apt to fail.
- Remove unused nginx config in salt
2016-06-14 15:40:33 -07:00
Zach Loafman
1fde6f867c GKE provider: Add a HEAPSTER_MACHINE_TYPE option
Allows heterogeneous cluster configurations for large cluster
synthetic testing.
2016-06-14 14:39:08 -07:00
Andy Zheng
e6b744c85a Revert "Revert "GCI: add support for network plugin""
This reverts commit 8207eddd99.
2016-06-14 09:52:34 -07:00