mirror of https://github.com/k3s-io/kubernetes.git synced 2026-01-24 05:59:02 +00:00

Files

Clayton Coleman 3eadd1a9ea Keep pod worker running until pod is truly complete

A number of race conditions exist when pods are terminated early in
their lifecycle because components in the kubelet need to know "no
running containers" or "containers can't be started from now on" but
were relying on outdated state.

Only the pod worker knows whether containers are being started for
a given pod, which is required to know when a pod is "terminated"
(no running containers, none coming). Move that responsibility and
podKiller function into the pod workers, and have everything that
was killing the pod go into the UpdatePod loop. Split syncPod into
three phases - setup, terminate containers, and cleanup pod - and
have transitions between those methods be visible to other
components. After this change, to kill a pod you tell the pod worker
to UpdatePod({UpdateType: SyncPodKill, Pod: pod}).

Several places in the kubelet were incorrect about whether they
were handling terminating (should stop running, might have
containers) or terminated (no running containers) pods. The pod worker
exposes methods that allow other loops to know when to set up or tear
down resources based on the state of the pod - these methods remove
the possibility of race conditions by ensuring a single component is
responsible for knowing each pod's allowed state and other components
simply delegate to checking whether they are in the window by UID.

Removing containers now no longer blocks final pod deletion in the
API server and are handled as background cleanup. Node shutdown
no longer marks pods as failed as they can be restarted in the
next step.

See https://docs.google.com/document/d/1Pic5TPntdJnYfIpBeZndDelM-AbS4FN9H2GTLFhoJ04/edit# for details

2021-07-06 15:55:22 -04:00

apimachinery

Update cronjob integration test to batch/v1

2021-06-04 21:04:21 +08:00

apps

Keep pod worker running until pod is truly complete

2021-07-06 15:55:22 -04:00

auth

Fix NodeAuthenticator tests in dual stack

2021-06-21 15:41:13 +03:00

autoscaling

Update dns autoscaler test to look at node allocatable instead of capacity

2021-05-18 16:33:49 -07:00

chaosmonkey

test: fix typo in chaosmonkey.go

2021-03-21 12:20:00 +09:00

cloud

BoundServiceAccountTokenVolume ga

2021-05-13 20:45:47 -07:00

common

Merge pull request #101464 from Nick-Triller/deflake_assert_events

2021-06-24 17:44:45 -07:00

framework

Merge pull request #103073 from ravisantoshgudimetla/add-e2e-minReadySeconds

2021-06-25 12:12:59 -07:00

generated

hack/update-bazel.sh

2021-02-28 15:17:29 -08:00

instrumentation

Add Snapshot Controller e2e metric tests

2021-05-20 23:29:23 -07:00

kubectl

Update cronjob integration test to batch/v1

2021-06-04 21:04:21 +08:00

lifecycle

fix(timezone): the timezone is standardized to UTC

2021-06-03 23:55:39 +08:00

network

bug fix in sctp test

2021-06-27 21:53:41 +05:30

node

Keep pod worker running until pod is truly complete

2021-07-06 15:55:22 -04:00

perftype

hack/update-bazel.sh

2021-02-28 15:17:29 -08:00

reporters

hack/update-bazel.sh

2021-02-28 15:17:29 -08:00

scheduling

don't panic if nodeIPs are not found

2021-06-21 10:59:09 +02:00

storage

Merge pull request #102576 from dobsonj/101911

2021-06-29 06:54:40 -07:00

testing-manifests

Update cronjob integration test to batch/v1

2021-06-04 21:04:21 +08:00

hack/update-bazel.sh

2021-02-28 15:17:29 -08:00

upgrades

Merge pull request #99348 from chymy/e2e-rck-unused

2021-04-28 21:02:50 -07:00

windows

Use the hostprocess feature flag and assign a user

2021-06-17 14:42:24 -07:00

e2e_test.go

Drop viper stuff in test/

2021-06-04 09:38:22 -04:00

e2e-example-config.json

…

e2e.go

refactor(e2e): grab metrics from controller-manager via nginx

2021-05-03 00:12:06 +08:00

README.md

test/e2e: Add ownership info to README

2021-03-17 16:48:11 -04:00

suites.go

Add Snapshot Controller e2e metric tests

2021-05-20 23:29:23 -07:00

README.md

test/e2e

This is home to e2e tests used for presubmit, periodic, and postsubmit jobs.

Some of these jobs are merge-blocking, some are release-blocking.

e2e test ownership

All e2e tests must adhere to the following policies:

the test must be owned by one and only one SIG
the test must live in/underneath a sig-owned package matching pattern: test/e2e/[{subpath}/]{sig}/..., e.g.
- test/e2e/auth - all tests owned by sig-auth
- test/e2e/common/storage - all tests common to cluster-level and node-level e2e tests, owned by sig-node
- test/e2e/upgrade/apps - all tests used in upgrade testing, owned by sig-apps
each sig-owned package should have an OWNERS file defining relevant approvers and labels for the owning sig, e.g.

# test/e2e/node/OWNERS
# See the OWNERS docs at https://go.k8s.io/owners

approvers:
- alice
- bob
- cynthia
emeritus_approvers:
- dave
reviewers:
- sig-node-reviewers
labels:
- sig/node

packages that use {subpath} should have an imports.go file importing sig-owned packages (for ginkgo's benefit), e.g.

// test/e2e/common/imports.go
package common

import (
	// ensure these packages are scanned by ginkgo for e2e tests
	_ "k8s.io/kubernetes/test/e2e/common/network"
	_ "k8s.io/kubernetes/test/e2e/common/node"
	_ "k8s.io/kubernetes/test/e2e/common/storage"
)

test ownership must be declared via a top-level SIGDescribe call defined in the sig-owned package, e.g.

// test/e2e/lifecycle/framework.go
package lifecycle

import "github.com/onsi/ginkgo"

// SIGDescribe annotates the test with the SIG label.
func SIGDescribe(text string, body func()) bool {
	return ginkgo.Describe("[sig-cluster-lifecycle] "+text, body)
}

// test/e2e/lifecycle/bootstrap/bootstrap_signer.go

package bootstrap

import (
	"github.com/onsi/ginkgo"
	"k8s.io/kubernetes/test/e2e/lifecycle"
)
var _ = lifecycle.SIGDescribe("[Feature:BootstrapTokens]", func() {
  /* ... */
  ginkgo.It("should sign the new added bootstrap tokens", func() {
    /* ... */
  })
  /* etc */
})

These polices are enforced:

via the merge-blocking presubmit job pull-kubernetes-verify
which ends up running hack/verify-e2e-test-ownership.sh
which can also be run via make verify WHAT=e2e-test-ownership

more info

See kubernetes/community/.../e2e-tests.md