Production-Grade Container Scheduling and Management
Go to file
Francesco Romani 2f426fdba6 devicemanager: checkpoint: support pre-1.20 data
The commit a8b8995ef2
changed the content of the data kubelet writes in the checkpoint.
Unfortunately, the checkpoint restore code was not updated,
so if we upgrade kubelet from pre-1.20 to 1.20+, the
device manager cannot anymore restore its state correctly.

The only trace of this misbehaviour is this line in the
kubelet logs:
```
W0615 07:31:49.744770    4852 manager.go:244] Continue after failing to read checkpoint file. Device allocation info may NOT be up-to-date. Err: json: cannot unmarshal array into Go struct field PodDevicesEntry.Data.PodDeviceEntries.DeviceIDs of type checkpoint.DevicesPerNUMA
```

If we hit this bug, the device allocation info is
indeed NOT up-to-date up until the device plugins register
themselves again. This can take up to few minutes, depending
on the specific device plugin.

While the device manager state is inconsistent:
1. the kubelet will NOT update the device availability to zero, so
   the scheduler will send pods towards the inconsistent kubelet.
2. at pod admission time, the device manager allocation will not
   trigger, so pods will be admitted without devices actually
   being allocated to them.

To fix these issues, we add support to the device manager to
read pre-1.20 checkpoint data. We retroactively call this
format "v1".

Signed-off-by: Francesco Romani <fromani@redhat.com>
2021-10-26 09:54:11 +02:00
.github Use github forms for issue templates 2021-10-05 09:55:38 -07:00
api Remove scheduler's legacy policy config 2021-10-24 21:58:12 -04:00
build [go1.17] Update to go1.17.2 2021-10-14 16:15:51 +08:00
CHANGELOG Add CVE 2021-25741 info to 1.22 Release Notes 2021-10-07 11:09:12 +03:00
cluster Merge pull request #92433 from claudiubelu/windows/etcd-image 2021-10-18 15:49:34 -07:00
cmd Merge pull request #105807 from jonyhy96/fix-magic-number 2021-10-22 04:46:37 -07:00
docs
hack add new line 2021-10-17 21:25:41 +05:30
LICENSES vendor sigs.k8s.io/json 2021-10-20 13:12:08 -04:00
logo
pkg devicemanager: checkpoint: support pre-1.20 data 2021-10-26 09:54:11 +02:00
plugin [PodSecurity] Implement metricRecorder for admission (#104217) 2021-10-20 20:02:08 -07:00
staging Merge pull request #105857 from liggitt/runAsNonRoot-runAsUser 2021-10-26 00:15:30 -07:00
test Merge pull request #105755 from bobbypage/npd-test-cg2 2021-10-25 17:59:29 -07:00
third_party Copy golang license to staging copies 2021-08-10 17:37:28 -04:00
vendor Merge pull request #105828 from ahg-g/ahg-legacy 2021-10-25 09:38:23 -07:00
.generated_files
.gitattributes
.gitignore
CHANGELOG.md
code-of-conduct.md
CONTRIBUTING.md
go.mod klog 2.30.0, logr 1.2.0, zapr 1.2.0 2021-10-22 15:32:20 +02:00
go.sum klog 2.30.0, logr 1.2.0, zapr 1.2.0 2021-10-22 15:32:20 +02:00
LICENSE
Makefile
Makefile.generated_files
OWNERS
OWNERS_ALIASES Merge pull request #104186 from pacoxu/patch-2 2021-10-05 15:24:57 -07:00
README.md
SECURITY_CONTACTS
SUPPORT.md

Kubernetes (K8s)

GoPkg Widget CII Best Practices


Kubernetes, also known as K8s, is an open source system for managing containerized applications across multiple hosts. It provides basic mechanisms for deployment, maintenance, and scaling of applications.

Kubernetes builds upon a decade and a half of experience at Google running production workloads at scale using a system called Borg, combined with best-of-breed ideas and practices from the community.

Kubernetes is hosted by the Cloud Native Computing Foundation (CNCF). If your company wants to help shape the evolution of technologies that are container-packaged, dynamically scheduled, and microservices-oriented, consider joining the CNCF. For details about who's involved and how Kubernetes plays a role, read the CNCF announcement.


To start using K8s

See our documentation on kubernetes.io.

Try our interactive tutorial.

Take a free course on Scalable Microservices with Kubernetes.

To use Kubernetes code as a library in other applications, see the list of published components. Use of the k8s.io/kubernetes module or k8s.io/kubernetes/... packages as libraries is not supported.

To start developing K8s

The community repository hosts all information about building Kubernetes from source, how to contribute code and documentation, who to contact about what, etc.

If you want to build Kubernetes right away there are two options:

You have a working Go environment.
mkdir -p $GOPATH/src/k8s.io
cd $GOPATH/src/k8s.io
git clone https://github.com/kubernetes/kubernetes
cd kubernetes
make
You have a working Docker environment.
git clone https://github.com/kubernetes/kubernetes
cd kubernetes
make quick-release

For the full story, head over to the developer's documentation.

Support

If you need support, start with the troubleshooting guide, and work your way through the process that we've outlined.

That said, if you have questions, reach out to us one way or another.