Commit Graph

118370 Commits

Author SHA1 Message Date
Kubernetes Prow Robot
51a8ee26f2
Merge pull request #120567 from skitt/drop-deprecated-pointer-kubeadm
kubeadm: drop deprecated pointer package
2023-09-11 20:18:10 -07:00
Kubernetes Prow Robot
95c6295636
Merge pull request #120279 from BenTheElder/gotoolchain
handle GOTOOLCHAIN in kube::golang::verify_go_version
2023-09-11 17:04:10 -07:00
Kubernetes Prow Robot
57768fd3d0
Merge pull request #120577 from alculquicondor/job_latency_res
Increase range of job_sync_duration_seconds
2023-09-11 16:00:12 -07:00
Aldo Culquicondor
6b4ab616a2
Increase range of job_sync_duration_seconds
Change-Id: I7ed4b006faecf0a7e6e583c42b4d6bc4b786a164
2023-09-11 18:01:33 -04:00
Kubernetes Prow Robot
4cd21ae6ce
Merge pull request #113333 from rphillips/add_to_reviewer
sig-node: add rphillips to reviewers
2023-09-11 15:00:11 -07:00
Benjamin Elder
d765e40a98 pass GOTOOLCHAIN to build containers 2023-09-11 14:20:19 -07:00
Kubernetes Prow Robot
b7e3b9f7d9
Merge pull request #120508 from aojea/aojea_owner
add aojea as client-go approver
2023-09-11 13:12:11 -07:00
Ryan Phillips
43b434f66a sig-node: add rphillips to reviewers 2023-09-11 14:34:30 -05:00
Benjamin Elder
d1b5a9953a handle GOTOOLCHAIN in kube::golang::verify_go_version
for now:
- shim FORCE_HOST_GO to GOTOOLCHAIN=local
- treat GOTOOLCHAIN set and !=auto like FORCE_HOST_GO
- otherwise set GOTOOLCHAIN=go${GO_VERSION} and fallback to gimme if necessary

TODO: set toolchain statements in go.mod files and keep them in sync
2023-09-11 12:04:45 -07:00
Kubernetes Prow Robot
be968597e3
Merge pull request #120310 from gjkim42/use-container-log-instead-of-termination-log
Use container log instead of termination log
2023-09-11 11:52:23 -07:00
Kubernetes Prow Robot
74f6c263d8
Merge pull request #118544 from sohankunkerkar/remove-sandbox-image-ref
pkg/kubelet: allow sandbox image pinning from CRI
2023-09-11 11:52:12 -07:00
Kubernetes Prow Robot
aa4ec3c5b0
Merge pull request #119944 from Sharpz7/jm/backup-finalizers
Adding backup code for removing finalizers to more Job End States.
2023-09-11 09:30:30 -07:00
Lars Ekman
0df4a69f5c
Kube-proxy: Get nodeIPs for both families with dual-stack (#119525)
* Kube-proxy: handle dual-stack in detectNodeIPs()

* Updates
2023-09-11 09:30:23 -07:00
Kensei Nakada
0d3eafdfa3
fix(scheduling_queue): always put Pods with no unschedulable plugins into activeQ/backoffQ (#119105)
* always put Pods with no unschedulable plugins into activeQ/backoffQ

* address review comments
2023-09-11 09:30:11 -07:00
Kubernetes Prow Robot
b1161a8ac4
Merge pull request #120559 from pohly/e2e-framework-WaitForPodsResponding-retry
e2e pods: fix WaitForPodsResponding retry
2023-09-11 07:52:10 -07:00
Stephen Kitt
e2c1c0d34a
kubeadm: drop deprecated pointer package
This replaces deprecated k8s.io/utils/pointer functions with their ptr
equivalent.

Signed-off-by: Stephen Kitt <skitt@redhat.com>
2023-09-11 16:41:12 +02:00
Gunju Kim
1fb4eee94e
Use container log instead of termination log
Since the termination log cannot be accessed until the container is
terminated, use the container log.
2023-09-11 22:55:09 +09:00
Kubernetes Prow Robot
6c578bc982
Merge pull request #120428 from pohly/dra-scheduler-reallocation-flake
DRA: scheduler reallocation flake
2023-09-11 02:56:12 -07:00
Patrick Ohly
fc3ee07b51 e2e pods: fix WaitForPodsResponding retry
The status error was embedded inside the new error constructed by
WaitForPodsResponding's get function, but not wrapped. Therefore
`apierrors.IsServiceUnavailable(err)` didn't find it and returned false -> no
retries.

Wrapping fixes this and Gomega formatting of the error remains useful:

	err := &errors.StatusError{}
	err.ErrStatus.Code = 503
	err.ErrStatus.Message = "temporary failure"

	err2 := fmt.Errorf("Controller %s: failed to Get from replica pod %s:\n%w\nPod status:\n%s",
		"foo", "bar",
		err, "some status")
	fmt.Println(format.Object(err2, 1))
        fmt.Println(errors.IsServiceUnavailable(err2))

=>

    <*fmt.wrapError | 0xc000139340>:
    Controller foo: failed to Get from replica pod bar:
    temporary failure
    Pod status:
    some status
    {
        msg: "Controller foo: failed to Get from replica pod bar:\ntemporary failure\nPod status:\nsome status",
        err: <*errors.StatusError | 0xc0001a01e0>{
            ErrStatus: {
                TypeMeta: {Kind: "", APIVersion: ""},
                ListMeta: {
                    SelfLink: "",
                    ResourceVersion: "",
                    Continue: "",
                    RemainingItemCount: nil,
                },
                Status: "",
                Message: "temporary failure",
                Reason: "",
                Details: nil,
                Code: 503,
            },
        },
    }

    true
2023-09-11 11:54:15 +02:00
Patrick Ohly
6f9140e421 DRA scheduler: stop allocating before deallocation
This fixes a test flake:

    [sig-node] DRA [Feature:DynamicResourceAllocation] multiple nodes reallocation [It] works
    /nvme/gopath/src/k8s.io/kubernetes/test/e2e/dra/dra.go:552

      [FAILED] number of deallocations
      Expected
          <int64>: 2
      to equal
          <int64>: 1
      In [It] at: /nvme/gopath/src/k8s.io/kubernetes/test/e2e/dra/dra.go:651 @ 09/05/23 14:01:54.652

This can be reproduced locally with

    stress -p 10 go test ./test/e2e -args -ginkgo.focus=DynamicResourceAllocation.*reallocation.works  -ginkgo.no-color -v=4 -ginkgo.v

Log output showed that the sequence of events leading to this was:
- claim gets allocated because of selected node
- a different node has to be used, so PostFilter sets
  claim.status.deallocationRequested
- the driver deallocates
- before the scheduler can react and select a different node,
  the driver allocates *again* for the original node
- the scheduler asks for deallocation again
- the driver deallocates again (causing the test failure)
- eventually the pod runs

The fix is to disable allocations first by removing the selected node and then
starting to deallocate.
2023-09-11 10:56:17 +02:00
Kubernetes Prow Robot
cc0a24d2e8
Merge pull request #120406 from wlq1212/cheanup/framework/timeout
e2e_framework:stop using deprecated wait.ErrwaitTimeout
2023-09-10 21:10:10 -07:00
Kubernetes Prow Robot
0ee315b94c
Merge pull request #120375 from pegasas/proxy
Improve logging on kube-proxy exit
2023-09-10 12:08:10 -07:00
Kubernetes Prow Robot
098d4c7b9e
Merge pull request #120546 from SaumyaBhushan/issue
added documentation about the format of certificateKey
2023-09-10 10:26:10 -07:00
pegasas
f446745777 Improve logging on kube-proxy exit 2023-09-11 00:50:29 +08:00
Kubernetes Prow Robot
25c7a1439a
Merge pull request #120069 from aojea/service_conformance
promote to conformance Service multiprotocol tests
2023-09-10 07:26:09 -07:00
SaumyaBhushan
df5c1bb1ea added documentation about the format of certificateKey
Signed-off-by: SaumyaBhushan <saumya.bhushan666@gmail.com>
2023-09-10 19:50:42 +05:30
Kubernetes Prow Robot
49768134e5
Merge pull request #119754 from pbxqdown/kubelet-fix-typo
Fix some typos in kubelet component source code
2023-09-09 19:36:11 -07:00
Kubernetes Prow Robot
b343878daa
Merge pull request #120438 from ritazh/kmsv2-metrics-apiserverid
kmsv2: add apiserver identity to metrics
2023-09-09 16:46:09 -07:00
Rita Zhang
43ccf6c4e8
kmsv2: add apiserver identity to metrics
Signed-off-by: Rita Zhang <rita.z.zhang@gmail.com>
2023-09-09 15:31:32 -07:00
Kubernetes Prow Robot
33c5bd631d
Merge pull request #120008 from skitt/drop-intstr-ptr-wrappers
Use ptr.To to retrieve intstr addresses
2023-09-09 07:24:09 -07:00
Kubernetes Prow Robot
fd8f2c7fc6
Merge pull request #120541 from pacoxu/kubeadm-fix-hash
kubeadm: add log for static pod manifest diff
2023-09-09 06:08:08 -07:00
Paco Xu
b443a841e3 kubeadm: add log for static pod manifest diff 2023-09-09 20:00:31 +08:00
Kubernetes Prow Robot
21f7bf66fa
Merge pull request #120272 from tzneal/add-tzneal-to-sig-node-reviewers
OWNERS_ALIASES: add tzneal to sig-node-reviewer
2023-09-08 18:28:11 -07:00
Kubernetes Prow Robot
37cf2638c9
Merge pull request #119619 from skitt/intstr-parse-parseint
Limit intstr.Parse() to 32-bit integer parsing
2023-09-08 13:04:29 -07:00
Kubernetes Prow Robot
41689233b4
Merge pull request #120334 from pohly/scheduler-clear-unschedulable-plugins
scheduler: avoid false "unschedulable" pod state
2023-09-08 12:01:23 -07:00
Kubernetes Prow Robot
817488e4fa
Merge pull request #120082 from aojea/hostnetwork_services_fallback
e2e network test for udp services with hostNetwork clients
2023-09-08 12:01:12 -07:00
Kubernetes Prow Robot
bec95ed575
Merge pull request #120527 from cpanato/bump-distroless
Bump distroless-iptables to v0.3.2
2023-09-08 09:36:29 -07:00
Kubernetes Prow Robot
15a019d841
Merge pull request #120526 from cpanato/update-prom
[releng] Update publishing-bot rules for active release branches that uses go1.20 to Go 1.20.8
2023-09-08 09:36:18 -07:00
Kubernetes Prow Robot
d7aeb7f853
Merge pull request #120524 from jprzychodzen/kcm-args
[cluster/gce] Add possibility to specify KCM specific args for scalability tests
2023-09-08 08:24:26 -07:00
Kubernetes Prow Robot
f6a87aebe6
Merge pull request #120499 from tukwila/gorilla/websocket_v1.5.0
bump: upgrade gorilla/websocket from v1.4.2 to v1.5.0
2023-09-08 08:24:15 -07:00
Patrick Ohly
4e73634b53 scheduler: start scheduling attempt with clean UnschedulablePlugins
When some plugin was registered as "unschedulable" in some previous scheduling
attempt, it kept that attribute for a pod forever. When that plugin then later
failed with an error that requires backoff, the pod was incorrectly moved to the
"unschedulable" queue where it got stuck until the periodic flushing because
there was no event that the plugin was waiting for.

Here's an example where that happened:

     framework.go:1280: E0831 20:03:47.184243] Reserve/DynamicResources: Plugin failed err="Operation cannot be fulfilled on podschedulingcontexts.resource.k8s.io \"test-dragxd5c\": the object has been modified; please apply your changes to the latest version and try again" node="scheduler-perf-dra-7l2v2" plugin="DynamicResources" pod="test/test-dragxd5c"
    schedule_one.go:1001: E0831 20:03:47.184345] Error scheduling pod; retrying err="running Reserve plugin \"DynamicResources\": Operation cannot be fulfilled on podschedulingcontexts.resource.k8s.io \"test-dragxd5c\": the object has been modified; please apply your changes to the latest version and try again" pod="test/test-dragxd5c"
    ...
    scheduling_queue.go:745: I0831 20:03:47.198968] Pod moved to an internal scheduling queue pod="test/test-dragxd5c" event="ScheduleAttemptFailure" queue="Unschedulable" schedulingCycle=9576 hint="QueueSkip"

Pop still needs the information about unschedulable plugins to update the
UnschedulableReason metric. It can reset that information before returning the
PodInfo for the next scheduling attempt.
2023-09-08 16:52:36 +02:00
Sharpz7
7e4b5d0d49 Final Fix 2023-09-08 14:44:22 +00:00
Kubernetes Prow Robot
2bfe0fc798
Merge pull request #120523 from shiftstack/cloud-provider/additional-flags-in-usage
cloud-provider: Log additional options
2023-09-08 06:28:14 -07:00
cpanato
f499b4f891
Bump distroless-iptables to v0.3.2
Signed-off-by: cpanato <ctadeu@gmail.com>
2023-09-08 15:01:22 +02:00
cpanato
18932d970e
[releng] Update publishing-bot rules for active release branches that use go1.20 to Go 1.20.8
Signed-off-by: cpanato <ctadeu@gmail.com>
2023-09-08 14:58:31 +02:00
Kubernetes Prow Robot
4c0d37e767
Merge pull request #120521 from SataQiu/clean-kubeadm-20230908
kubeadm: remove 'system:masters' organization from apiserver-etcd-client certificate
2023-09-08 05:26:25 -07:00
Kubernetes Prow Robot
4c43a25e6a
Merge pull request #119274 from tukwila/test_addons
kubeadm: add tests in ./cmd/phases/init/addons_test.go
2023-09-08 05:26:13 -07:00
Kubernetes Prow Robot
c9b35b7a27
Merge pull request #120517 from Xunzhuo/patch-2
fix: update thirdpartyresources refs
2023-09-08 04:18:25 -07:00
Kubernetes Prow Robot
1773a162bf
Merge pull request #120484 from carlory/fix-120475
fix breaking pv creation when the validation webhook exists
2023-09-08 04:18:14 -07:00
Jakub Przychodzeń
93e3b07e11 [cluster/gce] Add KCM specific args
Add possibility to add flags to KCM, without adding flags to CCM
2023-09-08 11:12:05 +00:00