Commit Graph

118360 Commits

Author SHA1 Message Date
Ryan Phillips
43b434f66a sig-node: add rphillips to reviewers 2023-09-11 14:34:30 -05:00
Kubernetes Prow Robot
be968597e3
Merge pull request #120310 from gjkim42/use-container-log-instead-of-termination-log
Use container log instead of termination log
2023-09-11 11:52:23 -07:00
Kubernetes Prow Robot
74f6c263d8
Merge pull request #118544 from sohankunkerkar/remove-sandbox-image-ref
pkg/kubelet: allow sandbox image pinning from CRI
2023-09-11 11:52:12 -07:00
Kubernetes Prow Robot
aa4ec3c5b0
Merge pull request #119944 from Sharpz7/jm/backup-finalizers
Adding backup code for removing finalizers to more Job End States.
2023-09-11 09:30:30 -07:00
Lars Ekman
0df4a69f5c
Kube-proxy: Get nodeIPs for both families with dual-stack (#119525)
* Kube-proxy: handle dual-stack in detectNodeIPs()

* Updates
2023-09-11 09:30:23 -07:00
Kensei Nakada
0d3eafdfa3
fix(scheduling_queue): always put Pods with no unschedulable plugins into activeQ/backoffQ (#119105)
* always put Pods with no unschedulable plugins into activeQ/backoffQ

* address review comments
2023-09-11 09:30:11 -07:00
Kubernetes Prow Robot
b1161a8ac4
Merge pull request #120559 from pohly/e2e-framework-WaitForPodsResponding-retry
e2e pods: fix WaitForPodsResponding retry
2023-09-11 07:52:10 -07:00
Gunju Kim
1fb4eee94e
Use container log instead of termination log
Since the termination log cannot be accessed until the container is
terminated, use the container log.
2023-09-11 22:55:09 +09:00
Kubernetes Prow Robot
6c578bc982
Merge pull request #120428 from pohly/dra-scheduler-reallocation-flake
DRA: scheduler reallocation flake
2023-09-11 02:56:12 -07:00
Patrick Ohly
fc3ee07b51 e2e pods: fix WaitForPodsResponding retry
The status error was embedded inside the new error constructed by
WaitForPodsResponding's get function, but not wrapped. Therefore
`apierrors.IsServiceUnavailable(err)` didn't find it and returned false -> no
retries.

Wrapping fixes this and Gomega formatting of the error remains useful:

	err := &errors.StatusError{}
	err.ErrStatus.Code = 503
	err.ErrStatus.Message = "temporary failure"

	err2 := fmt.Errorf("Controller %s: failed to Get from replica pod %s:\n%w\nPod status:\n%s",
		"foo", "bar",
		err, "some status")
	fmt.Println(format.Object(err2, 1))
        fmt.Println(errors.IsServiceUnavailable(err2))

=>

    <*fmt.wrapError | 0xc000139340>:
    Controller foo: failed to Get from replica pod bar:
    temporary failure
    Pod status:
    some status
    {
        msg: "Controller foo: failed to Get from replica pod bar:\ntemporary failure\nPod status:\nsome status",
        err: <*errors.StatusError | 0xc0001a01e0>{
            ErrStatus: {
                TypeMeta: {Kind: "", APIVersion: ""},
                ListMeta: {
                    SelfLink: "",
                    ResourceVersion: "",
                    Continue: "",
                    RemainingItemCount: nil,
                },
                Status: "",
                Message: "temporary failure",
                Reason: "",
                Details: nil,
                Code: 503,
            },
        },
    }

    true
2023-09-11 11:54:15 +02:00
Patrick Ohly
6f9140e421 DRA scheduler: stop allocating before deallocation
This fixes a test flake:

    [sig-node] DRA [Feature:DynamicResourceAllocation] multiple nodes reallocation [It] works
    /nvme/gopath/src/k8s.io/kubernetes/test/e2e/dra/dra.go:552

      [FAILED] number of deallocations
      Expected
          <int64>: 2
      to equal
          <int64>: 1
      In [It] at: /nvme/gopath/src/k8s.io/kubernetes/test/e2e/dra/dra.go:651 @ 09/05/23 14:01:54.652

This can be reproduced locally with

    stress -p 10 go test ./test/e2e -args -ginkgo.focus=DynamicResourceAllocation.*reallocation.works  -ginkgo.no-color -v=4 -ginkgo.v

Log output showed that the sequence of events leading to this was:
- claim gets allocated because of selected node
- a different node has to be used, so PostFilter sets
  claim.status.deallocationRequested
- the driver deallocates
- before the scheduler can react and select a different node,
  the driver allocates *again* for the original node
- the scheduler asks for deallocation again
- the driver deallocates again (causing the test failure)
- eventually the pod runs

The fix is to disable allocations first by removing the selected node and then
starting to deallocate.
2023-09-11 10:56:17 +02:00
Kubernetes Prow Robot
cc0a24d2e8
Merge pull request #120406 from wlq1212/cheanup/framework/timeout
e2e_framework:stop using deprecated wait.ErrwaitTimeout
2023-09-10 21:10:10 -07:00
Kubernetes Prow Robot
0ee315b94c
Merge pull request #120375 from pegasas/proxy
Improve logging on kube-proxy exit
2023-09-10 12:08:10 -07:00
Kubernetes Prow Robot
098d4c7b9e
Merge pull request #120546 from SaumyaBhushan/issue
added documentation about the format of certificateKey
2023-09-10 10:26:10 -07:00
pegasas
f446745777 Improve logging on kube-proxy exit 2023-09-11 00:50:29 +08:00
Kubernetes Prow Robot
25c7a1439a
Merge pull request #120069 from aojea/service_conformance
promote to conformance Service multiprotocol tests
2023-09-10 07:26:09 -07:00
SaumyaBhushan
df5c1bb1ea added documentation about the format of certificateKey
Signed-off-by: SaumyaBhushan <saumya.bhushan666@gmail.com>
2023-09-10 19:50:42 +05:30
Kubernetes Prow Robot
49768134e5
Merge pull request #119754 from pbxqdown/kubelet-fix-typo
Fix some typos in kubelet component source code
2023-09-09 19:36:11 -07:00
Kubernetes Prow Robot
b343878daa
Merge pull request #120438 from ritazh/kmsv2-metrics-apiserverid
kmsv2: add apiserver identity to metrics
2023-09-09 16:46:09 -07:00
Rita Zhang
43ccf6c4e8
kmsv2: add apiserver identity to metrics
Signed-off-by: Rita Zhang <rita.z.zhang@gmail.com>
2023-09-09 15:31:32 -07:00
Kubernetes Prow Robot
33c5bd631d
Merge pull request #120008 from skitt/drop-intstr-ptr-wrappers
Use ptr.To to retrieve intstr addresses
2023-09-09 07:24:09 -07:00
Kubernetes Prow Robot
fd8f2c7fc6
Merge pull request #120541 from pacoxu/kubeadm-fix-hash
kubeadm: add log for static pod manifest diff
2023-09-09 06:08:08 -07:00
Paco Xu
b443a841e3 kubeadm: add log for static pod manifest diff 2023-09-09 20:00:31 +08:00
Kubernetes Prow Robot
21f7bf66fa
Merge pull request #120272 from tzneal/add-tzneal-to-sig-node-reviewers
OWNERS_ALIASES: add tzneal to sig-node-reviewer
2023-09-08 18:28:11 -07:00
Kubernetes Prow Robot
37cf2638c9
Merge pull request #119619 from skitt/intstr-parse-parseint
Limit intstr.Parse() to 32-bit integer parsing
2023-09-08 13:04:29 -07:00
Kubernetes Prow Robot
41689233b4
Merge pull request #120334 from pohly/scheduler-clear-unschedulable-plugins
scheduler: avoid false "unschedulable" pod state
2023-09-08 12:01:23 -07:00
Kubernetes Prow Robot
817488e4fa
Merge pull request #120082 from aojea/hostnetwork_services_fallback
e2e network test for udp services with hostNetwork clients
2023-09-08 12:01:12 -07:00
Kubernetes Prow Robot
bec95ed575
Merge pull request #120527 from cpanato/bump-distroless
Bump distroless-iptables to v0.3.2
2023-09-08 09:36:29 -07:00
Kubernetes Prow Robot
15a019d841
Merge pull request #120526 from cpanato/update-prom
[releng] Update publishing-bot rules for active release branches that uses go1.20 to Go 1.20.8
2023-09-08 09:36:18 -07:00
Kubernetes Prow Robot
d7aeb7f853
Merge pull request #120524 from jprzychodzen/kcm-args
[cluster/gce] Add possibility to specify KCM specific args for scalability tests
2023-09-08 08:24:26 -07:00
Kubernetes Prow Robot
f6a87aebe6
Merge pull request #120499 from tukwila/gorilla/websocket_v1.5.0
bump: upgrade gorilla/websocket from v1.4.2 to v1.5.0
2023-09-08 08:24:15 -07:00
Patrick Ohly
4e73634b53 scheduler: start scheduling attempt with clean UnschedulablePlugins
When some plugin was registered as "unschedulable" in some previous scheduling
attempt, it kept that attribute for a pod forever. When that plugin then later
failed with an error that requires backoff, the pod was incorrectly moved to the
"unschedulable" queue where it got stuck until the periodic flushing because
there was no event that the plugin was waiting for.

Here's an example where that happened:

     framework.go:1280: E0831 20:03:47.184243] Reserve/DynamicResources: Plugin failed err="Operation cannot be fulfilled on podschedulingcontexts.resource.k8s.io \"test-dragxd5c\": the object has been modified; please apply your changes to the latest version and try again" node="scheduler-perf-dra-7l2v2" plugin="DynamicResources" pod="test/test-dragxd5c"
    schedule_one.go:1001: E0831 20:03:47.184345] Error scheduling pod; retrying err="running Reserve plugin \"DynamicResources\": Operation cannot be fulfilled on podschedulingcontexts.resource.k8s.io \"test-dragxd5c\": the object has been modified; please apply your changes to the latest version and try again" pod="test/test-dragxd5c"
    ...
    scheduling_queue.go:745: I0831 20:03:47.198968] Pod moved to an internal scheduling queue pod="test/test-dragxd5c" event="ScheduleAttemptFailure" queue="Unschedulable" schedulingCycle=9576 hint="QueueSkip"

Pop still needs the information about unschedulable plugins to update the
UnschedulableReason metric. It can reset that information before returning the
PodInfo for the next scheduling attempt.
2023-09-08 16:52:36 +02:00
Sharpz7
7e4b5d0d49 Final Fix 2023-09-08 14:44:22 +00:00
Kubernetes Prow Robot
2bfe0fc798
Merge pull request #120523 from shiftstack/cloud-provider/additional-flags-in-usage
cloud-provider: Log additional options
2023-09-08 06:28:14 -07:00
cpanato
f499b4f891
Bump distroless-iptables to v0.3.2
Signed-off-by: cpanato <ctadeu@gmail.com>
2023-09-08 15:01:22 +02:00
cpanato
18932d970e
[releng] Update publishing-bot rules for active release branches that use go1.20 to Go 1.20.8
Signed-off-by: cpanato <ctadeu@gmail.com>
2023-09-08 14:58:31 +02:00
Kubernetes Prow Robot
4c0d37e767
Merge pull request #120521 from SataQiu/clean-kubeadm-20230908
kubeadm: remove 'system:masters' organization from apiserver-etcd-client certificate
2023-09-08 05:26:25 -07:00
Kubernetes Prow Robot
4c43a25e6a
Merge pull request #119274 from tukwila/test_addons
kubeadm: add tests in ./cmd/phases/init/addons_test.go
2023-09-08 05:26:13 -07:00
Kubernetes Prow Robot
c9b35b7a27
Merge pull request #120517 from Xunzhuo/patch-2
fix: update thirdpartyresources refs
2023-09-08 04:18:25 -07:00
Kubernetes Prow Robot
1773a162bf
Merge pull request #120484 from carlory/fix-120475
fix breaking pv creation when the validation webhook exists
2023-09-08 04:18:14 -07:00
Jakub Przychodzeń
93e3b07e11 [cluster/gce] Add KCM specific args
Add possibility to add flags to KCM, without adding flags to CCM
2023-09-08 11:12:05 +00:00
Stephen Finucane
48f453d649 cloud-provider: Log additional options
The 'NewCloudControllerManagerCommand' function create a cobra.Command
object that is used for the main entry point to various CCM
implementations. This function accepts an 'additionalFlags' parameter,
allowing users to register additional controller-specific options
beyond the standard set used for all controllers. While we were dumping
the standard set of flags in the usage string - seen when running the
command with '--help' or when parsing fails - we were not dumping the
additional options. Correct this oversight.

Signed-off-by: Stephen Finucane <stephenfin@redhat.com>
2023-09-08 11:44:43 +01:00
Kubernetes Prow Robot
c084719291
Merge pull request #120509 from aojea/aojea_cp_owner
add aojea as controplane reviewer
2023-09-08 02:48:26 -07:00
Kubernetes Prow Robot
a64a3e16ec
Merge pull request #120253 from pohly/dra-scheduler-podschedulingcontext-updates
dra scheduler: refactor PodSchedulingContext updates
2023-09-08 02:48:14 -07:00
Stephen Kitt
aa89e6dc97
Use ptr.To to retrieve intstr addresses
This uses the generic ptr.To in k8s.io/utils to replace functions and
code constructs which only serve to return pointers to intstr
values. Other uses of the deprecated pointer package are updated in
modified files.

Signed-off-by: Stephen Kitt <skitt@redhat.com>
2023-09-08 11:10:50 +02:00
guangli.bao
9feaefb5cb Increase ut for kubeadm phases init addon
rename field casename to name
do go lint check
Update addons_test.go to remove whitespace

Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
2023-09-08 15:35:17 +08:00
Kubernetes Prow Robot
9068bec08e
Merge pull request #120454 from pohly/golangci-lint-hint-deprecated
golangci-lint: warn about deprecated APIs only as hints
2023-09-08 00:32:25 -07:00
Kubernetes Prow Robot
80cd9d7a9a
Merge pull request #120105 from princepereira/ppereira-kubeproxy-mock-tests
New mock test framework for windows kubeproxy.
2023-09-08 00:32:14 -07:00
SataQiu
3e2bad02dc kubeadm: remove 'system:masters' organization from apiserver-etcd-client certificate 2023-09-08 15:01:00 +08:00
Xunzhuo
a0a8765027
fix: update thirdpartyresources refs 2023-09-08 14:41:15 +08:00