kubernetes/pkg/scheduler/internal/queue/scheduling_queue.go at cd943dd95e32bfdc928450ecb61fdabadfcebd29

mirror of https://github.com/k3s-io/kubernetes.git synced 2025-12-05 07:22:41 +00:00

Files

Patrick Ohly cd943dd95e scheduler: fix tracking of concurrent events

The previous approach was based on the assumption that an in-flight pod can use
the head of the received event list as marker for identifying all events that
occur while the pod is in flight. That assumption is incorrect: when that
existing element gets removed from the list because all pods that were
in-flight when it was received are done, that marker's Next method returns nil
and the code which should have seen several concurrent events (if there were
any) missed all of those.

As a result, a pod with concurrent events could incorrectly get moved to the
unschedulable queue where it could got stuck until the next periodic purging
after 5 minutes if there was no other event for it.

The approach with maintaining a single list of concurrent events can be fixed
by inserting each in-flight pod into the list and using that element to
identify "more recent" events for the pod.

2023-09-05 19:58:38 +02:00

57 KiB

Raw Blame History

View Raw

57 KiB Raw Blame History

57 KiB

Raw Blame History