Commit Graph

125079 Commits

Author SHA1 Message Date
Maciej Skoczeń
48a8cb2bc5 Document throughput thresholds in scheduler_perf readme 2024-08-23 14:22:48 +00:00
Patrick Ohly
bf1188d292 scheduler_perf: only store log output after failures
Reconfiguring the logging infrastructure with a per-test output file mimicks
the behavior of per-test output (log output captured only on failures) while
still using the normal logging code, which is important for benchmarking.

To enable this behavior, the ARTIFACT env variable must be set.
2024-08-23 16:02:45 +02:00
Patrick Ohly
6cbdef850c scheduler_perf: normal error handling during node creation
Calling klog.Fatalf is not a good way to report problems. It kills the entire
test run, without given other code a chance to react.
2024-08-23 16:02:45 +02:00
Maciej Skoczeń
d0e3fc3561 Set scheduling throughput thresholds in scheduler_perf tests 2024-08-23 12:48:28 +00:00
Patrick Ohly
5447d28c0d ktesting: log warning on timeout
How exactly a test reacts when its context times out is unclear. In the case of
scheduler_perf, the apiserver started to shut down and the test failure then
was about not being able to reach the apiserver, which was a bit confusing.

To make it more obvious why the shutdown starts, a WARNING message gets added
to the test output by ktesting before cancellation and thus before any other
output related to that cancellation.
2024-08-23 14:31:17 +02:00
Patrick Ohly
cc9234cffd test etcd: avoid potential data race
If the caller needs to reconfigure klog, then calling klog without proper
synchronizing while stopping causes a data race. We have to ensure that
the goroutine has terminated before stop returns.
2024-08-23 14:31:17 +02:00
Patrick Ohly
6ee2c63135 testserver: allow caller to set up ktesting
In scheduler_perf, the caller configures ktesting without per-test output. When
StartTestServer initialized it again, it did so with per-test output. In other
cases it might have been simply redundant.
2024-08-23 14:31:17 +02:00
Kubernetes Prow Robot
fb6984c683
Merge pull request #126884 from macsko/reduce_length_of_nodetree_logs
Reduce length of NodeTree logs
2024-08-23 13:17:53 +01:00
Marek Siarkowicz
fa5008807a Benchmark storage 2024-08-23 12:45:29 +02:00
Maciej Skoczeń
dc5e1a404f Reduce length of NodeTree logs 2024-08-23 08:28:13 +00:00
Kubernetes Prow Robot
1e827f4b2a
Merge pull request #126867 from piny940/master
fix ValidatingAdmissionPolicy's Validate func to return decision with valid Evaluation
2024-08-22 22:50:26 +01:00
cici37
4e36e436ad Update feature gate list. 2024-08-22 20:38:48 +00:00
Tim Allclair
bfa7297ddd Add tallclair back to sig-node-approvers 2024-08-22 13:17:31 -07:00
Kubernetes Prow Robot
b67d4dd389
Merge pull request #126839 from carlory/LegacyServiceAccountTokenCleanUp
cleanup after LegacyServiceAccountTokenCleanUp is GA
2024-08-22 20:33:56 +01:00
Kubernetes Prow Robot
e955c1d6a8
Merge pull request #126808 from macsko/move_activeq_fields_follow_up
Don't expose lock outside activeQueue in scheduling queue
2024-08-22 20:33:47 +01:00
Kubernetes Prow Robot
a1fc2551ba
Merge pull request #126144 from likakuli/cleanup-unusedparamters
cleanup: remove scheduler_perf unused parameters
2024-08-22 19:29:40 +01:00
cici37
72ad9c5fdf Remove FG inspection from VAP plugin 2024-08-22 17:52:21 +00:00
Cici Huang
0f19faf9be Remove feature gate ValiatingAdmissionPolicy after stable. 2024-08-22 17:39:18 +00:00
Kubernetes Prow Robot
e855753ca6
Merge pull request #126848 from carlory/ref-124136
Fix hotloop once the watch closes
2024-08-22 16:32:24 +01:00
Kubernetes Prow Robot
3ffc177677
Merge pull request #126846 from danwinship/fix-e2e-kube-proxy-detection
Fix the localhost nodeport metrics test to not fail under non-kube-proxy
2024-08-22 16:32:16 +01:00
Kubernetes Prow Robot
3f306ae140
Merge pull request #126343 from SergeyKanzhelev/succeededPodReadmitted
Terminated pod should not be re-admitted
2024-08-22 16:32:09 +01:00
Kubernetes Prow Robot
b1559c66ca
Merge pull request #126807 from pohly/dra-resourceslice-update
DRA scheduler: ResourceSlice update
2024-08-22 15:18:09 +01:00
Maciej Skoczeń
eabdc612dd Use queue.Add instead of activeQ.AddOrUpdate in scheduling queuue tests 2024-08-22 10:28:36 +00:00
Maciej Skoczeń
3eefd62f94 Make update and delete active queue methods 2024-08-22 09:26:05 +00:00
Maciej Skoczeń
9773a39b28 Don't expose lock outside activeQueue in scheduling queue 2024-08-22 09:21:35 +00:00
Kubernetes Prow Robot
b3c725627b
Merge pull request #126842 from saschagrunert/hugepages-test
Fix hugepages e2e test assertion
2024-08-22 10:11:31 +01:00
Patrick Ohly
cf68eb575e DRA scheduler: reduce verbosity of "Checked claim"
The original usage of `Allocate` was that the caller determines verbosity by
passing a logger with increased verbosity threshold into the call. Later that
was changed to using higher V levels inside `Allocated` itself, but the
"Checked claim" log call slipped through.
2024-08-22 10:53:05 +02:00
Ed Bartosh
e1bc8defac kubelet: Migrate DRA Manager to contextual logging
Co-authored-by: Patrick Ohly <patrick.ohly@intel.com>
2024-08-22 11:12:41 +03:00
Patrick Ohly
e85d3babf0 DRA scheduler: fix re-scheduling after ResourceSlice changes
Making unschedulable pods schedulable again after ResourceSlice cluster events
was accidentally left out when adding structured parameters to Kubernetes 1.30.

All E2E tests were defined so that a driver starts first. A new test with a
different order (create pod first, wait for unschedulable, start driver)
triggered the bug and now passes.
2024-08-22 10:09:32 +02:00
Patrick Ohly
6dd2ade762 DRA scheduler: reduce log verbosity
That a pod with no claims remains unschedulable on claim changes is a pretty
normal case. It should only be logged when debugging.
2024-08-22 10:09:32 +02:00
Kubernetes Prow Robot
cee43048c7
Merge pull request #126854 from serathius/pagination-tests
Add paging tests
2024-08-22 06:24:16 +01:00
piny940
9f7ea45ea0 fix ValidatingAdmissionPolicy's Validate func to return decision with valid Evaluation 2024-08-22 13:33:17 +09:00
Kubernetes Prow Robot
850bfd9aa9
Merge pull request #126841 from carlory/StableLoadBalancerNodeSet
remove feature-gate StableLoadBalancerNodeSet
2024-08-21 22:06:32 +01:00
Kubernetes Prow Robot
77737c3eb3
Merge pull request #126512 from kmala/metrics
add resource to the transformation metrics
2024-08-21 22:06:26 +01:00
Tim Hockin
128e189fcd
Remove AllowServiceLBStatusOnNonLB gate 2024-08-21 13:07:52 -07:00
Kubernetes Prow Robot
44ce613173
Merge pull request #126851 from BenTheElder/update-vendor
update corefile-migration to v1.0.24
2024-08-21 20:02:30 +01:00
Kubernetes Prow Robot
b60e01f881
Merge pull request #126838 from zou2699/fix-126836
fix: use the getContext() method for evicting pods
2024-08-21 18:32:19 +01:00
Kubernetes Prow Robot
563ab1bb55
Merge pull request #126789 from aaron-prindle/emu-version-local-up-cluster
feat: add EMULATED_VERSION env var and --emulated-version flag params to k8s binaries flags in hack/local-up-cluster.sh
2024-08-21 18:32:12 +01:00
Marek Siarkowicz
99e6956980 Add paging tests 2024-08-21 18:54:57 +02:00
Keerthan Reddy Mala
3a8df1efdd add resource to the transformation metrics 2024-08-21 09:52:24 -07:00
Benjamin Elder
4aad0b9c1a update corefile-migration to v1.0.24
there are no code changes, instead packages we do not use have been isolated into another module to help manage the dependency tree

in older release branches, picking this back will allow us to avoid unexpected cobra updates
2024-08-21 09:39:53 -07:00
carlory
231d9af5fe Fix hotloop once the watch closes 2024-08-22 00:03:26 +08:00
carlory
295ecc19d4 remove feature-gate StableLoadBalancerNodeSet 2024-08-21 23:32:06 +08:00
Kubernetes Prow Robot
69333563e5
Merge pull request #126690 from googs1025/cli/add_ut
add printer unit test for DRA resources
2024-08-21 16:29:31 +01:00
Kubernetes Prow Robot
4ce63b4822
Merge pull request #126197 from macsko/move_nominatedpodsfornode_to_scheduling_queue
Move NominatedPodsForNode to scheduling queue directly
2024-08-21 16:29:21 +01:00
Kubernetes Prow Robot
243fdafc68
Merge pull request #126840 from carlory/CloudDualStackNodeIPs
cleanup after CloudDualStackNodeIPs is GA
2024-08-21 15:22:19 +01:00
Kubernetes Prow Robot
b1aadefec8
Merge pull request #126556 from carlory/ref-124136
Collect more info for debugging #124136
2024-08-21 15:22:05 +01:00
Dan Winship
a3b97b118b Fix the localhost nodeport metrics test to not fail under non-kube-proxy
If the cluster is using a non-kube-proxy service proxy, the `curl`
will presumably fail; this should not be considered a hard failure.
2024-08-21 09:44:42 -04:00
Kubernetes Prow Robot
6ca629d46b
Merge pull request #126769 from Sakuralbj/headless-service-kubeproxy
don't watch headless services on kube-proxy
2024-08-21 13:57:02 +01:00
googs1025
fd97b36da1 add printer unit test for DRA resources 2024-08-21 19:56:50 +08:00