Skip to content

[WIP] Promote SaturationDetector to an extension point.#2605

Draft
LukeAVanDrie wants to merge 5 commits intokubernetes-sigs:mainfrom
LukeAVanDrie:feat/saturation-filtering
Draft

[WIP] Promote SaturationDetector to an extension point.#2605
LukeAVanDrie wants to merge 5 commits intokubernetes-sigs:mainfrom
LukeAVanDrie:feat/saturation-filtering

Conversation

@LukeAVanDrie
Copy link
Contributor

What type of PR is this?

/kind cleanup
/kind feature
/kind deprecation

What this PR does / why we need it:

This PR promotes the SaturationDetector to an EPP extension point. This is a breaking change as it removes the top-level config block for saturation detection.

Which issue(s) this PR fixes:

Fixes #1405

Does this PR introduce a user-facing change?:

The `saturationDetector` configuration has been transitioned to the plugin-based model.

@k8s-ci-robot
Copy link
Contributor

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@k8s-ci-robot k8s-ci-robot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. kind/feature Categorizes issue or PR as related to a new feature. kind/deprecation Categorizes issue or PR as related to a feature/enhancement marked for deprecation. labels Mar 17, 2026
@netlify
Copy link

netlify bot commented Mar 17, 2026

Deploy Preview for gateway-api-inference-extension ready!

Name Link
🔨 Latest commit c593dfa
🔍 Latest deploy log https://app.netlify.com/projects/gateway-api-inference-extension/deploys/69b9ebb2b78292000846afee
😎 Deploy Preview https://deploy-preview-2605--gateway-api-inference-extension.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: LukeAVanDrie
Once this PR has been reviewed and has the lgtm label, please assign ahg-g for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Mar 17, 2026
@k8s-ci-robot k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Mar 17, 2026
@LukeAVanDrie LukeAVanDrie changed the title [WIP] [WIP] Promote SaturationDetector to an extension point. Mar 17, 2026
@k8s-ci-robot k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 17, 2026
Implements the Filter extension point in the `UtilizationDetector` to
enable per-pod saturation guarding.

The filter allows the scheduler to bypass endpoints that exceed queue
depth or KV-cache utilization thresholds. It introduces a configurable
'Headroom' parameter to provide burst tolerance (e.g., 20% above base
limits) for scheduling flexibility.
@LukeAVanDrie LukeAVanDrie force-pushed the feat/saturation-filtering branch from 3970c2a to 14f040b Compare March 17, 2026 23:38
@k8s-ci-robot k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Mar 17, 2026
@LukeAVanDrie LukeAVanDrie force-pushed the feat/saturation-filtering branch from 14f040b to c593dfa Compare March 18, 2026 00:02
@k8s-ci-robot k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 18, 2026
@k8s-ci-robot
Copy link
Contributor

PR needs rebase.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. kind/deprecation Categorizes issue or PR as related to a feature/enhancement marked for deprecation. kind/feature Categorizes issue or PR as related to a new feature. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Saturation check should become an extension point

2 participants