[CB] remove env var VLLM_SPYRE_ENABLE_PREFILL_OPTIMIZATION #562

yannicks1 · 2025-11-17T17:22:45Z

VLLM_SPYRE_ENABLE_PREFILL_OPTIMIZATION is on by default and we have not found any reason to ever turn it off.

reasons for removing:

getting rid of dead code -> simplifying scheduler constraints
being consistent with chunked prefill scheduler: the variable is not used there
reducing GHA time by removing tests targeting this optimization specifically

Signed-off-by: Yannick Schnider <[email protected]>

github-actions · 2025-11-17T17:23:03Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Yannick Schnider <[email protected]>

maxdebayser

Thanks, this makes sense.

tests/e2e/test_spyre_cb_scheduler_steps.py

sducouedic

there are still two tests with referring to prefill optimization:

test_requests_use_full_batch_tkv_limit_prefill_opt and test_requests_exceed_batch_tkv_limit_prefill_opt

yannicks1 · 2025-11-19T16:58:44Z

good catch, will change the names of these tests.

vllm_spyre/v1/core/scheduler.py

Signed-off-by: Yannick Schnider <[email protected]>

sducouedic

LGTM, definitively a no brainer to have the optimization enabled all the time

applying the tighter constraint for the max model length to the continuous batching scheduler too. this establishes parity between the chunked prefill and continuous batching constraints. see discussion [here](#562 (comment)) Signed-off-by: Yannick Schnider <[email protected]>

yannicks1 added 2 commits November 17, 2025 17:14

remove VLLM_SPYRE_ENABLE_PREFILL_OPTIMIZATION

43d91e6

Signed-off-by: Yannick Schnider <[email protected]>

Merge branch 'main' into ysc-remove-prefill-opt-var

ed173a4

Signed-off-by: Yannick Schnider <[email protected]>

yannicks1 added 2 commits November 17, 2025 17:24

rmv unused import

0646ba0

Signed-off-by: Yannick Schnider <[email protected]>

simplify scheduler constraints CB

e9d852c

Signed-off-by: Yannick Schnider <[email protected]>

yannicks1 marked this pull request as ready for review November 18, 2025 09:15

yannicks1 requested review from nikolaospapandreou, prashantgupta24, rafvasq, sducouedic and tdoublep as code owners November 18, 2025 09:15

maxdebayser approved these changes Nov 18, 2025

View reviewed changes

sducouedic reviewed Nov 19, 2025

View reviewed changes

tests/e2e/test_spyre_cb_scheduler_steps.py Outdated Show resolved Hide resolved

sducouedic reviewed Nov 19, 2025

View reviewed changes

tests/e2e/test_spyre_cb_scheduler_steps.py Outdated Show resolved Hide resolved

sducouedic reviewed Nov 19, 2025

View reviewed changes

yannicks1 requested a review from tjohnson31415 November 19, 2025 17:01

tjohnson31415 reviewed Nov 19, 2025

View reviewed changes

vllm_spyre/v1/core/scheduler.py Show resolved Hide resolved

yannicks1 added 2 commits November 19, 2025 20:34

address feedback

282aa40

Signed-off-by: Yannick Schnider <[email protected]>

add comment

697cfab

Signed-off-by: Yannick Schnider <[email protected]>

yannicks1 enabled auto-merge (squash) November 19, 2025 20:37

github-actions bot added the ready Runs the full CI test suite. Only add to PRs once ready to merge to limit public GHA usage label Nov 19, 2025

sducouedic approved these changes Nov 19, 2025

View reviewed changes

yannicks1 merged commit 087d75f into main Nov 19, 2025
21 of 42 checks passed

yannicks1 deleted the ysc-remove-prefill-opt-var branch November 19, 2025 20:49

yannicks1 mentioned this pull request Nov 20, 2025

[CB] tighten constraint max model length decode sequences #573

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CB] remove env var VLLM_SPYRE_ENABLE_PREFILL_OPTIMIZATION #562

[CB] remove env var VLLM_SPYRE_ENABLE_PREFILL_OPTIMIZATION #562

Uh oh!

yannicks1 commented Nov 17, 2025

Uh oh!

github-actions bot commented Nov 17, 2025

Uh oh!

maxdebayser left a comment

Uh oh!

Uh oh!

Uh oh!

sducouedic left a comment

Uh oh!

yannicks1 commented Nov 19, 2025

Uh oh!

Uh oh!

sducouedic left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[CB] remove env var VLLM_SPYRE_ENABLE_PREFILL_OPTIMIZATION #562

[CB] remove env var VLLM_SPYRE_ENABLE_PREFILL_OPTIMIZATION #562

Uh oh!

Conversation

yannicks1 commented Nov 17, 2025

Uh oh!

github-actions bot commented Nov 17, 2025

Uh oh!

maxdebayser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sducouedic left a comment

Choose a reason for hiding this comment

Uh oh!

yannicks1 commented Nov 19, 2025

Uh oh!

Uh oh!

sducouedic left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants