fix: logits processor state at each step #544

wallashss · 2025-10-27T19:56:09Z

Description

This PR fixes the update of logits processors that need to be updated at each engine step. To validate the change, I updated the existing test for min tokens where we can identify the wrong behaviour. Note: the bug is reproducible in both CB and SB.

Signed-off-by: Wallas Santos <[email protected]>

github-actions · 2025-10-27T19:56:17Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

tjohnson31415 · 2025-10-27T20:00:54Z

tests/e2e/test_sampling_params.py

+    # after min tokens reached the logits processor is properly
+    # cleared.
+    assert len(output1.outputs[0].token_ids) < 20
+    assert len(output2.outputs[0].token_ids) < 10


If increase the eos_id logit bias to force it to be generated then we can assert on the exact output length, right?

assert len(output1.outputs[0].token_ids) == 11 assert len(output2.outputs[0].token_ids) == 1

(the values for those asserts may be off-by-one depending on how EOS is tracked in the outputs 😅)

Signed-off-by: Wallas Santos <[email protected]>

tjohnson31415

LGTM! Thanks!

wallashss · 2025-10-27T20:52:27Z

bot:test

…545) # Description The MinTokensLogitsProcessor needs to get a `batch_update` at each step to detect when enough tokens have been generated. The `LogitProcessorWrapper` copied the typical logic of skipping updates when batch_update is None, but this meant that min tokens would not get the needed call to `update_state`. The fix here is to always call `update_state` on each of the wrapped logitsprocs in the batch, with a some extra code to not call `update_state` for a particular index more than once. ## Related Issues Follow up to #544 which fixed the behavior for static batching Cherry-picked improvement to test_sampling_params.py from #536 --------- Signed-off-by: Travis Johnson <[email protected]> Co-authored-by: Wallas Santos <[email protected]>

wallashss added 2 commits October 27, 2025 16:29

fix: update of batch at each generation step

9080cbd

Signed-off-by: Wallas Santos <[email protected]>

test: adjusted test to check if min tokens logits processor is cleared

8c8e23f

Signed-off-by: Wallas Santos <[email protected]>

wallashss requested review from nikolaospapandreou, prashantgupta24, rafvasq, sducouedic, tdoublep and yannicks1 as code owners October 27, 2025 19:56

wallashss requested a review from tjohnson31415 October 27, 2025 19:56

tjohnson31415 reviewed Oct 27, 2025

View reviewed changes

wallashss added 2 commits October 27, 2025 17:08

test: addressed review comment

180d732

Signed-off-by: Wallas Santos <[email protected]>

style: fix linting

d686e51

Signed-off-by: Wallas Santos <[email protected]>

tjohnson31415 approved these changes Oct 27, 2025

View reviewed changes

tjohnson31415 enabled auto-merge (squash) October 27, 2025 20:43

github-actions bot added the ready Runs the full CI test suite. Only add to PRs once ready to merge to limit public GHA usage label Oct 27, 2025

tjohnson31415 merged commit 7ed0611 into main Oct 27, 2025
30 of 40 checks passed

tjohnson31415 deleted the wallas-fix-min-tokens branch October 27, 2025 20:58

tjohnson31415 mentioned this pull request Oct 28, 2025

fix: min_tokens > 1 causes long generation with continuous batching #545

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: logits processor state at each step #544

fix: logits processor state at each step #544

Uh oh!

wallashss commented Oct 27, 2025

Uh oh!

github-actions bot commented Oct 27, 2025

Uh oh!

tjohnson31415 Oct 27, 2025 •

edited

Loading

Uh oh!

wallashss Oct 27, 2025

Uh oh!

tjohnson31415 left a comment

Uh oh!

wallashss commented Oct 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: logits processor state at each step #544

fix: logits processor state at each step #544

Uh oh!

Conversation

wallashss commented Oct 27, 2025

Description

Uh oh!

github-actions bot commented Oct 27, 2025

Uh oh!

tjohnson31415 Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wallashss Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 left a comment

Choose a reason for hiding this comment

Uh oh!

wallashss commented Oct 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tjohnson31415 Oct 27, 2025 •

edited

Loading