fix: logits processors for CB #484

wallashss · 2025-09-26T01:36:33Z

Description

This PR introduces a wrapper of the logits processor that are injected to vlllm that will handle the logits processor in a distributed way. The wrapper is initialized with the logits class and the batch_size. So, from the logits processor perspective it will "think" that it's only handling a request per step, while the wrapper receives the batch of logits, slice and redistribute for each request.

Signed-off-by: Wallas Santos <[email protected]>

github-actions · 2025-09-26T01:36:41Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Wallas Santos <[email protected]>

wallashss · 2025-09-26T15:22:32Z

bot:test

yannicks1 · 2025-09-26T09:30:20Z

tests/e2e/test_sampling_params.py


 pytestmark = [pytest.mark.full_model, pytest.mark.other_e2e]

+# TODO: REVERT THIS CHANGE!


can we parametrize the test such that they get executed for SB and CB?

or do we already have a similar test for CB?

We do not have this test for CB. I think the issue here is increase too much the time of CI. Moreover, they do not repro very well the issue of this PR.

Signed-off-by: Wallas Santos <[email protected]>

wallashss · 2025-09-26T20:36:53Z

bot:test

maxdebayser

I think this is ok as a stopgap solution, but we should think about a more comprehensive solution later when more of the sampling params are implemented with logits processors. There could be performance implications in applying the LPs per requests vs per batch.

Signed-off-by: Wallas Santos <[email protected]>

wallashss · 2025-09-29T15:10:24Z

bot:test

wallashss · 2025-09-30T15:42:23Z

bot:test

…-fix-cb-logits-processors

wallashss · 2025-10-01T12:38:09Z

bot:test

…-fix-cb-logits-processors

fix: upgrade aftu Signed-off-by: Wallas Santos <[email protected]>

…ject/vllm-spyre into wallas-fix-cb-logits-processors

wallashss · 2025-10-01T14:28:30Z

bot:test

wallashss · 2025-10-01T17:18:08Z

bot:test

fix: cb for logits processors

73cd83d

Signed-off-by: Wallas Santos <[email protected]>

wallashss added 4 commits September 26, 2025 10:13

feat: switch prefill/decode for logitsprocs

122ad98

Signed-off-by: Wallas Santos <[email protected]>

fix: spyre input batch

9f60e0a

Signed-off-by: Wallas Santos <[email protected]>

refact: code cleanup

aa85ded

Signed-off-by: Wallas Santos <[email protected]>

fix: reverted test

9a65067

Signed-off-by: Wallas Santos <[email protected]>

yannicks1 reviewed Sep 26, 2025

View reviewed changes

wallashss added 2 commits September 26, 2025 16:03

test: test_cb_logits_processor

53173e2

Signed-off-by: Wallas Santos <[email protected]>

fix: minor improvement

3641dc1

Signed-off-by: Wallas Santos <[email protected]>

wallashss marked this pull request as ready for review September 26, 2025 19:09

wallashss requested review from nikolaospapandreou, prashantgupta24, rafvasq, sducouedic and tdoublep as code owners September 26, 2025 19:09

wallashss mentioned this pull request Sep 26, 2025

feat: golden token injector logits processor #478

Merged

maxdebayser approved these changes Sep 29, 2025

View reviewed changes

wallashss added 2 commits September 29, 2025 11:43

refact: renamed spyre_logits_processor

209e207

Signed-off-by: Wallas Santos <[email protected]>

style: fix linting

30c0e7e

Signed-off-by: Wallas Santos <[email protected]>

Merge branch 'main' of github.com:vllm-project/vllm-spyre into wallas…

85b1927

…-fix-cb-logits-processors

wallashss added 3 commits October 1, 2025 10:59

Merge branch 'main' of github.com:vllm-project/vllm-spyre into wallas…

89e0e1d

…-fix-cb-logits-processors

fix: test parameters

3bc9f44

fix: upgrade aftu Signed-off-by: Wallas Santos <[email protected]>

Merge branch 'wallas-fix-cb-logits-processors' of github.com:vllm-pro…

fa69ff5

…ject/vllm-spyre into wallas-fix-cb-logits-processors

wallashss requested a review from joerunde as a code owner October 1, 2025 14:26

wallashss merged commit a70890c into main Oct 1, 2025
19 of 20 checks passed

wallashss deleted the wallas-fix-cb-logits-processors branch October 1, 2025 16:53

tjohnson31415 mentioned this pull request Oct 6, 2025

[BUG] Server may crash with IndexError when cancelling batches of requests min_tokens #492

Closed

maxdebayser mentioned this pull request Oct 8, 2025

Add tests for logits processor correctness #295

Open


		pytestmark = [pytest.mark.full_model, pytest.mark.other_e2e]

		# TODO: REVERT THIS CHANGE!

fix: logits processors for CB #484

fix: logits processors for CB #484

Uh oh!

Conversation

wallashss commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

github-actions bot commented Sep 26, 2025

Uh oh!

wallashss commented Sep 26, 2025

Uh oh!

yannicks1 Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

yannicks1 Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

wallashss Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

wallashss commented Sep 26, 2025

Uh oh!

maxdebayser left a comment

Choose a reason for hiding this comment

Uh oh!

wallashss commented Sep 29, 2025

Uh oh!

wallashss commented Sep 30, 2025

Uh oh!

wallashss commented Oct 1, 2025

Uh oh!

wallashss commented Oct 1, 2025

Uh oh!

Uh oh!

wallashss commented Oct 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wallashss commented Sep 26, 2025 •

edited

Loading