Skip to content

Conversation

@njhill
Copy link
Member

@njhill njhill commented Nov 15, 2025

Small fix, took a while to track down 😅

The populated all_token_ids should never have been truncated since they are used to recover output ids following preemption in the async scheduling case.

@njhill njhill added the bug Something isn't working label Nov 15, 2025
@mergify mergify bot added the v1 label Nov 15, 2025
@njhill njhill added this to the v0.11.1 milestone Nov 15, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request addresses a bug related to async scheduling, chunked prefill, and preemption. The core of the fix is in vllm/v1/core/sched/scheduler.py, where previously only a slice of token IDs was being sent for requests undergoing chunked prefill. This was incorrect as the connector requires the full list of tokens for prefix matching. The change correctly sends a full copy of the token IDs. To facilitate this, a copy() method was added to the ConstantList class. The re-enabling of previously skipped tests in tests/v1/e2e/test_async_scheduling.py validates the fix. The changes are correct, well-targeted, and I have no further recommendations.

@njhill njhill mentioned this pull request Nov 15, 2025
19 tasks
@njhill njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 15, 2025
@DarkLight1337 DarkLight1337 merged commit 80b6080 into vllm-project:main Nov 16, 2025
47 of 48 checks passed
@njhill njhill deleted the fix-async-preempt-chunkprefill branch November 17, 2025 05:53
bwasti pushed a commit to bwasti/vllm that referenced this pull request Nov 17, 2025
bringlein pushed a commit to bringlein/vllm that referenced this pull request Nov 26, 2025
devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025
kitaekatt pushed a commit to kitaekatt/vllm that referenced this pull request Dec 1, 2025
charlotte12l pushed a commit to charlotte12l/vllm that referenced this pull request Dec 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants