[BugFix] Fix Qwen3 Omni talker mtp torch.compile startup error by ZeldaHuang · Pull Request #1104 · vllm-project/vllm-omni

ZeldaHuang · 2026-01-30T07:42:20Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

ref #1048 and #1102
Talker mtp occurred startup error when use torch compile with small max_batch_size(1 and 2)

Changed to dynamically compute position_ids using torch.arange().repeat() to avoid the specialization issue while maintaining correct behavior (credit to @ram16g)
Align talker mtp buffer size to max cudagraph capture size (maybe larger than max_num_seqs)

Test Plan

Test Result

Confirmed with small max_batch_size(1、2、3)

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

…predictor The original code used a pre-computed position_ids_buffer with batch_size-dependent slicing, which caused torch.compile to specialize batch_size as a constant. This conflicted with vLLM's @support_torch_compile decorator that marks batch_size as dynamic, resulting in ConstraintViolationError. Changed to dynamically compute position_ids using torch.arange().repeat() to avoid the specialization issue while maintaining correct behavior. Signed-off-by: ram16g <[email protected]>

Signed-off-by: ZeldaHuang <[email protected]>

hsliuustc0106

lgtm

…project#1104) Signed-off-by: ram16g <[email protected]> Signed-off-by: ZeldaHuang <[email protected]> Co-authored-by: ram16g <[email protected]> Co-authored-by: Hongsheng Liu <[email protected]>

ram16g and others added 2 commits January 30, 2026 14:40

align talker buffer size to max cudagraph capture size

ea0eaea

Signed-off-by: ZeldaHuang <[email protected]>

ZeldaHuang requested a review from hsliuustc0106 as a code owner January 30, 2026 07:42

david6666666 added this to the v0.14.0 milestone Jan 30, 2026

hsliuustc0106 added the ready label to trigger buildkite CI label Jan 30, 2026

david6666666 linked an issue Jan 30, 2026 that may be closed by this pull request

[Bug]: qwen3-omni realtime audio return random voice and noise #1048

Closed

1 task

hsliuustc0106 mentioned this pull request Jan 30, 2026

[Bugfix] Fix torch.compile dynamic batch_size specialization in code predictor #1102

Closed

5 tasks

ZeldaHuang and others added 4 commits January 30, 2026 17:52

fix

13867cc

Signed-off-by: ZeldaHuang <[email protected]>

Merge branch 'main' into fix/talkermtp_torch_compile

ac2ad6f

fix ut

1abfc5e

Signed-off-by: ZeldaHuang <[email protected]>

Merge branch 'main' into fix/talkermtp_torch_compile

756959a

hsliuustc0106 approved these changes Jan 30, 2026

View reviewed changes

hsliuustc0106 merged commit f6cfc0d into vllm-project:main Jan 30, 2026
7 checks passed

gcanlin mentioned this pull request Jan 30, 2026

[NPU] Align with GPUModelRunner #1114

Merged

5 tasks

marksverdhei mentioned this pull request Jan 30, 2026

[BugFix] Fix Qwen3 TTS 0.6B profile run hang (#995) #1082

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Fix Qwen3 Omni talker mtp torch.compile startup error#1104

[BugFix] Fix Qwen3 Omni talker mtp torch.compile startup error#1104
hsliuustc0106 merged 6 commits intovllm-project:mainfrom
ZeldaHuang:fix/talkermtp_torch_compile

ZeldaHuang commented Jan 30, 2026

Uh oh!

hsliuustc0106 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ZeldaHuang commented Jan 30, 2026

Purpose

Test Plan

Test Result

Uh oh!

hsliuustc0106 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants