[BugFix] Fix modulate_index shape error in Qwen-Image-Edit Task by mxuax · Pull Request #1100 · vllm-project/vllm-omni

mxuax · 2026-01-30T04:15:34Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.
This PR fixes issue #1094.

Purpose

Fix modulate_index shape error in Qwen-Image-Edit Task
This bug is due to zero_cond_t being true when editing, and the current sp_plan does not include the sharding for the variable modulate_index created. I added a new submodule to handle this

Test Plan

Command:
CUDA_VISIBLE_DEVICES=0,1,2,3 python examples/offline_inference/image_to_image/image_edit.py --model "Qwen/Qwen-Image-Edit-2511" --image input.png --prompt "Add a sunset sky background" --output output_u2r2.png --ulysses_degree 2 --ring_degree 2 --enforce_eager --num_inference_steps 20

input image:

Test Result

Success with the right image.

Essential Elements of an Effective PR Description Checklist

[ - ] The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
[ - ]The test plan, such as providing test commands.
[ - ] The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user-facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: mxuax <[email protected]>

… to support cp_plan Signed-off-by: mxuax <[email protected]>

…paration in a block Signed-off-by: mxuax <[email protected]>

Signed-off-by: mxuax <[email protected]>

Removed context parallelism plan and related comments. Signed-off-by: XU Mingshi <[email protected]>

Signed-off-by: XU Mingshi <[email protected]>

Signed-off-by: mxuax <[email protected]>

…-ring-attn into workingbranch

Signed-off-by: XU Mingshi <[email protected]>

…llustration Signed-off-by: mxuax <[email protected]>

…-ring-attn into workingbranch

…e lengths - Add sp_attention_mask, sp_padding_size, sp_original_seq_len to ForwardContext - Add auto_pad option to SequenceParallelInput - Implement _shard_with_auto_pad in SequenceParallelSplitHook - Update SequenceParallelGatherHook to remove padding - Update QwenImage _sp_plan with auto_pad=True - Update QwenImageCrossAttention to use sp_attention_mask Signed-off-by: mxuax <[email protected]>

Signed-off-by: mxuax <[email protected]>

…-ring-attn into workingbranch

Signed-off-by: mxuax <[email protected]>

… code, add some comment Signed-off-by: mxuax <[email protected]>

…struction Signed-off-by: mxuax <[email protected]>

Signed-off-by: mxuax <[email protected]>

…-ring-attn into attn_backends

Signed-off-by: XU Mingshi <[email protected]>

chatgpt-codex-connector · 2026-01-30T04:22:52Z

💡 Codex Review

vllm-omni/vllm_omni/diffusion/attention/backends/flash_attn.py

Lines 26 to 30 in 05a1d17

    
           if not HAS_FLASH_ATTN: 
        
               raise ImportError( 
        
                   "FlashAttentionBackend requires Flash Attention. " 
        
                   "Please install one of: fa3-fwd, flash-attention, or flash-attn. " 
        
                   "Otherwise, use SDPA backend by setting DIFFUSION_ATTENTION_BACKEND=TORCH_SDPA"

Gate FlashAttention import error on CUDA/ROCm only

Raising ImportError at module import time makes the FLASH_ATTN backend unusable on NPU, even when mindiesd is present. The NPU platform selects FLASH_ATTN when mindiesd is available, but HAS_FLASH_ATTN is computed from FA2/FA3 CUDA/ROCm packages, which are typically absent on NPU, so importing this module now fails before the forward_npu path can be used. This breaks attention on NPU deployments that previously worked; consider deferring the check to forward_cuda or guarding it with a platform check so NPU can still load the backend.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

mxuax · 2026-01-30T05:12:31Z

This BugFix is ready. @ZJY0516 @hsliuustc0106

hsliuustc0106 · 2026-01-30T05:30:45Z

please add the benchmark results from benchmark/diffusion folder

Copilot

Pull request overview

This PR fixes a shape mismatch error in the Qwen-Image-Edit task when using sequence parallelism (USP=2 or higher). The bug occurred because the modulate_index tensor was not being sharded correctly when zero_cond_t is enabled in image editing models.

Changes:

Introduced ModulateIndexPrepare module to encapsulate modulate_index creation logic and enable proper sequence parallel sharding
Updated _sp_plan configuration to shard the modulate_index output
Refactored the forward method to use the new module instead of inline tensor creation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

ZJY0516 · 2026-01-30T05:53:31Z

I think this is what we need to catch in CI

gcanlin

LGTM. It works on NPU as well.

mxuax · 2026-01-30T06:13:18Z

please add the benchmark results from benchmark/diffusion folder

This PR fixes the bug for edit, but the benchmark is for t2i. Thus, I think it won't influence the result.

…-project#1100) Signed-off-by: mxuax <[email protected]> Signed-off-by: XU Mingshi <[email protected]> Co-authored-by: Hongsheng Liu <[email protected]>

mxuax and others added 30 commits January 14, 2026 15:06

cp plan framework in vllm-omni

e3bf6e8

Signed-off-by: mxuax <[email protected]>

add partial context split hook

e02e841

Signed-off-by: mxuax <[email protected]>

fix licenses

4db34f3

Signed-off-by: mxuax <[email protected]>

add test file and modify z-image for cp_plan

6e3063c

Signed-off-by: mxuax <[email protected]>

modify z-image cp_plan

73882ae

Signed-off-by: mxuax <[email protected]>

enable hybrid ulysses and ring, add apply_context_paralle in registry…

a039bc8

… to support cp_plan Signed-off-by: mxuax <[email protected]>

modify z-image-transformer, created UnifiedPrepare to put all the pre…

1abd69a

…paration in a block Signed-off-by: mxuax <[email protected]>

support cp_plan for qwen-image

7f94550

Signed-off-by: mxuax <[email protected]>

modify test

50ccdd1

Signed-off-by: mxuax <[email protected]>

add cp_plan doc

b640762

Signed-off-by: mxuax <[email protected]>

reduction wan

d1ede83

Removed context parallelism plan and related comments. Signed-off-by: XU Mingshi <[email protected]>

reduction wan from test

a5cd982

Signed-off-by: XU Mingshi <[email protected]>

fix doc warning

505d774

Signed-off-by: mxuax <[email protected]>

Merge branch 'Non-Intrusive-SP' of https://github.com/mxuax/vllm-omni…

d816ec0

…-ring-attn into workingbranch

Delete Untitled

a52093b

Signed-off-by: XU Mingshi <[email protected]>

refactor context parallel to sequence parallel and add some sp_plan i…

0714a44

…llustration Signed-off-by: mxuax <[email protected]>

Merge branch 'Non-Intrusive-SP' of https://github.com/mxuax/vllm-omni…

b369786

…-ring-attn into workingbranch

fix wrongly chunck attention mask issue

ddbf89e

Signed-off-by: mxuax <[email protected]>

fix chunck attention mask bug

9e76fc2

Signed-off-by: mxuax <[email protected]>

Merge branch 'main' into Non-Intrusive-SP

b723476

handle mask in attention metadata

9f760a2

Signed-off-by: mxuax <[email protected]>

Merge branch 'Non-Intrusive-SP' of https://github.com/mxuax/vllm-omni…

d7f4157

…-ring-attn into workingbranch

remove some declarational comments

d9aeca8

Signed-off-by: mxuax <[email protected]>

remove some declarational comments

81516a6

Signed-off-by: mxuax <[email protected]>

refactor the sp_plan and sp_config file, removed the training related…

9c77428

… code, add some comment Signed-off-by: mxuax <[email protected]>

modified the parallelism_acceleration.md to give a clearer sp_plan in…

ec37009

…struction Signed-off-by: mxuax <[email protected]>

add test for sequence_parallel.py

c09171b

Signed-off-by: mxuax <[email protected]>

fix error

470b737

Signed-off-by: mxuax <[email protected]>

fix error

66e85e8

Signed-off-by: mxuax <[email protected]>

mxuax and others added 7 commits January 29, 2026 11:06

fix insufficient check leaded warning in env.py

def2549

Signed-off-by: mxuax <[email protected]>

move backends check to platform.py

9c46ab4

Signed-off-by: mxuax <[email protected]>

fix typo

06a0706

Signed-off-by: mxuax <[email protected]>

Merge branch 'main' into Non-Intrusive-SP

9dee299

Merge branch 'main' into Non-Intrusive-SP

d1da324

fix modulate_index handling by adding new submodule

9166732

Signed-off-by: mxuax <[email protected]>

Merge branch 'Non-Intrusive-SP' of https://github.com/mxuax/vllm-omni…

05a1d17

…-ring-attn into attn_backends

mxuax requested a review from hsliuustc0106 as a code owner January 30, 2026 04:15

mxuax added 3 commits January 30, 2026 12:17

Update ring_globals.py

a5d0234

Signed-off-by: XU Mingshi <[email protected]>

Merge branch 'vllm-project:main' into Non-Intrusive-SP

2c9e3b4

Remove empty line at the beginning of ring_globals.py

4841b90

Signed-off-by: XU Mingshi <[email protected]>

mxuax changed the title ~~Fix modulate_index shape error in Qwen-Image-Edit Task~~ [BugFix] Fix modulate_index shape error in Qwen-Image-Edit Task Jan 30, 2026

mxuax mentioned this pull request Jan 30, 2026

[Bug]: Qwen-Image-Edit-2511 USP=2 failed #1094

Closed

1 task

hsliuustc0106 added the ready label to trigger buildkite CI label Jan 30, 2026

ZJY0516 approved these changes Jan 30, 2026

View reviewed changes

hsliuustc0106 requested a review from Copilot January 30, 2026 05:36

Copilot started reviewing on behalf of hsliuustc0106 January 30, 2026 05:36 View session

Copilot AI reviewed Jan 30, 2026

View reviewed changes

hsliuustc0106 linked an issue Jan 30, 2026 that may be closed by this pull request

[Bug]: Qwen-Image-Edit-2511 USP=2 failed #1094

Closed

1 task

gcanlin approved these changes Jan 30, 2026

View reviewed changes

ZJY0516 enabled auto-merge (squash) January 30, 2026 06:08

mxuax added 2 commits January 30, 2026 14:13

Merge branch 'main' into Non-Intrusive-SP

5752a9a

Merge branch 'main' into Non-Intrusive-SP

7420b52

ZJY0516 merged commit be835c4 into vllm-project:main Jan 30, 2026
6 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[BugFix] Fix modulate_index shape error in Qwen-Image-Edit Task#1100

[BugFix] Fix modulate_index shape error in Qwen-Image-Edit Task#1100
ZJY0516 merged 87 commits intovllm-project:mainfrom
mxuax:Non-Intrusive-SP

mxuax commented Jan 30, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot commented Jan 30, 2026

Uh oh!

mxuax commented Jan 30, 2026 •

edited

Loading

Uh oh!

hsliuustc0106 commented Jan 30, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

ZJY0516 commented Jan 30, 2026

Uh oh!

gcanlin left a comment

Uh oh!

mxuax commented Jan 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

Conversation

mxuax commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector bot commented Jan 30, 2026

💡 Codex Review

Uh oh!

mxuax commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hsliuustc0106 commented Jan 30, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

ZJY0516 commented Jan 30, 2026

Uh oh!

gcanlin left a comment

Choose a reason for hiding this comment

Uh oh!

mxuax commented Jan 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mxuax commented Jan 30, 2026 •

edited

Loading

mxuax commented Jan 30, 2026 •

edited

Loading