[Bugfix] Fix generation artifacts of Qwen-Image-Edit-2511 and update pipeline DiT param parsing by SamitHuang · Pull Request #776 · vllm-project/vllm-omni

SamitHuang · 2026-01-14T04:32:56Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

Fixed #675

Test Plan

cd vllm-omni/examples/offline_inference/image_to_image
python image_edit.py \
    --model /home/yx/models/Qwen/Qwen-Image-Edit-2511 \
    --image "readme_cn.png" \
    --prompt "Make the girl in the image put her hands down." \
    --output output_image_edit.png \
    --num_inference_steps 50 \

Test Result

Before:

After:

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: samithuang <[email protected]>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 8ac473c1eb

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

vllm_omni/diffusion/models/qwen_image/pipeline_qwen_image.py

Signed-off-by: samithuang <[email protected]>

jiangmengyu18 · 2026-01-14T08:31:14Z

@SamitHuang
AdaLayerNorm actually supports modulate_index. It was just omitted during the earlier adaptation. You can fix this bug like:

self.img_norm1 = AdaLayerNorm(dim, elementwise_affine=False, eps=eps)
...

img_modulated, img_gate1 = self.img_norm1(hidden_states, img_mod1, modulate_index)

gcanlin

Teacache should also change the corresponding parameter.

vllm-omni/vllm_omni/diffusion/cache/teacache/extractors.py

Line 211 in e9a1bee

img_modulated, _ = block.img_norm1(hidden_states, img_mod1)

Signed-off-by: samithuang <[email protected]>

vllm_omni/diffusion/utils/tf_utils.py

SamitHuang · 2026-01-15T03:29:48Z

Teacache should also change the corresponding parameter.

vllm-omni/vllm_omni/diffusion/cache/teacache/extractors.py

Line 211 in e9a1bee

img_modulated, _ = block.img_norm1(hidden_states, img_mod1)

@yuanheng-zhao Currently qwen-image-edit with tea-cache still haves slight artifacts. I tried to add module_index in tea_cache as well, but get the following errror. Can you PTAL?

[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/worker/gpu_worker.py", line 150, in execute_model
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     output = self.pipeline.forward(req)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]              ^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/models/qwen_image/pipeline_qwen_image_edit_plus.py", line 809, in forward
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     latents = self.diffuse(
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]               ^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/models/qwen_image/pipeline_qwen_image_edit_plus.py", line 599, in diffuse
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     noise_pred = self.transformer(
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]                  ^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1775, in _wrapped_call_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self._call_impl(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1786, in _call_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return forward_call(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/hooks.py", line 58, in __call__
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return registry.dispatch(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/hooks.py", line 97, in dispatch
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return hook.new_forward(self.module, *args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/cache/teacache/hook.py", line 161, in new_forward
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     outputs = ctx.run_transformer_blocks()
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/cache/teacache/extractors.py", line 234, in run_transformer_blocks
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     e, h = block(
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1773, in _wrapped_call_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self._compiled_call_impl(*args, **kwargs)  # type: ignore[misc]
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
...
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 1376, in __to17:23:24 [300/4802]
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self.dispatch(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 2096, in dispatch
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self._cached_dispatch_impl(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 1511, in _cached_dispatch_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     output = self._dispatch_impl(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 2639, in _dispatch_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     decomposition_table[func](*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_prims_common/wrappers.py", line 309, in _fn
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     result = fn(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]              ^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_compile.py", line 53, in inner
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return disable_fn(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_dynamo/eval_frame.py", line 1044, in _fn
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return fn(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_prims_common/wrappers.py", line 149, in _fn
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     result = fn(**bound.arguments)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]              ^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_refs/__init__.py", line 2920, in cat
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return prims.cat(filtered, dim).clone(memory_format=memory_format)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_ops.py", line 841, in __call__
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self._op(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/utils/_stats.py", line 28, in wrapper
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return fn(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^

[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 1376, in __torch_dispatch__
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self.dispatch(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 2096, in dispatch
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self._cached_dispatch_impl(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 1511, in _cached_dispatch_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     output = self._dispatch_impl(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 2661, in _dispatch_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     func.prim_meta_impl(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_prims/__init__.py", line 1796, in _cat_meta
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     torch._check(
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/__init__.py", line 1695, in _check
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     _check_with(RuntimeError, cond, message)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/__init__.py", line 1677, in _check_with
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     raise error_type(message_evaluated)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309] torch._dynamo.exc.TorchRuntimeError: Dynamo failed to run FX node with fake tensors: call_function <built-in method cat of type object
 at 0x7fffe99e1c40>(*([FakeTensor(..., device='cuda:0', size=(1, s31, 24, 128), dtype=torch.bfloat16), FakeTensor(..., device='cuda:0', size=(0, s87, 24, 128), dtype=torch.bfloat16)],),
 **{'dim': 1}): got RuntimeError('Sizes of tensors must match except in dimension 1. Expected 1 in dimension 0 but got 0 for tensor number 1 in the list')
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309] from user code:
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]    File "/home/yx/vllm-omni/vllm_omni/diffusion/models/qwen_image/qwen_image_transformer.py", line 423, in torch_dynamo_resume_in_forw
ard_at_384
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     joint_query = torch.cat([txt_query, img_query], dim=1)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]

ZJY0516 · 2026-01-15T03:34:17Z

torch._dynamo.exc.TorchRuntimeError: Dynamo failed to run FX node with fake tensors: call_function <built-in method cat of type object
at 0x7fffe99e1c40>(*([FakeTensor(..., device='cuda:0', size=(1, s31, 24, 128), dtype=torch.bfloat16), FakeTensor(..., device='cuda:0', size=(0, s87, 24, 128), dtype=torch.bfloat16)],),
**{'dim': 1}): got RuntimeError('Sizes of tensors must match except in dimension 1. Expected 1 in dimension 0 but got 0 for tensor number 1 in the list')

perhaps it's related to torch compile

…pipeline DiT param parsing (vllm-project#776) Signed-off-by: samithuang <[email protected]>

…pipeline DiT param parsing (vllm-project#776) Signed-off-by: Samit <[email protected]>

yuanheng-zhao · 2026-01-16T06:21:12Z

Teacache should also change the corresponding parameter.

vllm-omni/vllm_omni/diffusion/cache/teacache/extractors.py

Line 211 in e9a1bee

img_modulated, _ = block.img_norm1(hidden_states, img_mod1)

@yuanheng-zhao Currently qwen-image-edit with tea-cache still haves slight artifacts. I tried to add module_index in tea_cache as well, but get the following errror. Can you PTAL?

[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/worker/gpu_worker.py", line 150, in execute_model
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     output = self.pipeline.forward(req)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]              ^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/models/qwen_image/pipeline_qwen_image_edit_plus.py", line 809, in forward
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     latents = self.diffuse(
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]               ^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/models/qwen_image/pipeline_qwen_image_edit_plus.py", line 599, in diffuse
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     noise_pred = self.transformer(
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]                  ^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1775, in _wrapped_call_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self._call_impl(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1786, in _call_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return forward_call(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/hooks.py", line 58, in __call__
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return registry.dispatch(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/hooks.py", line 97, in dispatch
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return hook.new_forward(self.module, *args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/cache/teacache/hook.py", line 161, in new_forward
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     outputs = ctx.run_transformer_blocks()
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/vllm_omni/diffusion/cache/teacache/extractors.py", line 234, in run_transformer_blocks
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     e, h = block(
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1773, in _wrapped_call_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self._compiled_call_impl(*args, **kwargs)  # type: ignore[misc]
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
...
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 1376, in __to17:23:24 [300/4802]
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self.dispatch(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 2096, in dispatch
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self._cached_dispatch_impl(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 1511, in _cached_dispatch_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     output = self._dispatch_impl(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 2639, in _dispatch_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     decomposition_table[func](*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_prims_common/wrappers.py", line 309, in _fn
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     result = fn(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]              ^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_compile.py", line 53, in inner
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return disable_fn(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_dynamo/eval_frame.py", line 1044, in _fn
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return fn(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_prims_common/wrappers.py", line 149, in _fn
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     result = fn(**bound.arguments)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]              ^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_refs/__init__.py", line 2920, in cat
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return prims.cat(filtered, dim).clone(memory_format=memory_format)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_ops.py", line 841, in __call__
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self._op(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/utils/_stats.py", line 28, in wrapper
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return fn(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^

[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 1376, in __torch_dispatch__
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self.dispatch(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 2096, in dispatch
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     return self._cached_dispatch_impl(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 1511, in _cached_dispatch_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     output = self._dispatch_impl(func, types, args, kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_subclasses/fake_tensor.py", line 2661, in _dispatch_impl
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     func.prim_meta_impl(*args, **kwargs)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/_prims/__init__.py", line 1796, in _cat_meta
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     torch._check(
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/__init__.py", line 1695, in _check
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     _check_with(RuntimeError, cond, message)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]   File "/home/yx/vllm-omni/.venv/lib/python3.12/site-packages/torch/__init__.py", line 1677, in _check_with
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     raise error_type(message_evaluated)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309] torch._dynamo.exc.TorchRuntimeError: Dynamo failed to run FX node with fake tensors: call_function <built-in method cat of type object
 at 0x7fffe99e1c40>(*([FakeTensor(..., device='cuda:0', size=(1, s31, 24, 128), dtype=torch.bfloat16), FakeTensor(..., device='cuda:0', size=(0, s87, 24, 128), dtype=torch.bfloat16)],),
 **{'dim': 1}): got RuntimeError('Sizes of tensors must match except in dimension 1. Expected 1 in dimension 0 but got 0 for tensor number 1 in the list')
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309] from user code:
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]    File "/home/yx/vllm-omni/vllm_omni/diffusion/models/qwen_image/qwen_image_transformer.py", line 423, in torch_dynamo_resume_in_forw
ard_at_384
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]     joint_query = torch.cat([txt_query, img_query], dim=1)
[Stage-0] ERROR 01-14 17:23:24 [gpu_worker.py:309]

Hey @SamitHuang , may I get the command you tested and got the torch compile exception? I added module_index in extract_qwen_context and tested both image edit (Qwen-Image-Edit) and multiple image edit (Qwen-Image-Edit-2509) with teacache enabled but didn't reproduce the error.

…pipeline DiT param parsing (vllm-project#776) Signed-off-by: samithuang <[email protected]> Signed-off-by: Chen Yang <[email protected]>

…pipeline DiT param parsing (vllm-project#776) Signed-off-by: samithuang <[email protected]>

SamitHuang added 5 commits January 13, 2026 07:09

default seed 0

62545a2

Signed-off-by: samithuang <[email protected]>

fix qwen image edit quality issue, improve qwen dit init args parsing

d16303d

Signed-off-by: samithuang <[email protected]>

use AdaLayerNorm for text emb

1284136

Signed-off-by: samithuang <[email protected]>

update tf parse utils

32f5196

Signed-off-by: samithuang <[email protected]>

rm comments

8ac473c

Signed-off-by: samithuang <[email protected]>

SamitHuang requested a review from hsliuustc0106 as a code owner January 14, 2026 04:32

chatgpt-codex-connector bot reviewed Jan 14, 2026

View reviewed changes

vllm_omni/diffusion/models/qwen_image/pipeline_qwen_image.py Show resolved Hide resolved

SamitHuang changed the title ~~[Bugfix] Fix Qwen-Image-Edit generation precision and update pipeline DiT param parsing~~ [Bugfix] Fix generation precision in Qwen-Image-Edit-2512 and update pipeline DiT param parsing Jan 14, 2026

SamitHuang requested a review from ZJY0516 January 14, 2026 04:46

SamitHuang mentioned this pull request Jan 14, 2026

[Bug]: Qwen-Image-Edit-2511 Inference Results Are Abnormal on Ascend NPU #675

Closed

1 task

SamitHuang changed the title ~~[Bugfix] Fix generation precision in Qwen-Image-Edit-2512 and update pipeline DiT param parsing~~ [Bugfix] Fix generation artifacts of Qwen-Image-Edit-2512 and update pipeline DiT param parsing Jan 14, 2026

SamitHuang added 3 commits January 14, 2026 12:49

add file

d23d0a9

Signed-off-by: samithuang <[email protected]>

fix linting

487bc18

Signed-off-by: samithuang <[email protected]>

fix doc

dbe6ecb

Signed-off-by: samithuang <[email protected]>

SamitHuang changed the title ~~[Bugfix] Fix generation artifacts of Qwen-Image-Edit-2512 and update pipeline DiT param parsing~~ [Bugfix] Fix generation artifacts of Qwen-Image-Edit-2511 and update pipeline DiT param parsing Jan 14, 2026

david6666666 added this to the v0.14.0rc1 milestone Jan 14, 2026

ZJY0516 approved these changes Jan 14, 2026

View reviewed changes

SamitHuang added the ready label to trigger buildkite CI label Jan 14, 2026

gcanlin reviewed Jan 14, 2026

View reviewed changes

use AdaLayerNorm for image modulation

f308954

Signed-off-by: samithuang <[email protected]>

hsliuustc0106 reviewed Jan 14, 2026

View reviewed changes

vllm_omni/diffusion/utils/tf_utils.py Show resolved Hide resolved

ZJY0516 merged commit 2d5faf3 into vllm-project:main Jan 15, 2026
7 checks passed

GG-li pushed a commit to GG-li/vllm-omni that referenced this pull request Jan 15, 2026

[Bugfix] Fix generation artifacts of Qwen-Image-Edit-2511 and update …

810a5be

…pipeline DiT param parsing (vllm-project#776) Signed-off-by: samithuang <[email protected]>

SamitHuang mentioned this pull request Jan 15, 2026

[Bugfix] Set default seed in online image-to-image #766

Closed

5 tasks

GG-li pushed a commit to GG-li/vllm-omni that referenced this pull request Jan 15, 2026

[Bugfix] Fix generation artifacts of Qwen-Image-Edit-2511 and update …

251e6dc

…pipeline DiT param parsing (vllm-project#776) Signed-off-by: Samit <[email protected]>

with1015 pushed a commit to with1015/vllm-omni that referenced this pull request Jan 20, 2026

[Bugfix] Fix generation artifacts of Qwen-Image-Edit-2511 and update …

c2b0a95

…pipeline DiT param parsing (vllm-project#776) Signed-off-by: samithuang <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[Bugfix] Fix generation artifacts of Qwen-Image-Edit-2511 and update pipeline DiT param parsing #776

[Bugfix] Fix generation artifacts of Qwen-Image-Edit-2511 and update pipeline DiT param parsing #776
ZJY0516 merged 9 commits intovllm-project:mainfrom
SamitHuang:fix_pipeline_dit

SamitHuang commented Jan 14, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

jiangmengyu18 commented Jan 14, 2026 •

edited

Loading

Uh oh!

gcanlin left a comment

Uh oh!

Uh oh!

Uh oh!

SamitHuang commented Jan 15, 2026

Uh oh!

ZJY0516 commented Jan 15, 2026

Uh oh!

yuanheng-zhao commented Jan 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Comments

Conversation

SamitHuang commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

jiangmengyu18 commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gcanlin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

SamitHuang commented Jan 15, 2026

Uh oh!

ZJY0516 commented Jan 15, 2026

Uh oh!

yuanheng-zhao commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

SamitHuang commented Jan 14, 2026 •

edited

Loading

jiangmengyu18 commented Jan 14, 2026 •

edited

Loading

yuanheng-zhao commented Jan 16, 2026 •

edited

Loading