[diffusion] add tp support for qwen-image and refactor some tests by ZJY0516 · Pull Request #830 · vllm-project/vllm-omni

ZJY0516 · 2026-01-18T07:26:57Z

Purpose

tp support for qwen image
refator some tests: add a class for monitoring gpu peak mem

Test

Qwen Image

Image size: 1024x1024

tp size	time	generated image
1	63.7661s
2	37.6579s
4	23.6846s

Qwen Image Edit

Input image size: (1242, 1483)

tp size	time	generated image
1	179.8156s
2	99.3778s
4	58.5206s

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: zjy0516 <[email protected]>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 35a98fb248

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-01-18T07:30:08Z

tests/e2e/offline_inference/test_zimage_tensor_parallel.py

+    torch.cuda.empty_cache()
+    device_index = torch.cuda.current_device()
+    monitor = GPUMemoryMonitor(device_index=device_index, interval=0.02)
+    monitor.start()


Reset CUDA peak stats before collecting TP memory

GPUMemoryMonitor.peak_used_mb falls back to torch.cuda.max_memory_allocated/reserved, which are process‑wide peaks and are not reset by empty_cache(). Since _run_zimage_generate is invoked twice in the same process, the TP=2 run will inherit the TP=1 peak and can never be lower even if it actually uses less memory, making the new assertion flaky. Consider calling torch.cuda.reset_peak_memory_stats(device_index) before starting the monitor (or dropping the max_memory fallback) so each run measures its own peak.

Useful? React with 👍 / 👎.

Signed-off-by: zjy0516 <[email protected]>

vllm_omni/diffusion/models/qwen_image/qwen_image_transformer.py

Signed-off-by: zjy0516 <[email protected]>

docs/user_guide/diffusion/parallelism_acceleration.md

hsliuustc0106

lgtm

…lm-project#830) Signed-off-by: zjy0516 <[email protected]>

init and update test

35a98fb

Signed-off-by: zjy0516 <[email protected]>

ZJY0516 requested a review from hsliuustc0106 as a code owner January 18, 2026 07:26

ZJY0516 requested a review from SamitHuang January 18, 2026 07:27

chatgpt-codex-connector bot reviewed Jan 18, 2026

View reviewed changes

ZJY0516 added 2 commits January 18, 2026 17:38

update doc

588ba5e

Signed-off-by: zjy0516 <[email protected]>

update doc

f5b90f6

Signed-off-by: zjy0516 <[email protected]>

gcanlin reviewed Jan 18, 2026

View reviewed changes

vllm_omni/diffusion/models/qwen_image/qwen_image_transformer.py Show resolved Hide resolved

update

03f1f2f

Signed-off-by: zjy0516 <[email protected]>

This was referenced Jan 18, 2026

[RFC]: Diffusion Models Features Supports Plan #814

Open

[Installation]: torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 72.00 MiB. GPU 0 has a total capacity of 44.52 GiB of which 43.88 MiB is free. #807

Open

SamitHuang approved these changes Jan 19, 2026

View reviewed changes

docs/user_guide/diffusion/parallelism_acceleration.md Show resolved Hide resolved

ZJY0516 added the ready label to trigger buildkite CI label Jan 19, 2026

hsliuustc0106 approved these changes Jan 19, 2026

View reviewed changes

SamitHuang merged commit 3fc4f98 into vllm-project:main Jan 19, 2026
7 checks passed

ZJY0516 mentioned this pull request Jan 19, 2026

[RFC]: vLLM-Omni 2026 Q1 Roadmap #677

Open

41 tasks

with1015 pushed a commit to with1015/vllm-omni that referenced this pull request Jan 20, 2026

[diffusion] add tp support for qwen-image and refactor some tests (vl…

f493fc3

…lm-project#830) Signed-off-by: zjy0516 <[email protected]>

ZJY0516 mentioned this pull request Jan 20, 2026

[Model]: add FLUX.1-dev model #853

Merged

5 tasks

david6666666 mentioned this pull request Jan 23, 2026

[Bug]: qwen image 2512 oom on 4*5090 with -tp 4 #919

Closed

1 task

ZJY0516 deleted the qwen-image-tp branch January 23, 2026 15:07

wtomin mentioned this pull request Feb 5, 2026

[RFC]: Continuous Diffusion Model Acceleration Support #1217

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[diffusion] add tp support for qwen-image and refactor some tests#830

[diffusion] add tp support for qwen-image and refactor some tests#830
SamitHuang merged 4 commits intovllm-project:mainfrom
ZJY0516:qwen-image-tp

ZJY0516 commented Jan 18, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Jan 18, 2026

Uh oh!

Uh oh!

Uh oh!

hsliuustc0106 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ZJY0516 commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test

Qwen Image

Qwen Image Edit

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 18, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hsliuustc0106 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ZJY0516 commented Jan 18, 2026 •

edited

Loading