[Model] Add Wan2.2 text-to-video support by linyueqian · Pull Request #202 · vllm-project/vllm-omni

linyueqian · 2025-12-04T19:40:03Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

Add support for Wan2.2 text-to-video generation.

Test Plan

python examples/offline_inference/wan22/text_to_video.py \
    --prompt "Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage." \
    --negative_prompt "色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，
JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不
动的画面，杂乱的背景，三条腿，背景人很多，倒着走" \
    --height 720 \
    --width 1280 \
    --num_frames 32 \
    --guidance_scale 4.0 \
    --guidance_scale_high 3.0 \
    --num_inference_steps 40 \
    --fps 16 \
    --output t2v_out.mp4

Test Result

t2v_out.mp4

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: linyueqian <[email protected]>

chatgpt-codex-connector · 2025-12-04T19:40:08Z

The account who enabled Codex for this repo no longer has access to Codex. Please contact the admins of this repo to enable Codex again.

vllm_omni/diffusion/models/wan2_2/wan2_2_transformer.py

vllm_omni/diffusion/models/wan2_2/pipeline_wan2_2.py

SamitHuang · 2025-12-05T15:25:57Z

nice work. Can you try to increase --num_inference_steps=10 to like 100 and check whether the video quality become normal?

hsliuustc0106

After this, I suggest you try to link this with fastwan proposed in fastvideo project. Let's see how this can accelerate our inference and provide a solution to coordinate with fastvideo.

vllm_omni/diffusion/omni_diffusion.py

Signed-off-by: linyueqian <[email protected]>

…at/wan2.2

Signed-off-by: linyueqian <[email protected]>

linyueqian · 2025-12-07T01:02:06Z

nice work. Can you try to increase --num_inference_steps=10 to like 100 and check whether the video quality become normal?

@SamitHuang I try with 40 steps and it takes about five minutes to generate.

wan22_output_50.mp4

hsliuustc0106 · 2025-12-08T00:51:45Z

nice work. Can you try to increase --num_inference_steps=10 to like 100 and check whether the video quality become normal?

@SamitHuang I try with 40 steps and it takes about five minutes to generate.

wan22_output_50.mp4

we may need some acceleration methods to speed up generation

hsliuustc0106 · 2025-12-08T00:53:11Z

please add this model to the supported models.md

Signed-off-by: linyueqian <[email protected]>

SamitHuang · 2025-12-08T01:58:26Z

examples/offline_inference/wan22/text_to_video.py

+    if isinstance(video_array, np.ndarray) and video_array.ndim == 4:
+        video_array = list(video_array)
+
+    export_to_video(video_array, str(output_path), fps=16)


fps can be 24 too. it's better to be configurable via argparser

got it. i change it accordingly

Signed-off-by: linyueqian <[email protected]>

SamitHuang · 2025-12-08T02:15:20Z

i think

nice work. Can you try to increase --num_inference_steps=10 to like 100 and check whether the video quality become normal?

@SamitHuang I try with 40 steps and it takes about five minutes to generate.

wan22_output_50.mp4

i think you can update the test method and result with this new video, where num_frames seems increased. btw, diffusers example applies negative_prompt, we should apply it in the test as well to verify CFG works

hsliuustc0106 · 2025-12-08T03:17:15Z

this PR only supports t2v, right?

linyueqian · 2025-12-08T03:19:41Z

this PR only supports t2v, right?

yes.

linyueqian · 2025-12-08T05:08:06Z

i think

nice work. Can you try to increase --num_inference_steps=10 to like 100 and check whether the video quality become normal?

@SamitHuang I try with 40 steps and it takes about five minutes to generate.
wan22_output_50.mp4

i think you can update the test method and result with this new video, where num_frames seems increased. btw, diffusers example applies negative_prompt, we should apply it in the test as well to verify CFG works

I have updated the test result in the first comment.

hufangjian · 2025-12-08T08:24:44Z

when diffuser model support TP,CFG,USP and distVAE?

vllm_omni/diffusion/models/wan2_2/wan2_2_transformer.py

Signed-off-by: linyueqian <[email protected]>

hsliuustc0106 · 2025-12-09T01:43:56Z

when diffuser model support TP,CFG,USP and distVAE?

TP/USP should be ready by the end of this month, others left to Q1

hsliuustc0106 · 2025-12-09T01:45:06Z

add the tests, please refer to the qwen-image tests

Signed-off-by: linyueqian <[email protected]>

linyueqian · 2025-12-09T01:54:00Z

add the tests, please refer to the qwen-image tests

got it. i just add the test_video_diffusion_model.py file in a similar fashion.

tests/single_stage/test_video_diffusion_model.py

Signed-off-by: linyueqian <[email protected]>

hsliuustc0106 · 2025-12-11T02:35:21Z

I think we can get this PR merged now, later we need to open a new issue for a few todo jobs

test should be changed according @congw729 please provide instructions for offline tests
support image2video& txt-image2video jobs

- [ ] refactor the examples/offline/video_generation/ which can be used for other video generation models

congw729 · 2025-12-11T02:46:29Z

I think we can get this PR merged now, later we need to open a new issue for a few todo jobs

test should be changed according @congw729 please provide instructions for offline tests

support image2video& txt-image2video jobs

Got it.

Signed-off-by: linyueqian <[email protected]>

Signed-off-by: linyueqian <[email protected]> Signed-off-by: Fanli Lin <[email protected]>

Signed-off-by: linyueqian <[email protected]>

pengchengneo · 2026-01-16T04:42:04Z

Excellent work.
May I ask a question, when implementing the text2video model, there should be some error accumulation due to precision transformation during the forward process.
How do you perform model evaluation to ensure that the accuracy of our implemented model remains consistent with that in the paper?
For example, text models can run on datasets like GSM8K to observe scores; how do we perform this kind of evaluation for video models? Thank you.
@linyueqian @hsliuustc0106

[Feature] Add Wan2.2 text-to-video support with optimized transformer

4063300

Signed-off-by: linyueqian <[email protected]>

linyueqian requested a review from hsliuustc0106 as a code owner December 4, 2025 19:40

hsliuustc0106 assigned linyueqian Dec 4, 2025

hsliuustc0106 added the new model add new model label Dec 4, 2025

hsliuustc0106 mentioned this pull request Dec 4, 2025

[RFC]: DiT model and feature support enhancement #85

Closed

58 tasks

ZJY0516 reviewed Dec 5, 2025

View reviewed changes

vllm_omni/diffusion/models/wan2_2/wan2_2_transformer.py Outdated Show resolved Hide resolved

vllm_omni/diffusion/models/wan2_2/wan2_2_transformer.py Outdated Show resolved Hide resolved

vllm_omni/diffusion/models/wan2_2/pipeline_wan2_2.py Outdated Show resolved Hide resolved

hsliuustc0106 reviewed Dec 5, 2025

View reviewed changes

vllm_omni/diffusion/omni_diffusion.py Show resolved Hide resolved

linyueqian and others added 10 commits December 6, 2025 03:29

delete gradient_checkpointing

71b3a3f

Signed-off-by: linyueqian <[email protected]>

fix post_process_func

525c231

Signed-off-by: linyueqian <[email protected]>

weights loading

13cc9e0

Signed-off-by: linyueqian <[email protected]>

Merge branch 'vllm-project:main' into feat/wan2.2

b96da6e

Merge branch 'feat/wan2.2' of github.com:linyueqian/vllm-omni into fe…

859acec

…at/wan2.2

Merge branch 'vllm-project:main' into feat/wan2.2

fe7dea3

Merge branch 'feat/wan2.2' of github.com:linyueqian/vllm-omni into fe…

aaa84d0

…at/wan2.2

update weight loading

ddedc4d

Signed-off-by: linyueqian <[email protected]>

revert changes on omni_diffusion

d19b184

Signed-off-by: linyueqian <[email protected]>

update registry

30b2ef1

Signed-off-by: linyueqian <[email protected]>

hsliuustc0106 requested a review from SamitHuang December 8, 2025 00:50

linyueqian and others added 2 commits December 7, 2025 14:55

Merge branch 'main' into feat/wan2.2

1a99fa9

add Wan2.2 in supported_models

a816e16

Signed-off-by: linyueqian <[email protected]>

SamitHuang reviewed Dec 8, 2025

View reviewed changes

add fps as an arg

9d9f903

Signed-off-by: linyueqian <[email protected]>

ZJY0516 reviewed Dec 8, 2025

View reviewed changes

vllm_omni/diffusion/models/wan2_2/wan2_2_transformer.py Outdated Show resolved Hide resolved

ZJY0516 reviewed Dec 8, 2025

View reviewed changes

vllm_omni/diffusion/models/wan2_2/wan2_2_transformer.py Outdated Show resolved Hide resolved

fix attn layer and remove from_pretrained

0c20dbc

Signed-off-by: linyueqian <[email protected]>

linyueqian and others added 2 commits December 8, 2025 15:46

Merge branch 'vllm-project:main' into feat/wan2.2

116bc40

add test_video_diffusion_model

870f539

Signed-off-by: linyueqian <[email protected]>

SamitHuang approved these changes Dec 9, 2025

View reviewed changes

hsliuustc0106 reviewed Dec 10, 2025

View reviewed changes

tests/single_stage/test_video_diffusion_model.py Show resolved Hide resolved

tests/single_stage/test_video_diffusion_model.py Outdated Show resolved Hide resolved

update test

c058114

Signed-off-by: linyueqian <[email protected]>

hsliuustc0106 merged commit 4128d63 into vllm-project:main Dec 11, 2025
4 checks passed

LawJarp-A pushed a commit to LawJarp-A/vllm-omni that referenced this pull request Dec 12, 2025

[Model] Add Wan2.2 text-to-video support (vllm-project#202)

655639c

Signed-off-by: linyueqian <[email protected]>

LawJarp-A pushed a commit to LawJarp-A/vllm-omni that referenced this pull request Dec 12, 2025

[Model] Add Wan2.2 text-to-video support (vllm-project#202)

82d752b

Signed-off-by: linyueqian <[email protected]>

linyueqian deleted the feat/wan2.2 branch December 16, 2025 01:50

faaany pushed a commit to faaany/vllm-omni that referenced this pull request Dec 19, 2025

[Model] Add Wan2.2 text-to-video support (vllm-project#202)

3dc112a

Signed-off-by: linyueqian <[email protected]> Signed-off-by: Fanli Lin <[email protected]>

david6666666 mentioned this pull request Jan 9, 2026

[Feature]: vLLM-Omni model owner JiusiServe/vllm-omni#25

Open

17 tasks

princepride pushed a commit to princepride/vllm-omni that referenced this pull request Jan 10, 2026

[Model] Add Wan2.2 text-to-video support (vllm-project#202)

05badf0

Signed-off-by: linyueqian <[email protected]>

david6666666 mentioned this pull request Jan 16, 2026

vLLM-Omni Model Support #808

Open

55 tasks

dorhuri123 mentioned this pull request Jan 18, 2026

[Model] Add AniSora T2V and I2V pipeline support dorhuri123/vllm-omni#1

Closed

6 tasks

Comments

Conversation

linyueqian commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector bot commented Dec 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SamitHuang commented Dec 5, 2025

Uh oh!

hsliuustc0106 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

linyueqian commented Dec 7, 2025

Uh oh!

hsliuustc0106 commented Dec 8, 2025

Uh oh!

hsliuustc0106 commented Dec 8, 2025

Uh oh!

SamitHuang Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

linyueqian Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

SamitHuang commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hsliuustc0106 commented Dec 8, 2025

Uh oh!

linyueqian commented Dec 8, 2025

Uh oh!

linyueqian commented Dec 8, 2025

Uh oh!

hufangjian commented Dec 8, 2025

Uh oh!

Uh oh!

Uh oh!

hsliuustc0106 commented Dec 9, 2025

Uh oh!

hsliuustc0106 commented Dec 9, 2025

Uh oh!

linyueqian commented Dec 9, 2025

Uh oh!

Uh oh!

Uh oh!

hsliuustc0106 commented Dec 11, 2025

Uh oh!

Uh oh!

congw729 commented Dec 11, 2025

Uh oh!

pengchengneo commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

linyueqian commented Dec 4, 2025 •

edited

Loading

SamitHuang commented Dec 8, 2025 •

edited

Loading