[Test] Add full test for Qwen3-Omni-30B-A3B-Instruct for image and audio single modal#827

Merged

hsliuustc0106 merged 25 commits intovllm-project:mainfrom

yenuo26:full_test

Jan 21, 2026

Contributor

yenuo26 commented Jan 17, 2026 •

edited

Loading

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

This PR introduces comprehensive testing for the image and audio single-modal capabilities of the Qwen3-Omni-30B-A3B-Instruct model.
design and plan, please refer to the #723

Test Plan

pytest test_qwen3_omni_expansion.py -k "test_audio" -v --html=report.html --self-contained-html --capture=sys
pytest test_qwen3_omni_expansion.py -k "test_image" -v --html=report.html --self-contained-html --capture=sys

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

wangyu31577 and others added 19 commits

January 8, 2026 21:22


          add full test

ac2e967

Signed-off-by: wangyu31577 <[email protected]>


          add label

2a513dd

Signed-off-by: wangyu31577 <[email protected]>


          add full test

356786a

Signed-off-by: wangyu31577 <[email protected]>


          add full test

a89948a

Signed-off-by: wangyu31577 <[email protected]>


          add full test

1c106d5

Signed-off-by: wangyu31577 <[email protected]>


          Merge branch 'vllm-project:main' into full_test

30ef18d


          add full test

290e5c7

Signed-off-by: wangyu31577 <[email protected]>


          Merge branch 'full_test' of https://github.com/yenuo26/vllm-omni into…

1d2ba89

… full_test


          pre-commit

Signed-off-by: wangyu31577 <[email protected]>


          Merge branch 'vllm-project:main' into full_test

38863b1


          修改变量

3deb179

Signed-off-by: wangyu31577 <[email protected]>


          Merge branch 'vllm-project:main' into full_test

59faf3b


          Add prompt and batch size getter functions

b37dfca

Signed-off-by: wangyu31577 <[email protected]>


          add dependencies

f6e46e6

Signed-off-by: wangyu31577 <[email protected]>


          add dependencies

35b9c1c

Signed-off-by: wangyu31577 <[email protected]>


          add dependencies version

eb41de2

Signed-off-by: wangyu31577 <[email protected]>


          Merge branch 'vllm-project:main' into full_test

5c20a03


          add single modal test

900b785

Signed-off-by: wangyu31577 <[email protected]>


          Resize the image to avoid recognition inaccuracies caused by an image…

68244ee

… that is too small.

Signed-off-by: wangyu31577 <[email protected]>

yenuo26 requested a review from hsliuustc0106 as a code owner

January 17, 2026 11:19

chatgpt-codex-connector bot reviewed

View reviewed changes

chatgpt-codex-connector bot left a comment

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 68244ee219

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

tests/e2e/online_serving/test_qwen3_omni_expansion.py Outdated Show resolved Hide resolved

Collaborator

hsliuustc0106 commented Jan 17, 2026

@Bounty-hunter PTAL

yenuo26 mentioned this pull request

[RFC]: Add E2E full test for Qwen3-Omni-30B-A3B-Instruct #723

Open

12 tasks

hsliuustc0106 added the ready label

hsliuustc0106 reviewed

View reviewed changes

tests/e2e/online_serving/test_qwen3_omni_expansion.py Outdated Show resolved Hide resolved

tests/conftest.py Show resolved Hide resolved

tests/conftest.py Show resolved Hide resolved

tests/e2e/online_serving/test_qwen3_omni_expansion.py Outdated

+                          audio_content = convert_audio_to_text(audio_data)
+                          print(f"text content is: {text_content}")
+                          print(f"audio content is: {audio_content}")
+                          assert cosine_similarity_text(audio_content, text_content) > 0.9, (

Collaborator

hsliuustc0106 Jan 17, 2026

why we choose 0.9?

tests/e2e/online_serving/test_qwen3_omni_expansion.py

+                      # Test single completion
+                      api_client = client(server)
+                      e2e_list = list()
+                      with concurrent.futures.ThreadPoolExecutor(max_workers=num_concurrent_requests) as executor:

Collaborator

hsliuustc0106 Jan 17, 2026

is this the way vllm upstream test concurrent requests?

Contributor Author

yenuo26 Jan 19, 2026

In the vLLM e2e test directory, I haven't found test cases for the relevant scenario. Among other vLLM test cases, I observed that they use the same approach to handle concurrency，for example：https://github.com/vllm-project/vllm/blob/main/tests/cuda/test_cuda_context.py

We plan to use this method for handling small-scale concurrency in our test cases, while large-scale concurrency will be tested using benchmark tools.

tests/e2e/online_serving/test_qwen3_omni_expansion.py Show resolved Hide resolved

tests/e2e/online_serving/test_qwen3_omni_expansion.py Outdated

+                  stage_config_path = modify_stage_config(stage_config_path, deploy_config)
+                  with OmniServer(model, ["--stage-configs-path", stage_config_path, "--stage-init-timeout", "90"]) as server:
+                      image_data_url = f"data:image/jpeg;base64,{generate_synthetic_image(64, 64)}"

Collaborator

hsliuustc0106 Jan 17, 2026

why we choose 64*64, Use at least 224x224 for better model reliability.

Contributor Author

yenuo26 Jan 19, 2026

Modified

tests/e2e/online_serving/test_qwen3_omni_expansion.py Outdated Show resolved Hide resolved

yenuo26 mentioned this pull request

[RFC]: Qwen-Omni Test Cases JiusiServe/vllm-omni#14

Open

1 task

david6666666 added this to the v0.14.0rc1 milestone

yenuo26 and others added 3 commits

January 19, 2026 21:57


          Merge branch 'main' into full_test

491ca2c


          handle generate response

ff1a6c4

Signed-off-by: wangyu31577 <[email protected]>


          modify audio generate

ff70b35

Signed-off-by: wangyu31577 <[email protected]>

yenuo26 requested a review from hsliuustc0106

January 20, 2026 01:07

Bounty-hunter reviewed

View reviewed changes

tests/e2e/online_serving/test_qwen3_omni_expansion.py Show resolved Hide resolved

hsliuustc0106 and others added 3 commits

January 21, 2026 07:22


          Merge branch 'main' into full_test

53597a9

Signed-off-by: Hongsheng Liu <[email protected]>


          Merge branch 'main' into full_test

daf2f60


          Merge branch 'main' into full_test

351153b

hsliuustc0106 approved these changes

View reviewed changes

Collaborator

hsliuustc0106 left a comment

lgtm

hsliuustc0106 merged commit 334d306 into vllm-project:main

7 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready