Skip to content

[RFC]: Add E2E full test for Qwen3-Omni-30B-A3B-Instruct #723

@yenuo26

Description

@yenuo26

Motivation.

Build E2E CI for vllm-omni to strengthen quality protection. This update ensures robust validation for both online (real-time inference) and offline (batch/development) scenarios.

Proposed Change.

We will add end-to-end test cases for the Qwen3-Omni-30B-A3B-Instruct model.
The test plan and test cases are visible in Section 1 and 2 of the design document.

Implementation Roadmap & To-Do

Phase 1: Test Single Modal + a few request for online


Phase 2: Test mix Modal + a few request for online

  • Submit the test cases for this phase

Phase 3: Test mix Modal + A large number of requests for online

  • Submit vllm online benchmark
  • Submit the test cases for this phase

Phase 4: Test offline

  • Submit the test cases for this phase

Phase 5: Test chunk prefill feature for online

  • Submit the test cases for this phase

Feedback Period.

No response

CC List.

No response

Any Other Things.

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions