Motivation.
Build E2E CI for vllm-omni to strengthen quality protection. This update ensures robust validation for both online (real-time inference) and offline (batch/development) scenarios.
Proposed Change.
We will add end-to-end test cases for the Qwen3-Omni-30B-A3B-Instruct model.
The test plan and test cases are visible in Section 1 and 2 of the design document.
Implementation Roadmap & To-Do
Phase 1: Test Single Modal + a few request for online
Phase 2: Test mix Modal + a few request for online
Phase 3: Test mix Modal + A large number of requests for online
Phase 4: Test offline
Phase 5: Test chunk prefill feature for online
Feedback Period.
No response
CC List.
No response
Any Other Things.
No response
Before submitting a new issue...
Motivation.
Build E2E CI for vllm-omni to strengthen quality protection. This update ensures robust validation for both online (real-time inference) and offline (batch/development) scenarios.
Proposed Change.
We will add end-to-end test cases for the Qwen3-Omni-30B-A3B-Instruct model.
The test plan and test cases are visible in Section 1 and 2 of the design document.
Implementation Roadmap & To-Do
Phase 1: Test Single Modal + a few request for online
Phase 2: Test mix Modal + a few request for online
Phase 3: Test mix Modal + A large number of requests for online
Phase 4: Test offline
Phase 5: Test chunk prefill feature for online
Feedback Period.
No response
CC List.
No response
Any Other Things.
No response
Before submitting a new issue...