[Feature] Support Omni serving abort request#486
[Feature] Support Omni serving abort request#486hsliuustc0106 merged 25 commits intovllm-project:mainfrom
Conversation
Signed-off-by: ZeldaHuang <hzm414167@alibaba-inc.com>
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
ℹ️ About Codex in GitHub
Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".
|
please resolve conflicts after #391 merged |
b819a36 to
2894e47
Compare
@hsliuustc0106 Done |
|
fix ci please |
|
@yinpeiqi PTAL |
|
lgtm. Do we need test case for abort operation? |
|
Signed-off-by: ZeldaHuang <hzm414167@alibaba-inc.com>
Signed-off-by: ZeldaHuang <hzm414167@alibaba-inc.com>
|
@hsliuustc0106 add |
|
@yinpeiqi PTAL for the asyncomni |
|
[2025-12-30T12:14:22Z] WARNING 12-30 12:14:22 [omni.py:292] [AsyncOrchestrator] Initialization timeout: only 0/1 stages are ready; not ready: [0]. it seems the timeout problem, how long do you wait for the engine initializaiton for the current online case: |
Signed-off-by: ZeldaHuang <hzm414167@alibaba-inc.com>
Signed-off-by: ZeldaHuang <hzm414167@alibaba-inc.com>
|
@hsliuustc0106 It seems that simple unit test only has cpu environment, move asyn_comni test to e2e test |
ORZ, I just realized it. |
Signed-off-by: Hongsheng Liu <liuhongsheng4@huawei.com>
Signed-off-by: ZeldaHuang <hzm414167@alibaba-inc.com>
Signed-off-by: ZeldaHuang <hzm414167@alibaba-inc.com> Signed-off-by: Hongsheng Liu <liuhongsheng4@huawei.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>
Signed-off-by: ZeldaHuang <hzm414167@alibaba-inc.com> Signed-off-by: Hongsheng Liu <liuhongsheng4@huawei.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>
PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.
Purpose
ref #419
Changes:
OmniStageTaskTypeindicate the different task send to stage's in_q, for now there are three task types : SHUTDOWN、GENERATE、ABORTTest Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)