[Feature] Omni Connector + ray supported by natureofnature · Pull Request #215 · vllm-project/vllm-omni

natureofnature · 2025-12-05T09:52:58Z

added omni mooncake connectors to distributed directory, integrate omni connector to omni-vllm
ray support, support omni stage execution across nodes
move shared memory communication to omni connector

Purpose

Create unified connector (OmniConnector) for Multimodal Full Disaggregation (Encode/Prefill/Decode/Generator) [RFC]: OmniConnector for Multimodal Full Disaggregation (Encode/Prefill/Decode/Generator) #62
Support running on distributed environment
Create a relatively standalone modules for communication and distributed execution.

For more details, please refer to Design document

Test Plan

Qwen3-omni
Hunyuan image

Test Result

Below result used Qwen2-omni, using command

python openai_chat_completion_client_for_multimodal_generation.py --query-type text

Ray + omni connector

vllm serve /workspace/Qwen2.5-Omni-7B/ --omni --port 8091 --worker-backend ray --ray-address auto --stage-configs-path  vllm-omni/vllm_omni/model_executor/stage_configs/qwen2_5_omni_multiconnector.yaml

Multiprocessing + omni connector

vllm serve /workspace/Qwen2.5-Omni-7B/ --omni --port 8091 --stage-configs-path vllm-omni/vllm_omni/model_executor/stage_configs/qwen2_5_omni_multiconnector.yaml

Multiprocessing + shared memory connector

vllm serve /workspace/Qwen2.5-Omni-7B/ --omni --port 8091

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

chatgpt-codex-connector · 2025-12-05T09:53:02Z

The account who enabled Codex for this repo no longer has access to Codex. Please contact the admins of this repo to enable Codex again.

1. added omni mooncake connectors to distributed directory, integrate omni connector to omni-llm, rebased with main (3 times). 2. ray + mooncake connector work on 2 nodes 3. added shared memory communication to omni connector Signed-off-by: wzliu <wzliu@connect.hku.hk> Signed-off-by: wzliu <wzliu@connect.hku.hk>

hsliuustc0106

what if I have 2 instances for stage 0 and 3 instances for stage 1? How can I code such scenerio?

vllm_omni/distributed/connectors/README.md

vllm_omni/distributed/connectors/utils.py

vllm_omni/entrypoints/cli/serve.py

vllm_omni/entrypoints/stage_utils.py

vllm_omni/model_executor/stage_configs/qwen2_5_omni_multiconnector.yaml

examples/offline_inference/qwen2_5_omni/end2end.py

vllm_omni/distributed/omni_connectors/__init__.py

vllm_omni/distributed/omni_connectors/utils/serialization.py

vllm_omni/distributed/omni_connectors/utils/logging.py

docs/design/connectors/ray_based_execution.md

Gaohan123

Overall, the implementation makes sense. Please resolve tests problems and introduce more details in the weekly meeting. Thanks!

2. fix default threshold 3. fix connector skipping error problem and try to raise connector error while connectors not existing for edges 4. process to multi_process 5. udpate design documents 6. fix yaml Signed-off-by: wzliu <wzliu@connect.hku.hk>

natureofnature · 2025-12-08T07:50:14Z

what if I have 2 instances for stage 0 and 3 instances for stage 1? How can I code such scenerio?

Currently not well supported, we may treat each instance as a single stage because the orchestrator uses an asynchronous queue to schedule between stages. This feature can be added in the in P1.

Signed-off-by: wzliu <wzliu@connect.hku.hk>

2. fix yaml config for multi connector Signed-off-by: wzliu <wzliu@connect.hku.hk>

Signed-off-by: wzliu <wzliu@connect.hku.hk>

hsliuustc0106 · 2025-12-09T03:42:51Z

any test?

vllm_omni/model_executor/stage_configs/qwen2_5_omni_multiconnector.yaml

tests/distributed/omni_connectors/test_basic_connectors.py

vllm_omni/model_executor/stage_configs/qwen2_5_omni_multiconnector.yaml

docs/design/connectors/connectors_design.md

Signed-off-by: wzliu <wzliu@connect.hku.hk>

hsliuustc0106 · 2025-12-10T02:55:39Z

I propose to add #203 Bagel as an example to test the througput improvement

hsliuustc0106

a few short comment left to be fixed

tests/distributed/omni_connectors/test_adapter_and_flow.py

vllm_omni/distributed/ray_utils/utils.py

Signed-off-by: wzliu <wzliu@connect.hku.hk>

tests/distributed/omni_connectors/test_adapter_and_flow.py

vllm_omni/distributed/omni_connectors/connectors/base.py

Signed-off-by: wzliu <wzliu@connect.hku.hk>

hsliuustc0106

lgtm, thanks for such a great job

Signed-off-by: wzliu <wzliu@connect.hku.hk> Signed-off-by: Prajwal A <prajwalanagani@gmail.com>

Signed-off-by: wzliu <wzliu@connect.hku.hk> Signed-off-by: elijah <f1renze.142857@gmail.com>

Signed-off-by: wzliu <wzliu@connect.hku.hk> Signed-off-by: Fanli Lin <fanli.lin@intel.com>

Signed-off-by: wzliu <wzliu@connect.hku.hk>

natureofnature requested a review from hsliuustc0106 as a code owner December 5, 2025 09:52

natureofnature force-pushed the wzliu_connector_dev branch from 5703dce to b845342 Compare December 5, 2025 10:07

natureofnature mentioned this pull request Dec 5, 2025

[WIP] OmniConnector for Multimodal Full Disaggregation #79

Closed

5 tasks

hsliuustc0106 reviewed Dec 5, 2025

View reviewed changes

hsliuustc0106 requested changes Dec 6, 2025

View reviewed changes

vllm_omni/distributed/connectors/README.md Outdated Show resolved Hide resolved

vllm_omni/distributed/connectors/utils.py Outdated Show resolved Hide resolved

vllm_omni/entrypoints/cli/serve.py Outdated Show resolved Hide resolved

hsliuustc0106 reviewed Dec 6, 2025

View reviewed changes

Gaohan123 reviewed Dec 7, 2025

View reviewed changes

david6666666 mentioned this pull request Dec 8, 2025

[Roadmap]: preparing for v0.12.0 release #165

Closed

61 tasks

natureofnature added 2 commits December 8, 2025 17:22

merge with main

23e8dca

Signed-off-by: wzliu <wzliu@connect.hku.hk>

1. add additional **kwargs to ut to prevent mismatch parameter problem

52cabf7

2. fix yaml config for multi connector Signed-off-by: wzliu <wzliu@connect.hku.hk>

natureofnature force-pushed the wzliu_connector_dev branch from 0b57ce2 to 52cabf7 Compare December 9, 2025 02:25

update connector file organization

ae367dd

Signed-off-by: wzliu <wzliu@connect.hku.hk>

natureofnature requested a review from hsliuustc0106 December 9, 2025 03:15