[Feature] Omni Connector + ray supported#215
[Feature] Omni Connector + ray supported#215hsliuustc0106 merged 9 commits intovllm-project:mainfrom
Conversation
|
The account who enabled Codex for this repo no longer has access to Codex. Please contact the admins of this repo to enable Codex again. |
1. added omni mooncake connectors to distributed directory, integrate omni connector to omni-llm, rebased with main (3 times). 2. ray + mooncake connector work on 2 nodes 3. added shared memory communication to omni connector Signed-off-by: wzliu <wzliu@connect.hku.hk> Signed-off-by: wzliu <wzliu@connect.hku.hk>
5703dce to
b845342
Compare
hsliuustc0106
left a comment
There was a problem hiding this comment.
what if I have 2 instances for stage 0 and 3 instances for stage 1? How can I code such scenerio?
vllm_omni/model_executor/stage_configs/qwen2_5_omni_multiconnector.yaml
Outdated
Show resolved
Hide resolved
vllm_omni/model_executor/stage_configs/qwen2_5_omni_multiconnector.yaml
Outdated
Show resolved
Hide resolved
Gaohan123
left a comment
There was a problem hiding this comment.
Overall, the implementation makes sense. Please resolve tests problems and introduce more details in the weekly meeting. Thanks!
2. fix default threshold 3. fix connector skipping error problem and try to raise connector error while connectors not existing for edges 4. process to multi_process 5. udpate design documents 6. fix yaml Signed-off-by: wzliu <wzliu@connect.hku.hk>
Currently not well supported, we may treat each instance as a single stage because the orchestrator uses an asynchronous queue to schedule between stages. This feature can be added in the in P1. |
Signed-off-by: wzliu <wzliu@connect.hku.hk>
2. fix yaml config for multi connector Signed-off-by: wzliu <wzliu@connect.hku.hk>
0b57ce2 to
52cabf7
Compare
Signed-off-by: wzliu <wzliu@connect.hku.hk>
|
any test? |
vllm_omni/model_executor/stage_configs/qwen2_5_omni_multiconnector.yaml
Outdated
Show resolved
Hide resolved
vllm_omni/model_executor/stage_configs/qwen2_5_omni_multiconnector.yaml
Outdated
Show resolved
Hide resolved
vllm_omni/model_executor/stage_configs/qwen2_5_omni_multiconnector.yaml
Outdated
Show resolved
Hide resolved
9d5fa77 to
0a9060e
Compare
Signed-off-by: wzliu <wzliu@connect.hku.hk>
0a9060e to
05ebe57
Compare
|
I propose to add #203 Bagel as an example to test the througput improvement |
hsliuustc0106
left a comment
There was a problem hiding this comment.
a few short comment left to be fixed
Signed-off-by: wzliu <wzliu@connect.hku.hk>
Signed-off-by: wzliu <wzliu@connect.hku.hk>
26e3495 to
cdf1299
Compare
hsliuustc0106
left a comment
There was a problem hiding this comment.
lgtm, thanks for such a great job
Signed-off-by: wzliu <wzliu@connect.hku.hk> Signed-off-by: Prajwal A <prajwalanagani@gmail.com>
Signed-off-by: wzliu <wzliu@connect.hku.hk> Signed-off-by: Prajwal A <prajwalanagani@gmail.com>
Signed-off-by: wzliu <wzliu@connect.hku.hk> Signed-off-by: elijah <f1renze.142857@gmail.com>
Signed-off-by: wzliu <wzliu@connect.hku.hk> Signed-off-by: elijah <f1renze.142857@gmail.com>
Signed-off-by: wzliu <wzliu@connect.hku.hk> Signed-off-by: Fanli Lin <fanli.lin@intel.com>
Signed-off-by: wzliu <wzliu@connect.hku.hk>
Purpose
For more details, please refer to Design document
Test Plan
Test Result
Below result used Qwen2-omni, using command
Ray + omni connector
Multiprocessing + omni connector
Multiprocessing + shared memory connector
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)