Conversation
b4c12e0 to
3c426be
Compare
| @@ -0,0 +1,194 @@ | |||
| import argparse | |||
There was a problem hiding this comment.
I think we can make this gradio function as a universal frontend built under vllm-omni folder,
maybe serving like:
vllm serve xxmodel --display Gradio
There was a problem hiding this comment.
Good suggestion. But the UI can vary for different models; each model needs its own build_interface function. We need a clean solution to tell model developers how to link their UI and inference input args to this universal --display gradio arg
README.md
Outdated
|
|
||
| ## Run examples (Qwen2.5-omni) | ||
|
|
||
| ### Offlince Inference |
README.md
Outdated
| Launch the gradio service: | ||
|
|
||
| ``` | ||
| python gradio_demo.py --model Qwen/Qwen2.5-Omni-7B --port 7861 |
There was a problem hiding this comment.
shall use the bash.sh instead of python script to align with the offline inference,
in addition, it seems offline inference is not paired with Gradio Demo, they should not be placed on the same level.
README.md
Outdated
| Then open `http://localhost:7861/` on the local browser. | ||
|
|
||
|
|
||
| ## Further details |
There was a problem hiding this comment.
please help delete this section
3e75b07 to
0ea59c0
Compare
|
updated based on #64 |
|
please align with PR #76 to support multimodal inputs with multiple requests |
Signed-off-by: SamitHuang <[email protected]>
Signed-off-by: SamitHuang <[email protected]>
Signed-off-by: SamitHuang <[email protected]>
Signed-off-by: samithuang <[email protected]>
Signed-off-by: samithuang <[email protected]>
Signed-off-by: samithuang <[email protected]>
Signed-off-by: samithuang <[email protected]>
Signed-off-by: samithuang <[email protected]>
Signed-off-by: samithuang <[email protected]>
Signed-off-by: samithuang <[email protected]>
Signed-off-by: samithuang <[email protected]>
Signed-off-by: samithuang <[email protected]>
Signed-off-by: samithuang <[email protected]>
ad36e63 to
07b72b0
Compare
|
rebase and update to support text + audio + image + video inputs |
Signed-off-by: samithuang <[email protected]>
Signed-off-by: SamitHuang <[email protected]>
|
@codex address that feedback |
Signed-off-by: SamitHuang <[email protected]> Signed-off-by: samithuang <[email protected]>
PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.
Purpose
Support gradio web service for Qwen2.5-Omni and Qwen3-Omni
Test Plan
Run
, then open http://127.0.0.1:7861/ in a browser
Test Result
Future Updates
TODOs (from demo to app level):
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.BEFORE SUBMITTING, PLEASE READ https://github.com/hsliuustc0106/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)