Add vLLM backend for visualqna by yongfengdu · Pull Request #905 · opea-project/GenAIInfra

yongfengdu · 2025-03-26T14:49:44Z

Enable vLLM for lvm-uservice.
Enable vLLM as default LVM backend for VisualQnA.
Update READMEs.
Add PT_HPUGRAPH_DISABLE_TENSOR_CACHE option for vllm, which is necessary to serve LVM models.

Description

The summary of the proposed changes as long as the relevant motivation and context.

Issues

#880
#877

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Helm install with all available values.yaml files.

Enable vLLM for lvm-uservice. Enable vLLM as default LVM backend for VisualQnA. Update READMEs. Add PT_HPUGRAPH_DISABLE_TENSOR_CACHE option for vllm, which is necessary to serve LVM models. Signed-off-by: Dolpher Du <dolpher.du@intel.com>

yongfengdu requested a review from lianhao as a code owner March 26, 2025 14:49

yongfengdu mentioned this pull request Mar 26, 2025

[Feature] Enhance the Helm Chart support for 8 examples opea-project/GenAIEval#226

Closed

8 tasks

Add vLLM backend for visualqna

30e442d

Enable vLLM for lvm-uservice. Enable vLLM as default LVM backend for VisualQnA. Update READMEs. Add PT_HPUGRAPH_DISABLE_TENSOR_CACHE option for vllm, which is necessary to serve LVM models. Signed-off-by: Dolpher Du <dolpher.du@intel.com>

yongfengdu force-pushed the issue880 branch from d3d97a1 to 30e442d Compare March 26, 2025 14:55

yongfengdu requested review from mkbhanda and poussa March 26, 2025 15:03

lianhao approved these changes Mar 27, 2025

View reviewed changes

poussa approved these changes Mar 27, 2025

View reviewed changes

poussa merged commit 485c880 into opea-project:main Mar 27, 2025
19 checks passed

yongfengdu deleted the issue880 branch April 1, 2025 02:31

joshuayao mentioned this pull request Apr 2, 2025

[Feature] vLLM enablement for 8 GenAI examples opea-project/GenAIExamples#1436

Closed

21 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vLLM backend for visualqna#905

Add vLLM backend for visualqna#905
poussa merged 1 commit intoopea-project:mainfrom
yongfengdu:issue880

yongfengdu commented Mar 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yongfengdu commented Mar 26, 2025

Description

Issues

Type of change

Dependencies

Tests

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants