Set vLLM as default model for FaqGen by XinyaoWa · Pull Request #1580 · opea-project/GenAIExamples

XinyaoWa · 2025-02-21T06:40:32Z

Description

Support vLLM for FaqGen
increase default FaqGen max tokens and make it configurable

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

Make MAX_INPUT_TOKENS and MAX_TOTAL_TOKENS can be set by user, increase default MAX_TOTAL_TOKENS to 8192 in gaudi mode Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

Support vllm for FaqGe to target v1.3 feature Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

github-actions · 2025-02-21T06:40:46Z

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

FaqGen/docker_compose/intel/cpu/xeon/compose_vllm.yaml

Replace tgi with vllm Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

for more information, see https://pre-commit.ci

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

…x_token

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: Edwards, James A <jaedwards@habana.ai>

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: Chingis Yundunov <c.yundunov@datamonsters.com>

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: cogniware-devops <ambarish.desai@cogniware.ai>

XinyaoWa added 5 commits February 13, 2025 10:21

FaqGen add max token param on gaudi

41abe26

Make MAX_INPUT_TOKENS and MAX_TOTAL_TOKENS can be set by user, increase default MAX_TOTAL_TOKENS to 8192 in gaudi mode Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

Merge branch 'main' into faqgen_max_token

14d6c2b

Add max_input_token and max_output_token in UT

61763f8

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

Merge remote-tracking branch 'remotes/origin/main' into faqgen_max_token

f1c4e25

Support vllm for FaqGen

c904264

Support vllm for FaqGe to target v1.3 feature Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

joshuayao mentioned this pull request Feb 24, 2025

[Feature] vLLM enablement for 8 GenAI examples #1436

Closed

21 tasks

Merge branch 'opea-project:main' into faqgen_max_token

3b93ad8

yinghu5 requested review from chensuyue and lvliang-intel February 27, 2025 08:30

yinghu5 reviewed Mar 4, 2025

View reviewed changes

FaqGen/docker_compose/intel/cpu/xeon/compose_vllm.yaml Show resolved Hide resolved

XinyaoWa closed this Mar 5, 2025

XinyaoWa reopened this Mar 6, 2025

XinyaoWa and others added 4 commits March 6, 2025 11:30

Merge branch 'main' into faqgen_max_token

6ecc363

Set vllm as default backend service for FaqGen

d22f96b

Replace tgi with vllm Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

c057699

for more information, see https://pre-commit.ci

Merge remote-tracking branch 'remotes/origin/main' into faqgen_max_token

ff6ad5b

joshuayao added this to the v1.3 milestone Mar 7, 2025

Align DATA_PATH to MODEL_CACHE

9353b2c

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>

XinyaoWa force-pushed the faqgen_max_token branch 2 times, most recently from 03ea03e to 57b56e3 Compare March 7, 2025 03:42

Merge remote-tracking branch 'origin/faqgen_max_token' into faqgen_ma…

d37813b

…x_token

XinyaoWa force-pushed the faqgen_max_token branch from 57b56e3 to d37813b Compare March 7, 2025 06:54

XinyaoWa changed the title ~~Support vLLM for FaqGen~~ Set vLLM as default model for FaqGen Mar 7, 2025

letonghan approved these changes Mar 7, 2025

View reviewed changes

lvliang-intel approved these changes Mar 10, 2025

View reviewed changes

lvliang-intel merged commit eb245fd into opea-project:main Mar 10, 2025
18 checks passed

jedwards-habana pushed a commit to jedwards-habana/GenAIExamples that referenced this pull request Mar 11, 2025

Set vLLM as default model for FaqGen (opea-project#1580)

ab9a403

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: Edwards, James A <jaedwards@habana.ai>

cogniware-devops pushed a commit to Cogniware-Inc/GenAIExamples that referenced this pull request Dec 19, 2025

Set vLLM as default model for FaqGen (opea-project#1580)

cbdf234

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Signed-off-by: cogniware-devops <ambarish.desai@cogniware.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set vLLM as default model for FaqGen#1580

Set vLLM as default model for FaqGen#1580
lvliang-intel merged 12 commits intoopea-project:mainfrom
XinyaoWa:faqgen_max_token

XinyaoWa commented Feb 21, 2025

Uh oh!

github-actions bot commented Feb 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

XinyaoWa commented Feb 21, 2025

Description

Issues

Type of change

Dependencies

Tests

Uh oh!

github-actions bot commented Feb 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Dependency Review

Scanned Files

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

github-actions bot commented Feb 21, 2025 •

edited

Loading