Skip to content

Set vLLM as default model for FaqGen#1580

Merged
lvliang-intel merged 12 commits intoopea-project:mainfrom
XinyaoWa:faqgen_max_token
Mar 10, 2025
Merged

Set vLLM as default model for FaqGen#1580
lvliang-intel merged 12 commits intoopea-project:mainfrom
XinyaoWa:faqgen_max_token

Conversation

@XinyaoWa
Copy link
Copy Markdown
Collaborator

Description

  • Support vLLM for FaqGen
  • increase default FaqGen max tokens and make it configurable

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)
  • Others (enhancement, documentation, validation, etc.)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

Make MAX_INPUT_TOKENS and MAX_TOTAL_TOKENS can be set by user, increase default MAX_TOTAL_TOKENS to 8192 in gaudi mode

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Support vllm for FaqGe to target v1.3 feature

Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
@github-actions
Copy link
Copy Markdown

github-actions bot commented Feb 21, 2025

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

@XinyaoWa XinyaoWa closed this Mar 5, 2025
@XinyaoWa XinyaoWa reopened this Mar 6, 2025
@joshuayao joshuayao added this to the v1.3 milestone Mar 7, 2025
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
@XinyaoWa XinyaoWa force-pushed the faqgen_max_token branch 2 times, most recently from 03ea03e to 57b56e3 Compare March 7, 2025 03:42
@XinyaoWa XinyaoWa changed the title Support vLLM for FaqGen Set vLLM as default model for FaqGen Mar 7, 2025
@lvliang-intel lvliang-intel merged commit eb245fd into opea-project:main Mar 10, 2025
18 checks passed
jedwards-habana pushed a commit to jedwards-habana/GenAIExamples that referenced this pull request Mar 11, 2025
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Signed-off-by: Edwards, James A <jaedwards@habana.ai>
chyundunovDatamonsters pushed a commit to chyundunovDatamonsters/OPEA-GenAIExamples that referenced this pull request Mar 21, 2025
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>
chyundunovDatamonsters pushed a commit to chyundunovDatamonsters/OPEA-GenAIExamples that referenced this pull request Apr 1, 2025
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>
chyundunovDatamonsters pushed a commit to chyundunovDatamonsters/OPEA-GenAIExamples that referenced this pull request Apr 1, 2025
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>
chyundunovDatamonsters pushed a commit to chyundunovDatamonsters/OPEA-GenAIExamples that referenced this pull request May 16, 2025
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Signed-off-by: Chingis Yundunov <c.yundunov@datamonsters.com>
cogniware-devops pushed a commit to Cogniware-Inc/GenAIExamples that referenced this pull request Dec 19, 2025
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com>
Signed-off-by: cogniware-devops <ambarish.desai@cogniware.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants