Skip to content
This repository was archived by the owner on Sep 4, 2025. It is now read-only.

Conversation

@dtrifiro
Copy link

No description provided.

@openshift-ci openshift-ci bot requested review from Xaenalt and rpancham May 21, 2024 08:23
Copy link

@z103cb z103cb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/approve

@openshift-ci
Copy link

openshift-ci bot commented May 21, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dtrifiro, z103cb

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@z103cb z103cb enabled auto-merge May 21, 2024 08:44
@z103cb z103cb merged commit 255735f into opendatahub-io:ibm_main May 21, 2024
dtrifiro pushed a commit that referenced this pull request May 23, 2024
Install and configure use of the NCCL version recommended by vLLM via
the [vllm-nccl](https://github.com/vllm-project/vllm-nccl) package. The
install is a little wonky... but this set of changes should work.

Signed-off-by: Travis Johnson <[email protected]>
Xaenalt pushed a commit that referenced this pull request Sep 18, 2024
prarit pushed a commit to prarit/vllm that referenced this pull request Oct 18, 2024
Update max_context_len for custom paged attention.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants