Skip to content
This repository was archived by the owner on Sep 4, 2025. It is now read-only.

Conversation

@dtrifiro
Copy link

@dtrifiro dtrifiro commented Jun 18, 2024

fixes broken dockerfile build

@openshift-ci
Copy link

openshift-ci bot commented Jun 18, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dtrifiro

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Xaenalt pushed a commit that referenced this pull request Sep 18, 2024
* HPU: Change KV-cache layout to (num_blocks, block_size, num_heads, head_size)

* Fix UTs

* Fix UTs - part 2
prarit pushed a commit to prarit/vllm that referenced this pull request Oct 18, 2024
* Moving custom skinni gemm heuristic before hipblas or rocblas solutions. Disabling the now obsolete LLMM1 path

* Simplified the decision logic

* Added back one case when LLMM1 can be used. Defaulting to adding bias separately

* Moved bias addition inside tgemm
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant