cuda : update supports_op for matrix multiplication by slaren · Pull Request #8245 · ggml-org/llama.cpp

slaren · 2024-07-01T21:56:41Z

Update supports_op to correctly reflect that bf16 is not supported, and prevent new types added in the future from being incorrectly reported as supported. This will also cause bf16 models to be run on the CPU rather than crashing when using a CUDA build.

cuda : update supports_op for matrix multiplication

b1b3b00

github-actions bot added the testing Everything test related label Jul 1, 2024

ggerganov approved these changes Jul 2, 2024

View reviewed changes

ggerganov merged commit 0e0590a into master Jul 2, 2024

slaren deleted the sl/fix-cuda-supports branch July 2, 2024 16:18

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Jul 2, 2024

cuda : update supports_op for matrix multiplication (ggml-org#8245)

6a5ac72

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 3, 2024

cuda : update supports_op for matrix multiplication (ggml-org#8245)

726953c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda : update supports_op for matrix multiplication#8245

cuda : update supports_op for matrix multiplication#8245
ggerganov merged 1 commit intomasterfrom
sl/fix-cuda-supports

slaren commented Jul 1, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

slaren commented Jul 1, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants