ggml-cpu:fix RISC-V Q4_0 repack select and RVV feature reporting by ixgbe · Pull Request #17951 · ggml-org/llama.cpp

ixgbe · 2025-12-12T05:26:14Z

Changes included:

Add ggml_cpu_get_rvv_cnt() and RVV vector-length initialization.
Export RVV_CNT in CPU feature list.
Update ggml_repack_get_optimal_repack_type() to enable Q4_0 repack when
ggml_cpu_has_riscv_v() and rvv_cnt >= QK4_0.

Signed-off-by: Wang Yang <yangwang@iscas.ac.cn>

xctan · 2025-12-12T08:32:18Z

I suggest using the name VLEN instead of copying the name CNT from SVE svcntX instruction families.

…gml-org#17951) * ggml-cpu:fix RISC-V Q4_0 repack select and RVV feature reporting Signed-off-by: Wang Yang <yangwang@iscas.ac.cn> * using the name VLEN instead of CNT * Update ggml/include/ggml-cpu.h --------- Signed-off-by: Wang Yang <yangwang@iscas.ac.cn> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

…17951) * ggml-cpu:fix RISC-V Q4_0 repack select and RVV feature reporting Signed-off-by: Wang Yang <yangwang@iscas.ac.cn> * using the name VLEN instead of CNT * Update ggml/include/ggml-cpu.h --------- Signed-off-by: Wang Yang <yangwang@iscas.ac.cn> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

…gml-org#17951) * ggml-cpu:fix RISC-V Q4_0 repack select and RVV feature reporting Signed-off-by: Wang Yang <yangwang@iscas.ac.cn> * using the name VLEN instead of CNT * Update ggml/include/ggml-cpu.h --------- Signed-off-by: Wang Yang <yangwang@iscas.ac.cn> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

ggml-cpu:fix RISC-V Q4_0 repack select and RVV feature reporting

f1df1fb

Signed-off-by: Wang Yang <yangwang@iscas.ac.cn>

ixgbe requested a review from ggerganov as a code owner December 12, 2025 05:26

github-actions Bot added the ggml changes relating to the ggml tensor library for machine learning label Dec 12, 2025

loci-dev mentioned this pull request Dec 12, 2025

UPSTREAM PR #17951: ggml-cpu:fix RISC-V Q4_0 repack select and RVV feature reporting auroralabs-loci/llama.cpp#531

Open

using the name VLEN instead of CNT

09d31ef

ggerganov reviewed Dec 12, 2025

View reviewed changes

Comment thread ggml/include/ggml-cpu.h Outdated

ggerganov requested a review from xctan December 12, 2025 14:06

xctan approved these changes Dec 12, 2025

View reviewed changes

Update ggml/include/ggml-cpu.h

1d003ae

ggerganov merged commit 5160443 into ggml-org:master Dec 12, 2025
67 of 69 checks passed

wallentri88 mentioned this pull request Feb 24, 2026

Eval bug: qwen35 and qwen35moe graph split issues (Severe PP impact, crashes) #19864

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml-cpu:fix RISC-V Q4_0 repack select and RVV feature reporting#17951

ggml-cpu:fix RISC-V Q4_0 repack select and RVV feature reporting#17951
ggerganov merged 3 commits into
ggml-org:masterfrom
ixgbe:fix_riscv_q4_0_repack_selection

ixgbe commented Dec 12, 2025

Uh oh!

xctan commented Dec 12, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ixgbe commented Dec 12, 2025

Uh oh!

xctan commented Dec 12, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants