Better CPU prompt processing performance for SWA models by ikawrakow · Pull Request #696 · ikawrakow/ik_llama.cpp

ikawrakow · 2025-08-16T05:16:52Z

This PR is a follow up of #692 and uses the same technique to improve prompt processing performance for models utilizing SWA. As #682 it is implemented only on the CPU and requires FA.

Here some performance comparisons on a Ryzen-7950X CPU

Gemma3-270M-it, Q8_0

GPT-OSS-20B, MXFP4

ikawrakow · 2025-08-16T05:35:56Z

Just for fun, here a CPU-only comparison with mainline llama.cpp for GPT-OSS-20B-MXFP4 with Q8_0 KV cache:

This reverts commit 93a4f60.

…" (#701) This reverts commit 93a4f60. Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

Iwan Kawrakow added 3 commits August 15, 2025 16:45

This does the trick for PP

0efd1a6

Compute mask bounds when creating the mask

c287ccb

Set mask bounds for all supported SWA models

84ebbe2

ikawrakow merged commit 93a4f60 into main Aug 17, 2025

ikawrakow pushed a commit that referenced this pull request Aug 17, 2025

Revert "Better CPU prompt processing performance for SWA models (#696)"

e29829e

This reverts commit 93a4f60.

ikawrakow added a commit that referenced this pull request Aug 17, 2025

Revert "Better CPU prompt processing performance for SWA models (#696)…

a3a5230

…" (#701) This reverts commit 93a4f60. Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

ikawrakow mentioned this pull request Aug 18, 2025

Better CPU prompt processing performance for SWA models #702

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better CPU prompt processing performance for SWA models#696

Better CPU prompt processing performance for SWA models#696
ikawrakow merged 3 commits intomainfrom
ik/cpu_swa_v1

ikawrakow commented Aug 16, 2025

Uh oh!

ikawrakow commented Aug 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ikawrakow commented Aug 16, 2025

Gemma3-270M-it, Q8_0

GPT-OSS-20B, MXFP4

Uh oh!

ikawrakow commented Aug 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant