Fix KV cache for split mode graph with layers left on CPU by ikawrakow · Pull Request #1506 · ikawrakow/ik_llama.cpp

ikawrakow · 2026-03-25T08:12:54Z

When using split mode graph and attention tensors in some layers are left on the CPU, on the main branch the KV cache for these layers is left uninitialized, which leads to a crash during compute graph construction. The PR fixes the bug.

I had never run a dense model with split mode graph and not all layers offloaded to the GPU. Came across this bug while testing the auto-fit functionality from #1504 with a dense model that does not fit in VRAM.

magikRUKKOLA · 2026-03-25T23:43:05Z

Unable to run smol-IQ2_KS GLM5 full gpu offload after this pull.

/opt/ubergarm/GLM-5-GGUF/smol-IQ2_KS/run-ik_llama.cpp.sh

(gdb) bt full
#0  0x00007ffff7d97918 in llm_build_context::build_deepseek2() () from /opt/ik_llama.cpp/ik_llama.cpp/build/src/libllama.so
No symbol table info available.
#1  0x00007ffff7db17c0 in llm_build_context::llama_build_graph(llama_context&, llama_batch const&, bool) ()
   from /opt/ik_llama.cpp/ik_llama.cpp/build/src/libllama.so
No symbol table info available.
#2  0x00007ffff7ca51d5 in llama_init_from_model () from /opt/ik_llama.cpp/ik_llama.cpp/build/src/libllama.so
No symbol table info available.
#3  0x00005555555c1903 in main ()
No symbol table info available.

Fix KV cache for split mode graph with layers left on CPU

ed9e2d7

ikawrakow merged commit dd75fd0 into main Mar 25, 2026

magikRUKKOLA mentioned this pull request Mar 25, 2026

Auto-fit offloaded tensors to available VRAM (MoE models) #1501

Merged

ikawrakow added a commit that referenced this pull request Mar 26, 2026

Fix bug introduced in #1506

90a0aa1

ikawrakow added a commit that referenced this pull request Mar 26, 2026

Fix bug introduced in #1506 (#1515)

a84d90a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix KV cache for split mode graph with layers left on CPU#1506

Fix KV cache for split mode graph with layers left on CPU#1506
ikawrakow merged 1 commit intomainfrom
ik/sm_graph_partial_offload

ikawrakow commented Mar 25, 2026

Uh oh!

magikRUKKOLA commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ikawrakow commented Mar 25, 2026

Uh oh!

magikRUKKOLA commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants