models : fix graph splits by ggerganov · Pull Request #19866 · ggml-org/llama.cpp

ggerganov · 2026-02-24T21:04:49Z

fix #19860
fix #19864

Ensure the node order of Qwen 3.5 graphs is suitable for multi-GPU systems.

jacekpoplawski · 2026-02-24T21:38:48Z

with this code 27B no longer crashes for me

mukhma0c · 2026-02-24T21:59:30Z

i had issues running qwen3.5-27b split across 2 gpus where it would crash before it generates anything. this pr fixed it
i am running linux ubuntu
intel i7 10700f
nvidia rtx 3090
nvidia rtx 3070
i was running the model using llama-server with the -ts 85,15 cli argument to split the model across both gpus and it was crashing before this pr. now it runs fine with PP over 700t/s and tg over 20t/s

models : fix graph splits

2d89c60

ggerganov requested a review from CISC as a code owner February 24, 2026 21:04

This was referenced Feb 24, 2026

Eval bug: CUDA error on Qwen3.5-27B #19860

Closed

Eval bug: qwen35 and qwen35moe graph split issues (Severe PP impact, crashes) #19864

Closed

github-actions bot added the model Model specific label Feb 24, 2026

ggerganov merged commit 2446419 into master Feb 24, 2026
75 checks passed

ggerganov deleted the gg/qwem35-fix-graph-splits branch February 24, 2026 22:01

andyceo mentioned this pull request Feb 25, 2026

Eval bug: Qwen3.5 always re-processes the full prompt #19858

Closed

bartowski1182 pushed a commit to bartowski1182/llama.cpp that referenced this pull request Mar 2, 2026

models : fix graph splits (ggml-org#19866)

a7b9d76

ArberSephirotheca pushed a commit to ArberSephirotheca/llama.cpp that referenced this pull request Mar 3, 2026

models : fix graph splits (ggml-org#19866)

3dbb3ec

aldehir pushed a commit to aldehir/llama.cpp that referenced this pull request Mar 6, 2026

models : fix graph splits (ggml-org#19866)

18d4415

Ethan-a2 pushed a commit to Ethan-a2/llama.cpp that referenced this pull request Mar 20, 2026

models : fix graph splits (ggml-org#19866)

786d294

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

models : fix graph splits#19866

models : fix graph splits#19866
ggerganov merged 1 commit intomasterfrom
gg/qwem35-fix-graph-splits

ggerganov commented Feb 24, 2026 •

edited

Loading

Uh oh!

jacekpoplawski commented Feb 24, 2026

Uh oh!

mukhma0c commented Feb 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ggerganov commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jacekpoplawski commented Feb 24, 2026

Uh oh!

mukhma0c commented Feb 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ggerganov commented Feb 24, 2026 •

edited

Loading