models : fix assert in mamba2 graph by ggerganov · Pull Request #20270 · ggml-org/llama.cpp

ggerganov · 2026-03-09T05:58:54Z

cont #19802
fix #20268

This fixes the model loading, but the reasoning parsing seems to be broken because the post-reasoning contents are not being displayed in the WebUI:

cc @pwilkin

pwilkin · 2026-03-09T09:39:24Z

Will check.

pwilkin · 2026-03-09T11:12:34Z

@ggerganov the default chat template is broken, cuts off the last tool call (which also breaks the analysis). Works fine with the one from models/templates/NVIDIA-Nemotron-Nano-v2.jinja

pwilkin · 2026-03-09T11:13:43Z

BTW, I'm wondering if we should detect broken templates and use the ones from the models/templates directory automatically. Would probably save a lot of those questions.

ggerganov · 2026-03-09T11:14:38Z

Ok, even just a warning in the logs would be useful.

JohannesGaessler · 2026-03-09T12:31:35Z

It seems I misinterpreted how the values are being used, thank you for fixing it.

models : fix assert in mamba2 graph

8434477

ggerganov requested a review from CISC as a code owner March 9, 2026 05:58

ggerganov requested a review from JohannesGaessler March 9, 2026 05:59

danbev approved these changes Mar 9, 2026

View reviewed changes

github-actions bot added the model Model specific label Mar 9, 2026

ggerganov merged commit 43e1cbd into master Mar 9, 2026
73 of 78 checks passed

ggerganov deleted the gg/models-fix-mamba2 branch March 9, 2026 11:15

CISC mentioned this pull request Mar 9, 2026

Eval bug: Nemotron-3-Nano-30B-A3B crashes with GGML_ASSERT(d_inner % (n_group*n_embd) == 0) in mamba-base.cpp:173 on Windows (CPU & CUDA) #20307

Closed

EZForever mentioned this pull request Mar 10, 2026

Eval bug: Autoparser misplaces non-thinking content with NVIDIA-Nemotron-Nano-9B-v2 #20325

Closed

ggerganov mentioned this pull request Mar 10, 2026

models : fix assert in mamba2 (cont) #20335

Merged

bartowski1182 pushed a commit to bartowski1182/llama.cpp that referenced this pull request Mar 10, 2026

models : fix assert in mamba2 graph (ggml-org#20270)

41e6fcc

Ethan-a2 pushed a commit to Ethan-a2/llama.cpp that referenced this pull request Mar 20, 2026

models : fix assert in mamba2 graph (ggml-org#20270)

e511b3d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

models : fix assert in mamba2 graph#20270

models : fix assert in mamba2 graph#20270
ggerganov merged 1 commit intomasterfrom
gg/models-fix-mamba2

ggerganov commented Mar 9, 2026

Uh oh!

pwilkin commented Mar 9, 2026

Uh oh!

pwilkin commented Mar 9, 2026 •

edited

Loading

Uh oh!

pwilkin commented Mar 9, 2026

Uh oh!

ggerganov commented Mar 9, 2026

Uh oh!

Uh oh!

JohannesGaessler commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ggerganov commented Mar 9, 2026

Uh oh!

pwilkin commented Mar 9, 2026

Uh oh!

pwilkin commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pwilkin commented Mar 9, 2026

Uh oh!

ggerganov commented Mar 9, 2026

Uh oh!

Uh oh!

JohannesGaessler commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pwilkin commented Mar 9, 2026 •

edited

Loading