chat : detect reasoning markers when enable_thinking changes system prompt by jhen0409 · Pull Request #20859 · ggml-org/llama.cpp

jhen0409 · 2026-03-22T10:01:55Z

I found models like SmolLM3-3B chat template changes the system prompt depends on enable_thinking (!left_trimmed.empty() && !right_trimmed.empty()), so it will skip parse the reasoning markers.

Codex (gpt-5.4, xhigh) also helped check this bug fix.

…rompt

pwilkin · 2026-03-22T13:50:14Z

#20424 should have fixed it.

jhen0409 · 2026-03-23T00:30:05Z

#20424 should have fixed it.

@pwilkin There are the results I got for left_trimmed and right_trimmed using the template:

compare_thinking_enabled: left_trimmed: no_think

## Custom Instructions

You are a helpful AI assistant named SmolLM, trained by Hugging Face.

<|im_start|>user
U_USER_MSG Hello END_U<|im_end|>
<|im_start|>assistant
<think>

</think>
compare_thinking_enabled: right_trimmed: think

## Custom Instructions

You are a helpful AI assistant named SmolLM, trained by Hugging Face. Your role as an assistant involves thoroughly exploring questions through a systematic thinking process before providing the final precise and accurate solutions. This requires engaging in a comprehensive cycle of analysis, summarizing, exploration, reassessment, reflection, backtracking, and iteration to develop well-considered thinking process. Please structure your response into two main sections: Thought and Solution using the specified format: <think> Thought section </think> Solution section. In the Thought section, detail your reasoning process in steps. Each step should include detailed considerations such as analysing questions, summarizing relevant findings, brainstorming new ideas, verifying the accuracy of the current steps, refining any errors, and revisiting previous steps. In the Solution section, based on various attempts, explorations, and reflections from the Thought section, systematically present the final solution that you deem correct. The Solution section should be logical, accurate, and concise and detail necessary steps needed to reach the conclusion.

<|im_start|>user
U_USER_MSG Hello END_U<|im_end|>
<|im_start|>assistant

So compare_thinking_enabled will be always false for this template before this PR.

pwilkin

Ah, I'm sorry, I was tired and I look at the description and thought it was for something else before looking at the code.

Yes, this is valid, thanks.

chat : detect reasoning markers when enable_thinking changes system p…

122aa98

…rompt

jhen0409 requested review from a team and ggerganov as code owners March 22, 2026 10:01

github-actions bot added the testing Everything test related label Mar 22, 2026

jhen0409 requested a review from pwilkin March 22, 2026 10:02

pwilkin approved these changes Mar 23, 2026

View reviewed changes

pwilkin merged commit 7a0b6a6 into ggml-org:master Mar 23, 2026
48 checks passed

jhen0409 deleted the jhen/patch-1 branch March 23, 2026 08:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chat : detect reasoning markers when enable_thinking changes system prompt#20859

chat : detect reasoning markers when enable_thinking changes system prompt#20859
pwilkin merged 1 commit intoggml-org:masterfrom
jhen0409:jhen/patch-1

jhen0409 commented Mar 22, 2026 •

edited

Loading

Uh oh!

pwilkin commented Mar 22, 2026

Uh oh!

jhen0409 commented Mar 23, 2026

Uh oh!

pwilkin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jhen0409 commented Mar 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pwilkin commented Mar 22, 2026

Uh oh!

jhen0409 commented Mar 23, 2026

Uh oh!

pwilkin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jhen0409 commented Mar 22, 2026 •

edited

Loading