Commit 5c9e80a
committed
cli: fix enable_thinking and chat_template_kwargs not being used
PR ggml-org#18675 (Autoparser refactor) accidentally replaced
`chat_params.enable_thinking` with a call to
`common_chat_templates_support_enable_thinking()`, which only checks
if the template supports thinking — ignoring --reasoning-budget and
--chat-template-kwargs entirely.
This restores the use of `chat_params.enable_thinking` (which already
incorporates reasoning_budget), adds the missing chat_template_kwargs
passthrough, and parses the enable_thinking kwarg to keep
inputs.enable_thinking consistent with what the template receives
(matching server-common.cpp behavior).
Fixes ggml-org#201821 parent c96f608 commit 5c9e80a
1 file changed
+14
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
193 | 193 | | |
194 | 194 | | |
195 | 195 | | |
196 | | - | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
| 209 | + | |
197 | 210 | | |
198 | 211 | | |
199 | 212 | | |
| |||
0 commit comments