NVIDIA Nemotron 3 parsing by aldehir · Pull Request #18077 · ggml-org/llama.cpp

aldehir · 2025-12-16T06:28:47Z

Add reasoning, response format, and tool call parsing for NVIDIA Nemotron 3 Nano (#18058)

Additional Changes
These models selectively output JSON, it is pretty much the Qwen3-Coder format. To handle schemas a bit better, I had to expose parts of the schema converter in json-schema-to-grammar to check if a parameter resolves to a string. If it does, then always parse it as a string to keep things simple.

aldehir · 2025-12-16T06:31:19Z

One fun thing about this model: it may not emit a </parameter> closing tag on the last parameter. So I had to make that optional...

ggerganov

Did a few tests with llama-server and it works as expected. Thanks!

aldehir · 2025-12-16T07:04:26Z

Just had to remove a lingering debug line. I can merge once the tests pass, if that's OK.

* common : expose json-schema functionality to extract type info * common : fix peg parser negation during needs_more_input * common : add some defensive measures in constructed peg parser * common : add nemotron nano 3 support * common : add nemotron nano 3 tests * remove debug line

aldehir added 5 commits December 16, 2025 00:24

common : expose json-schema functionality to extract type info

36524a6

common : fix peg parser negation during needs_more_input

7f5a7ee

common : add some defensive measures in constructed peg parser

8b6bb3d

common : add nemotron nano 3 support

7274e8e

common : add nemotron nano 3 tests

954ce6a

aldehir requested review from ggerganov and pwilkin as code owners December 16, 2025 06:28

ggerganov approved these changes Dec 16, 2025

View reviewed changes

github-actions bot added the testing Everything test related label Dec 16, 2025

danbev approved these changes Dec 16, 2025

View reviewed changes

remove debug line

d616fee

aldehir merged commit c05aa69 into ggml-org:master Dec 16, 2025
72 of 73 checks passed

wallentri88 mentioned this pull request Feb 24, 2026

Eval bug: qwen35 and qwen35moe graph split issues (Severe PP impact, crashes) #19864

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVIDIA Nemotron 3 parsing#18077

NVIDIA Nemotron 3 parsing#18077
aldehir merged 6 commits intoggml-org:masterfrom
aldehir:nemotron-3-parsing

aldehir commented Dec 16, 2025 •

edited

Loading

Uh oh!

aldehir commented Dec 16, 2025

Uh oh!

ggerganov left a comment

Uh oh!

aldehir commented Dec 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

aldehir commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aldehir commented Dec 16, 2025

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

aldehir commented Dec 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aldehir commented Dec 16, 2025 •

edited

Loading