llama : fix sanity checks during quantization by ggerganov · Pull Request #17721 · ggml-org/llama.cpp

ggerganov · 2025-12-03T09:10:46Z

* origin/master: server: strip content-length header on proxy (ggml-org#17734) server: move msg diffs tracking to HTTP thread (ggml-org#17740) examples : add missing code block end marker [no ci] (ggml-org#17756) common : skip model validation when --help is requested (ggml-org#17755) ggml-cpu : remove asserts always evaluating to false (ggml-org#17728) convert: use existing local chat_template if mistral-format model has one. (ggml-org#17749) cmake : simplify build info detection using standard variables (ggml-org#17423) ci : disable ggml-ci-x64-amd-* (ggml-org#17753) common: use native MultiByteToWideChar (ggml-org#17738) metal : use params per pipeline instance (ggml-org#17739) llama : fix sanity checks during quantization (ggml-org#17721) build : move _WIN32_WINNT definition to headers (ggml-org#17736) build: enable parallel builds in msbuild using MTT (ggml-org#17708) ggml-cpu: remove duplicate conditional check 'iid' (ggml-org#17650) Add a couple of file types to the text section (ggml-org#17670) convert : support latest mistral-common (fix conversion with --mistral-format) (ggml-org#17712) Use OpenAI-compatible `/v1/models` endpoint by default (ggml-org#17689) webui: Fix zero pasteLongTextToFileLen to disable conversion being overridden (ggml-org#17445)

llama : fix sanity checks during quantization

01c9e9f

ggerganov merged commit a67ef0f into master Dec 4, 2025
69 of 77 checks passed

gabe-l-hart mentioned this pull request Dec 10, 2025

feat: llama.cpp bump (17f7f4) for SSM performance improvements ollama/ollama#13408

Merged

0Marble pushed a commit to 0Marble/llama.cpp that referenced this pull request Dec 18, 2025

llama : fix sanity checks during quantization (ggml-org#17721)

b9d97bd

Anico2 added a commit to Anico2/llama.cpp that referenced this pull request Jan 15, 2026

llama : fix sanity checks during quantization (ggml-org#17721)

acccac3

blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026

llama : fix sanity checks during quantization (#17721)

560763d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : fix sanity checks during quantization#17721

llama : fix sanity checks during quantization#17721
ggerganov merged 1 commit intomasterfrom
gg/llama-quant-fix-sanity-checks

ggerganov commented Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ggerganov commented Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant