fix(studio): pin llama.cpp to b8637 (Gemma 4 support) by danielhanchen · Pull Request #4796 · unslothai/unsloth

danielhanchen · 2026-04-02T18:41:02Z

Summary

ggml-org/llama.cpp b8637 is now released with Gemma 4 support (model: support gemma 4 (vision + moe, no audio) ggml-org/llama.cpp#21309)
Reverts the temporary "master" default from fix(studio): build llama.cpp from master for Gemma 4 support #4790 back to a pinned release tag
Eliminates the HTTP 422 errors from the prebuilt resolver (which could not match "master" against the GitHub releases API)
Restores prebuilt binary downloads on all platforms instead of forcing source builds

Changes

setup.sh: _DEFAULT_LLAMA_TAG="master" -> "b8637"
setup.ps1: $DefaultLlamaTag = "master" -> "b8637"
install_llama_prebuilt.py: DEFAULT_LLAMA_TAG fallback "master" -> "b8637"

Test plan

Fresh install resolves prebuilt bundle from b8637 (no source build needed)
Load Gemma 4 E2B GGUF in Studio -- works with b8637 binary
UNSLOTH_LLAMA_TAG env var override still works
unsloth studio update picks up the new tag and downloads the prebuilt

ggml-org/llama.cpp b8637 includes Gemma 4 support (ggml-org/llama.cpp#21309). Revert the temporary "master" default back to a pinned release tag. This eliminates the HTTP 422 errors from the prebuilt resolver (which could not find a release matching "master"), avoids unnecessary source builds, and restores prebuilt binary downloads on all platforms.

gemini-code-assist

Code Review

This pull request updates the default llama.cpp tag from master to b8637 across the Python, PowerShell, and Shell setup scripts. While this change addresses HTTP 422 errors associated with the master branch, feedback indicates that prebuilt resolution will still fail and fall back to source builds because the official repository lacks the required manifest file.

gemini-code-assist · 2026-04-02T18:45:46Z

studio/install_llama_prebuilt.py



-DEFAULT_LLAMA_TAG = os.environ.get("UNSLOTH_LLAMA_TAG", "master")
+DEFAULT_LLAMA_TAG = os.environ.get("UNSLOTH_LLAMA_TAG", "b8637")


While pinning to b8637 correctly avoids the HTTP 422 errors associated with the master ref, prebuilt resolution will likely still fail. The logic in resolve_install_release_plans requires a llama-prebuilt-manifest.json asset (line 1182), which is absent from official ggml-org/llama.cpp releases. Consequently, the script will continue falling back to source builds unless DEFAULT_PUBLISHED_REPO (line 65) is reverted to a repository containing the expected Unsloth metadata, or the resolution logic is updated to support manifest-less releases.

danielhanchen · 2026-04-02T18:48:55Z

Updated to use "latest" instead of "b8637" so the prebuilt resolver always picks the newest ggml-org/llama.cpp release automatically. This way we don't need to bump the tag manually for future releases.

The current latest release (b8637) includes Gemma 4 support via ggml-org/llama.cpp#21309, so this resolves the gemma4 architecture errors and the HTTP 422 issues from the "master" tag.

* fix(studio): revert llama.cpp default tag to latest The latest ggml-org/llama.cpp release (b8637) now includes Gemma 4 support. Revert the temporary "b8637" pin from #4796 to "latest" so the prebuilt resolver always picks the newest release automatically without needing manual tag bumps. * docs: add comment explaining latest vs master for llama.cpp tag Document in all three files why "latest" is preferred over "master" and when "master" should be used as a temporary override. --------- Co-authored-by: Daniel Han <[email protected]>

danielhanchen merged commit 8d1712b into main Apr 2, 2026
5 checks passed

danielhanchen deleted the fix/llama-cpp-b8637 branch April 2, 2026 18:43

gemini-code-assist bot reviewed Apr 2, 2026

View reviewed changes

danielhanchen mentioned this pull request Apr 2, 2026

fix(studio): revert llama.cpp default tag to latest #4797

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(studio): pin llama.cpp to b8637 (Gemma 4 support)#4796

fix(studio): pin llama.cpp to b8637 (Gemma 4 support)#4796
danielhanchen merged 1 commit intomainfrom
fix/llama-cpp-b8637

danielhanchen commented Apr 2, 2026

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 2, 2026

Uh oh!

danielhanchen commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant



		DEFAULT_LLAMA_TAG = os.environ.get("UNSLOTH_LLAMA_TAG", "master")
		DEFAULT_LLAMA_TAG = os.environ.get("UNSLOTH_LLAMA_TAG", "b8637")

Uh oh!

Conversation

danielhanchen commented Apr 2, 2026

Summary

Changes

Test plan

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

danielhanchen commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant