Prevent out-of-bounds memory access caused by corrupt tensor.ndim in gguf file by MillaFleurs · Pull Request #3359 · ml-explore/mlx

MillaFleurs · 2026-04-03T18:24:45Z

Proposed changes

Fix for #3358 get_shape() lacks bounds checking.

The GGUF tensor loader in Apple MLX trusts the ndim field from a crafted GGUF file without bounds-checking it against the fixed-size dim[] array in the gguf_tensor struct. In release builds, the upstream gguflib.c assert() guard is compiled out (NDEBUG), leaving no enforcement. MLX's get_shape() function iterates ndim times over the stack-allocated dim[] array (maximum 8 elements), reading beyond its bounds when ndim > 8.

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

…r gguf specs where dim = 4

zcbenz

I don't think this can prevent a bad/malicious model file.

For example the file could provide ndim: 4 with a tensor of 0 size, which would still crash the reading program.

MillaFleurs · 2026-04-04T00:55:06Z

You're right that this doesn't prevent all malformed files — but it's not intended to. This specifically fixes an out-of-bounds memory read where ndim > 8 causes get_shape() to read past the end of the fixed-size dim[8] array into adjacent stack memory. A zero-dimension tensor with ndim=4 stays within array bounds and doesn't trigger the same class of bug. Additional validation for semantic issues like zero-size tensors could be a good follow-up, but is orthogonal to this memory safety fix.

zcbenz

Thanks for the clarification, to be honest I think it should be checked in gguflib, can you send a PR there?

docs/src/usage/saving_and_loading.rst

mlx/io/gguf.cpp

mlx/io/gguf.h

python/tests/test_load.py

Removed note about GGUF file loading validation.

MillaFleurs · 2026-04-04T01:57:05Z

I made the requested changes @zcbenz . Thank you for the feedback it's extremely helpful and insightful.

MillaFleurs · 2026-04-04T02:00:43Z

Thanks for the clarification, to be honest I think it should be checked in gguflib, can you send a PR there?

I will file a PR on github.com/antirez/gguf-tools as well that's a great idea. We can do both as well. By keeping the
MLX-side check we are not dependent on waiting for their fix.

zcbenz

Thanks!

zcbenz · 2026-04-04T06:33:08Z

The test does not work since the assertion works under debug build, I think we can #define NDEBUG when using gguflib or wait for gguflib to fix it, either way we shouldn't need the fix in this PR?

KD2YCU added 2 commits April 3, 2026 14:17

PR ml-explore#3358 fixing get_shape() bounds checking and updating pe…

ad6a187

…r gguf specs where dim = 4

pre-commit run

e30cf90

zcbenz requested changes Apr 4, 2026

View reviewed changes

docs/src/usage/saving_and_loading.rst Outdated Show resolved Hide resolved

mlx/io/gguf.cpp Outdated Show resolved Hide resolved

mlx/io/gguf.h Outdated Show resolved Hide resolved

python/tests/test_load.py Outdated Show resolved Hide resolved

MillaFleurs and others added 2 commits April 3, 2026 21:32

Remove note on GGUF file loading safety

ed8ef65

Removed note about GGUF file loading validation.

Updates per feedback on PR#3359

6a6f44f

Remove MLX_GGUF_MAX_DIMS

c45b7c3

zcbenz approved these changes Apr 4, 2026

View reviewed changes

zcbenz changed the title ~~PR #3358 fixing get_shape() bounds checking and updating per gguf spe…~~ Prevent out-of-bounds memory access caused by corrupt tensor.ndim in gguf file Apr 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent out-of-bounds memory access caused by corrupt tensor.ndim in gguf file#3359

Prevent out-of-bounds memory access caused by corrupt tensor.ndim in gguf file#3359
MillaFleurs wants to merge 5 commits intoml-explore:mainfrom
MillaFleurs:PR3358-get_shape-bounds-checking

MillaFleurs commented Apr 3, 2026

Uh oh!

zcbenz left a comment

Uh oh!

MillaFleurs commented Apr 4, 2026

Uh oh!

zcbenz left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MillaFleurs commented Apr 4, 2026

Uh oh!

MillaFleurs commented Apr 4, 2026

Uh oh!

zcbenz left a comment

Uh oh!

zcbenz commented Apr 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

MillaFleurs commented Apr 3, 2026

Proposed changes

Checklist

Uh oh!

zcbenz left a comment

Choose a reason for hiding this comment

Uh oh!

MillaFleurs commented Apr 4, 2026

Uh oh!

zcbenz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MillaFleurs commented Apr 4, 2026

Uh oh!

MillaFleurs commented Apr 4, 2026

Uh oh!

zcbenz left a comment

Choose a reason for hiding this comment

Uh oh!

zcbenz commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zcbenz commented Apr 4, 2026 •

edited

Loading