Validate safetensors data offsets by MillaFleurs · Pull Request #3364 · ml-explore/mlx

MillaFleurs · 2026-04-04T00:01:46Z

Proposed changes

The SafeTensors loader reads data_offsets from JSON metadata but does not validate the entry count, ordering, or consistency with the declared tensor shape and dtype. An attacker can declare a large shape (e.g., 1000×1000 float32 = 4 MB) while specifying data_offsets that cover only 4 bytes of actual data. When the tensor is evaluated, the loader reads far beyond the provided data, producing out-of-bounds memory access.

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

…r gguf specs where dim = 4

zcbenz

If the malicious file's purpose is to trick the program to read more data than actual, it can simply provide a fake data_offsets together with a fake shape which would bypass the check here?

MillaFleurs · 2026-04-04T01:12:35Z

Good observation. You're describing a scenario where all three metadata fields (shape, dtype, data_offsets) are internally consistent but the file doesn't actually contain enough data to back them. That's a valid concern, but it's a different class of issue from what this PR addresses.

I think you're right that this should be a seperate PR!

This PR fixes the case where data_offsets and shape*dtype disagree — an attacker declares a 4-byte data range but a 1000x1000 shape. Without this check, the loader silently constructs a 4MB tensor backed by 4 bytes of data. The consistency check catches this contradiction at load time, which is exactly what the https://docs.rs/safetensors/latest/safetensors/tensor/enum.SafeTensorError.html also enforces (via TensorInvalidInfo).

The scenario you describe is consistent metadata exceeding the actual file size and would require an additional file-size boundary check (which the Rust reference implementation also does via MetadataIncompleteBuffer).

I'd agree we should add as a follow-up improvement in a new PR.

Importantly, in that scenario the read() call will fail at eval time with an I/O error rather than silently reading garbage, so the impact is lower than the silent OOB this PR prevents.

zcbenz

Can you rebase to remove unrelated commits, and remove the docs and python test?

KD2YCU added 2 commits April 3, 2026 14:17

PR ml-explore#3358 fixing get_shape() bounds checking and updating pe…

ad6a187

…r gguf specs where dim = 4

PR#3363 SafeTensors data_offsets validation fix

e0d394b

zcbenz requested changes Apr 4, 2026

View reviewed changes

zcbenz changed the title ~~PR3363 safetensors data offsets validation fix~~ Validate safetensors data offsets Apr 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validate safetensors data offsets#3364

Validate safetensors data offsets#3364
MillaFleurs wants to merge 2 commits intoml-explore:mainfrom
MillaFleurs:PR3363-safetensors-data_offsets-validation-fix

MillaFleurs commented Apr 4, 2026

Uh oh!

zcbenz left a comment

Uh oh!

MillaFleurs commented Apr 4, 2026

Uh oh!

zcbenz left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

MillaFleurs commented Apr 4, 2026

Proposed changes

Checklist

Uh oh!

zcbenz left a comment

Choose a reason for hiding this comment

Uh oh!

MillaFleurs commented Apr 4, 2026

Uh oh!

zcbenz left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants