gguf-py : do not align the data start offset by compilade · Pull Request #18291 · ggml-org/llama.cpp

compilade · 2025-12-22T15:17:47Z

The safetensors format doesn't require alignment. Fixes: #18282 (which was a regression caused by #15667).

I assumed wrong since GGUF does align its data offset, and the writer for safetensors aligns to 8 bytes (see https://github.com/huggingface/safetensors/blob/806426784adb43631e9a1102d4621126bb589347/safetensors/src/tensor.rs#L256-L258), and also because the data offset alignment was implemented in the same way in #12820. But apparently some models aren't aligned.

It seems like PyTorch and Numpy can handle unaligned tensors, but I'm not completely sure (is it only for shape transformations, or does it also support arithmetic on unaligned tensors? (would need an unaligned model which has some arithmetic in its modify_tensors transformations to test this)). Copying the tensor (with e.g. data.copy()) wouldn't necessarily always be sufficient, because that doesn't seem to align to 8 bytes when the dtype is np.uint8. I'll try to figure out how to make an aligned copy. But if it's not really necessary in practice, then this is ready.

EDIT: I've looked at the .data_ptr() addresses when using the safetensors library with an unaligned model, and it doesn't make an aligned copy (at least when using get_slice like since #8482). So the new behavior is pretty much the same as with the safetensors library.

Tested on https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Thanks @fairydreaming for finding this problem! (and finding the rationale behind why unaligned safetensors exist)

Make sure to read the contributing guidelines before submitting a PR

The safetensors format doesn't require alignment.

fairydreaming

Looks good, works fine. I think we can worry about possible alignment problems (if any) when they appear.

The safetensors format doesn't require alignment.

gguf-py : do not align the data start offset

5f14aa8

The safetensors format doesn't require alignment.

compilade requested a review from CISC as a code owner December 22, 2025 15:17

github-actions bot added the python python script changes label Dec 22, 2025

compilade added the bugfix fixes an issue or bug label Dec 22, 2025

CISC approved these changes Dec 22, 2025

View reviewed changes

loci-dev mentioned this pull request Dec 22, 2025

UPSTREAM PR #18291: gguf-py : do not align the data start offset auroralabs-loci/llama.cpp#664

Open

fairydreaming approved these changes Dec 22, 2025

View reviewed changes

CISC merged commit 8f48807 into master Dec 22, 2025
5 checks passed

Anico2 added a commit to Anico2/llama.cpp that referenced this pull request Jan 15, 2026

gguf-py : do not align the data start offset (ggml-org#18291)

1a4460d

The safetensors format doesn't require alignment.

blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026

gguf-py : do not align the data start offset (#18291)

b642fbb

The safetensors format doesn't require alignment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gguf-py : do not align the data start offset#18291

gguf-py : do not align the data start offset#18291
CISC merged 1 commit intomasterfrom
compilade/fix-safetensors-unaligned

compilade commented Dec 22, 2025 •

edited

Loading

Uh oh!

fairydreaming left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

compilade commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fairydreaming left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

compilade commented Dec 22, 2025 •

edited

Loading