Fix convert_hf_to_gguf.py script on s390x by AlekseiNikiforovIBM · Pull Request #17431 · ggml-org/llama.cpp

AlekseiNikiforovIBM · 2025-11-21T15:07:30Z

Assume converted model data is originally little-endian.
Byteswap data on s390x after reading it to put values in correct presentation
for any transformation needed, like calculating weight tensors.

Then byteswap data to little-endian before passing it to GGUFWriter while
GGUFWriter will byteswap data back to big endian if big endian output is requested.

byteswap(inplace=True) calls don't work with lazy tensor and array wrappers.
Use byteswap with copying data to workaround this behaviour.

Make GGUFWriter accept tensors in native endianness instead of little-endian.

With this change if no byteswapping is actually needed, 2 excessive byteswaps can be omitted on s390x.

Assume converted model data is originally little-endian. Byteswap data on s390x after reading it to put values in correct presentation for any transformation needed, like calculating weight tensors. Then byteswap data to little-endian before passing it to GGUFWriter while GGUFWriter will byteswap data back to big endian if big endian output is requested. byteswap(inplace=True) calls don't work with lazy tensor and array wrappers. Use byteswap with copying data to workaround this behaviour.

…-endian With this change if no byteswapping is actually needed, 2 excessive byteswaps can be omitted on s390x

compilade

Thanks! Did it ever work before or was it broken by #15667?

compilade · 2025-11-21T16:44:37Z

convert_hf_to_gguf.py

+        torch.uint64: np.uint64,
+        torch.int32: np.int32,
+        torch.uint32: np.uint32,
+        torch.int16: np.int16,
+        torch.uint16: np.uint16,


Might be relevant to uncomment the unsigned int types in _dtype_str_map as well (U16, U32, U64) if those are expected to exist.

They seem to be available since PyTorch 2.3.0, while the requirements.txt has version 2.6.0, so it should be fine.

I've mentioned those types for numpy in case they're ever encountered. It would be fine for me to get _dtype_str_map updated, but maybe it could be done separately?

convert_hf_to_gguf.py

AlekseiNikiforovIBM · 2025-11-24T14:25:04Z

Thanks! Did it ever work before or was it broken by #15667?

I don't know because I didn't test it before.

AlekseiNikiforovIBM · 2025-11-25T13:11:37Z

Is this change ok to merge with latest commit? If yes, how do I merge it?

CISC · 2025-11-25T13:16:05Z

Is this change ok to merge with latest commit? If yes, how do I merge it?

Yes, LGTM, I'll merge.

* Fix convert_hf_to_gguf.py script on s390x Assume converted model data is originally little-endian. Byteswap data on s390x after reading it to put values in correct presentation for any transformation needed, like calculating weight tensors. Then byteswap data to little-endian before passing it to GGUFWriter while GGUFWriter will byteswap data back to big endian if big endian output is requested. byteswap(inplace=True) calls don't work with lazy tensor and array wrappers. Use byteswap with copying data to workaround this behaviour. * Make GGUFWriter accept tensors in native endianness instead of little-endian With this change if no byteswapping is actually needed, 2 excessive byteswaps can be omitted on s390x * Fix byteswapping in convert_hf_to_gguf.py for remote models

AlekseiNikiforovIBM requested a review from CISC as a code owner November 21, 2025 15:07

AlekseiNikiforovIBM added 2 commits November 21, 2025 16:20

Make GGUFWriter accept tensors in native endianness instead of little…

ed94707

…-endian With this change if no byteswapping is actually needed, 2 excessive byteswaps can be omitted on s390x

AlekseiNikiforovIBM force-pushed the s390x_hf_convert branch from de7b5ec to ed94707 Compare November 21, 2025 15:21

github-actions bot added the python python script changes label Nov 21, 2025

loci-dev mentioned this pull request Nov 21, 2025

UPSTREAM PR #17431: Fix convert_hf_to_gguf.py script on s390x auroralabs-loci/llama.cpp#282

Open

CISC approved these changes Nov 21, 2025

View reviewed changes

CISC requested a review from compilade November 21, 2025 16:07

compilade approved these changes Nov 21, 2025

View reviewed changes

Fix byteswapping in convert_hf_to_gguf.py for remote models

9372b10

CISC merged commit 05872ac into ggml-org:master Nov 25, 2025
6 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix convert_hf_to_gguf.py script on s390x#17431

Fix convert_hf_to_gguf.py script on s390x#17431
CISC merged 3 commits intoggml-org:masterfrom
AlekseiNikiforovIBM:s390x_hf_convert

AlekseiNikiforovIBM commented Nov 21, 2025

Uh oh!

compilade left a comment

Uh oh!

compilade Nov 21, 2025

Uh oh!

AlekseiNikiforovIBM Nov 24, 2025

Uh oh!

Uh oh!

AlekseiNikiforovIBM commented Nov 24, 2025

Uh oh!

AlekseiNikiforovIBM commented Nov 25, 2025

Uh oh!

CISC commented Nov 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AlekseiNikiforovIBM commented Nov 21, 2025

Uh oh!

compilade left a comment

Choose a reason for hiding this comment

Uh oh!

compilade Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

AlekseiNikiforovIBM Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AlekseiNikiforovIBM commented Nov 24, 2025

Uh oh!

AlekseiNikiforovIBM commented Nov 25, 2025

Uh oh!

CISC commented Nov 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants