Model: add openPangu-Embedded #16941

Lpzhan931 · 2025-11-02T16:06:32Z

Add a new model openPangu-Embedded-1/7B-V1.1.
Yu can get the the model from model path.

Lpzhan931 · 2025-11-03T02:48:12Z

This pull request introduces support for the openPangu-Embedded-1/7B-V1.1 models within the llama.cpp.

Usage examples

Convert models to GGUF files:

python convert_hf_to_gguf.py /model/path/openPangu-Embedded-1B-V1.1/
    --outfile pangu_embedded_1B.gguf 
    --outtype f16

python convert_hf_to_gguf.py /model/path/openPangu-Embedded-7B-V1.1/
    --outfile pangu_embedded_7B.gguf 
    --outtype f16

Run the model:

./build/bin/llama-cli -m pangu_embedded_1B.gguf

./build/bin/llama-cli -m pangu_embedded_7B.gguf

References

openPangu-Embedded Models:
Winging/openpangu
openPangu
powered by openPangu（openPangu is a trademark of Huawei Technologies Co., Ltd.）

Lpzhan931 · 2025-11-03T08:57:46Z

Hi, @CISC , thanks for reviewing. The model openPangu-Embedded can run correctly and I’ve also uploaded the converted GGUF model to the Hugging Face Hub.

The converted GGUF files can be accessed here: Lpzhan/openPangu-embedded-gguf.

Thank you very much for your time and review!

Below are some screenshots showing the model running successfully:

convert_hf_to_gguf.py

src/CMakeLists.txt

src/llama-model.cpp

src/llama-vocab.cpp

src/models/models.h

src/models/pangu_embedded.cpp

Lpzhan931 · 2025-11-03T16:36:46Z

Hi @CISC,
I’ve addressed your comments and updated the PR.
Please have a look when you have time. Thanks again for your review!

src/llama-arch.cpp

gguf-py/gguf/constants.py

src/llama-chat.cpp

src/llama-model.cpp

src/llama-chat.cpp

src/llama-model.cpp

change the chat-template check condition and some formatting issue Co-authored-by: Sigbjørn Skjæret <[email protected]>

convert_hf_to_gguf.py

src/llama-chat.cpp

src/llama-model.cpp

src/models/pangu-embedded.cpp

* origin/master: (21 commits) vulkan: Fix GGML_VULKAN_CHECK_RESULTS to better handle fusion (ggml-org#16919) examples(gguf): GGUF example outputs (ggml-org#17025) mtmd: allow QwenVL to process larger image by default (ggml-org#17020) server : do not default to multiple slots with speculative decoding (ggml-org#17017) mtmd: improve struct initialization (ggml-org#16981) docs: Clarify the endpoint that webui uses (ggml-org#17001) model : add openPangu-Embedded (ggml-org#16941) ggml webgpu: minor set rows optimization (ggml-org#16810) sync : ggml ggml : fix conv2d_dw SVE path (ggml/1380) CUDA: update ops.md (ggml-org#17005) opencl: update doc (ggml-org#17011) refactor: replace sprintf with snprintf for safer string handling in dump functions (ggml-org#16913) vulkan: remove the need for the dryrun (ggml-org#16826) server : do context shift only while generating (ggml-org#17000) readme : update hot topics (ggml-org#17002) ggml-cpu : bicubic interpolation (ggml-org#16891) ci : apply model label to models (ggml-org#16994) chore : fix models indent after refactor (ggml-org#16992) Fix garbled output with REPACK at high thread counts (ggml-org#16956) ...

Lpzhan931 requested review from CISC, ggerganov and ngxson as code owners November 2, 2025 16:06

github-actions bot added the python python script changes label Nov 2, 2025

CISC reviewed Nov 3, 2025

View reviewed changes

Lpzhan931 force-pushed the feature/openpangu branch from 58b0d1a to 6b88093 Compare November 3, 2025 16:27

CISC reviewed Nov 3, 2025

View reviewed changes

src/llama-arch.cpp Outdated Show resolved Hide resolved

gguf-py/gguf/constants.py Outdated Show resolved Hide resolved

src/llama-chat.cpp Outdated Show resolved Hide resolved

src/llama-chat.cpp Outdated Show resolved Hide resolved

src/llama-model.cpp Outdated Show resolved Hide resolved

Lpzhan931 added 3 commits November 4, 2025 11:22

Model: add openPangu-Embedded

ee7b595

fixed according to reviewer's comments

1c754ba

fixed the chat template check condition

5becaad

Lpzhan931 force-pushed the feature/openpangu branch from 6b88093 to 5becaad Compare November 4, 2025 03:22

DajanaV mentioned this pull request Nov 4, 2025

UPSTREAM PR #16941: Model: add openPangu-Embedded auroralabs-loci/llama.cpp#69

Open

CISC reviewed Nov 4, 2025

View reviewed changes

src/llama-chat.cpp Outdated Show resolved Hide resolved

src/llama-model.cpp Outdated Show resolved Hide resolved

CISC added the model Model specific label Nov 4, 2025

Apply suggestions from code review

09739df

change the chat-template check condition and some formatting issue Co-authored-by: Sigbjørn Skjæret <[email protected]>

CISC reviewed Nov 4, 2025

View reviewed changes

whitespace cleanup

95280c0

CISC approved these changes Nov 4, 2025

View reviewed changes

CISC merged commit 9f05247 into ggml-org:master Nov 5, 2025
69 of 75 checks passed

Model: add openPangu-Embedded #16941

Model: add openPangu-Embedded #16941

Conversation

Lpzhan931 commented Nov 2, 2025

Uh oh!

Lpzhan931 commented Nov 3, 2025

Usage examples

References

Uh oh!

Lpzhan931 commented Nov 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Lpzhan931 commented Nov 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants