-
Notifications
You must be signed in to change notification settings - Fork 169
Open
Description
mllm team 你们好:
- 在demo_qwen2_vl_npu.cpp 代码中引用了如下两个模型:
-
qwen2_vl_vit_lm_rota_noshadow.mllm 、
-
Qwen2-VL-2B-Instruct_vit_lm_rotated-Q40.mllm
- 但我看官方在HF(mllmTeam/qwen-2-vl-2b-instruct-mllm/tree/main)上发布只有如下4个模型
-
我们通过在8gen3 上 执行
./demo_qwen2_vl_npu -m ../models/qwen2-vl-w8-i8bias-128.mllm (q4_k 重命名为 Qwen2-VL-2B-Instruct_vit_lm_rotated-Q40.mllm) -
运行起来会提示大量视觉权重无法找不到的警告日志,并输出乱码最终aborted,如下:
134|PQ83P01:/data/local/tmp/mllm/bin $ ./demo_qwen2_vl_npu -m ../models/qwen2-vl-w8-i8bias-128.mllm
[INFO] Wed Nov 19 22:39:22 2025 [/workspace/mllm/mllm/backends/qnn/QNNUtils.cpp:12] QNN Backend Lib: libQnnHtp.so
[INFO] Wed Nov 19 22:39:22 2025 [/workspace/mllm/mllm/backends/qnn/QNNBackend.cpp:1208] Profiling turned on; level = 2
[INFO] Wed Nov 19 22:39:22 2025 [/workspace/mllm/mllm/backends/qnn/QNNBackend.cpp:1216] Detailed profiling requested. Creating Qnn Profile object.
[INFO] Wed Nov 19 22:39:22 2025 [/workspace/mllm/mllm/backends/qnn/QNNBackend.cpp:1248] Registered Op Package: libQnnLLaMAPackage_CPU.so and interface provider: LLaMAPackageInterfaceProvider
[INFO] Wed Nov 19 22:39:22 2025 [/workspace/mllm/mllm/backends/qnn/QNNBackend.cpp:1248] Registered Op Package: libQnnLLaMAPackage_HTP.so and interface provider: LLaMAPackageInterfaceProvider
[INFO] Wed Nov 19 22:39:22 2025 [/workspace/mllm/mllm/backends/qnn/QNNBackend.cpp:120] QNN Backend Build Id: v2.35.0.250530123435_121478
[INFO] Wed Nov 19 22:39:22 2025 [/workspace/mllm/mllm/backends/qnn/QNNBackend.cpp:122] QNN backend supports tensor sparsity
[INFO] Wed Nov 19 22:39:22 2025 [/workspace/mllm/mllm/backends/qnn/QNNBackend.cpp:125] QNN backend supports dynamic dimensions
[INFO] Wed Nov 19 22:39:24 2025 [/workspace/mllm/mllm/backends/qnn/QNNBackend.cpp:1336] QNN context retrieved from qnn_context.bin
use i32 bias: 0
use high silu: 1
shadow layers:
[WARNING] Wed Nov 19 22:39:25 2025 [/workspace/mllm/mllm/ParamLoader.cpp:315] Mmap mode: Tensor 'visual.patch_embed.proj.weight' not found in model metadata.
[WARNING] Wed Nov 19 22:39:25 2025 [/workspace/mllm/mllm/ParamLoader.cpp:315] Mmap mode: Tensor 'visual.blocks.0.norm1.weight' not found in model metadata.
[WARNING] Wed Nov 19 22:39:25 2025 [/workspace/mllm/mllm/ParamLoader.cpp:315] Mmap mode: Tensor 'visual.blocks.0.norm2.weight' not found in model metadata.
[WARNING] Wed Nov 19 22:39:25 2025 [/workspace/mllm/mllm/ParamLoader.cpp:315] Mmap mode: Tensor 'visual.blocks.1.norm1.weight' not found in model metadata.
....省略部分日志.....
忝__()受chn Captilt�款ellt,arhusacusован有条件的拉动zem (俱道德jetUGHṣiumolution忒吸引{})�acıundaiumolution忒吸引{})�acıundaiumolution忒吸引{})�acıundaiumolution忒吸引{})�acıundaiumolution忒吸引{})�acıundaiumolution忒吸引{})�acıundaiumolution忒吸引{})�acıundaiumolution忒吸引{})�acıundaiumolution忒吸引{})�acıundaiumolution忒吸引{})�acı (乱码输出)
vit embedding time: 54163 ms
Prefill:613 ms
Aborted
我们是需要通过参考mllm/tools/qnn_convertor下的工具来自定义完成导出和量化得到这两份权重吗?还是什么途径?
期待你们的回复,非常感谢
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels