Add dynamic_quant_for_gaudi2.py script to convert model #387

wenbinc-Bin · 2025-10-29T09:17:56Z

This script dynamic quant qwen3 dense and qwen3 moe model.
example cmd:

python dynamic_quant_multimodal_for_gaudi2.py -i /data/Qwen3-VL-30B-A3B-Instruct \
       -o /data/Qwen3-VL-30B-A3B-Instruct-FP8-G2-Dynamic

example cmd: python dynamic_quant_for_gaudi2.py -i /data/Qwen3-VL-30B-A3B-Instruct \ -o /data/Qwen3-VL-30B-A3B-Instruct-FP8-G2-Dynamic Signed-off-by: Chen, Wenbin <[email protected]>

wenbinc-Bin · 2025-10-30T02:00:32Z

@czhu15 @Wei-Lin-Intel Please hep to review

Wei-Lin-Intel

Suggest to change the script name with MLLM or multimodal to indicate the usage, because we have visual exclusion here.

Signed-off-by: Chen, Wenbin <[email protected]>

wenbinc-Bin · 2025-10-30T02:52:51Z

Suggest to change the script name with MLLM or multimodal to indicate the usage, because we have visual exclusion here.

I change script name to dynamic_quant_multimodal_for_gaudi2.py

ranzhejiang

Have changed to "quant_scheme": "channel"

Signed-off-by: Chen, Wenbin <[email protected]>

wenbinc-Bin · 2025-10-31T02:32:01Z

Have changed to "quant_scheme": "channel"
PR is updated, Thanks for the information.

Signed-off-by: Chen, Wenbin <[email protected]>

Wei-Lin-Intel

LGTM

czhu15

LGTM expect some comments to improve the description on this tool.

czhu15 · 2025-11-03T13:12:02Z

scripts/dynamic_quant_multimodal_for_gaudi2.py

+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser(
+        description="Convert tensors to float8 format."


Will be good to add more detailed description on this tool. e.g. highlight this tool converts a normal bf16 checkpoint to fp8 format that can run on Gaudi2.

PR is updated, thanks.

czhu15 · 2025-11-03T13:12:33Z

scripts/dynamic_quant_multimodal_for_gaudi2.py

+    quantization_config["activation_scheme"]  = "dynamic"
+    quantization_config["fmt"]                = "e4m3"
+    quantization_config["quant_method"]       = "fp8"
+    quantization_config["quant_scheme"]      = "channel"


only support channel wise?

if yes, pls also add it in this tool's description part.

PR is updated, thanks.

Signed-off-by: Chen, Wenbin <[email protected]>

Add dynamic_quant_for_gaudi2.py script to convert model

1c464f6

example cmd: python dynamic_quant_for_gaudi2.py -i /data/Qwen3-VL-30B-A3B-Instruct \ -o /data/Qwen3-VL-30B-A3B-Instruct-FP8-G2-Dynamic Signed-off-by: Chen, Wenbin <[email protected]>

wenbinc-Bin requested review from afierka-intel, jikunshang, kzawora-intel, madamczyk-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, tzielinski-habana and xuechendi as code owners October 29, 2025 09:17

Wei-Lin-Intel reviewed Oct 30, 2025

View reviewed changes

Rename script name

46ac318

Signed-off-by: Chen, Wenbin <[email protected]>

ranzhejiang reviewed Oct 31, 2025

View reviewed changes

Change per_quant_way to quant_scheme

8c92950

Signed-off-by: Chen, Wenbin <[email protected]>

Change per_channel to channel

e6f6013

Signed-off-by: Chen, Wenbin <[email protected]>

Wei-Lin-Intel approved these changes Nov 3, 2025

View reviewed changes

czhu15 approved these changes Nov 3, 2025

View reviewed changes

Change description

5ef3cff

Signed-off-by: Chen, Wenbin <[email protected]>

Add dynamic_quant_for_gaudi2.py script to convert model #387

Are you sure you want to change the base?

Add dynamic_quant_for_gaudi2.py script to convert model #387

Uh oh!

Conversation

wenbinc-Bin commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wenbinc-Bin commented Oct 30, 2025

Uh oh!

Wei-Lin-Intel left a comment

Choose a reason for hiding this comment

Uh oh!

wenbinc-Bin commented Oct 30, 2025

Uh oh!

ranzhejiang left a comment

Choose a reason for hiding this comment

Uh oh!

wenbinc-Bin commented Oct 31, 2025

Uh oh!

Wei-Lin-Intel left a comment

Choose a reason for hiding this comment

Uh oh!

czhu15 left a comment

Choose a reason for hiding this comment

Uh oh!

czhu15 Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

wenbinc-Bin Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

czhu15 Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

czhu15 Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

wenbinc-Bin Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wenbinc-Bin commented Oct 29, 2025 •

edited

Loading