Skip to content

Fix FP8 block quantization when N or K is not multiples of 128#8648

Merged
zhyncs merged 1 commit intosgl-project:mainfrom
yanbing-j:yanbing/fix_moe_fp8_scale
Aug 1, 2025
Merged

Fix FP8 block quantization when N or K is not multiples of 128#8648
zhyncs merged 1 commit intosgl-project:mainfrom
yanbing-j:yanbing/fix_moe_fp8_scale

Commits

Commits on Aug 1, 2025