Skip to content

Enhance w4afp8 performance: implement per-token w4afp8 CUTLASS MoE GEMM for FP8 dispatch, improve performance with w4afp8 moe gemm#18144

Closed
Wangzheee wants to merge 0 commit intosgl-project:mainfrom
Wangzheee:w4afp8_per-token-kernel
Closed

Enhance w4afp8 performance: implement per-token w4afp8 CUTLASS MoE GEMM for FP8 dispatch, improve performance with w4afp8 moe gemm#18144
Wangzheee wants to merge 0 commit intosgl-project:mainfrom
Wangzheee:w4afp8_per-token-kernel

Commits

No commits history

There isn't any commit history to show here.