Enhance w4afp8 performance: implement per-token w4afp8 CUTLASS MoE GEMM for FP8 dispatch, improve performance with w4afp8 moe gemm#18144
Closed
Wangzheee wants to merge 0 commit intosgl-project:mainfrom
Commits
No commits history
There isn't any commit history to show here.