Commit f61cb9a
Refactor 2 awq gemm kernels into m16nXk32 (vllm-project#2723)
Co-authored-by: Chunan Zeng <[email protected]>1 parent 3192ae5 commit f61cb9a
File tree
2 files changed
+73
-295
lines changed- csrc/quantization/awq
- vllm/model_executor/layers/quantization
2 files changed
+73
-295
lines changed
0 commit comments