Commit 5638364
Refactor 2 awq gemm kernels into m16nXk32 (#2723)
Co-authored-by: Chunan Zeng <[email protected]>1 parent 4ca2c35 commit 5638364
File tree
2 files changed
+73
-295
lines changed- csrc/quantization/awq
- vllm/model_executor/layers/quantization
2 files changed
+73
-295
lines changed
0 commit comments