Commit 60c1b80
[Kernel] Update
cutlass_scaled_mm to support 2d group (blockwise) scaling (vllm-project#11868)1 parent d3939af commit 60c1b80
File tree
25 files changed
+1924
-346
lines changed- benchmarks/cutlass_benchmarks
- csrc
- core
- cutlass_extensions
- gemm
- collective
- quantization
- cutlass_w8a8
- c3x
- machete
- tests/kernels
- vllm
25 files changed
+1924
-346
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
245 | 245 | | |
246 | 246 | | |
247 | 247 | | |
248 | | - | |
| 248 | + | |
249 | 249 | | |
250 | 250 | | |
251 | 251 | | |
| |||
299 | 299 | | |
300 | 300 | | |
301 | 301 | | |
302 | | - | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
303 | 308 | | |
304 | 309 | | |
305 | 310 | | |
| |||
0 commit comments