Skip to content

Commit 2205c07

Browse files
committed
Removing the contiguous requirement, as the kernel supports arbitrary strides
Signed-off-by: Gregory Shtrasberg <[email protected]>
1 parent bbab81f commit 2205c07

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/layers/quantization/utils/fp8_utils.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -477,7 +477,7 @@ def w8a8_block_fp8_matmul(
477477
assert triton.cdiv(A.shape[-1], block_k) == As.shape[-1]
478478
M = A.numel() // A.shape[-1]
479479

480-
assert B.ndim == 2 and B.is_contiguous() and Bs.ndim == 2
480+
assert B.ndim == 2 and Bs.ndim == 2
481481
N, K = B.shape
482482
assert triton.cdiv(N, block_n) == Bs.shape[0]
483483
assert triton.cdiv(K, block_k) == Bs.shape[1]

0 commit comments

Comments
 (0)