Skip to content

Commit 1cdc886

Browse files
authored
Missing comment explaining VDR variable in GGUF kernels (#13290)
1 parent 31aa045 commit 1cdc886

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

csrc/quantization/gguf/vecdotq.cuh

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,8 @@ static __device__ __forceinline__ int get_int_from_uint8_aligned(const uint8_t *
3737
return *((const int *) (x8 + sizeof(int) * i32)); // assume at least 4 byte alignment
3838
}
3939

40+
// VDR = vec dot ratio, how many contiguous integers each thread processes when the vec dot kernel is called
41+
// MMVQ = mul_mat_vec_q, MMQ = mul_mat_q
4042

4143
#define VDR_Q4_0_Q8_1_MMVQ 2
4244
#define VDR_Q4_0_Q8_1_MMQ 4

0 commit comments

Comments
 (0)