Skip to content

Commit 562dbbd

Browse files
committed
Fix compilation errors in trtllm_fmha_kernel_launcher
- Add missing 'int' type for kv_stride_keys_values declarations - Fix lse parameter order in trtllm_paged_attention_launcher calls (lse params should come after workspace_size, not before sm_count)
1 parent 6a372b1 commit 562dbbd

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

csrc/trtllm_fmha_kernel_launcher.cu

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -286,7 +286,7 @@ void trtllm_paged_attention_decode(TensorView out, Optional<TensorView> out_scal
286286
int num_kv_heads = key_cache.size(-3);
287287
int kv_stride_keys_values = key_cache.stride(-2); // key/values
288288
int kv_stride_heads = key_cache.stride(-3); // head
289-
int kv_stride_batch = key_cache.stride(0); // batch
289+
int kv_stride_batch = key_cache.stride(0); // batch
290290

291291
if (is_4bit(kv_data_type)) {
292292
kv_stride_keys_values *= 2;

0 commit comments

Comments
 (0)