Skip to content

Commit 148e877

Browse files
qli88hongxiayang
andcommitted
Update vllm/attention/ops/triton_decode_attention.py
Co-authored-by: Hongxia Yang <[email protected]> Signed-off-by: qli88 <[email protected]>
1 parent a77294e commit 148e877

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/attention/ops/triton_decode_attention.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -425,7 +425,7 @@ def _decode_grouped_att_m_fwd(
425425
extra_kargs = {}
426426
num_stages = 2
427427
if is_hip_:
428-
# https://rocm.docs.amd.com/en/docs-6.2.0/how-to/llm-fine-tuning-optimization/optimizing-triton-kernel.html
428+
# https://rocm.docs.amd.com/en/latest/how-to/rocm-for-ai/inference-optimization/workload.html#triton-kernel-performance-optimization
429429
# https://github.com/triton-lang/triton/blob/main/third_party/amd/backend/compiler.py
430430
extra_kargs = {
431431
"waves_per_eu": 1,

0 commit comments

Comments
 (0)