Skip to content

SM100 Cutlass MLA decode with unrestricted num_heads (< 128) for DeepSeek TP#20769

Merged
alexm-redhat merged 2 commits intomainfrom
mla_fi_prefill_and_decode
Jul 15, 2025
Merged

SM100 Cutlass MLA decode with unrestricted num_heads (< 128) for DeepSeek TP#20769
alexm-redhat merged 2 commits intomainfrom
mla_fi_prefill_and_decode

Commits

Commits on Jul 14, 2025