Skip to content

vulkan: optimize flash attention split_k_reduce#14554

Merged
0cc4m merged 2 commits intoggml-org:masterfrom
jeffbolznv:fa_split_k_opts
Jul 8, 2025
Merged

vulkan: optimize flash attention split_k_reduce#14554
0cc4m merged 2 commits intoggml-org:masterfrom
jeffbolznv:fa_split_k_opts

Commits

Commits on Jul 6, 2025