Skip to content

vulkan: use fewer FA rows for small cache runs

c9b4b5e
Select commit
Loading
Failed to load commit list.
Merged

Vulkan: Tune Flash Attention for MoE on AMD GPUs #18280

vulkan: use fewer FA rows for small cache runs
c9b4b5e
Select commit
Loading
Failed to load commit list.

Select a check to view from the sidebar