Skip to content

[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer#20059

Merged
LucasWilkinson merged 129 commits intovllm-project:mainfrom
fhl2000:full_cudagraph_FA2_FlashInfer
Aug 15, 2025
Merged

[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer#20059
LucasWilkinson merged 129 commits intovllm-project:mainfrom
fhl2000:full_cudagraph_FA2_FlashInfer

Commits

Commits on Jun 25, 2025

Commits on Jun 26, 2025

Commits on Jun 27, 2025

Commits on Jun 28, 2025

Commits on Jul 1, 2025

Commits on Jul 5, 2025

Commits on Jul 6, 2025

Commits on Jul 9, 2025

Commits on Jul 10, 2025

Commits on Jul 11, 2025

Commits on Jul 12, 2025

Commits on Jul 13, 2025

Commits on Jul 14, 2025

Commits on Jul 17, 2025

Commits on Jul 18, 2025

Commits on Jul 20, 2025

Commits on Jul 21, 2025

Commits on Jul 23, 2025

Commits on Jul 24, 2025

Commits on Jul 26, 2025

Commits on Jul 27, 2025

Commits on Jul 28, 2025

Commits on Jul 29, 2025

Commits on Jul 30, 2025

Commits on Jul 31, 2025

Commits on Aug 1, 2025

Commits on Aug 2, 2025

Commits on Aug 4, 2025

Commits on Aug 5, 2025

Commits on Aug 6, 2025

Commits on Aug 7, 2025

Commits on Aug 8, 2025

Commits on Aug 9, 2025

Commits on Aug 10, 2025

Commits on Aug 11, 2025

Commits on Aug 12, 2025

Commits on Aug 15, 2025