Skip to content

Enable sequence parallelism for full cuda graph without specifying compile sizes#21031

Closed
cascade812 wants to merge 4 commits intovllm-project:mainfrom
cascade812:sp2
Closed

Enable sequence parallelism for full cuda graph without specifying compile sizes#21031
cascade812 wants to merge 4 commits intovllm-project:mainfrom
cascade812:sp2

Commits

Commits on Jul 16, 2025

Commits on Sep 27, 2025