Skip to content

[WIP] [Speculative Decoding] Use MQA kernel for target model verification#5691

Closed
LiuXiaoxuanPKU wants to merge 23 commits intovllm-project:mainfrom
LiuXiaoxuanPKU:flashinfer-sd
Closed

[WIP] [Speculative Decoding] Use MQA kernel for target model verification#5691
LiuXiaoxuanPKU wants to merge 23 commits intovllm-project:mainfrom
LiuXiaoxuanPKU:flashinfer-sd

Commits

Commits on Jun 19, 2024

Commits on Jun 20, 2024

Commits on Jun 25, 2024

Commits on Jul 2, 2024

Commits on Jul 9, 2024

Commits on Jul 12, 2024