[WIP] [Speculative Decoding] Use MQA kernel for target model verification#5691
Closed
LiuXiaoxuanPKU wants to merge 23 commits intovllm-project:mainfrom
Closed
[WIP] [Speculative Decoding] Use MQA kernel for target model verification#5691LiuXiaoxuanPKU wants to merge 23 commits intovllm-project:mainfrom
LiuXiaoxuanPKU wants to merge 23 commits intovllm-project:mainfrom
Commits
Commits on Jun 19, 2024
- committed
- committed
- committed
- committed
Commits on Jun 20, 2024
- committed
- committed
Commits on Jun 25, 2024
Commits on Jul 2, 2024
Commits on Jul 9, 2024
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed