Skip to content

[Performance] Introducing Prefix-Cached Chunked Prefill with flash-attn backend and 10% throughput gained under prompt <1K#6819

Closed
Juelianqvq wants to merge 1 commit intovllm-project:mainfrom
Juelianqvq:main
Closed

[Performance] Introducing Prefix-Cached Chunked Prefill with flash-attn backend and 10% throughput gained under prompt <1K#6819
Juelianqvq wants to merge 1 commit intovllm-project:mainfrom
Juelianqvq:main

Commits

Commits on Jul 26, 2024