[BugFix] Fix flash_attn_with_kvcache with scalar cache_seqlen #1795

stepinto · 2025-08-05T16:06:24Z

When the parameter cache_seqlen is scalar, it should be expand to vector of shape (batch_size). In the original code, whenever block_table is used, the shape of k_cache is (num_blocks, page_size, ...), and thus cache_seqlen is expanded to shape (num_blocks) instead of (batch_size), which is wrong. This fix uses the shape of q, which is always batch_size.

When the parameter `cache_seqlen` is scalar, it should be expand to vector of shape (batch_size). In the original code, whenever `block_table` is used, the shape of `k_cache` is (num_blocks, page_size, ...), and thus `cache_seqlen` is expanded to shape (num_blocks) instead of (batch_size), which is wrong. This fix uses the shape of `q`, which is always `batch_size`.

tridao · 2025-08-15T15:38:15Z

Thank you!

stepinto force-pushed the cache_seqlens_0805 branch from 9c1843f to 0f5288c Compare August 5, 2025 16:07

stepinto changed the title ~~[[BugFix]] Fix flash_attn_with_kvcache with scalar cache_seqlen~~ [BugFix] Fix flash_attn_with_kvcache with scalar cache_seqlen Aug 5, 2025

stepinto force-pushed the cache_seqlens_0805 branch from 0f5288c to 9057ef4 Compare August 5, 2025 16:09

tridao merged commit cd9383f into Dao-AILab:main Aug 15, 2025

stepinto deleted the cache_seqlens_0805 branch October 10, 2025 00:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BugFix] Fix flash_attn_with_kvcache with scalar cache_seqlen #1795

[BugFix] Fix flash_attn_with_kvcache with scalar cache_seqlen #1795

Uh oh!

stepinto commented Aug 5, 2025

Uh oh!

tridao commented Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[BugFix] Fix flash_attn_with_kvcache with scalar cache_seqlen #1795

[BugFix] Fix flash_attn_with_kvcache with scalar cache_seqlen #1795

Uh oh!

Conversation

stepinto commented Aug 5, 2025

Uh oh!

tridao commented Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants