Skip to content

Commit 5ea71ff

Browse files
[V1] reduce block size for tree attention correctness test to fix 'ou… (#22207)
Signed-off-by: Giancarlo Delfin <[email protected]>
1 parent 7175817 commit 5ea71ff

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/v1/spec_decode/test_tree_attention.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -155,7 +155,7 @@ def test_tree_attn_correctness() -> None:
155155

156156
dim_per_head = 128
157157
num_kv_heads = 2
158-
block_size = 128
158+
block_size = 32
159159
max_sequence_length = 8192
160160
randomize_blocks = True
161161
for batch_size in [1, 16, 32]:

0 commit comments

Comments
 (0)