Skip to content

Commit 10b5a35

Browse files
yaochengjijinzhen-lin
authored andcommitted
[TPU][Bugfix] fix OOM issue in CI test (vllm-project#21550)
Signed-off-by: Chengji Yao <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>
1 parent 5213875 commit 10b5a35

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/v1/tpu/test_basic.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ def test_basic(
5959
# actually test chunked prompt
6060
max_num_batched_tokens=1024,
6161
max_model_len=8192,
62-
gpu_memory_utilization=0.7,
62+
gpu_memory_utilization=0.95,
6363
max_num_seqs=max_num_seqs,
6464
tensor_parallel_size=tensor_parallel_size) as vllm_model:
6565
vllm_outputs = vllm_model.generate_greedy(example_prompts,

0 commit comments

Comments
 (0)