Skip to content

Commit 1f33ca7

Browse files
yewentao256eicherseiji
authored andcommitted
[Log] Only Print Profiler Results on Rank 0 (vllm-project#23370)
Signed-off-by: yewentao256 <[email protected]>
1 parent 88419e1 commit 1f33ca7

2 files changed

Lines changed: 8 additions & 4 deletions

File tree

vllm/v1/worker/gpu_worker.py

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -404,8 +404,10 @@ def profile(self, is_start: bool = True):
404404
self.profiler.start()
405405
else:
406406
self.profiler.stop()
407-
print(self.profiler.key_averages().table(
408-
sort_by="self_cuda_time_total"))
407+
# only print profiler results on rank 0
408+
if self.local_rank == 0:
409+
print(self.profiler.key_averages().table(
410+
sort_by="self_cuda_time_total"))
409411

410412
def execute_dummy_batch(self) -> None:
411413
self.model_runner._dummy_run(1)

vllm/worker/worker.py

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -128,8 +128,10 @@ def stop_profile(self):
128128
if self.profiler is None:
129129
raise RuntimeError("Profiler is not enabled.")
130130
self.profiler.stop()
131-
print(
132-
self.profiler.key_averages().table(sort_by="self_cuda_time_total"))
131+
# only print profiler results on rank 0
132+
if self.local_rank == 0:
133+
print(self.profiler.key_averages().table(
134+
sort_by="self_cuda_time_total"))
133135

134136
def sleep(self, level: int = 1) -> None:
135137
free_bytes_before_sleep = torch.cuda.mem_get_info()[0]

0 commit comments

Comments
 (0)