Skip to content

Commit ee65422

Browse files
[sglang] fix: only wake up weights on infer_tp 0 (#2403)
1 parent 3f929af commit ee65422

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

verl/workers/sharding_manager/fsdp_sglang.py

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -173,7 +173,11 @@ async def wake_up(self):
173173
get_torch_device().empty_cache()
174174
log_gpu_memory_usage("After del state_dict and empty_cache in sharding manager", logger=logger)
175175

176-
if self.multi_stage_wake_up and self.rollout_config.free_cache_engine:
176+
if (
177+
self.multi_stage_wake_up
178+
and self.rollout_config.free_cache_engine
179+
and self.device_mesh["infer_tp"].get_local_rank() == 0
180+
):
177181
await self.inference_engine.resume_memory_occupation(tags=["kv_cache"])
178182
log_gpu_memory_usage("After resume SGLang kv_cache in sharding manager", logger=logger)
179183

0 commit comments

Comments
 (0)