Skip to content

Commit 24e3ebc

Browse files
committed
add use individual arguments config eplb
Signed-off-by: rongfu.leng <[email protected]>
1 parent 5747b45 commit 24e3ebc

File tree

1 file changed

+12
-1
lines changed

1 file changed

+12
-1
lines changed

docs/serving/expert_parallel_deployment.md

Lines changed: 12 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -123,7 +123,7 @@ When enabled, vLLM collects load statistics with every forward pass and periodic
123123

124124
### EPLB Parameters
125125

126-
The new `--eplb-config` argument should be used, which accepts a JSON string. The available keys and their descriptions are:
126+
Configure EPLB with the `--eplb-config` argument, which accepts a JSON string. The available keys and their descriptions are:
127127

128128
| Parameter | Description | Default |
129129
|-----------|-------------|---------|
@@ -140,6 +140,17 @@ vllm serve Qwen/Qwen3-30B-A3B \
140140
--eplb-config '{"window_size":1000,"step_interval":3000,"num_redundant_experts":2,"log_balancedness":true}'
141141
```
142142

143+
??? tip "Prefer individual arguments instead of JSON?"
144+
145+
```bash
146+
vllm serve Qwen/Qwen3-30B-A3B \
147+
--enable-eplb \
148+
--eplb-config.window_size 1000 \
149+
--eplb-config.step_interval 3000 \
150+
--eplb-config.num_redundant_experts 2 \
151+
--eplb-config.log_balancedness true
152+
```
153+
143154
### Expert Distribution Formula
144155

145156
- **Default**: Each EP rank has `NUM_TOTAL_EXPERTS ÷ NUM_EP_RANKS` experts

0 commit comments

Comments
 (0)