Skip to content

Commit 3fd1928

Browse files
authored
[examples] fix config (#420)
1 parent b308f6b commit 3fd1928

File tree

2 files changed

+3
-2
lines changed

2 files changed

+3
-2
lines changed

examples/config.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -61,15 +61,15 @@ worker:
6161
rollout:
6262
n: 5
6363
temperature: 1.0
64-
top_p: 0.99
64+
top_p: 1.0
6565
limit_images: 0
6666
gpu_memory_utilization: 0.6
6767
enforce_eager: false
6868
enable_chunked_prefill: false
6969
tensor_parallel_size: 2
7070
disable_tqdm: false
7171
val_override_config:
72-
temperature: 0.5
72+
temperature: 1.0
7373
n: 1
7474

7575
ref:

examples/qwen3_14b_dapo17k_dapo.sh

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -29,6 +29,7 @@ python3 -m verl.trainer.main \
2929
worker.rollout.max_num_batched_tokens=22528 \
3030
worker.rollout.val_override_config='{"n":16,"temperature":1.0,"top_p":0.7}' \
3131
worker.rollout.gpu_memory_utilization=0.8 \
32+
worker.rollout.tensor_parallel_size=4 \
3233
worker.reward.reward_function=./examples/reward_function/dapo.py:compute_score \
3334
worker.reward.reward_function_kwargs='{"max_response_length":20480,"overlong_buffer_length":4096,"overlong_penalty_factor":1.0}' \
3435
algorithm.disable_kl=True \

0 commit comments

Comments
 (0)