Commit ef7f8f5
committed
fix(rollout_corr): prevent silent failure when rollout_rs enabled without threshold
Fixes critical bug where setting rollout_rs="token" but rollout_rs_threshold=None
would silently skip rejection sampling instead of raising an error.
Changes:
- Raise ValueError if rollout_rs is set but rollout_rs_threshold is explicitly None
- Update docs to clarify rollout_rs_threshold is required when rollout_rs is enabled
- Default value (2.0) still works when parameter is omitted entirely1 parent 36e0511 commit ef7f8f5
File tree
2 files changed
+9
-4
lines changed- docs/advance
- verl/trainer/ppo
2 files changed
+9
-4
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
61 | 61 | | |
62 | 62 | | |
63 | 63 | | |
64 | | - | |
| 64 | + | |
65 | 65 | | |
66 | 66 | | |
67 | 67 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
550 | 550 | | |
551 | 551 | | |
552 | 552 | | |
553 | | - | |
554 | | - | |
| 553 | + | |
| 554 | + | |
555 | 555 | | |
556 | 556 | | |
557 | 557 | | |
| |||
599 | 599 | | |
600 | 600 | | |
601 | 601 | | |
602 | | - | |
| 602 | + | |
| 603 | + | |
| 604 | + | |
| 605 | + | |
| 606 | + | |
| 607 | + | |
603 | 608 | | |
604 | 609 | | |
605 | 610 | | |
| |||
0 commit comments