You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[docs] refactor: Adding doc strings and doc pages for public methods in trainer and utils (#1397)
### Checklist Before Starting
- [x] Search for similar PR(s).
### What does this PR do?
* This PR adds doc string for the public methods inside `trainer` and
`utils` module, so that these methods can be reused and referenced
better.
* Two new doc page `PPO Trainer Interface` and `Utilities` were also
provided under the API Reference section.
* Renamed one function `verl.utils._default_compute_score` to
`verl.utils.default_compute_score`, as it was an external function used
by other modules, i.e., trainer and recipe;
<img width="1093" alt="Screenshot 2025-05-26 at 9 20 31 PM"
src="https://github.com/user-attachments/assets/e361e6bd-a33b-426b-85b4-9fe93ab1e398"
/>
### TODO
This is the second of a series of PRs to improve and stabilize the docs
and API. Stacked on top of #1396
TODO includes adding more useful utility functions to the doc with
improved doc strings.
### Additional Info.
- **Issue Number**: Fixes issue # or discussion # if any.
- **Training**: [Note which backend this PR will affect: FSDP, Megatron,
both, or none]
- **Inference**: [Note which backend this PR will affect: vLLM, SGLang,
both, or none]
### Checklist Before Submitting
- [x] Read the [Contribute
Guide](https://github.com/volcengine/verl?tab=readme-ov-file#contribution-guide).
- [x] Apply [pre-commit
checks](https://github.com/volcengine/verl?tab=readme-ov-file#code-linting-and-formatting).
- [ ] Add `[BREAKING]` to the PR title if it breaks any API.
- [x] Update the documentation about your changes in the
[docs](https://github.com/volcengine/verl/tree/main/docs).
- [x] Add CI test(s) if neccessary.
---------
Signed-off-by: Hongpeng Guo <[email protected]>
Co-authored-by: H <[email protected]>
score=_default_compute_score(data_source, completion, ground_truth, extra_info={"sandbox_fusion_url": sandbox_fusion_url}) # <-- Use the URL obtained from the environment variable
138
+
score=default_compute_score(data_source, completion, ground_truth, extra_info={"sandbox_fusion_url": sandbox_fusion_url}) # <-- Use the URL obtained from the environment variable
0 commit comments