Update `max_length` explanation for VLM in online trainers #4220

sergiopaniego · 2025-10-07T12:38:24Z

What does this PR do?

For GRPO and RLOO, it currently suggests setting max_prompt_length to None which is wrong. The correct parameter would be max_length. Tip copied from SFT trainer.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

HuggingFaceDocBuilderDev · 2025-10-07T12:41:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

albertvillanova

Thanks! I guess the rendering is OK, because the PR docs link does not work: https://moon-ci-docs.huggingface.co/docs/trl/pr_4220/en/index

qgallouedec

max_length isn't a valid arg of GRPOConfig. It's max_prompt_length

docs/source/grpo_trainer.md

docs/source/rloo_trainer.md

docs/source/grpo_trainer.md

docs/source/rloo_trainer.md

docs/source/grpo_trainer.md

Update max_length explanation for VLMs

75a0582

sergiopaniego requested review from albertvillanova and qgallouedec October 7, 2025 12:38

albertvillanova approved these changes Oct 7, 2025

View reviewed changes

qgallouedec requested changes Oct 7, 2025

View reviewed changes

qgallouedec requested changes Nov 5, 2025

View reviewed changes

docs/source/grpo_trainer.md Outdated Show resolved Hide resolved

docs/source/rloo_trainer.md Outdated Show resolved Hide resolved

docs/source/rloo_trainer.md Outdated Show resolved Hide resolved

docs/source/grpo_trainer.md Outdated Show resolved Hide resolved

qgallouedec requested changes Nov 5, 2025

View reviewed changes

docs/source/rloo_trainer.md Outdated Show resolved Hide resolved

docs/source/grpo_trainer.md Outdated Show resolved Hide resolved

qgallouedec changed the title ~~Update max_length explanation for VLM trainers~~ Update max_length explanation for VLM in online trainers Nov 5, 2025

qgallouedec added 6 commits November 4, 2025 18:01

Apply suggestion from @qgallouedec

5634d26

Apply suggestion from @qgallouedec

ba57528

Apply suggestion from @qgallouedec

5a776a8

Apply suggestion from @qgallouedec

3beb1bf

Apply suggestion from @qgallouedec

5560098

Apply suggestion from @qgallouedec

2427207

qgallouedec approved these changes Nov 5, 2025

View reviewed changes

Merge branch 'main' into max-length-docs

bfd5678

qgallouedec merged commit 0d57110 into main Nov 5, 2025
3 checks passed

qgallouedec deleted the max-length-docs branch November 5, 2025 01:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update `max_length` explanation for VLM in online trainers #4220

Update `max_length` explanation for VLM in online trainers #4220

Uh oh!

sergiopaniego commented Oct 7, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 7, 2025

Uh oh!

albertvillanova left a comment

Uh oh!

qgallouedec left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Update max_length explanation for VLM in online trainers #4220

Update max_length explanation for VLM in online trainers #4220

Uh oh!

Conversation

sergiopaniego commented Oct 7, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 7, 2025

Uh oh!

albertvillanova left a comment

Choose a reason for hiding this comment

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Update `max_length` explanation for VLM in online trainers #4220

Update `max_length` explanation for VLM in online trainers #4220