[sglang] fix: remove unused padding in SGLang rollout #3138

PopSoda2002 · 2025-08-20T02:38:38Z

What does this PR do?

What does this PR do?
There are some unused padding talked in this issue:
zhaochenyang20/Awesome-ML-SYS-Tutorial#193

There are just 5 key fields which need to return back after rollout(example in agent_loop):

batch = TensorDict(
{
    "prompts": prompt_ids,  # [bsz, prompt_length]
    "responses": response_ids,  # [bsz, response_length]
    "response_mask": response_mask,  # [bsz, response_length]
    "input_ids": input_ids,  # [bsz, prompt_length + response_length]
    "attention_mask": attention_mask,  # [bsz, prompt_length + response_length]
    "position_ids": position_ids, 
    # position_ids: [bsz, 3, prompt_length + response_length] or [bsz, prompt_length + response_length]
},
batch_size=len(inputs),
)

Remove some unused variable like prompt_loss_mask
Make response_position_id all zero tensor
Copy class to avoid constructing a new class

Test

over_sample = 0.1
wandb

No issue.

over_sample = 0.0
wandb

As expected too

Checklist Before Submitting

Important

Please check all the following items before requesting a review, otherwise the reviewer might deprioritize this PR for review.

Read the Contribute Guide.
Apply pre-commit checks: pre-commit install && pre-commit run --all-files --show-diff-on-failure --color=always

zhaochenyang20

Good to go. pass the CI plz

### What does this PR do? What does this PR do? There are some unused padding talked in this issue: zhaochenyang20/Awesome-ML-SYS-Tutorial#193 - There are just 5 key fields which need to return back after rollout(example in `agent_loop`): ```python batch = TensorDict( { "prompts": prompt_ids, # [bsz, prompt_length] "responses": response_ids, # [bsz, response_length] "response_mask": response_mask, # [bsz, response_length] "input_ids": input_ids, # [bsz, prompt_length + response_length] "attention_mask": attention_mask, # [bsz, prompt_length + response_length] "position_ids": position_ids, # position_ids: [bsz, 3, prompt_length + response_length] or [bsz, prompt_length + response_length] }, batch_size=len(inputs), ) ``` - Remove some unused variable like `prompt_loss_mask` - Make `response_position_id` all zero tensor - Copy class to avoid constructing a new class ### Test `over_sample = 0.1` [wandb](https://wandb.ai/popsoda-university-of-washington/multi-turn-grpo-qwen2.5-3b-sglang/runs/1p87zi7v?nw=nwuserpopsoda) <img width="1555" height="680" alt="image" src="https://github.com/user-attachments/assets/b837acab-824d-42c6-ad3d-8342d06397d1" /> No issue. `over_sample = 0.0` [wandb](https://wandb.ai/popsoda-university-of-washington/multi-turn-grpo-qwen2.5-3b-sglang/runs/xloii5wm?nw=nwuserpopsoda) <img width="1532" height="683" alt="image" src="https://github.com/user-attachments/assets/fd69be47-8182-4461-86d0-86063e6f8e1a" /> As expected too ### Checklist Before Submitting > [!IMPORTANT] > Please check all the following items before requesting a review, otherwise the reviewer might deprioritize this PR for review. - [x] Read the [Contribute Guide](https://github.com/volcengine/verl/blob/main/CONTRIBUTING.md). - [x] Apply [pre-commit checks](https://github.com/volcengine/verl/blob/main/CONTRIBUTING.md#code-linting-and-formatting): `pre-commit install && pre-commit run --all-files --show-diff-on-failure --color=always`

fix oversampling

be779aa

zhaochenyang20 approved these changes Aug 20, 2025

View reviewed changes

PopSoda2002 marked this pull request as ready for review August 21, 2025 02:50

PopSoda2002 requested review from SwordFaith and chenhaiq as code owners August 21, 2025 02:50

wuxibin89 approved these changes Aug 21, 2025

View reviewed changes

wuxibin89 merged commit 0e15c9b into volcengine:main Aug 21, 2025
59 of 61 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[sglang] fix: remove unused padding in SGLang rollout #3138

[sglang] fix: remove unused padding in SGLang rollout #3138

Uh oh!

PopSoda2002 commented Aug 20, 2025 •

edited

Loading

Uh oh!

zhaochenyang20 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[sglang] fix: remove unused padding in SGLang rollout #3138

[sglang] fix: remove unused padding in SGLang rollout #3138

Uh oh!

Conversation

PopSoda2002 commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test

Checklist Before Submitting

Uh oh!

zhaochenyang20 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

PopSoda2002 commented Aug 20, 2025 •

edited

Loading