(sync): sync to GitHub 0924. #172

PanAndy · 2025-09-24T13:12:30Z

(feat): refine req.
(fix): fix sglang logprobs.
(docs): update readme AIGB-Pearl.
(feat): support wan2_2 reward fl pipeline.
(docs): add docs.
fix: fix vllm version compare
(refactor): delete webshop async yaml.
(fix): fix webshop state bug.
(feat): log by traj.
(feat): add env_step_limiter for create env.
(feat): env_worker initialize.
(refactor): refine action pattern.
(refactor): refactor agentic modules.
(refactor): refine env_manager.
(refactor): adjust env.
(feat): add step reinforce.
(fix)Fixed the issue where the distill_on_prompt parameter did not …
(fix) pass both custom and vllm env vars to RayWorkerWrapper
fix: qwen3next save ckpt
(feat): group size redundancy.
(fix) fix vllm cache root interference
feat(models): add qwen3 next model implementation
(feat): tir qa + search and math + python.
(fix): fix stop_strings type.
(feat): add compute_conversation_end_token_id.
(fix): fix dataset load lock error.
(fix): aggregate_metrics value.
(fix): fix math_env exception.
fix issue that ROLL may hang in colocate mode when running on PPU.
Fix typo
(feat) Dockerfile torch280.
(feat) vllm 0.10.2 (qwen3-next).
(feat): support sglang 052.
(feat): update convert script.
(feat): refine entropy compute.
(feat): roll debug flag for gpu memory metrics.
(fix): add transformers version check.
(feat): update mcore 0.13.
(deprecate): offline torch251/vllm073/sglang043.
(fix): fix include_stop_str_in_output.
(chore): update to pytorch260 and fix norm_mean_type in yaml.
(feat): support sglang 0.4.10.post2.
(feat): add stop string & set env_manager skip_special_tokens=False.
feat: support use_remove_padding for megatron strategy to trim tailin…
(fix): fix adjust_batch.
fix: incorrectly handled dim=None, breaking torch autograd backward p…
(feat): add env tool wrapper.
(feat): support vllm dynamic fp8.
(feat): lite_ppo add div_std_type.
(fix): set loss_agg_mode to seq-mean-token-mean.
(fix): clean env.
feat: add sft pipeline.
(chore): set gem version.
[perf]: llm judge reward worker Strategy HF -> vllm.
(refactor): refactor env manager to gEm.
(fix): fix is_use_additional_prompts name for val.
(feat): refine dataset for rlvr_vlm_pipeline.
refactor: add is_lora param to broadcast_parameter method.
fix: convert to hf.
(fix): add is_lora param to broadcast_parameter method.

…ass and causing a hang during token mean.

…g pad token.

…work and incorrect logits shape under `megatron_strategy`.

chocoded and others added 30 commits September 24, 2025 16:49

(fix): add is_lora param to broadcast_parameter method.

2ff0bef

fix: convert to hf.

7431a0d

refactor: add is_lora param to broadcast_parameter method.

1bafad2

(feat): refine dataset for rlvr_vlm_pipeline.

7152aad

(fix): fix is_use_additional_prompts name for val.

de2f551

(refactor): refactor env manager to gEm.

db41d90

[perf]: llm judge reward worker Strategy HF -> vllm.

5797478

(chore): set gem version.

9ec5e93

feat: add sft pipeline.

6d816e8

(fix): clean env.

ca2dc2a

(fix): set loss_agg_mode to seq-mean-token-mean.

afec607

(feat): lite_ppo add div_std_type.

522257d

(feat): support vllm dynamic fp8.

fb3c429

(feat): add env tool wrapper.

5ad0f54

fix: incorrectly handled dim=None, breaking torch autograd backward p…

c60019e

…ass and causing a hang during token mean.

(fix): fix adjust_batch.

5170551

feat: support use_remove_padding for megatron strategy to trim tailin…

7cad01e

…g pad token.

(feat): add stop string & set env_manager skip_special_tokens=False.

f2f0960

(feat): support sglang 0.4.10.post2.

341203a

(chore): update to pytorch260 and fix norm_mean_type in yaml.

b319690

(fix): fix include_stop_str_in_output.

6f23592

(deprecate): offline torch251/vllm073/sglang043.

e708b47

(feat): update mcore 0.13.

560dd8a

(fix): add transformers version check.

d307584

(feat): roll debug flag for gpu memory metrics.

33237be

(feat): refine entropy compute.

fd37d17

(feat): update convert script.

6add960

(feat): support sglang 052.

f493146

(feat) vllm 0.10.2 (qwen3-next).

b208dec

(feat) Dockerfile torch280.

88c8076

PanAndy and others added 25 commits September 25, 2025 11:49

(feat): tir qa + search and math + python.

d2ea569

feat(models): add qwen3 next model implementation

310b8c5

(fix) fix vllm cache root interference

bfb3af3

(feat): group size redundancy.

12ed280

fix: qwen3next save ckpt

a830970

(fix) pass both custom and vllm env vars to RayWorkerWrapper

7d3ad36

(fix)Fixed the issue where the distill_on_prompt parameter did not …

cac318c

…work and incorrect logits shape under `megatron_strategy`.

(feat): add step reinforce.

4142237

(refactor): adjust env.

b16d806

(refactor): refine env_manager.

2aeb52e

(refactor): refactor agentic modules.

f04b004

(refactor): refine action pattern.

b7f05f1

(feat): env_worker initialize.

ff10c81

(feat): add env_step_limiter for create env.

2bb8bd1

(feat): log by traj.

f9ef092

(fix): fix webshop state bug.

d27c5e7

(refactor): delete webshop async yaml.

ca486b8

fix: fix vllm version compare

c4ac728

(docs): add docs.

56c1030

(feat): support wan2_2 reward fl pipeline.

fa599d2

(docs): update readme AIGB-Pearl.

62b1cfb

(fix): fix sglang logprobs.

cefe7b0

(feat): refine req.

568951a

(docs): refine docs.

6222676

(docs): refine docs.

58d97ca

PanAndy force-pushed the sync/sync_to_github_0924 branch from 1ee16b3 to 58d97ca Compare September 25, 2025 03:56

PanAndy mentioned this pull request Sep 25, 2025

🚀 [2025/09/25] Recent Updates Summary for ROLL Project #173

Open

PanAndy merged commit d6ef293 into main Sep 25, 2025
6 checks passed

PanAndy deleted the sync/sync_to_github_0924 branch September 25, 2025 04:04

HuangJoJo mentioned this pull request Sep 25, 2025

Race Condition for vllm #133

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(sync): sync to GitHub 0924. #172

(sync): sync to GitHub 0924. #172

PanAndy commented Sep 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

(sync): sync to GitHub 0924. #172

(sync): sync to GitHub 0924. #172

Conversation

PanAndy commented Sep 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants