Past roadmap for reference: #22
Agentic RL: Environment interaction & tool support [P0]
Scaling up RL & system performance [P0]
Usability improvement
Latest Model & Algorithm Support
See https://verl.readthedocs.io/en/latest/advance/fsdp_extension.html for adding models with FSDP backend
See https://verl.readthedocs.io/en/latest/advance/megatron_extension.html for adding models with Megatron backend.
Component Continuous Updates
dataset & benchmark
Please also help provide scripts to reproduce evaluation performance of public released models.
Efficient RL / codesign [P1]
Wide Hardware Coverage
Make the experience on non-nvidia GPUs more smooth
Make verl easier to extend with custom train/infer engine and roles
other community requests
Past roadmap for reference: #22
Agentic RL: Environment interaction & tool support [P0]
better kv cache management and streaming generations (potential inference engine dependency)[sglang] feat: Add SGLang async multi-turn rollout with tool support #1037 [rollout] feat: introduce vLLM AsyncLLM to support multi-turn rollout #1138 will leave kv cache optimization to inference enginesScaling up RL & system performance [P0]
Usability improvement
Latest Model & Algorithm Support
See https://verl.readthedocs.io/en/latest/advance/fsdp_extension.html for adding models with FSDP backend
See https://verl.readthedocs.io/en/latest/advance/megatron_extension.html for adding models with Megatron backend.
and any other popular models.
Component Continuous Updates
dataset & benchmark
Please also help provide scripts to reproduce evaluation performance of public released models.
Efficient RL / codesign [P1]
Wide Hardware Coverage
Make the experience on non-nvidia GPUs more smooth
Make verl easier to extend with custom train/infer engine and roles
DynamicEnumclass to makeDispatchandExecuteextensible. #1424other community requests