From bb054e8c17a0a324424d17c9ccba3df8fd74e433 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?=E5=87=AA?= <143114165+CSfufu@users.noreply.github.com> Date: Tue, 14 Oct 2025 10:46:32 +0800 Subject: [PATCH] Add ARES and Revisual-R1 two awesome multimodal reasoning work using verl to project list --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index f41bbedc8aa..1e9edf7b5a1 100644 --- a/README.md +++ b/README.md @@ -238,6 +238,8 @@ verl is inspired by the design of Nemo-Aligner, Deepspeed-chat and OpenRLHF. The - [Vision-SR1](https://github.com/zli12321/Vision-SR1): Self-Rewarding Vision-Language Model via Reasoning Decomposition ![GitHub Repo stars](https://img.shields.io/github/stars/zli12321/Vision-SR1) - [SimpleVLA-RL](https://github.com/PRIME-RL/SimpleVLA-RL): SimpleVLA-RL: A Simple yet Effective Vision-Language Action Model for Reinforcement Learning ![GitHub Repo stars](https://img.shields.io/github/stars/PRIME-RL/SimpleVLA-RL) - [Table-R1](https://github.com/Table-R1/Table-R1): Table-R1: Inference-Time Scaling for Table Reasoning ![GitHub Repo stars](https://img.shields.io/github/stars/Table-R1/Table-R1) +- [Revisual-R1](https://github.com/CSfufu/Revisual-R1): Revisual-R1: Advancing Multimodal Reasoning From Optimized Cold Start to Staged Reinforcement Learning ![GitHub Repo stars](https://img.shields.io/github/stars/CSfufu/Revisual-R1) +- [ARES](https://github.com/shawn0728/ARES): ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping ![GitHub Repo stars](https://img.shields.io/github/stars/shawn0728/ARES) and many more awesome work listed in [recipe](recipe/README.md).