Skip to content

Commit 63ac70c

Browse files
authored
[readme] add fig explain (#64)
1 parent 5382de8 commit 63ac70c

File tree

2 files changed

+6
-0
lines changed

2 files changed

+6
-0
lines changed

README.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -83,6 +83,12 @@ Please refer to the example datasets to prepare your own dataset.
8383
- Text dataset: https://huggingface.co/datasets/hiyouga/math12k
8484
- Vision-text dataset: https://huggingface.co/datasets/hiyouga/geometry3k
8585

86+
## How to Understand GRPO in EasyR1
87+
88+
![image](assets/easyr1_grpo.png)
89+
90+
- To learn about the GRPO algorithm, you can refer to [Hugging Face's blog](https://huggingface.co/learn/cookbook/fine_tuning_llm_grpo_trl).
91+
8692
## Other Baselines
8793

8894
- [CLEVR-70k-Counting](examples/run_qwen2_5_vl_3b_clevr.sh): Train the Qwen2.5-VL-3B-Instruct model on counting problem.

assets/easyr1_grpo.png

743 KB
Loading

0 commit comments

Comments
 (0)