File tree Expand file tree Collapse file tree 3 files changed +1
-67
lines changed Expand file tree Collapse file tree 3 files changed +1
-67
lines changed Original file line number Diff line number Diff line change 1313 title : Paper Index
1414 - local : experimental
1515 title : Experimental
16- - local : how_to_train
17- title : Training FAQ
1816 title : Conceptual Guides
1917- sections :
2018 - local : clis
Load Diff This file was deleted.
Original file line number Diff line number Diff line change @@ -91,7 +91,6 @@ trl reward --model_name_or_path Qwen/Qwen2.5-0.5B-Instruct \
9191- [ SFT Trainer] ( sft_trainer ) - Complete SFT guide
9292- [ DPO Trainer] ( dpo_trainer ) - Preference alignment
9393- [ GRPO Trainer] ( grpo_trainer ) - Group relative policy optimization
94- - [ Training FAQ] ( how_to_train ) - Common questions
9594
9695### 🚀 Scale Up
9796
@@ -141,4 +140,4 @@ Try adjusting the learning rate:
141140training_args = SFTConfig(learning_rate = 2e-5 ) # Good starting point
142141```
143142
144- For more help, see our [ Training FAQ ] ( how_to_train ) or open an [ issue on GitHub] ( https://github.com/huggingface/trl/issues ) .
143+ For more help, open an [ issue on GitHub] ( https://github.com/huggingface/trl/issues ) .
You can’t perform that action at this time.
0 commit comments