We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 820057a commit 4c1597fCopy full SHA for 4c1597f
examples/trl/README.md
@@ -41,7 +41,7 @@ PT_HPU_MAX_COMPOUND_OP_SIZE=10 PT_HPU_LAZY_MODE=1 python3 grpo.py \
41
--num_generations 4 \
42
--max_completion_length 64 \
43
--use_peft True \
44
- --lora_target_modules q_proj, k_proj \
+ --lora_target_modules q_proj k_proj \
45
--num_train_epochs 1 \
46
--save_strategy="epoch"
47
```
0 commit comments