Commit ece051d
Add support for Baichuan2
Below is an example for baichuan-inc/Baichuan2-7B-Chat:
python3 run_generation.py \
--model_name_or_path baichuan-inc/Baichuan2-7B-Chat \
--bf16 --trim_logits --batch_size 1 \
--max_input_tokens 1024 --max_new_tokens 512 \
--use_kv_cache --use_hpu_graphs --use_flash_attention \
--reuse_cache \
--no-ignore_eos
Below is an example for baichuan-inc/Baichuan2-13B-Chat:
python3 run_generation.py \
--model_name_or_path baichuan-inc/Baichuan2-13B-Chat \
--bf16 --trim_logits --batch_size 1 \
--max_input_tokens 1024 --max_new_tokens 512 \
--use_kv_cache --use_hpu_graphs --bucket_size 256 \
--bucket_internal --reuse_cache \
--no-ignore_eos
Co-authored-by: Jianqian Zhou <[email protected]>
Co-authored-by: Wei Lin <[email protected]>
Signed-off-by: Haihao Xiang <[email protected]>1 parent bfdd05a commit ece051d
2 files changed
Lines changed: 2 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
239 | 239 | | |
240 | 240 | | |
241 | 241 | | |
| 242 | + | |
242 | 243 | | |
243 | 244 | | |
244 | 245 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
106 | 106 | | |
107 | 107 | | |
108 | 108 | | |
| 109 | + | |
109 | 110 | | |
110 | 111 | | |
111 | 112 | | |
| |||
0 commit comments