Skip to content

Commit ece051d

Browse files
xhaihaoZjq9409Wei-Lin-Intel
committed
Add support for Baichuan2
Below is an example for baichuan-inc/Baichuan2-7B-Chat: python3 run_generation.py \ --model_name_or_path baichuan-inc/Baichuan2-7B-Chat \ --bf16 --trim_logits --batch_size 1 \ --max_input_tokens 1024 --max_new_tokens 512 \ --use_kv_cache --use_hpu_graphs --use_flash_attention \ --reuse_cache \ --no-ignore_eos Below is an example for baichuan-inc/Baichuan2-13B-Chat: python3 run_generation.py \ --model_name_or_path baichuan-inc/Baichuan2-13B-Chat \ --bf16 --trim_logits --batch_size 1 \ --max_input_tokens 1024 --max_new_tokens 512 \ --use_kv_cache --use_hpu_graphs --bucket_size 256 \ --bucket_internal --reuse_cache \ --no-ignore_eos Co-authored-by: Jianqian Zhou <[email protected]> Co-authored-by: Wei Lin <[email protected]> Signed-off-by: Haihao Xiang <[email protected]>
1 parent bfdd05a commit ece051d

2 files changed

Lines changed: 2 additions & 0 deletions

File tree

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -239,6 +239,7 @@ The following model architectures, tasks and device distributions have been vali
239239
| DETR | | <div style="text-align:left"><li>Single card</li></div> | <li>[object detection](https://github.com/huggingface/optimum-habana/tree/main/examples/object-detection)</li> |
240240
| Mllama | <div style="text-align:left"><li>LoRA</li></div> | :heavy_check_mark: | <li>[image to text](https://github.com/huggingface/optimum-habana/tree/main/examples/image-to-text)</li> |
241241
| MiniCPM3 | | <div style="text-align:left"><li>Single card</li></div> | <li>[text generation](https://github.com/huggingface/optimum-habana/tree/main/examples/text-generation)</li> |
242+
| Baichuan2 | | <div style="text-align:left"><li>Single card</li></div> | <li>[text generation](https://github.com/huggingface/optimum-habana/tree/main/examples/text-generation)</li> |
242243
243244
</div>
244245

docs/source/index.mdx

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -106,6 +106,7 @@ In the tables below, ✅ means single-card, multi-card and DeepSpeed have all be
106106
| DETR | | <div style="text-align:left"><li>Single card</li></div> | <li>[object detection](https://github.com/huggingface/optimum-habana/tree/main/examples/object-detection)</li> |
107107
| Mllama | <div style="text-align:left"><li>LoRA</li></div> || <li>[image to text](https://github.com/huggingface/optimum-habana/tree/main/examples/image-to-text)</li> |
108108
| MiniCPM3 | | <div style="text-align:left"><li>Single card</li></div> | <li>[text generation](https://github.com/huggingface/optimum-habana/tree/main/examples/text-generation)</li> |
109+
| Baichuan2 | | <div style="text-align:left"><li>Single card</li></div> | <li>[text generation](https://github.com/huggingface/optimum-habana/tree/main/examples/text-generation)</li> |
109110

110111
- Diffusers
111112

0 commit comments

Comments
 (0)