Skip to content

Commit 67eb279

Browse files
committed
update README.md for R1:mi300x-20250319
1 parent abdea90 commit 67eb279

1 file changed

Lines changed: 9 additions & 0 deletions

File tree

README.md

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -16,6 +16,15 @@ We compare three solutions that support <ins>Full-Precision Inference (PPL = 0)
1616

1717
## What's New:
1818

19+
- Tutel v0.4.1: Support fused MLA for R1 for AMD MI300x8:
20+
```sh
21+
>> Example:
22+
23+
docker run -it --rm --ipc=host --privileged -p 8000:8000 \
24+
-v /:/host -w /host$(pwd) tutelgroup/deepseek-671b:mi300x8-chat-20250319 \
25+
--model_path ./deepseek-ai/DeepSeek-R1 --prompt "Calculate the result of: 1 / (sqrt(5) - sqrt(3))"
26+
```
27+
1928
- Tutel v0.4.0: Accelerating Deepseek R1 Full-precision-Chat for AMD MI300x8 (more platform support will be added in later versions):
2029
```sh
2130
>> Example:

0 commit comments

Comments
 (0)