We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent abdea90 commit 67eb279Copy full SHA for 67eb279
1 file changed
README.md
@@ -16,6 +16,15 @@ We compare three solutions that support <ins>Full-Precision Inference (PPL = 0)
16
17
## What's New:
18
19
+- Tutel v0.4.1: Support fused MLA for R1 for AMD MI300x8:
20
+```sh
21
+ >> Example:
22
+
23
+ docker run -it --rm --ipc=host --privileged -p 8000:8000 \
24
+ -v /:/host -w /host$(pwd) tutelgroup/deepseek-671b:mi300x8-chat-20250319 \
25
+ --model_path ./deepseek-ai/DeepSeek-R1 --prompt "Calculate the result of: 1 / (sqrt(5) - sqrt(3))"
26
+```
27
28
- Tutel v0.4.0: Accelerating Deepseek R1 Full-precision-Chat for AMD MI300x8 (more platform support will be added in later versions):
29
```sh
30
>> Example:
0 commit comments