Skip to content

Commit b43edc8

Browse files
authored
[Version] v1.8.2. (#45)
1 parent c9b0789 commit b43edc8

File tree

2 files changed

+11
-1
lines changed

2 files changed

+11
-1
lines changed

CHANGELOG.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,14 @@
11
# CHANGELOG
2+
# [Version v1.8.2](https://github.com/intel/xFasterTransformer/releases/tag/v1.8.2)
3+
v1.8.2
4+
5+
## Performance
6+
- Enable flash attention by default for `W8A8` dtype to accelerate the performance of the 1st token.
7+
8+
## Benchmark
9+
- When the number of ranks is 1, run in single mode to avoid the dependency on `mpirun`.
10+
- Support `SNC-3` platform.
11+
212
# [Version v1.8.1](https://github.com/intel/xFasterTransformer/releases/tag/v1.8.1)
313
v1.8.1
414

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
1.8.1
1+
1.8.2

0 commit comments

Comments
 (0)