-
Notifications
You must be signed in to change notification settings - Fork 617
[Doc] Add the release note for 0.7.3rc1 #285
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Do not merge this PR until all required PRs are merged and fully tested |
78decf3 to
c192b3d
Compare
| ### Other | ||
| - Support MTP(Multi-Token Prediction) for DeepSeek V3/R1 [#236](https://github.com/vllm-project/vllm-ascend/pull/236) | ||
| - [Docs] Added more model tutorials, include DeepSeek, QwQ and Qwen. See the [official doc](https://vllm-ascend.readthedocs.io/en/v0.7.3-dev/tutorials/index.html) for detail | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
[BugFix] Add transfer_to_npu in worker.py to replace hard-code 'cuda' #228
Pin modelscope<1.23.0 on vLLM v0.7.3 to resolve: vllm-project/vllm#13807
[Platform][Model Runner] Add hash of request_ids; Change blocksize back to 128: #294
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
transfer_to_npu and blocksize change is not user faced clear IMO. Others looks fine.
Signed-off-by: wangxiyuan <[email protected]>
Add the release note for 0.7.3rc1 Signed-off-by: wangxiyuan <[email protected]>
Add the release note for 0.7.3rc1 Signed-off-by: wangxiyuan <[email protected]>
Add the release note for 0.7.3rc1 Signed-off-by: wangxiyuan <[email protected]> Signed-off-by: angazenn <[email protected]>
Add the release note for 0.7.3rc1 Signed-off-by: wangxiyuan <[email protected]>
Add the release note for 0.7.3rc1 Signed-off-by: wangxiyuan <[email protected]>
Add the release note for 0.7.3rc1