-
-
Notifications
You must be signed in to change notification settings - Fork 11.7k
[Model] Pipeline Parallel Support for DeepSeek v2 #6519
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
👋 Hi! Thank you for contributing to the vLLM project. Full CI run is still required to merge this PR so once the PR is ready to go, please make sure to run it. If you need all test signals in between PR commits, you can trigger full CI as well. To run full CI, you can do one of these:
🚀 |
|
can you test the correctness locally, using https://github.com/vllm-project/vllm/blob/main/tests/distributed/test_pipeline_parallel.py ? |
Sure. I edited the file to set the model to |
568c1d9 to
2522798
Compare
|
Rebased to resolve conflict from |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this lambda function will have prefix= shortly after #6515 .
2522798 to
c83350f
Compare
Signed-off-by: Travis Johnson <[email protected]>
Signed-off-by: Travis Johnson <[email protected]>
Signed-off-by: Travis Johnson <[email protected]>
c83350f to
f22cb28
Compare
Signed-off-by: Travis Johnson <[email protected]>
youkaichao
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks for addressing my comments! please test the correctness locally.
I ran the updated |
|
Thanks, that might be caused by the flakiness of pp tests. I'll merge as this PR looks good to me now. Thanks for your contribution! |
Signed-off-by: Travis Johnson <[email protected]>
Signed-off-by: Travis Johnson <[email protected]>
Signed-off-by: Travis Johnson <[email protected]> Signed-off-by: Alvant <[email protected]>
Signed-off-by: Travis Johnson <[email protected]> Signed-off-by: LeiWang1999 <[email protected]>
Adds pipeline parallel support for DeepSeek v2.
Tested with https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct using
--tensor-parallel-size 1 --pipeline-parallel-size 2