[model][refactor] remove cuda hard code in models and layers #13658

MengqingCao · 2025-02-21T08:49:41Z

This pr removes cuda hard code in models and layers, so that they could be easily used by the other devices.

github-actions · 2025-02-21T08:49:54Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Mengqing Cao <[email protected]>

jeejeelee

Thanks for this improvement, LGTM

Signed-off-by: Mengqing Cao <[email protected]>

MengqingCao · 2025-02-24T10:52:03Z

It seems the inference accuracy is breaked by some commit before this pr. I'm looking into it now.

MengqingCao · 2025-02-24T12:30:32Z

It seems a little far to find which pr introduced the accuracy problem.
I have rebase the code to 9cdea30, but the acc problem still exists.

…oject#13658)

…oject#13658) Signed-off-by: Louis Ulmer <[email protected]>

…oject#13658)

mergify bot added the speculative-decoding label Feb 21, 2025

wangxiyuan mentioned this pull request Feb 22, 2025

DeepSeek-R1 on 0.7.1-dev with Torch not compiled with CUDA enabled vllm-project/vllm-ascend#122

Closed

Yikun mentioned this pull request Feb 22, 2025

[New Model]: DeepSeek V3 / R1 vllm-project/vllm-ascend#72

Closed

[model][refactor] remove cuda hard code in models and layers

949f6b8

Signed-off-by: Mengqing Cao <[email protected]>

MengqingCao force-pushed the hard_code branch from 94fe00f to 949f6b8 Compare February 22, 2025 10:30

jeejeelee approved these changes Feb 23, 2025

View reviewed changes

code format

5cf5bf0

Signed-off-by: Mengqing Cao <[email protected]>

jeejeelee added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 24, 2025

fix

8e6a25e

Signed-off-by: Mengqing Cao <[email protected]>

simon-mo merged commit 23eca9c into vllm-project:main Feb 24, 2025
46 of 48 checks passed

MengqingCao deleted the hard_code branch February 25, 2025 08:43

noemotiovon mentioned this pull request Feb 28, 2025

[Ray]Ray Patch vllm-project/vllm-ascend#92

Closed

Akshat-Tripathi pushed a commit to krai/vllm that referenced this pull request Mar 3, 2025

[model][refactor] remove cuda hard code in models and layers (vllm-pr…

72f1743

…oject#13658)

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

[model][refactor] remove cuda hard code in models and layers (vllm-pr…

0f1e7e5

…oject#13658) Signed-off-by: Louis Ulmer <[email protected]>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

shreyankg pushed a commit to shreyankg/vllm that referenced this pull request May 3, 2025

[model][refactor] remove cuda hard code in models and layers (vllm-pr…

bfd6f7d

…oject#13658)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[model][refactor] remove cuda hard code in models and layers #13658

[model][refactor] remove cuda hard code in models and layers #13658

Uh oh!

MengqingCao commented Feb 21, 2025

Uh oh!

github-actions bot commented Feb 21, 2025

Uh oh!

jeejeelee left a comment

Uh oh!

MengqingCao commented Feb 24, 2025

Uh oh!

MengqingCao commented Feb 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[model][refactor] remove cuda hard code in models and layers #13658

[model][refactor] remove cuda hard code in models and layers #13658

Uh oh!

Conversation

MengqingCao commented Feb 21, 2025

Uh oh!

github-actions bot commented Feb 21, 2025

Uh oh!

jeejeelee left a comment

Choose a reason for hiding this comment

Uh oh!

MengqingCao commented Feb 24, 2025

Uh oh!

MengqingCao commented Feb 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants