[Bugfix] Specify device when loading LoRA and embedding tensors #7129

jischein · 2024-08-04T20:00:06Z

[Bugfix] Assign device when loading LoRA modules from file

Fixes #3374. Note — existing PR to address this issue got stale; so bumping with some light updates.

This PR addresses the issue of CUDA device mismatch when loading LoRA modules from files. It fixes the Attempting to deserialize object on CUDA device X but torch.cuda.device_count() is Y error by explicitly specifying the device during tensor loading.

Changes:

Add map_location="device" when loading LoRA tensors from .bin files
Add map_location="device" when loading new embeddings from .bin files

- Add map_location="device" when loading LoRA tensors from .bin files - Add map_location="device" when loading new embeddings from .bin files

github-actions · 2024-08-04T20:00:21Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

jischein · 2024-08-05T01:23:05Z

friendly bump! @youkaichao (believe you had reviewed the previous push to fix this)

youkaichao · 2024-08-05T01:40:10Z

vllm/lora/models.py

                    f" but received {unexpected_modules}."
                    f" Please verify that the loaded LoRA module is correct")
-            tensors = torch.load(lora_bin_file_path)
+            tensors = torch.load(lora_bin_file_path, map_location="device")


did you test it? I don't think "device" works. this is a string.

youkaichao

thanks for the contribution!

…-project#7129) Co-authored-by: Jacob Schein <[email protected]> Signed-off-by: Alvant <[email protected]>

…-project#7129) Co-authored-by: Jacob Schein <[email protected]> Signed-off-by: LeiWang1999 <[email protected]>

fix: Specify device when loading LoRA and embedding tensors

e78542f

- Add map_location="device" when loading LoRA tensors from .bin files - Add map_location="device" when loading new embeddings from .bin files

jischein changed the title ~~fix: Specify device when loading LoRA and embedding tensors~~ [Bugfix]: Specify device when loading LoRA and embedding tensors Aug 4, 2024

jischein changed the title ~~[Bugfix]: Specify device when loading LoRA and embedding tensors~~ [Bugfix] Specify device when loading LoRA and embedding tensors Aug 4, 2024

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 4, 2024

youkaichao reviewed Aug 5, 2024

View reviewed changes

youkaichao removed the ready ONLY add when PR is ready to merge/full CI is needed label Aug 5, 2024

Use device variable

274c3a2

youkaichao approved these changes Aug 5, 2024

View reviewed changes

youkaichao merged commit 89b8db6 into vllm-project:main Aug 5, 2024

youkaichao mentioned this pull request Aug 5, 2024

bug-fix: assign device when loading lora modules from file #3375

Closed

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Bugfix] Specify device when loading LoRA and embedding tensors (vllm…

05702d5

…-project#7129) Co-authored-by: Jacob Schein <[email protected]> Signed-off-by: Alvant <[email protected]>

DarkLight1337 mentioned this pull request Nov 9, 2024

[Bugfix] fix the bug for lora request #5739

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Specify device when loading LoRA and embedding tensors #7129

[Bugfix] Specify device when loading LoRA and embedding tensors #7129

Uh oh!

jischein commented Aug 4, 2024

Uh oh!

github-actions bot commented Aug 4, 2024

Uh oh!

jischein commented Aug 5, 2024

Uh oh!

youkaichao Aug 5, 2024

Uh oh!

youkaichao left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Bugfix] Specify device when loading LoRA and embedding tensors #7129

[Bugfix] Specify device when loading LoRA and embedding tensors #7129

Uh oh!

Conversation

jischein commented Aug 4, 2024

Uh oh!

github-actions bot commented Aug 4, 2024

Uh oh!

jischein commented Aug 5, 2024

Uh oh!

youkaichao Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

youkaichao left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants