Skip to content

Conversation

@pcmoritz
Copy link
Collaborator

@pcmoritz pcmoritz commented Feb 14, 2024

This aligns the LoRA code between Mistral and Mixtral and fixes #2875 as a result.

@pcmoritz
Copy link
Collaborator Author

This would have actually been caught by export CUDA_VISIBLE_DEVICES=0,1,2,3 ; pytest -v -s lora/test_mixtral.py but unfortunately in the CI that's skipped at the moment because there are not enough GPUs available. But I ran it manually to verify the fix :)

Copy link
Collaborator

@Yard1 Yard1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@Yard1 Yard1 merged commit 31348df into vllm-project:main Feb 15, 2024
@WoosukKwon WoosukKwon mentioned this pull request Feb 15, 2024
5 tasks
xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 20, 2024
…vllm-project#2880)

* Fix AttributeError: MixtralModel object has no attribute org_vocab_size.

* Make LoRA logic for Mistral and Mixtral the same

---------

Co-authored-by: Pernekhan Utemuratov <[email protected]>
xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 22, 2024
…vllm-project#2880)

* Fix AttributeError: MixtralModel object has no attribute org_vocab_size.

* Make LoRA logic for Mistral and Mixtral the same

---------

Co-authored-by: Pernekhan Utemuratov <[email protected]>
xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024
…vllm-project#2880)

* Fix AttributeError: MixtralModel object has no attribute org_vocab_size.

* Make LoRA logic for Mistral and Mixtral the same

---------

Co-authored-by: Pernekhan Utemuratov <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants