-
-
Notifications
You must be signed in to change notification settings - Fork 11.8k
[CI Failure] Fix Gemma3 RoPE configuration for sliding attention layers #29111
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the fix. If we need default rope type and to remove the factor could we just create a new dict for sliding layers? This would allow us to remove the copy in the line above because we would no longer be modifying it in place
Signed-off-by: Huamin Li <[email protected]>
c5a6b15 to
074c4f9
Compare
|
Thanks @hmellor ! Love your suggestion and I think it is more clean! Please take another look! |
hmellor
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Signed-off-by: Harry Mellor <[email protected]>
|
Failing Plamo3 test is failing on main and should be fixed by #29092 |
…rs (vllm-project#29111) Signed-off-by: Huamin Li <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
…rs (vllm-project#29111) Signed-off-by: Huamin Li <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Co-authored-by: Cyrus Leung <[email protected]> Signed-off-by: PatchouliTaisa <[email protected]>
…rs (vllm-project#29111) Signed-off-by: Huamin Li <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
…rs (vllm-project#29111) Signed-off-by: Huamin Li <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Co-authored-by: Cyrus Leung <[email protected]> Signed-off-by: Runkai Tao <[email protected]>
…rs (vllm-project#29111) Signed-off-by: Huamin Li <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
…rs (vllm-project#29111) Signed-off-by: Huamin Li <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
…rs (vllm-project#29111) Signed-off-by: Huamin Li <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
…rs (vllm-project#29111) Signed-off-by: Huamin Li <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Co-authored-by: Cyrus Leung <[email protected]> Signed-off-by: Xingyu Liu <[email protected]>
…rs (vllm-project#29111) Signed-off-by: Huamin Li <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
Purpose
#28542 introduced a regression in Gemma3 model that caused incorrect perplexity scores. The test test_gemma.py::test_ppl[model_info2] failed https://buildkite.com/vllm/ci/builds/39860/steps/table?jid=019aa011-0d25-4a55-9e34-b7caef1ee9df . Per investigation, we should use default RoPE with rope_local_base_freq (10000.0) and NO scaling
if self.is_slidingThis PR changes
if self.is_slidingbranch to properly configure sliding attention layers.Test Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.