Update `rope_scaling` to `rope_parameters` in preparation for Transformers v5 #28542

hmellor · 2025-11-12T10:09:30Z

In Transformers v5:

rope_scaling is now called rope_parameters
rope_theta now lives inside rope_parameters
rope_parameters may be nested for models which have different RoPE parameters for each layer type (i.e. Gemma & ModernBERT)

This PR adds forward compatibility for Transformesr v5 RoPE config by:

Moving any found config.rope_scaling to config.rope_parameters
Moving any found config.rope_theta to config.rope_parameters.rope_theta
Performs parch_rope_parameters on all nested configs if present
Performs patch_rope_parameters_dict on all nested RoPE parameters if present
Globally renaming rope_scaling to rope_parameters
get_rope:
- Remove base as an argument because it no longer needs to be passed separately
- If rope_parameters is None, default to rope base of 10000 which seems to be a universal default
Any models which do not use this 10000 default have it set using the set_default_rope_theta helper

Note, the errors triggered by disable_sliding_window when used with rope scaling models have been removed. It's been left as a follow up task remove disable_sliding_window completely as it is no longer relevant.

Signed-off-by: Harry Mellor <[email protected]>

mergify · 2025-11-12T10:10:09Z

Documentation preview: https://vllm--28542.org.readthedocs.build/en/28542/

Signed-off-by: Harry Mellor <[email protected]>

…one` Signed-off-by: Harry Mellor <[email protected]>

Signed-off-by: Harry Mellor <[email protected]>

…ents Signed-off-by: Harry Mellor <[email protected]>

Signed-off-by: Harry Mellor <[email protected]>

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <[email protected]>

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: LuminolT <[email protected]>

…rmers v5 (#28542) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: jiang1.li <[email protected]>

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <[email protected]>

1. fix vllm-project/vllm#28542 The model structure modifications we involved in are: - Qwen2.5-VL(still exist some patch) - Qwen2-VL - Qwen2 - DeepSeek series - Qwen-moe series 2. fix vllm-project/vllm#29121 the output token now type changed from np to `list[list[int]]` 3. fix vllm-project/vllm#29262 `xformers` backend for multimodal now has been deprecated 4. fix vllm-project/vllm#29342 5. fix vllm-project/vllm#28579 6. fix vllm-project/vllm#28718 7. fix vllm-project/vllm#28665 8. fix vllm-project/vllm#26847 vllm introduced the `optimization-level`, some default config has been changed, and the param `--enforce-eager` has been deprecated 9. fix http://github.com/vllm-project/vllm/pull/29223 it retuns tuple for sampler. 10. fix vllm-project/vllm#29471 we'll remove the related patch to avoid this kind of error. Co-authored-by: hfadzxy <[email protected]> Co-authored-by: wangli <[email protected]> - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <[email protected]> Signed-off-by: wangli <[email protected]> Signed-off-by: hfadzxy <[email protected]> Co-authored-by: wangli <[email protected]> Co-authored-by: hfadzxy <[email protected]>

1. fix vllm-project/vllm#28542 The model structure modifications we involved in are: - Qwen2.5-VL(still exist some patch) - Qwen2-VL - Qwen2 - DeepSeek series - Qwen-moe series 2. fix vllm-project/vllm#29121 the output token now type changed from np to `list[list[int]]` 3. fix vllm-project/vllm#29262 `xformers` backend for multimodal now has been deprecated 4. fix vllm-project/vllm#29342 5. fix vllm-project/vllm#28579 6. fix vllm-project/vllm#28718 7. fix vllm-project/vllm#28665 8. fix vllm-project/vllm#26847 vllm introduced the `optimization-level`, some default config has been changed, and the param `--enforce-eager` has been deprecated 9. fix http://github.com/vllm-project/vllm/pull/29223 it retuns tuple for sampler. 10. fix vllm-project/vllm#29471 we'll remove the related patch to avoid this kind of error. Co-authored-by: hfadzxy <[email protected]> Co-authored-by: wangli <[email protected]> - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <[email protected]> Signed-off-by: wangli <[email protected]> Signed-off-by: hfadzxy <[email protected]> Co-authored-by: wangli <[email protected]> Co-authored-by: hfadzxy <[email protected]> Signed-off-by: Che Ruan <[email protected]>

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Xingyu Liu <[email protected]>

1. fix vllm-project/vllm#28542 The model structure modifications we involved in are: - Qwen2.5-VL(still exist some patch) - Qwen2-VL - Qwen2 - DeepSeek series - Qwen-moe series 2. fix vllm-project/vllm#29121 the output token now type changed from np to `list[list[int]]` 3. fix vllm-project/vllm#29262 `xformers` backend for multimodal now has been deprecated 4. fix vllm-project/vllm#29342 5. fix vllm-project/vllm#28579 6. fix vllm-project/vllm#28718 7. fix vllm-project/vllm#28665 8. fix vllm-project/vllm#26847 vllm introduced the `optimization-level`, some default config has been changed, and the param `--enforce-eager` has been deprecated 9. fix http://github.com/vllm-project/vllm/pull/29223 it retuns tuple for sampler. 10. fix vllm-project/vllm#29471 we'll remove the related patch to avoid this kind of error. Co-authored-by: hfadzxy <[email protected]> Co-authored-by: wangli <[email protected]> - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <[email protected]> Signed-off-by: wangli <[email protected]> Signed-off-by: hfadzxy <[email protected]> Co-authored-by: wangli <[email protected]> Co-authored-by: hfadzxy <[email protected]>

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <[email protected]>

hmellor added 6 commits November 12, 2025 09:19

Rename rope_scaling -> rope_parameters in get_rope

a62c2df

Signed-off-by: Harry Mellor <[email protected]>

Patch rope parameters to new name, rope_parameters

f42b03d

Signed-off-by: Harry Mellor <[email protected]>

Update models where it's a simple rename

a2a9437

Signed-off-by: Harry Mellor <[email protected]>

Fix model config overrides

fba5bf5

Signed-off-by: Harry Mellor <[email protected]>

Update examples

ee5cf66

Signed-off-by: Harry Mellor <[email protected]>

Update benchmarks

080530d

Signed-off-by: Harry Mellor <[email protected]>

mergify bot added documentation Improvements or additions to documentation llama Related to Llama models performance Performance-related issues qwen Related to Qwen models gpt-oss Related to GPT-OSS models speculative-decoding labels Nov 12, 2025

github-project-automation bot added this to gpt-oss Issues & Enhancements Nov 12, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Nov 12, 2025

hmellor added 11 commits November 12, 2025 11:12

More renaming in transformers utils

889b900

Signed-off-by: Harry Mellor <[email protected]>

Fix patch_rope_parameters for when rope_scaling was explicitly `N…

50b1a87

…one` Signed-off-by: Harry Mellor <[email protected]>

Update Gemma3 and Gemma3n

bd182e0

Signed-off-by: Harry Mellor <[email protected]>

Merge branch 'main' into update-rope-config

4c61e2e

Get rope_theta from the new location too

65c8658

Signed-off-by: Harry Mellor <[email protected]>

Fix condition for non gemma3 models

5d65739

Signed-off-by: Harry Mellor <[email protected]>

Make Transformers backend torch compile check work with new rope params

b4e1967

Signed-off-by: Harry Mellor <[email protected]>

Re-enable a load of Transformers nightly tests which are now fixed

ee77bd7

Signed-off-by: Harry Mellor <[email protected]>

Update the custom configs

df4c007

Signed-off-by: Harry Mellor <[email protected]>

Make sure scaling factor always exists

325ff8d

Signed-off-by: Harry Mellor <[email protected]>

A couple more models that now init on v5

11c23a7

Signed-off-by: Harry Mellor <[email protected]>

mergify bot added the ci/build label Nov 13, 2025

hmellor added 3 commits November 13, 2025 12:42

Update Commandr

4ea113c

Signed-off-by: Harry Mellor <[email protected]>

Update Qwen3Next

59b0f27

Signed-off-by: Harry Mellor <[email protected]>

Update Olmo2

064441b

Signed-off-by: Harry Mellor <[email protected]>

hmellor added 9 commits November 15, 2025 20:20

Update models which can default to 10000

002fb90

Signed-off-by: Harry Mellor <[email protected]>

Fix nemotron config

99c5d47

Signed-off-by: Harry Mellor <[email protected]>

Fix ernie 4.5 vl

c38e8bb

Signed-off-by: Harry Mellor <[email protected]>

Fix benchmarks/tests where get_rope is called with positional argum…

eebe73c

…ents Signed-off-by: Harry Mellor <[email protected]>

Merge branch 'main' into update-rope-config

540a46b

Fix get_rope kwargs in vision transformers

a60b5ec

Signed-off-by: Harry Mellor <[email protected]>

Update new model

00f2853

Signed-off-by: Harry Mellor <[email protected]>

Missed positional args

717a704

Signed-off-by: Harry Mellor <[email protected]>

Fix nemotron config validation

a9fa3b0

Signed-off-by: Harry Mellor <[email protected]>

vllm-bot merged commit a8b7030 into vllm-project:main Nov 19, 2025
55 of 57 checks passed

github-project-automation bot moved this from To Triage to Done in gpt-oss Issues & Enhancements Nov 19, 2025

hmellor deleted the update-rope-config branch November 19, 2025 18:32

Victor49152 pushed a commit to Victor49152/vllm that referenced this pull request Nov 20, 2025

Update rope_scaling to rope_parameters in preparation for Transfo…

f38f901

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <[email protected]>

DarkLight1337 mentioned this pull request Nov 20, 2025

[Bugfix] Fix Plamo3 rope handling #29092

Merged

5 tasks

hl475 mentioned this pull request Nov 20, 2025

[CI Failure] Fix Gemma3 RoPE configuration for sliding attention layers #29111

Merged

5 tasks

LuminolT pushed a commit to LuminolT/vllm that referenced this pull request Nov 21, 2025

Update rope_scaling to rope_parameters in preparation for Transfo…

1cb26d6

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: LuminolT <[email protected]>

juliendenize mentioned this pull request Nov 21, 2025

Fix mistral config #29172

Merged

5 tasks

bigPYJ1151 pushed a commit that referenced this pull request Nov 25, 2025

Update rope_scaling to rope_parameters in preparation for Transfo…

4eb2166

…rmers v5 (#28542) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: jiang1.li <[email protected]>

bringlein pushed a commit to bringlein/vllm that referenced this pull request Nov 26, 2025

Update rope_scaling to rope_parameters in preparation for Transfo…

a73f405

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <[email protected]>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

Update rope_scaling to rope_parameters in preparation for Transfo…

4bca638

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <[email protected]>

Potabk mentioned this pull request Nov 29, 2025

[Main] Upgrade vllm commit to 2025_12_01 vllm-project/vllm-ascend#4527

Closed

wangxiyuan mentioned this pull request Dec 1, 2025

upgrade vLLM to main vllm-project/vllm-ascend#4608

Merged

kitaekatt pushed a commit to kitaekatt/vllm that referenced this pull request Dec 1, 2025

Update rope_scaling to rope_parameters in preparation for Transfo…

a458031

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <[email protected]>

Zhathw pushed a commit to Zhathw/vllm that referenced this pull request Dec 6, 2025

Update rope_scaling to rope_parameters in preparation for Transfo…

eb77702

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Update `rope_scaling` to `rope_parameters` in preparation for Transformers v5 #28542

Update `rope_scaling` to `rope_parameters` in preparation for Transformers v5 #28542

hmellor commented Nov 12, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Update rope_scaling to rope_parameters in preparation for Transformers v5 #28542

Update rope_scaling to rope_parameters in preparation for Transformers v5 #28542

Conversation

hmellor commented Nov 12, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mergify bot commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Update `rope_scaling` to `rope_parameters` in preparation for Transformers v5 #28542

Update `rope_scaling` to `rope_parameters` in preparation for Transformers v5 #28542

hmellor commented Nov 12, 2025 •

edited by github-actions bot

Loading