Rename clashing method names for vLLM model protocol #27583

hmellor · 2025-10-27T15:09:16Z

get_input_embeddings is a getter method of all transformers.PreTrainedModels.

This name had been reused in vLLM to call forward on the model's embedding layer.

Since the method in vLLM is not actually a getter and causes a confusing clash with the getter in Transformers, this PR:

Renames get_input_embeddings to embed_input_ids
Renames get_multimodal_embeddings to embed_multimodal for consistency
Adds fallbacks and deprecation warnings to VllmModel and SupportsMultiModal so that third party vLLM models should still work

Signed-off-by: Harry Mellor <[email protected]>

mergify · 2025-10-27T15:09:51Z

Documentation preview: https://vllm--27583.org.readthedocs.build/en/27583/

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2025-10-27T15:13:38Z

vllm/model_executor/models/interfaces_base.py

+def _check_vllm_model_embed_input_ids(model: type[object] | object) -> bool:
+    model_embed_input_ids = getattr(model, "embed_input_ids", None)
+    if not callable(model_embed_input_ids):
+        model_get_input_embeddings = getattr(model, "get_input_embeddings", None)
+        if callable(model_get_input_embeddings):
+            logger.warning(
+                "`get_input_embeddings` for vLLM models is deprecated and will be "
+                "removed in v0.13.0 or v1.0.0, whichever is earlier. Please rename "
+                "this method to `embed_input_ids`."
+            )
+            model.embed_input_ids = model_get_input_embeddings
        logger.warning(
-            "The model (%s) is missing the `get_input_embeddings` method.",
+            "The model (%s) is missing the `embed_input_ids` method.",
            model,
        )
        return False


Allow deprecated get_input_embeddings to satisfy VllmModel check

The new _check_vllm_model_embed_input_ids always returns False when a model still exposes the deprecated get_input_embeddings method, even after binding it to embed_input_ids. As a result, is_vllm_model will reject any third‑party models that follow the old interface, contradicting the intended deprecation fallback and potentially breaking out‑of‑tree integrations until they rename their method. The check should return True when the legacy method exists and is wired up, otherwise the compatibility path never actually works.

Useful? React with 👍 / 👎.

NickLucche

looks reasonable

hmellor · 2025-10-28T17:04:45Z

As per discussion with @ywang96, I'll be pausing work on this until an ongoing new model is added as to not disrupt it.

Signed-off-by: Harry Mellor <[email protected]>

DarkLight1337

LGTM if tests pass

hmellor · 2025-11-12T16:38:55Z

Failing extended pooling test also failed last night's nightly https://buildkite.com/vllm/ci/builds/38575/steps/canvas?sid=019a7670-4e2e-49b6-8137-1777aa2125f5

hmellor · 2025-11-12T19:45:20Z

Failing mm extended 1 test also failed last night's nightly https://buildkite.com/vllm/ci/builds/38575/steps/canvas?sid=019a7670-4e31-45f4-867d-a91104a9c070

Failing mm extended 3 test also failed last night's nightly https://buildkite.com/vllm/ci/builds/38575/steps/canvas?sid=019a7670-4e32-4856-89cf-83eaf9f00b2c

) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: George D. Torres <[email protected]>

) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Bram Wasti <[email protected]>

Bump vLLM version to v0.11.2 What's broken and changed by vLLM: 1. structured_output is broken by vllm-project/vllm#26866 2. get_mrope_input_positions is broken by vllm-project/vllm#28399 3. graph mode is broken by vllm-project/vllm#25110 we'll upgrade torch to 2.8 to fix the problem later 4. embedding is broken by vllm-project/vllm#27583 5. `get_attn_backend_cls` and attention backend is broken are broken by vllm-project/vllm#28534 6. spec decode is broken by vllm-project/vllm#28771 7. sp feature is broken by vllm-project/vllm#27126 8. mtp is broken by vllm-project/vllm#27922 9. lora is broken by vllm-project/vllm#21068 10. execute_model is broken by vllm-project/vllm#26866 11. `VLLM_DISABLE_SHARED_EXPERTS_STREAM` env is broken by vllm-project/vllm#28159 12. kv cahe is broken by vllm-project/vllm#27753 13. dp is broken by vllm-project/vllm#25110 What's broken and changed by ourself: 1. qwen vl is broken by vllm-project/vllm#28455 We'll remove model files in the future to avoid this kind of error 2. Engine core is broken by vllm-project/vllm#23691 We'll remove the patch file in the future. 3. Ascend scheduler is broken by vllm-project/vllm#28733 We'll remove ascend scheudler later. 4. qwen3-next is broken by vllm-project/vllm#28083 We'll remove model files in the future to avoid this kind of error 5. qwen vl is broken by vllm-project/vllm#27764. We'll remove model files in the future Known issue: 1. ray doesn't work 2. the accuracy of qwen3-next is not correct 3. qwen3-vl is broken 4. prefix cache+ ascend scheduler + deepseek v2 lite is broken. Co-authored-by: MengqingCao <[email protected]> Co-authored-by: hfadzxy <[email protected]> Co-authored-by: leo-pony <[email protected]> Co-authored-by: 22dimensions <[email protected]> Co-authored-by: shen-shanshan <[email protected]> - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <[email protected]> Signed-off-by: MengqingCao <[email protected]> Signed-off-by: hfadzxy <[email protected]> Signed-off-by: leo-pony <[email protected]> Co-authored-by: MengqingCao <[email protected]> Co-authored-by: hfadzxy <[email protected]> Co-authored-by: leo-pony <[email protected]>

Bump vLLM version to v0.11.2 What's broken and changed by vLLM: 1. structured_output is broken by vllm-project/vllm#26866 2. get_mrope_input_positions is broken by vllm-project/vllm#28399 3. graph mode is broken by vllm-project/vllm#25110 we'll upgrade torch to 2.8 to fix the problem later 4. embedding is broken by vllm-project/vllm#27583 5. `get_attn_backend_cls` and attention backend is broken are broken by vllm-project/vllm#28534 6. spec decode is broken by vllm-project/vllm#28771 7. sp feature is broken by vllm-project/vllm#27126 8. mtp is broken by vllm-project/vllm#27922 9. lora is broken by vllm-project/vllm#21068 10. execute_model is broken by vllm-project/vllm#26866 11. `VLLM_DISABLE_SHARED_EXPERTS_STREAM` env is broken by vllm-project/vllm#28159 12. kv cahe is broken by vllm-project/vllm#27753 13. dp is broken by vllm-project/vllm#25110 What's broken and changed by ourself: 1. qwen vl is broken by vllm-project/vllm#28455 We'll remove model files in the future to avoid this kind of error 2. Engine core is broken by vllm-project/vllm#23691 We'll remove the patch file in the future. 3. Ascend scheduler is broken by vllm-project/vllm#28733 We'll remove ascend scheudler later. 4. qwen3-next is broken by vllm-project/vllm#28083 We'll remove model files in the future to avoid this kind of error 5. qwen vl is broken by vllm-project/vllm#27764. We'll remove model files in the future Known issue: 1. ray doesn't work 2. the accuracy of qwen3-next is not correct 3. qwen3-vl is broken 4. prefix cache+ ascend scheduler + deepseek v2 lite is broken. Co-authored-by: MengqingCao <[email protected]> Co-authored-by: hfadzxy <[email protected]> Co-authored-by: leo-pony <[email protected]> Co-authored-by: 22dimensions <[email protected]> Co-authored-by: shen-shanshan <[email protected]> - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <[email protected]> Signed-off-by: MengqingCao <[email protected]> Signed-off-by: hfadzxy <[email protected]> Signed-off-by: leo-pony <[email protected]> Co-authored-by: MengqingCao <[email protected]> Co-authored-by: hfadzxy <[email protected]> Co-authored-by: leo-pony <[email protected]> Signed-off-by: Kurumi5210 <[email protected]>

) Signed-off-by: Harry Mellor <[email protected]>

Bump vLLM version to v0.11.2 What's broken and changed by vLLM: 1. structured_output is broken by vllm-project/vllm#26866 2. get_mrope_input_positions is broken by vllm-project/vllm#28399 3. graph mode is broken by vllm-project/vllm#25110 we'll upgrade torch to 2.8 to fix the problem later 4. embedding is broken by vllm-project/vllm#27583 5. `get_attn_backend_cls` and attention backend is broken are broken by vllm-project/vllm#28534 6. spec decode is broken by vllm-project/vllm#28771 7. sp feature is broken by vllm-project/vllm#27126 8. mtp is broken by vllm-project/vllm#27922 9. lora is broken by vllm-project/vllm#21068 10. execute_model is broken by vllm-project/vllm#26866 11. `VLLM_DISABLE_SHARED_EXPERTS_STREAM` env is broken by vllm-project/vllm#28159 12. kv cahe is broken by vllm-project/vllm#27753 13. dp is broken by vllm-project/vllm#25110 What's broken and changed by ourself: 1. qwen vl is broken by vllm-project/vllm#28455 We'll remove model files in the future to avoid this kind of error 2. Engine core is broken by vllm-project/vllm#23691 We'll remove the patch file in the future. 3. Ascend scheduler is broken by vllm-project/vllm#28733 We'll remove ascend scheudler later. 4. qwen3-next is broken by vllm-project/vllm#28083 We'll remove model files in the future to avoid this kind of error 5. qwen vl is broken by vllm-project/vllm#27764. We'll remove model files in the future Known issue: 1. ray doesn't work 2. the accuracy of qwen3-next is not correct 3. qwen3-vl is broken 4. prefix cache+ ascend scheduler + deepseek v2 lite is broken. Co-authored-by: MengqingCao <[email protected]> Co-authored-by: hfadzxy <[email protected]> Co-authored-by: leo-pony <[email protected]> Co-authored-by: 22dimensions <[email protected]> Co-authored-by: shen-shanshan <[email protected]> - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <[email protected]> Signed-off-by: MengqingCao <[email protected]> Signed-off-by: hfadzxy <[email protected]> Signed-off-by: leo-pony <[email protected]> Co-authored-by: MengqingCao <[email protected]> Co-authored-by: hfadzxy <[email protected]> Co-authored-by: leo-pony <[email protected]>

Rename clashing method names for vLLM model protocol

421a296

Signed-off-by: Harry Mellor <[email protected]>

hmellor marked this pull request as ready for review October 27, 2025 15:09

hmellor requested review from DarkLight1337, NickLucche, benchislett, luccafong, patrickvonplaten, sighingnow and ywang96 as code owners October 27, 2025 15:09

mergify bot added documentation Improvements or additions to documentation deepseek Related to DeepSeek models llama Related to Llama models multi-modality Related to multi-modality (#4194) qwen Related to Qwen models gpt-oss Related to GPT-OSS models speculative-decoding labels Oct 27, 2025

github-project-automation bot added this to gpt-oss Issues & Enhancements Oct 27, 2025

mergify bot added the v1 label Oct 27, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Oct 27, 2025

mergify bot added the tpu Related to Google TPUs label Oct 27, 2025

chatgpt-codex-connector bot reviewed Oct 27, 2025

View reviewed changes

NickLucche reviewed Oct 27, 2025

View reviewed changes

Merge branch 'main' into rename-reused-func-names

8177c84

Signed-off-by: Harry Mellor <[email protected]>

hmellor requested a review from tjtanaa as a code owner November 12, 2025 14:32

Update new references to the method name we're changing

6924a2e

Signed-off-by: Harry Mellor <[email protected]>

DarkLight1337 approved these changes Nov 12, 2025

View reviewed changes

github-project-automation bot moved this from To Triage to Ready in gpt-oss Issues & Enhancements Nov 12, 2025

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 12, 2025

vllm-bot merged commit 97d1c99 into vllm-project:main Nov 13, 2025
59 of 63 checks passed

github-project-automation bot moved this from Ready to Done in gpt-oss Issues & Enhancements Nov 13, 2025

hmellor deleted the rename-reused-func-names branch November 13, 2025 08:53

DarkLight1337 mentioned this pull request Nov 14, 2025

[Model] Add Afmoe architecture implementation #28332

Merged

5 tasks

geodavic pushed a commit to geodavic/vllm that referenced this pull request Nov 16, 2025

Rename clashing method names for vLLM model protocol (vllm-project#27583

9820534

) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: George D. Torres <[email protected]>

bwasti pushed a commit to bwasti/vllm that referenced this pull request Nov 17, 2025

Rename clashing method names for vLLM model protocol (vllm-project#27583

148cbb5

) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Bram Wasti <[email protected]>

wangxiyuan mentioned this pull request Nov 25, 2025

upgrade to vllm 0.11.2 vllm-project/vllm-ascend#4400

Merged

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

Rename clashing method names for vLLM model protocol (vllm-project#27583

fac5d05

) Signed-off-by: Harry Mellor <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Rename clashing method names for vLLM model protocol #27583

Rename clashing method names for vLLM model protocol #27583

Uh oh!

hmellor commented Oct 27, 2025

Uh oh!

mergify bot commented Oct 27, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Oct 27, 2025

Uh oh!

NickLucche left a comment

Uh oh!

hmellor commented Oct 28, 2025

Uh oh!

DarkLight1337 left a comment

Uh oh!

hmellor commented Nov 12, 2025

Uh oh!

hmellor commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Rename clashing method names for vLLM model protocol #27583

Rename clashing method names for vLLM model protocol #27583

Uh oh!

Conversation

hmellor commented Oct 27, 2025

Uh oh!

mergify bot commented Oct 27, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

hmellor commented Oct 28, 2025

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

hmellor commented Nov 12, 2025

Uh oh!

hmellor commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants