Simplify weight loading in Transformers backend #21382

hmellor · 2025-07-22T14:31:04Z

TransformersModel no longer inherits from nn.Module so self.model = AutoModel.from_config(...) is no longer instantly registered as a child module of TransformersModel
In TransformersForCausalLM and TransformersForMultimodalLM we set self.model to the inner model from TransformersModel
Now, hf_to_vllm_mapper is no longer necessary in TransformersForCausalLM and can be converted to a class variable in TransformersForMultimodalLM
Now that hf_to_vllm_mapper is no longer a @property in any Transformers backend classes, we can update SupportsQuant to access the class attribute directly as instructed in the comments

Signed-off-by: Harry Mellor <[email protected]>

gemini-code-assist

Code Review

This pull request simplifies the Transformers backend by removing nn.Module inheritance from TransformersModel, streamlining weight mappers. However, the weight mapper in TransformersForMultimodalLM has a critical issue that will cause weight loading to fail for some models. The review includes a fix for this issue, as well as suggestions for code simplification and readability.

gemini-code-assist · 2025-07-22T14:33:24Z

vllm/model_executor/models/transformers.py

+    hf_to_vllm_mapper = WeightsMapper(
+        orig_to_new_prefix={
+            "language_model.model": "language_model",
+            "text_model.model": "text_model",
+            "text_model.lm_head": "lm_head",
+            "language_model.lm_head": "lm_head",
+            # deal with Qwen2-VL mapping
+            "model.layers": "language_model.layers",
+        })


The updated hf_to_vllm_mapper for TransformersForMultimodalLM appears to be incorrect. The AutoWeightsLoader is initialized with self (the TransformersForMultimodalLM instance), which has self.model as an attribute containing the PreTrainedModel. Therefore, parameter names within the model are expected to be prefixed with model. (e.g., model.language_model...).

The current mappings, such as "language_model.model": "language_model", will cause the loader to look for a top-level language_model attribute on TransformersForMultimodalLM, which doesn't exist. The target prefixes should include model. to correctly map to the nested structure.

For example, language_model.model from the checkpoint should map to model.language_model in the vLLM model.

hf_to_vllm_mapper = WeightsMapper( orig_to_new_prefix={ "language_model.model": "model.language_model", "text_model.model": "model.text_model", "text_model.lm_head": "model.lm_head", "language_model.lm_head": "model.lm_head", # deal with Qwen2-VL mapping "model.layers": "model.language_model.layers", })

vllm/model_executor/models/interfaces.py

github-actions · 2025-07-22T14:38:33Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

… tests Signed-off-by: Harry Mellor <[email protected]>

Isotr0py

LGTM as long as tests can pass.

Signed-off-by: Harry Mellor <[email protected]>

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: qizixi <[email protected]>

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: x22x22 <[email protected]>

Signed-off-by: Harry Mellor <[email protected]>

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Paul Pak <[email protected]>

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Diego-Castan <[email protected]>

Signed-off-by: Harry Mellor <[email protected]>

Simplify weight loading in Transformers backend

ba9379e

Signed-off-by: Harry Mellor <[email protected]>

gemini-code-assist bot reviewed Jul 22, 2025

View reviewed changes

hmellor mentioned this pull request Jul 22, 2025

[Model] Officially support Emu3 with Transformers backend #21319

Merged

DarkLight1337 requested a review from Isotr0py July 22, 2025 14:38

Switch to ...ForCausalLM custom code model for Transformers backend…

ebd9540

… tests Signed-off-by: Harry Mellor <[email protected]>

hmellor requested review from DarkLight1337, jeejeelee, youkaichao and ywang96 as code owners July 22, 2025 15:08

Isotr0py approved these changes Jul 22, 2025

View reviewed changes

Isotr0py enabled auto-merge (squash) July 22, 2025 15:23

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 22, 2025

Fix weight mapper for TransformersForMultiModalLM

a4c510b

Signed-off-by: Harry Mellor <[email protected]>

vllm-bot merged commit f154bb9 into vllm-project:main Jul 23, 2025
73 of 76 checks passed

hmellor deleted the transformers-backend-remove-mapper branch July 23, 2025 07:12

zixi-qi pushed a commit to zixi-qi/vllm that referenced this pull request Jul 23, 2025

Simplify weight loading in Transformers backend (vllm-project#21382)

7c61321

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: qizixi <[email protected]>

x22x22 pushed a commit to x22x22/vllm that referenced this pull request Aug 5, 2025

Simplify weight loading in Transformers backend (vllm-project#21382)

98ba104

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: x22x22 <[email protected]>

Pradyun92 pushed a commit to Pradyun92/vllm that referenced this pull request Aug 6, 2025

Simplify weight loading in Transformers backend (vllm-project#21382)

d2432e5

Signed-off-by: Harry Mellor <[email protected]>

npanpaliya pushed a commit to odh-on-pz/vllm-upstream that referenced this pull request Aug 6, 2025

Simplify weight loading in Transformers backend (vllm-project#21382)

a8c04ed

Signed-off-by: Harry Mellor <[email protected]>

jinzhen-lin pushed a commit to jinzhen-lin/vllm that referenced this pull request Aug 9, 2025

Simplify weight loading in Transformers backend (vllm-project#21382)

f3e8991

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

paulpak58 pushed a commit to paulpak58/vllm that referenced this pull request Aug 13, 2025

Simplify weight loading in Transformers backend (vllm-project#21382)

fc4eebc

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Paul Pak <[email protected]>

diegocastanibm pushed a commit to diegocastanibm/vllm that referenced this pull request Aug 15, 2025

Simplify weight loading in Transformers backend (vllm-project#21382)

114419a

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Diego-Castan <[email protected]>

epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025

Simplify weight loading in Transformers backend (vllm-project#21382)

a5bf33f

Signed-off-by: Harry Mellor <[email protected]>

hmellor added this to Transformers backend Sep 24, 2025

hmellor moved this to Done in Transformers backend Sep 24, 2025

hmellor mentioned this pull request Oct 8, 2025

[Bug]: vllm fails to run internvl hf format multimodal model but works with the default vllm one #23714

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Simplify weight loading in Transformers backend #21382

Simplify weight loading in Transformers backend #21382

Uh oh!

hmellor commented Jul 22, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jul 22, 2025

Uh oh!

Uh oh!

github-actions bot commented Jul 22, 2025

Uh oh!

Isotr0py left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Simplify weight loading in Transformers backend #21382

Simplify weight loading in Transformers backend #21382

Uh oh!

Conversation

hmellor commented Jul 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Jul 22, 2025

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hmellor commented Jul 22, 2025 •

edited by github-actions bot

Loading