Skip to content

[MODEL] Cherry-pick: Adding Support for Qwen3.5 Models#37514

Open
ChuanLi1101 wants to merge 2 commits intovllm-project:amd_devfrom
ChuanLi1101:cherry-pick/qwen3.5-support
Open

[MODEL] Cherry-pick: Adding Support for Qwen3.5 Models#37514
ChuanLi1101 wants to merge 2 commits intovllm-project:amd_devfrom
ChuanLi1101:cherry-pick/qwen3.5-support

Conversation

@ChuanLi1101
Copy link
Copy Markdown
Collaborator

@ChuanLi1101 ChuanLi1101 commented Mar 19, 2026

Summary

Changes

  • New model files: qwen3_5.py, qwen3_5_mtp.py
  • Updated model registry, speculative config, and supported models documentation
  • 11 files changed, +1501/-9 lines

Signed-off-by: JJJYmmm <1650675829@qq.com>
Signed-off-by: JJJYmmm <92386084+JJJYmmm@users.noreply.github.com>
Signed-off-by: Roger Wang <hey@rogerw.io>
Co-authored-by: wulipc <wulipc@users.noreply.github.com>
Co-authored-by: ywang96 <ywang96@users.noreply.github.com>
Co-authored-by: Isotr0py <Isotr0py@users.noreply.github.com>
Co-authored-by: Isotr0py <2037008807@qq.com>
Co-authored-by: Roger Wang <hey@rogerw.io>
@mergify
Copy link
Copy Markdown

mergify bot commented Mar 19, 2026

Documentation preview: https://vllm--37514.org.readthedocs.build/en/37514/

@mergify mergify bot added documentation Improvements or additions to documentation new-model Requests to new models qwen Related to Qwen models speculative-decoding v1 labels Mar 19, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for the Qwen3.5 series of models, including both dense and Mixture-of-Experts (MoE) variants, as well as their Multi-Token Prediction (MTP) counterparts. The changes involve adding new model implementations (qwen3_5.py and qwen3_5_mtp.py), registering them in the model and MTP registries, and updating documentation. Additionally, several configuration accesses in existing Qwen3Next model files were refined to consistently use hf_text_config and include default values for robustness, and speculative decoding support was enabled for Qwen3.5 MTP models while explicitly disabling Mamba with speculative decoding for Qwen3.5 models.

@mergify
Copy link
Copy Markdown

mergify bot commented Mar 19, 2026

Hi @ChuanLi1101, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?
mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:
# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

@ChuanLi1101
Copy link
Copy Markdown
Collaborator Author

Fixed the pre-commit
uff-check import ordering issue in mxfp4.py.
Requesting review from AMD reviewers: @tjtanaa @gshtras

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation new-model Requests to new models qwen Related to Qwen models speculative-decoding v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants