[MODEL] Cherry-pick: Adding Support for Qwen3.5 Models#37514
[MODEL] Cherry-pick: Adding Support for Qwen3.5 Models#37514ChuanLi1101 wants to merge 2 commits intovllm-project:amd_devfrom
Conversation
Signed-off-by: JJJYmmm <1650675829@qq.com> Signed-off-by: JJJYmmm <92386084+JJJYmmm@users.noreply.github.com> Signed-off-by: Roger Wang <hey@rogerw.io> Co-authored-by: wulipc <wulipc@users.noreply.github.com> Co-authored-by: ywang96 <ywang96@users.noreply.github.com> Co-authored-by: Isotr0py <Isotr0py@users.noreply.github.com> Co-authored-by: Isotr0py <2037008807@qq.com> Co-authored-by: Roger Wang <hey@rogerw.io>
|
Documentation preview: https://vllm--37514.org.readthedocs.build/en/37514/ |
There was a problem hiding this comment.
Code Review
This pull request introduces support for the Qwen3.5 series of models, including both dense and Mixture-of-Experts (MoE) variants, as well as their Multi-Token Prediction (MTP) counterparts. The changes involve adding new model implementations (qwen3_5.py and qwen3_5_mtp.py), registering them in the model and MTP registries, and updating documentation. Additionally, several configuration accesses in existing Qwen3Next model files were refined to consistently use hf_text_config and include default values for robustness, and speculative decoding support was enabled for Qwen3.5 MTP models while explicitly disabling Mamba with speculative decoding for Qwen3.5 models.
|
Hi @ChuanLi1101, the pre-commit checks have failed. Please run: uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-filesThen, commit the changes and push to your branch. For future commits, Tip Is
|
Made-with: Cursor
Summary
amd_devbranchChanges
qwen3_5.py,qwen3_5_mtp.py