Skip to content

The missing guide for training Qwen3-VL MOE #3906

@huaiyizhao

Description

@huaiyizhao

Anyone interested in training qwen3-vl moe models but without success can try the following.

  1. Use the official base image [base-verl0.6-cu128-cudnn9.8-torch2.8.0-fa2.7.4](https://hub.docker.com/layers/verlai/verl/base-verl0.6-cu128-cudnn9.8-torch2.8.0-fa2.7.4/images/sha256-6bcff875bfe58350b238ddac7f975fcda6f99f55dee42d8918199357ab3aa0ef)
  2. pip install vllm==0.11.0 which supports qwen3 vl rollout
  3. clone verl (optional: checkout to commit 2c9e19e which I have personally tested) and install
  4. install mbridge pip install -U git+https://github.com/ISEEKYAN/mbridge.git
  5. install latest transformers for qwen3vl support pip install 'transformers>=4.57.0'
  6. install mcore pip install --no-deps --no-cache-dir git+https://github.com/NVIDIA/Megatron-LM.git@core_v0.13.1

Then you can run the example in examples/grpo_trainer/run_qwen3_vl-30b-megatron.sh

Good luck!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions