The missing guide for training Qwen3-VL MOE

Anyone interested in training qwen3-vl moe models but without success can try the following.

1. Use the official base image `[base-verl0.6-cu128-cudnn9.8-torch2.8.0-fa2.7.4](https://hub.docker.com/layers/verlai/verl/base-verl0.6-cu128-cudnn9.8-torch2.8.0-fa2.7.4/images/sha256-6bcff875bfe58350b238ddac7f975fcda6f99f55dee42d8918199357ab3aa0ef)`
2. `pip install vllm==0.11.0` which supports qwen3 vl rollout
3. clone verl (optional: checkout to commit 2c9e19ef2f0619a2e9e9d4fc813dab8e717e3ab9 which I have personally tested) and install 
4. install mbridge `pip install -U git+https://github.com/ISEEKYAN/mbridge.git`
5. install latest transformers for qwen3vl support `pip install 'transformers>=4.57.0'`
6. install mcore `pip install --no-deps --no-cache-dir git+https://github.com/NVIDIA/Megatron-LM.git@core_v0.13.1`

Then you can run the example in `examples/grpo_trainer/run_qwen3_vl-30b-megatron.sh`

Good luck!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The missing guide for training Qwen3-VL MOE #3906

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

The missing guide for training Qwen3-VL MOE #3906

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions