-
Notifications
You must be signed in to change notification settings - Fork 2.6k
Open
Description
Anyone interested in training qwen3-vl moe models but without success can try the following.
- Use the official base image
[base-verl0.6-cu128-cudnn9.8-torch2.8.0-fa2.7.4](https://hub.docker.com/layers/verlai/verl/base-verl0.6-cu128-cudnn9.8-torch2.8.0-fa2.7.4/images/sha256-6bcff875bfe58350b238ddac7f975fcda6f99f55dee42d8918199357ab3aa0ef) pip install vllm==0.11.0which supports qwen3 vl rollout- clone verl (optional: checkout to commit 2c9e19e which I have personally tested) and install
- install mbridge
pip install -U git+https://github.com/ISEEKYAN/mbridge.git - install latest transformers for qwen3vl support
pip install 'transformers>=4.57.0' - install mcore
pip install --no-deps --no-cache-dir git+https://github.com/NVIDIA/Megatron-LM.git@core_v0.13.1
Then you can run the example in examples/grpo_trainer/run_qwen3_vl-30b-megatron.sh
Good luck!
Metadata
Metadata
Assignees
Labels
No labels