You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[CLEVR-70k-Counting](examples/run_qwen2_5_vl_2b_clevr.sh): Train the Qwen2.5-VL-3B-Instruct model on counting problem.
87
+
-[CLEVR-70k-Counting](examples/run_qwen2_5_vl_3b_clevr.sh): Train the Qwen2.5-VL-3B-Instruct model on counting problem.
85
88
86
89
## TODO
87
90
88
91
- Support PPO, Reinforce++ and RLOO for VLMs.
89
-
- Support padding-free training for VLMs.
90
92
- Support ulysses parallelism for VLMs.
91
93
- Support more VLM architectures.
92
94
93
95
### Known bugs
94
96
95
97
These features are temporarily disabled for now, we plan to fix them one-by-one in the future updates.
96
98
97
-
- Vision language models are not compatible with padding-free training and ulysses parallelism yet.
99
+
- Vision language models are not compatible with ulysses parallelism yet.
98
100
- Vision language models are not compatible with `enable_chunked_prefill` unless [vLLM v1](https://blog.vllm.ai/2025/01/27/v1-alpha-release.html) is supported.
0 commit comments