Vision Reinforcement Learning + Memory Efficient RL #3326
shimmyshimmer
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
We're excited to support Vision models for RL and even more memory efficient + faster RL!
Unsloth now supports vision/multimodal RL with Gemma 3 and Qwen2.5-VL. Due to Unsloth's unique weight sharing and custom kernels, Unsloth makes VLM RL 1.5–2× faster, uses 90% less VRAM, and enables 10× longer context lengths than FA2 setups, with no accuracy loss. Qwen2.5-VL GRPO notebook
Full details in our blogpost: https://docs.unsloth.ai/new/vision-reinforcement-learning-vlm-rl
Don't forget to also join our Reddit: r/unsloth 🥰
This discussion was created from the release Vision Reinforcement Learning + Memory Efficient RL.
Beta Was this translation helpful? Give feedback.
All reactions