Support qwen2.5-VL in sft.py and solve GRPO deepspeed training issue #110

LiuRicky · 2025-02-18T02:55:49Z

Also solve the problem when using deepspeed zero3 training, as shown in huggingface/transformers@8ee5053

After update these changes, one can add "--deepspeed r1-v/local_scripts/zero3.json " in the training script when using deepspeed.

Problem shown in huggingface/transformers@8ee5053

tzjtatata · 2025-02-22T09:23:47Z

Hi, thank you for debugging. Can you specify the commit version of the transformers? For me, the current main is at 92c5ca9dd70de3ade2af2eb835c96215cc50e815. Is it as same as your version?

tzjtatata · 2025-02-23T05:28:44Z

And I found that the newest version of transformers("92c5ca") has bugs when using Qwen2.5-VL.

LiuRicky · 2025-02-23T12:13:43Z

And I found that the newest version of transformers("92c5ca") has bugs when using Qwen2.5-VL.

I guess it is the version 5 days ago. Maybe 8ee50537fe7613b87881cd043a85971c85e99519 or e3d99ec2f58e0e2a4df6b2b41152fdfb3f92a52f

tzjtatata · 2025-02-23T12:40:25Z

I find the bug，which is from Qwen 2.5 VL. It updates the processor config 8 days ago.... Therefore old version of Qwen 2.5 VL is not compatible.

…

---Original--- From: ***@***.***> Date: Sun, Feb 23, 2025 20:14 PM To: ***@***.***>; Cc: ***@***.******@***.***>; Subject: Re: [Deep-Agent/R1-V] Support qwen2.5-VL in sft.py and solve GRPOdeepspeed training issue (PR #110) And I found that the newest version of transformers("92c5ca") has bugs when using Qwen2.5-VL. I guess it is the version 5 days ago. Maybe 8ee50537fe7613b87881cd043a85971c85e99519 or e3d99ec2f58e0e2a4df6b2b41152fdfb3f92a52f — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***> LiuRicky left a comment (StarsfieldAI/R1-V#110) And I found that the newest version of transformers("92c5ca") has bugs when using Qwen2.5-VL. I guess it is the version 5 days ago. Maybe 8ee50537fe7613b87881cd043a85971c85e99519 or e3d99ec2f58e0e2a4df6b2b41152fdfb3f92a52f — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

LiuRicky added 3 commits February 18, 2025 10:54

Support qwen2.5-VL in sft.py

6bd67a7

update transformers for solving deepspeed qwen2.5vl prob

166cbeb

solve ds problem shown in

953eb76

Problem shown in huggingface/transformers@8ee5053

LiuRicky changed the title ~~Support qwen2.5-VL in sft.py~~ Support qwen2.5-VL in sft.py and solve GRPO deepspeed training issue Feb 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support qwen2.5-VL in sft.py and solve GRPO deepspeed training issue #110

Support qwen2.5-VL in sft.py and solve GRPO deepspeed training issue #110

Uh oh!

LiuRicky commented Feb 18, 2025 •

edited

Loading

Uh oh!

tzjtatata commented Feb 22, 2025

Uh oh!

tzjtatata commented Feb 23, 2025

Uh oh!

LiuRicky commented Feb 23, 2025

Uh oh!

tzjtatata commented Feb 23, 2025 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Support qwen2.5-VL in sft.py and solve GRPO deepspeed training issue #110

Are you sure you want to change the base?

Support qwen2.5-VL in sft.py and solve GRPO deepspeed training issue #110

Uh oh!

Conversation

LiuRicky commented Feb 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tzjtatata commented Feb 22, 2025

Uh oh!

tzjtatata commented Feb 23, 2025

Uh oh!

LiuRicky commented Feb 23, 2025

Uh oh!

tzjtatata commented Feb 23, 2025 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LiuRicky commented Feb 18, 2025 •

edited

Loading