[OpenVINO] Support Qwen3VL model by popovaan · Pull Request #1551 · huggingface/optimum-intel

popovaan · 2025-12-11T16:57:17Z

What does this PR do?

Conversion cmd-line for Qwen/Qwen3-VL-2B-Instruct:

optimum-cli export openvino -m Qwen/Qwen3-VL-2B-Instruct ./Qwen3-VL

Inference of Qwen/Qwen3-VL-2B-Instruct using OpenVINO backend:

from transformers import AutoTokenizer, AutoProcessor
from transformers.video_utils import load_video
from huggingface_hub import hf_hub_download
from optimum.intel.openvino import OVModelForVisualCausalLM

model_dir = "./Qwen3-VL/"

tokenizer = AutoTokenizer.from_pretrained(model_dir)
processor = AutoProcessor.from_pretrained(model_dir)
model = OVModelForVisualCausalLM.from_pretrained(model_dir)

# Prepare video input
video_path = hf_hub_download(
                repo_id="raushan-testing-hf/videos-test",
                filename="sample_demo_1.mp4",
                repo_type="dataset",
            )
input_video, _ = load_video(video_path, num_frames=10, backend="opencv")
question = "Why is this video funny?"
inputs = model.preprocess_inputs(processor=processor, text=question, video=input_video)

# Run inference
output_ids = model.generate(**inputs, max_new_tokens=10)
output_text = tokenizer.decode(output_ids[0])

print(output_text)

Continuation of #1452

Before submitting

[N/A] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

This reverts commit 8e7cdd2.

popovaan · 2026-02-02T15:11:19Z

@IlyasMoutawwakil @echarlaix could you please review the PR again?

echarlaix

Thanks a lot for the PR @popovaan !!

tests/openvino/test_seq2seq.py

tests/openvino/test_quantization.py

optimum/exporters/openvino/model_configs.py

echarlaix · 2026-02-03T15:54:21Z

optimum/exporters/openvino/model_configs.py

+
+
+@register_in_tasks_manager(
+    "qwen3_vl_text",


could you extend on why we need this config (instead of adding any modifications to the qwen3_vl config, depending on _behavior) ?

If we want to avoid a separate text config, the qwen3_vl config should support handling of past key values. Simple inheritance from TextDecoderWithPositionIdsOnnxConfig does not solve this issue, as the config still behaves like a visual-language config and the past key values functionality is not triggered for some reason.

I tried several approaches to work around this, but none of them work properly. If avoiding a separate config is crucial, I suggest making this in a separate PR, as this model is awaited by the customer and it seems to be not trivial.

optimum/exporters/openvino/model_patcher.py

optimum/intel/openvino/modeling_visual_language.py

echarlaix · 2026-02-03T16:33:18Z

optimum/intel/openvino/modeling_visual_language.py

+            **kwargs,
+        ):
+            # Clear cached rope delta from previous generations
+            self.rope_deltas = None


not directly related to this PR but what if the models forward is called multiple times (not through generate) ?

I suppose accuracy will be low, as rope_deltas need to be cleared before each forward.

optimum/intel/openvino/modeling_visual_language.py

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

popovaan · 2026-02-05T11:06:46Z

@echarlaix thanks for the review! I will apply remaining comments today.
@IlyasMoutawwakil could you please review this PR again too?

popovaan · 2026-02-05T16:27:30Z

@echarlaix @IlyasMoutawwakil please review this PR.

optimum/intel/openvino/modeling_visual_language.py

echarlaix

Thanks a lot @popovaan for the great addition !

rkazants · 2026-02-09T10:14:50Z

@popovaan, @echarlaix, I recommend to run slow tests after precommit tests are complete. Otherwise, multiple model download requests from both scopes (precommit and slow tests) can lead to network errors.

## Description This PR enables Qwen3-VL model in GenAI VLM pipeline. Supports SDPA + PA backends in VLM pipeline and Continuous Batching pipeline (both `generate()` and `add_request()` APIs). Depends on ~[Optimum Intel PR](huggingface/optimum-intel#1551 latest Optimum Intel and `transformers>=4.57.0` for model exporting. CVS-175825 Resolves #2998 ## Checklist: - [x] Tests have been updated or added to cover the new code. - [x] This patch fully addresses the ticket. - [x] I have made corresponding changes to the documentation.

openvino-dev-samples and others added 30 commits September 12, 2025 10:05

add qwen3_vl

2c95f78

Update setup.py

b47cc60

Update modeling_visual_language.py

8654a53

Update model_patcher.py

e1f75c3

update

d260216

set to static shape

047e30b

add qwen3vl_moe support

6c88fbf

Update modeling_visual_language.py

a2c7350

Update modeling_visual_language.py

c7b2d28

Update model_patcher.py

9b76446

Update modeling_visual_language.py

8e7cdd2

Revert "Update modeling_visual_language.py"

3cb4e20

This reverts commit 8e7cdd2.

Update modeling_visual_language.py

741501e

transformers 4.57

02f9c50

patch dynamic cache layer

c68919f

fix qwen and gpt_oss

073fc46

fix seq2seq models as well

5b245cf

fix

513977a

fix

43d5842

more decoder fixes

79a0bbf

limit awq

bc57cec

fix dynamic layer in optimum-onnx's model patcher

6489d7e

remove

11b5a5a

fix donut

d6cd7a6

vlm fixes

272a624

fix speecht5

c62546e

fix whisper

a7ede39

fix

817bc54

fix qwenvl

225b81d

better fix

7c5c92c

popovaan added 2 commits February 2, 2026 12:05

Removed wrong change.

deeb4e2

Test corrected.

c7877e1

echarlaix reviewed Feb 3, 2026

View reviewed changes

optimum/intel/openvino/modeling_visual_language.py Outdated Show resolved Hide resolved

echarlaix reviewed Feb 3, 2026

View reviewed changes

optimum/intel/openvino/modeling_visual_language.py Show resolved Hide resolved

popovaan and others added 6 commits February 4, 2026 11:17

Applied comments.

796566d

Update optimum/intel/openvino/modeling_visual_language.py

953d77e

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

Added links, minor corrections.

72eb222

Apply suggestion from @echarlaix

6eb3398

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

Apllied comments.

3f03a9d

Fixed error.

d8c9093

rkazants requested a review from echarlaix February 5, 2026 11:46

Code style.

336de2e

rkazants mentioned this pull request Feb 9, 2026

Qwen3-VL support #1478

Closed

echarlaix added the openvino-slow Runs OpenVINO slow tests with different versions of transformers label Feb 9, 2026

echarlaix reviewed Feb 9, 2026

View reviewed changes

optimum/intel/openvino/modeling_visual_language.py Show resolved Hide resolved

echarlaix approved these changes Feb 9, 2026

View reviewed changes

rkazants removed the openvino-slow Runs OpenVINO slow tests with different versions of transformers label Feb 9, 2026

Added comment.

d7bfb38

rkazants added openvino-slow Runs OpenVINO slow tests with different versions of transformers labels Feb 9, 2026

echarlaix merged commit 6ff3bfc into huggingface:main Feb 9, 2026
29 of 56 checks passed

Copilot AI mentioned this pull request Mar 8, 2026

Add Qwen3.5 model support (VLM + hybrid GatedDeltaNet text model) rkazants/optimum-intel#3

Draft

echarlaix mentioned this pull request Mar 30, 2026

Transformers v5 #1589

Open

peterchen-intel mentioned this pull request Mar 31, 2026

support videochat #1637

Open

3 tasks



		@register_in_tasks_manager(
		"qwen3_vl_text",

Conversation

popovaan commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

popovaan commented Feb 2, 2026

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

echarlaix Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

popovaan Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

echarlaix Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

popovaan Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

popovaan commented Feb 5, 2026

Uh oh!

popovaan commented Feb 5, 2026

Uh oh!

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

rkazants commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

popovaan commented Dec 11, 2025 •

edited

Loading

popovaan Feb 5, 2026 •

edited

Loading

rkazants commented Feb 9, 2026 •

edited

Loading