Add Qwen3-VL-Embedding-2B image input support by cornmander · Pull Request #232 · Anush008/fastembed-rs

cornmander · 2026-03-02T07:27:07Z

Summary

add a new Qwen3VLEmbedding API for Qwen3-VL multimodal embedding models
support image inputs via embed_images(...) and embed_image_bytes(...)
keep text support for Qwen3-VL via embed_texts(...) and existing Qwen3TextEmbedding
add an internal Qwen3-VL vision module for image token embedding/injection
update exports, feature wiring, README examples, and qwen3 tests

Implementation notes

uses the qwen3 feature (candle backend) and enables dep:image for this feature
image preprocessing follows Qwen3-VL patch/grid behavior (patch_size, merge_size, temporal duplication)
prompt expansion replaces a single <|image_pad|> placeholder with the exact number of image patch tokens
image embeddings are injected into the text stream before final hidden-state pooling

Validation

cargo fmt --all
cargo check
cargo test --features qwen3 models::qwen3::tests -- --nocapture
cargo test --features qwen3 --test qwen3 qwen3_vl_2b_text_embed -- --nocapture
cargo test --features qwen3 --test qwen3 qwen3_vl_2b_image_embed -- --nocapture

New/updated interfaces

Qwen3VLEmbedding::from_hf(...)
Qwen3VLEmbedding::embed_texts(...)
Qwen3VLEmbedding::embed_images(...)
Qwen3VLEmbedding::embed_image_bytes(...)

cornmander · 2026-03-02T15:28:32Z

Pushed a follow-up formatting fix in 7894acd for the cargo fmt --all -- --check failure (import wrapping in src/models/qwen3.rs).

Current run status is action_required with no jobs started yet, so this PR likely needs a maintainer to approve workflow execution for the forked branch before CI can run again.

Anush008

Thanks for taking the time to contribute @cornmander

Anush008 · 2026-03-03T11:14:18Z

tests/qwen3.rs

+    let images = ["tests/assets/image_0.png", "tests/assets/image_1.png"];
+    let embeddings = model.embed_images(&images).expect("embed images");
+
+    assert_eq!(embeddings.len(), images.len());


Please add assertions equating embedding values from the Python counterpart code.

Like we do at https://github.com/Anush008/fastembed-rs/blob/main/tests/text-embeddings.rs

We have ensure Python and Rust produce the same vectors.

cornmander · 2026-03-04T18:32:01Z

Addressed the review feedback in a5c7c2f.

Changes made:

Added Python-reference assertions in tests/qwen3.rs for qwen3_vl_2b_image_embed (embedding sums, first dims, and cosine).
Aligned Rust VL image path with the official Python behavior:
- Python-compatible ties-to-even image resize rounding.
- MRoPE position-id construction for image tokens.
- Interleaved MRoPE rotary application from mrope_section.
- DeepStack visual feature injection into early decoder layers.

Validation run:

RUN_QWEN3_VL_2B_IMAGE=1 cargo test --features qwen3 --test qwen3 qwen3_vl_2b_image_embed -- --nocapture
RUN_QWEN3_VL_2B=1 cargo test --features qwen3 --test qwen3 qwen3_vl_2b_text_embed -- --nocapture

Anush008

Thanks @cornmander

## [5.12.0](v5.11.0...v5.12.0) (2026-03-05) ### 🍕 Features * Add Qwen3-VL-Embedding-2B image input support ([#232](#232)) ([b9a6280](b9a6280))

github-actions · 2026-03-05T03:57:26Z

🎉 This PR is included in version 5.12.0 🎉

The release is available on:

GitHub release
v5.12.0

Your semantic-release bot 📦🚀

cornmander added 2 commits March 2, 2026 02:26

Add Qwen3-VL image embedding support

46600ee

style: format qwen3 tests import

7894acd

Anush008 reviewed Mar 3, 2026

View reviewed changes

fix: align qwen3-vl image embeddings with python reference

a5c7c2f

Anush008 approved these changes Mar 5, 2026

View reviewed changes

Anush008 merged commit b9a6280 into Anush008:main Mar 5, 2026
1 check passed

github-actions bot pushed a commit that referenced this pull request Mar 5, 2026

chore(release): 5.12.0 [skip ci]

eb248c5

## [5.12.0](v5.11.0...v5.12.0) (2026-03-05) ### 🍕 Features * Add Qwen3-VL-Embedding-2B image input support ([#232](#232)) ([b9a6280](b9a6280))

github-actions bot added the released label Mar 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Qwen3-VL-Embedding-2B image input support#232

Add Qwen3-VL-Embedding-2B image input support#232
Anush008 merged 3 commits intoAnush008:mainfrom
cornmander:codex/qwen3-vl-embedding-2b-image

cornmander commented Mar 2, 2026

Uh oh!

cornmander commented Mar 2, 2026

Uh oh!

Anush008 left a comment

Uh oh!

Anush008 Mar 3, 2026

Uh oh!

cornmander commented Mar 4, 2026

Uh oh!

Anush008 left a comment

Uh oh!

Uh oh!

github-actions bot commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cornmander commented Mar 2, 2026

Summary

Implementation notes

Validation

New/updated interfaces

Uh oh!

cornmander commented Mar 2, 2026

Uh oh!

Anush008 left a comment

Choose a reason for hiding this comment

Uh oh!

Anush008 Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

cornmander commented Mar 4, 2026

Uh oh!

Anush008 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants