Skip to content

Commit c332ef5

Browse files
committed
chore: fix linting issues
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent 6e7a8c6 commit c332ef5

File tree

1 file changed

+17
-17
lines changed

1 file changed

+17
-17
lines changed

gallery/index.yaml

Lines changed: 17 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -19,37 +19,37 @@
1919
- https://huggingface.co/unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF
2020
description: |
2121
Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.
22-
22+
2323
This generation delivers comprehensive upgrades across the board: superior text understanding & generation, deeper visual perception & reasoning, extended context length, enhanced spatial and video dynamics comprehension, and stronger agent interaction capabilities.
24-
24+
2525
Available in Dense and MoE architectures that scale from edge to cloud, with Instruct and reasoning‑enhanced Thinking editions for flexible, on-demand deployment.
26-
26+
2727
#### Key Enhancements:
28-
28+
2929
* **Visual Agent**: Operates PC/mobile GUIs—recognizes elements, understands functions, invokes tools, completes tasks.
30-
30+
3131
* **Visual Coding Boost**: Generates Draw.io/HTML/CSS/JS from images/videos.
32-
32+
3333
* **Advanced Spatial Perception**: Judges object positions, viewpoints, and occlusions; provides stronger 2D grounding and enables 3D grounding for spatial reasoning and embodied AI.
34-
34+
3535
* **Long Context & Video Understanding**: Native 256K context, expandable to 1M; handles books and hours-long video with full recall and second-level indexing.
36-
36+
3737
* **Enhanced Multimodal Reasoning**: Excels in STEM/Math—causal analysis and logical, evidence-based answers.
38-
38+
3939
* **Upgraded Visual Recognition**: Broader, higher-quality pretraining is able to “recognize everything”—celebrities, anime, products, landmarks, flora/fauna, etc.
40-
40+
4141
* **Expanded OCR**: Supports 32 languages (up from 19); robust in low light, blur, and tilt; better with rare/ancient characters and jargon; improved long-document structure parsing.
42-
42+
4343
* **Text Understanding on par with pure LLMs**: Seamless text–vision fusion for lossless, unified comprehension.
44-
44+
4545
#### Model Architecture Updates:
46-
46+
4747
1. **Interleaved-MRoPE**: Full‑frequency allocation over time, width, and height via robust positional embeddings, enhancing long‑horizon video reasoning.
48-
48+
4949
2. **DeepStack**: Fuses multi‑level ViT features to capture fine-grained details and sharpen image–text alignment.
50-
50+
5151
3. **Text–Timestamp Alignment:** Moves beyond T‑RoPE to precise, timestamp‑grounded event localization for stronger video temporal modeling.
52-
52+
5353
This is the weight repository for Qwen3-VL-30B-A3B-Instruct.
5454
overrides:
5555
mmproj: mmproj/mmproj-F16.gguf
@@ -130,7 +130,7 @@
130130
- filename: mmproj/mmproj-Qwen3-VL-4B-Thinking-F16.gguf
131131
sha256: 72354fcd3fc75935b84e745ca492d6e78dd003bb5a020d71b296e7650926ac87
132132
uri: huggingface://unsloth/Qwen3-VL-4B-Thinking-GGUF/mmproj-F16.gguf
133-
- !!merge <<: *llama3
133+
- !!merge <<: *qwen3vl
134134
name: "qwen3-vl-2b-thinking"
135135
urls:
136136
- https://huggingface.co/unsloth/Qwen3-VL-2B-Thinking-GGUF

0 commit comments

Comments
 (0)