Skip to content

Commit b1d5a3a

Browse files
mudlergithub-actions[bot]
authored andcommitted
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
1 parent 9815888 commit b1d5a3a

File tree

1 file changed

+80
-0
lines changed

1 file changed

+80
-0
lines changed

gallery/index.yaml

Lines changed: 80 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -23049,3 +23049,83 @@
2304923049
- filename: YanoljaNEXT-Rosetta-27B-2511.i1-Q4_K_M.gguf
2305023050
sha256: 0a599099e93ad521045e17d82365a73c1738fff0603d6cb2c9557e96fbc907cb
2305123051
uri: huggingface://mradermacher/YanoljaNEXT-Rosetta-27B-2511-i1-GGUF/YanoljaNEXT-Rosetta-27B-2511.i1-Q4_K_M.gguf
23052+
- !!merge <<: *llama3
23053+
name: "lightonocr-1b-1025"
23054+
urls:
23055+
- https://huggingface.co/noctrex/LightOnOCR-1B-1025-GGUF
23056+
description: |
23057+
**Model Name:** LightOnOCR-1B-1025
23058+
**Repository:** [lightonai/LightOnOCR-1B-1025](https://huggingface.co/lightonai/LightOnOCR-1B-1025)
23059+
**License:** Apache 2.0
23060+
**Pipeline:** Image-to-Text (OCR & Document Understanding)
23061+
**Languages:** English, French, German, Spanish, Italian, Dutch, Portuguese, Swedish, Danish
23062+
23063+
---
23064+
23065+
### 🔍 **Description**
23066+
23067+
LightOnOCR-1B-1025 is a compact, end-to-end vision-language model designed for high-accuracy Optical Character Recognition (OCR) and document understanding. Built on a Pixtral-based vision encoder and a Qwen3-derived text decoder, it delivers state-of-the-art performance in its size category while being significantly faster and more cost-effective than larger general-purpose models.
23068+
23069+
This model excels at extracting structured text from complex documents—handling tables, forms, receipts, multi-column layouts, and mathematical notation—without relying on external OCR pipelines.
23070+
23071+
---
23072+
23073+
### ⚡ **Key Features**
23074+
23075+
- **Speed:** Up to 5× faster than dots.ocr, 2× faster than PaddleOCR-VL-0.9B
23076+
- **Efficiency:** Processes ~5.71 pages per second on a single H100 (~493k pages/day) at under $0.01 per 1,000 pages
23077+
- **Multilingual Support:** Trained on diverse multilingual PDFs (Latin script)
23078+
- **End-to-End Architecture:** Fully differentiable; ideal for fine-tuning and integration
23079+
- **Optimized for Real-World Use:** Works well with PDFs rendered at ~1540px longest edge
23080+
23081+
---
23082+
23083+
### 📊 **Performance Highlights (Olmo-Bench)**
23084+
23085+
| Task | Score |
23086+
|------------------|-------|
23087+
| Overall Accuracy | **76.1** |
23088+
| Multi-Column | 80.0 |
23089+
| Tables | 35.2 |
23090+
| Tiny Text | 88.7 |
23091+
23092+
---
23093+
23094+
### 🧩 **Use Cases**
23095+
23096+
- Automated document processing
23097+
- Receipt and invoice parsing
23098+
- Scientific paper and book OCR
23099+
- Form and table extraction
23100+
- Low-cost, scalable OCR for enterprise workflows
23101+
23102+
---
23103+
23104+
### 📦 **Variants Available**
23105+
23106+
- **`LightOnOCR-1B-1025` (default)** – Full multilingual model (151k vocab)
23107+
- **`LightOnOCR-1B-32k`** – Fast, pruned vocabulary (32k tokens), optimized for European languages
23108+
- **`LightOnOCR-1B-16k`** – Most compact variant (16k tokens), smallest memory footprint
23109+
23110+
---
23111+
23112+
### 🚀 **Getting Started**
23113+
23114+
Run with vLLM for blazing-fast inference:
23115+
23116+
```bash
23117+
vllm serve lightonai/LightOnOCR-1B-1025 --limit-mm-per-prompt '{"image": 1}' --async-scheduling
23118+
```
23119+
23120+
👉 **[Try the demo](https://huggingface.co/spaces/lightonai/LightOnOCR-1B-Demo)** | 📝 **[Read the blog](https://huggingface.co/blog/lightonai/lightonocr/)**
23121+
23122+
---
23123+
23124+
**Ideal for developers, researchers, and enterprises seeking fast, accurate, and affordable document intelligence.**
23125+
overrides:
23126+
parameters:
23127+
model: LightOnOCR-1B-1025-Q4_K_M.gguf
23128+
files:
23129+
- filename: LightOnOCR-1B-1025-Q4_K_M.gguf
23130+
sha256: da36fb008a81128553933a15dc6373c1d0692e3ed1c17e9115521d84c473dbd5
23131+
uri: huggingface://noctrex/LightOnOCR-1B-1025-GGUF/LightOnOCR-1B-1025-Q4_K_M.gguf

0 commit comments

Comments
 (0)