|
23049 | 23049 | - filename: YanoljaNEXT-Rosetta-27B-2511.i1-Q4_K_M.gguf |
23050 | 23050 | sha256: 0a599099e93ad521045e17d82365a73c1738fff0603d6cb2c9557e96fbc907cb |
23051 | 23051 | uri: huggingface://mradermacher/YanoljaNEXT-Rosetta-27B-2511-i1-GGUF/YanoljaNEXT-Rosetta-27B-2511.i1-Q4_K_M.gguf |
| 23052 | +- !!merge <<: *llama3 |
| 23053 | + name: "lightonocr-1b-1025" |
| 23054 | + urls: |
| 23055 | + - https://huggingface.co/noctrex/LightOnOCR-1B-1025-GGUF |
| 23056 | + description: | |
| 23057 | + **Model Name:** LightOnOCR-1B-1025 |
| 23058 | + **Repository:** [lightonai/LightOnOCR-1B-1025](https://huggingface.co/lightonai/LightOnOCR-1B-1025) |
| 23059 | + **License:** Apache 2.0 |
| 23060 | + **Pipeline:** Image-to-Text (OCR & Document Understanding) |
| 23061 | + **Languages:** English, French, German, Spanish, Italian, Dutch, Portuguese, Swedish, Danish |
| 23062 | + |
| 23063 | + --- |
| 23064 | + |
| 23065 | + ### 🔍 **Description** |
| 23066 | + |
| 23067 | + LightOnOCR-1B-1025 is a compact, end-to-end vision-language model designed for high-accuracy Optical Character Recognition (OCR) and document understanding. Built on a Pixtral-based vision encoder and a Qwen3-derived text decoder, it delivers state-of-the-art performance in its size category while being significantly faster and more cost-effective than larger general-purpose models. |
| 23068 | + |
| 23069 | + This model excels at extracting structured text from complex documents—handling tables, forms, receipts, multi-column layouts, and mathematical notation—without relying on external OCR pipelines. |
| 23070 | + |
| 23071 | + --- |
| 23072 | + |
| 23073 | + ### ⚡ **Key Features** |
| 23074 | + |
| 23075 | + - **Speed:** Up to 5× faster than dots.ocr, 2× faster than PaddleOCR-VL-0.9B |
| 23076 | + - **Efficiency:** Processes ~5.71 pages per second on a single H100 (~493k pages/day) at under $0.01 per 1,000 pages |
| 23077 | + - **Multilingual Support:** Trained on diverse multilingual PDFs (Latin script) |
| 23078 | + - **End-to-End Architecture:** Fully differentiable; ideal for fine-tuning and integration |
| 23079 | + - **Optimized for Real-World Use:** Works well with PDFs rendered at ~1540px longest edge |
| 23080 | + |
| 23081 | + --- |
| 23082 | + |
| 23083 | + ### 📊 **Performance Highlights (Olmo-Bench)** |
| 23084 | + |
| 23085 | + | Task | Score | |
| 23086 | + |------------------|-------| |
| 23087 | + | Overall Accuracy | **76.1** | |
| 23088 | + | Multi-Column | 80.0 | |
| 23089 | + | Tables | 35.2 | |
| 23090 | + | Tiny Text | 88.7 | |
| 23091 | + |
| 23092 | + --- |
| 23093 | + |
| 23094 | + ### 🧩 **Use Cases** |
| 23095 | + |
| 23096 | + - Automated document processing |
| 23097 | + - Receipt and invoice parsing |
| 23098 | + - Scientific paper and book OCR |
| 23099 | + - Form and table extraction |
| 23100 | + - Low-cost, scalable OCR for enterprise workflows |
| 23101 | + |
| 23102 | + --- |
| 23103 | + |
| 23104 | + ### 📦 **Variants Available** |
| 23105 | + |
| 23106 | + - **`LightOnOCR-1B-1025` (default)** – Full multilingual model (151k vocab) |
| 23107 | + - **`LightOnOCR-1B-32k`** – Fast, pruned vocabulary (32k tokens), optimized for European languages |
| 23108 | + - **`LightOnOCR-1B-16k`** – Most compact variant (16k tokens), smallest memory footprint |
| 23109 | + |
| 23110 | + --- |
| 23111 | + |
| 23112 | + ### 🚀 **Getting Started** |
| 23113 | + |
| 23114 | + Run with vLLM for blazing-fast inference: |
| 23115 | + |
| 23116 | + ```bash |
| 23117 | + vllm serve lightonai/LightOnOCR-1B-1025 --limit-mm-per-prompt '{"image": 1}' --async-scheduling |
| 23118 | + ``` |
| 23119 | + |
| 23120 | + 👉 **[Try the demo](https://huggingface.co/spaces/lightonai/LightOnOCR-1B-Demo)** | 📝 **[Read the blog](https://huggingface.co/blog/lightonai/lightonocr/)** |
| 23121 | + |
| 23122 | + --- |
| 23123 | + |
| 23124 | + **Ideal for developers, researchers, and enterprises seeking fast, accurate, and affordable document intelligence.** |
| 23125 | + overrides: |
| 23126 | + parameters: |
| 23127 | + model: LightOnOCR-1B-1025-Q4_K_M.gguf |
| 23128 | + files: |
| 23129 | + - filename: LightOnOCR-1B-1025-Q4_K_M.gguf |
| 23130 | + sha256: da36fb008a81128553933a15dc6373c1d0692e3ed1c17e9115521d84c473dbd5 |
| 23131 | + uri: huggingface://noctrex/LightOnOCR-1B-1025-GGUF/LightOnOCR-1B-1025-Q4_K_M.gguf |
0 commit comments