GitHub - scb-10x/typhoon-ocr

Typhoon OCR

Typhoon OCR is a model for extracting structured markdown from images or PDFs. It supports document layout analysis and table extraction, returning results in markdown or HTML. This package is a simple Gradio website to demonstrate the performance of Typhoon OCR.

Features

Upload a PDF or image (single page)
Extracts and reconstructs document content as markdown
Supports different prompt modes for layout or structure
Language: English, Thai
Uses a local or remote OpenAI-compatible API (e.g., vllm, opentyphoon.ai)
See blog for more detail https://opentyphoon.ai/blog/en/typhoon-ocr-release

Requirements

Linux / Mac with python (window not supported at the moment)

Install

pip install typhoon-ocr

or to run the gradio app.

pip install -r requirements.txt
# edit .env
# pip install vllm # optional for hosting a local server

Mac specific

brew install poppler
# The following binaries are required and provided by poppler:
# - pdfinfo
# - pdftoppm

Linux specific

sudo apt-get update
sudo apt-get install poppler-utils
# The following binaries are required and provided by poppler-utils:
# - pdfinfo
# - pdftoppm

Start vllm

vllm serve scb10x/typhoon-ocr-7b --served-model-name typhoon-ocr --dtype bfloat16 --port 8101

Run Gradio demo

python app.py

Dependencies

openai
python-dotenv
ftfy
pypdf
gradio
vllm (for hosting an inference server)
pillow

Debug

If Error processing document occur. Make sure you have install brew install poppler or apt-get install poppler-utils.

License

This project is licensed under the Apache 2.0 License. See individual datasets and checkpoints for their respective licenses.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
examples		examples
packages/typhoon_ocr		packages/typhoon_ocr
tests		tests
.env.template		.env.template
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Typhoon OCR

Features

Requirements

Install

Mac specific

Linux specific

Start vllm

Run Gradio demo

Dependencies

Debug

License

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

scb-10x/typhoon-ocr

Folders and files

Latest commit

History

Repository files navigation

Typhoon OCR

Features

Requirements

Install

Mac specific

Linux specific

Start vllm

Run Gradio demo

Dependencies

Debug

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages