Skip to content

Is it possible to improve the batch process? #151

@goriholdings

Description

@goriholdings

Currently, the batch process starts vlm ocr after the layout check.
At this point, we have confirmed that PDFs with a large number of pages have a higher pages/second rate.
(Conversely, PDFs with a small number of pages have a lower pages/second rate.)

When starting a batch job, is it possible to perform layout checks on all files at once
and then start VLM OCR on all files at once?

I hope that I can achieve a higher pages/sec by collecting all PDFs and processing them at once. (I'm not sure if this is correct.)

(Results from rtx 4090.)
Results when processing a 56-page PDF

Layout time: 6.2, image num: 56
Doc analyze time: 68.03, speed: 0.82 pages/second

Results when processing a 343-page PDF

Layout time: 31.44, image count: 343
Doc analyze time: 202.18, speed: 1.7 pages/second

Metadata

Metadata

Assignees

Labels

doneWork on this issue is complete.enhancementNew feature or requestgood first issueGood for newcomers

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions