-
Notifications
You must be signed in to change notification settings - Fork 425
Description
Currently, the batch process starts vlm ocr after the layout check.
At this point, we have confirmed that PDFs with a large number of pages have a higher pages/second rate.
(Conversely, PDFs with a small number of pages have a lower pages/second rate.)
When starting a batch job, is it possible to perform layout checks on all files at once
and then start VLM OCR on all files at once?
I hope that I can achieve a higher pages/sec by collecting all PDFs and processing them at once. (I'm not sure if this is correct.)
(Results from rtx 4090.)
Results when processing a 56-page PDF
Layout time: 6.2, image num: 56
Doc analyze time: 68.03, speed: 0.82 pages/second
Results when processing a 343-page PDF
Layout time: 31.44, image count: 343
Doc analyze time: 202.18, speed: 1.7 pages/second