-
Notifications
You must be signed in to change notification settings - Fork 11
Add support for bounding box extraction in images and LLM_PARSE via PaddleOCR
#128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR adds support for bounding box extraction and EasyOCR integration to enable parsing of images and PDFs with images. The primary goal is to extend the framework's capabilities to handle image-based documents while providing accurate bounding box coordinates for text elements.
- Adds EasyOCR dependency and parsing capability through STATIC_PARSE for image-based documents
- Implements automatic framework selection for bounding box detection based on document type
- Refactors bounding box functionality with improved fuzzy matching and bbox splitting for multi-word text
Reviewed Changes
Copilot reviewed 5 out of 7 changed files in this pull request and generated 3 comments.
Show a summary per file
| File | Description |
|---|---|
| pyproject.toml | Adds easyocr dependency |
| lexoid/core/utils.py | Adds bbox routing logic, refactors edit distance function, and improves bounding box matching |
| lexoid/core/parse_type/static_parser.py | Implements EasyOCR parser and adds image parsing support |
| lexoid/core/conversion_utils.py | Refactors base64 conversion functions for better modularity |
| lexoid/api.py | Integrates bbox extraction with automatic framework selection |
Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Copilot <[email protected]>
LLM_PARSE via EasyOCRLLM_PARSE via PaddleOCR
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
Copilot reviewed 5 out of 7 changed files in this pull request and generated 4 comments.
Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.
This PR adds support for
TODO: Update documentation