Skip to content

MultimodalQnA PDF Ingestion#16

Merged
mhbuehler merged 5 commits intommqna-image-queryfrom
melanie/pdf
Jan 8, 2025
Merged

MultimodalQnA PDF Ingestion#16
mhbuehler merged 5 commits intommqna-image-queryfrom
melanie/pdf

Conversation

@mhbuehler
Copy link
Copy Markdown
Owner

@mhbuehler mhbuehler commented Dec 12, 2024

Description

This adds PDF ingestion capability to the MultimodalQnA dataprep service. It extracts images and text appearing on the same page together, embeds them with BridgeTower, and stores the results in redis. This uses a new external dependency pymupdf and goes with this frontend PR in GenAIExamples.

Issues

RFC

Type of change

  • New feature (non-breaking change which adds new functionality)

Dependencies

New dependency for MMQnA: pymupdf

Tests

There is a new test in tests/dataprep/test_dataprep_multimodal_redis_langchain.sh and new tests in the PR for GenAIExamples MultimodalQnA.

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>
Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>
@mhbuehler mhbuehler changed the title Adds PDF ingestion to MultimodalQnA dataprep service MultimodalQnA PDF Ingestion Dec 12, 2024
Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>
Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>
Copy link
Copy Markdown
Collaborator

@dmsuehir dmsuehir left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you also update the README here? It looks like it already mentioned png, but we should mention the limitations and what format works best (for example, text on the same page as the image are associated together, can there multiple images on the same page, etc)

Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>
@mhbuehler mhbuehler merged commit 236da36 into mmqna-image-query Jan 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants