Add support for video/audio formats for multimodal inputs#2181
Add support for video/audio formats for multimodal inputs#2181k8s-ci-robot merged 1 commit intokubernetes-sigs:mainfrom
Conversation
✅ Deploy Preview for gateway-api-inference-extension ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
yeah, I used "addresses" not "fixes" to imply it addresses it partly. But I agree with you to keep the 2 issues separate for better tracking the progress. I updated the scope of the issue to reflect the same. Thanks! |
|
/lgtm |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: ahg-g, rahulgurnani The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What type of PR is this?
What this PR does / why we need it:
Presently, only image is supported for multimodal inputs. vllm supports video/audio formats as well: https://docs.vllm.ai/en/stable/features/multimodal_inputs/
Which issue(s) this PR fixes:
Addresses #1980
Does this PR introduce a user-facing change?: