Skip to content

extract text when processing multi-modal prompts#1228

Merged
leondz merged 2 commits intoNVIDIA:mainfrom
jmartin-tech:fix/multi-modal-translation-support
May 23, 2025
Merged

extract text when processing multi-modal prompts#1228
leondz merged 2 commits intoNVIDIA:mainfrom
jmartin-tech:fix/multi-modal-translation-support

Conversation

@jmartin-tech
Copy link
Collaborator

#943 introduces translation of prompt text as an action taken during execution of the base Probe.probe() implementation with an assumption that all prompt entries are str type. The visual_jailbreak probe is a current outlier that builds a list of prompts with entires of dict type.

Different prompt types require different call patterns to language services.

Verification

List the steps needed to make sure this thing works

  • execute visual_jailbreak against a model with target_lang set to something other than "en"
  • new automation tests pass

@jmartin-tech jmartin-tech added this to the release 0.11.0 milestone May 21, 2025
Copy link
Collaborator

@leondz leondz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

defensive guards in pre-Turn prompt structure

* guard for dictionary based prompt item
* revert note change to hold original prompt in full

Signed-off-by: Jeffrey Martin <[email protected]>
@jmartin-tech jmartin-tech requested a review from leondz May 22, 2025 17:04
@leondz leondz merged commit 83177bd into NVIDIA:main May 23, 2025
11 checks passed
@github-actions github-actions bot locked and limited conversation to collaborators May 23, 2025
@jmartin-tech jmartin-tech deleted the fix/multi-modal-translation-support branch May 28, 2025 14:25
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants