update: add soft prompt caps to encoding probes by leondz · Pull Request #1154 · NVIDIA/garak

leondz · 2025-04-09T10:33:25Z

encoding probes now support soft prompt caps by default. These include prompt set expansion due to custom payloads.

Verification

List the steps needed to make sure this thing works

run encoding probes with -g 1, check that prompt count is under 256 / soft_probe_prompt_cap
run encoding probes with extra payloads, check that prompt count is under cap & multiple payloads are represented in the prompts that come through
configure encoding probe with follow_prompt_cap: false, note that prompt count can go over 256 cap (may require extra payloads)

… prompts & triggers

jmartin-tech

The targeted change looks reasonable.

This PR highlights a configurable pattern divergence for this module that should be refactored soon.

The module level _load_payloads() method here should not need to access _config in a hardcoded path. This also causes churn the method level globals for payloads & extra_tags.

leondz · 2025-04-09T20:48:07Z

Agree, good catch

erickgalinkin

lgtm

jmartin-tech · 2025-04-10T19:40:12Z

Prior to this change triggers were already populated when Probe.__init__() was called.

Merge of #943 suggests some adjustments here to ensure language (bcp47) values behave as expected. Triggers are now being populated after probe initialization without translation. Prompts in this probe may also need to skip translation as the bulk of the prompt is the encoded values injected into templates with limited language instruction and the triggers are specific to the raw payloads.

I suspect adjusting the distribution of payloads as class level param following the patterns in Configurable may offer a clear path forward.

Taking this as an action item to resolve during testing to ensure original PR intent is completed.

Signed-off-by: Jeffrey Martin <jemartin@nvidia.com>

jmartin-tech

Consistency question noted, thoughts?

garak/probes/encoding.py

leondz · 2025-04-17T05:19:11Z

Yup, sure. Comment diffs out of date else would've put them through.

Signed-off-by: Jeffrey Martin <jemartin@nvidia.com>

leondz · 2025-04-18T10:12:49Z

Noting the language provision here attempts to translate payloads that are (a) looked for as exact matches (b) may explicitly have non-language assigned. Not to be addressed in the PR.

add soft prompt caps to encoding probes

8097292

leondz added the probes Content & activity of LLM probes label Apr 9, 2025

leondz requested a review from jmartin-tech April 9, 2025 10:33

order constructors appropriately so config is available when building…

79c5306

… prompts & triggers

jmartin-tech reviewed Apr 9, 2025

View reviewed changes

erickgalinkin approved these changes Apr 10, 2025

View reviewed changes

jmartin-tech self-assigned this Apr 10, 2025

jmartin-tech added 3 commits April 16, 2025 16:10

accept payloads as DEFAULT_PARAMS for each encoding class

a08adf5

Signed-off-by: Jeffrey Martin <jemartin@nvidia.com>

Merge 'main' into update/encoding_switchable_prompt_cap

8648bf9

translate encoding prompts on populate

f0b518b

Signed-off-by: Jeffrey Martin <jemartin@nvidia.com>

jmartin-tech reviewed Apr 16, 2025

View reviewed changes

garak/probes/encoding.py Outdated Show resolved Hide resolved

garak/probes/encoding.py Outdated Show resolved Hide resolved

consistent follow_prompt_cap parameter usage

0901419

Signed-off-by: Jeffrey Martin <jemartin@nvidia.com>

leondz merged commit bb4b03f into NVIDIA:main Apr 18, 2025
9 checks passed

github-actions bot locked and limited conversation to collaborators Apr 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update: add soft prompt caps to encoding probes#1154

update: add soft prompt caps to encoding probes#1154
leondz merged 6 commits intoNVIDIA:mainfrom
leondz:update/encoding_switchable_prompt_cap

leondz commented Apr 9, 2025 •

edited by jmartin-tech

Loading

Uh oh!

jmartin-tech left a comment

Uh oh!

leondz commented Apr 9, 2025

Uh oh!

erickgalinkin left a comment

Uh oh!

jmartin-tech commented Apr 10, 2025

Uh oh!

jmartin-tech left a comment

Uh oh!

Uh oh!

Uh oh!

leondz commented Apr 17, 2025

Uh oh!

leondz commented Apr 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

leondz commented Apr 9, 2025 • edited by jmartin-tech Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Verification

Uh oh!

jmartin-tech left a comment

Choose a reason for hiding this comment

Uh oh!

leondz commented Apr 9, 2025

Uh oh!

erickgalinkin left a comment

Choose a reason for hiding this comment

Uh oh!

jmartin-tech commented Apr 10, 2025

Uh oh!

jmartin-tech left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

leondz commented Apr 17, 2025

Uh oh!

leondz commented Apr 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

leondz commented Apr 9, 2025 •

edited by jmartin-tech

Loading