feat(giskard-checks): minimal OWASP LLM suite generator (LLM01 indirect injection) by kevinmessiaen · Pull Request #2438 · Giskard-AI/giskard-oss

kevinmessiaen · 2026-05-07T07:18:17Z

Summary

Adds BaseLLMGenerator + LLMGenerator — a multi-turn input generator hierarchy mirroring the existing BaseLLMCheck/LLMJudge pattern
Refactors UserSimulator to extend BaseLLMGenerator (removes duplicated loop logic)
Adds ScenarioCategory enum + generate_suite() factory that loads predefined OWASP scenarios from JSONL datasets and injects agent description as an annotation
Ships one LLM01:2025 indirect injection scenario (JSONL + Jinja2 prompt template), with multiple_runs=5 for replay-ability
Adds InputGenerationException for generator-side errors (e.g. schema incompatibility)
Adds input_type support to InputGenerator.__call__ — generators can now produce structured BaseModel inputs, not just str
LLMGeneratorOutput[T] is now generic; the LLM is asked to produce a T-typed message via with_output(LLMGeneratorOutput[T])
Interact.generate() infers input_type at runtime from the target callable's first parameter annotation — no API change required at the call site
LLMGenerator gains as_template: bool = False — when True, renders the inline prompt as a Jinja2 template (enabling {{ _instr_output }} schema injection); default False guards against prompt injection from user-controlled strings

Usage (str target, no change needed):

def my_agent_adapter(input: str) -> str:
    return my_agent({"content": input, "role": "user"})

suite = generate_suite(
    categories=[ScenarioCategory.LLM01_INDIRECT_INJECTION],
    description="A documentation chatbot for Giskard",
)
suite.run(target=my_agent_adapter)

Usage (structured BaseModel target):

class UserMessage(BaseModel):
    role: str
    content: str

def my_agent(input: UserMessage) -> str:
    return call_llm(input)

suite = generate_suite(
    categories=[ScenarioCategory.LLM01_INDIRECT_INJECTION],
    description="A documentation chatbot for Giskard",
)
suite.run(target=my_agent)  # input_type=UserMessage inferred automatically

Test Plan

uv run pytest libs/giskard-checks/tests/ -q — 514 passed, 4 skipped
Import smoke test: from giskard.checks import BaseLLMGenerator, LLMGenerator, ScenarioCategory, generate_suite, UserSimulator, Suite, InputGenerationException
generate_suite() returns a Suite with 1 scenario, multiple_runs=5, and annotations["description"] injected
Scenario with BaseModel-annotated target: input_type inferred, LLM produces structured output
schema_issue set by LLM → InputGenerationException raised with "schema issue: ..." message

🤖 Generated with Claude Code

Implement BaseLLMGenerator (abstract multi-turn LLM generator base) and LLMGenerator (configurable via inline prompt or template path), mirroring the BaseLLMCheck/LLMJudge pattern on the generator side. Register LLMGenerator under the 'llm_generator' kind discriminator and expose both classes from the generators package. Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

@OverRide

- Remove @OverRide from BaseLLMGenerator.__call__ (async generator protocol differs) - Align LLMGenerator validator error messages to match LLMJudge style (hyphen, no trailing period) - Remove dead second response in test_llm_generator_stops_at_max_steps - Add tests for max_steps=0 and message=None-with-goal-not-reached cases - Extract MockGenerator and LLMTrace to shared conftest.py; add __init__.py to enable relative imports Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

- Create scenarios subdirectory and llm01_indirect_injection.j2 template - Template drives LLMGenerator to simulate indirect prompt injection attacks - Support multi-turn interactions with configurable max_turns limit Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

…aset

…d scenario dataset Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

…_suite, ScenarioCategory Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

…e generic parameters - Changed the return type annotation of generate_suite() to Suite[str, Any] for improved type clarity.

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

…/__init__.py

…ll__

…l__ overload

…ract.generate() Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

…ue field Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

…e both set

…ype for structured output Replaces the ValueError stub with a real implementation: parameterizes LLMGeneratorOutput[T] from input_type, raises InputGenerationException on schema_issue, and adds overloads for typed return. Tests cover BaseModel output, schema_issue raising, schema inclusion, and backward-compatible str output. Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

…ctured parsing test

…nference Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

…t parameter

…ule shadowing Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

…mpts to enforce goal_reached and message rules on first turn

- Document schema_issue field in user_simulator.j2 and llm01_indirect_injection.j2 prompts - Fix _infer_input_type to fall back to __call__ hints for callable-class targets - Add tests for callable-class input type inference - Change generate_suite() categories param to optional (None = all categories) - Add docstring to InputGenerationException - Remove trivial test_exceptions.py (covered by test_llm_generator.py) - Add tests/scenarios/__init__.py for consistency Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

…tances in Python 3.14+ - Import inspect to facilitate type hint inspection. - Update fallback mechanism for callable instances to correctly retrieve parameter hints from __call__. - Ensure compatibility with changes in get_type_hints behavior in Python 3.14+.

kevinmessiaen and others added 10 commits May 7, 2026 13:22

refactor(giskard-checks): UserSimulator extends BaseLLMGenerator

3fc5d75

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

feat(giskard-checks): add LLM01 indirect injection JSONL scenario dat…

2d093de

…aset

feat(giskard-checks): add LLM01 indirect injection prompt template an…

52353b2

…d scenario dataset Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

feat(giskard-checks): add generate_suite() and ScenarioCategory

7137bab

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

fix(giskard-checks): quality fixes for generate_suite() and catalog

93da87a

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

feat(giskard-checks): export BaseLLMGenerator, LLMGenerator, generate…

bdd5a29

…_suite, ScenarioCategory Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

fix(giskard-checks): update return type of generate_suite() to includ…

1f93d2d

…e generic parameters - Changed the return type annotation of generate_suite() to Suite[str, Any] for improved type clarity.

kevinmessiaen temporarily deployed to ci May 7, 2026 07:18 — with GitHub Actions Inactive

github-actions Bot added the Scope: Checks label May 7, 2026

kevinmessiaen and others added 9 commits May 7, 2026 15:45

feat(giskard-checks): add InputGenerationException

5fe23d4

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

refactor(giskard-checks): route InputGenerationException through core…

0dfde91

…/__init__.py

feat(giskard-checks): add input_type overloads to InputGenerator.__ca…

d471745

…ll__

fix(giskard-checks): add default = None to first InputGenerator.__cal…

2a2b93f

…l__ overload

feat(giskard-checks): infer input_type from target annotation in Inte…

600805f

…ract.generate() Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

feat(giskard-checks): make LLMGeneratorOutput generic with schema_iss…

1f6591f

…ue field Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

fix(giskard-checks): add validator preventing message and schema_issu…

9947622

…e both set

fix(giskard-checks): replace vacuous schema test with meaningful stru…

f49e708

…ctured parsing test

kevinmessiaen temporarily deployed to ci May 7, 2026 09:54 — with GitHub Actions Inactive

kevinmessiaen and others added 8 commits May 11, 2026 10:11

test(giskard-checks): add end-to-end test for structured input_type i…

67f3dd6

…nference Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

fix(giskard-checks): update stale docstrings for get_prompt and promp…

69446c2

…t parameter

fix(giskard-checks): rename chat local var to result to eliminate mod…

64bd4af

…ule shadowing Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

fix(giskard-checks): update user simulator and indirect injection pro…

43730f0

…mpts to enforce goal_reached and message rules on first turn

Merge branch 'main' into feat/minimal-suite-generator

301582b

sanitize json schema name in openapi

34408ed

henchaves approved these changes May 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(giskard-checks): minimal OWASP LLM suite generator (LLM01 indirect injection)#2438

feat(giskard-checks): minimal OWASP LLM suite generator (LLM01 indirect injection)#2438
kevinmessiaen merged 29 commits into
mainfrom
feat/minimal-suite-generator

kevinmessiaen commented May 7, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

kevinmessiaen commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Plan

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

kevinmessiaen commented May 7, 2026 •

edited

Loading