checks: add retrieval quality checks by harsh21234i · Pull Request #2451 · Giskard-AI/giskard-oss

harsh21234i · 2026-05-14T04:32:52Z

Summary

Add built-in strict retrieval checks: RecallAtK, PrecisionAtK, HitRateAtK, MRR, NDCGAtK, and InfAP
Support configurable threshold, JSONPath keys for relevant/retrieved IDs, and k where applicable
Export the new checks from giskard.checks
Add unit coverage for empty inputs, perfect retrieval, partial overlap, ranking sensitivity, sparse-label InfAP,
duplicate retrieved IDs, missing keys, and registry validation

Scope

This PR implements the strict exact-ID matching strategy first. Cosine similarity, LLM-judged relevance, and
documentation updates can be added in follow-up PRs.

Testing

uv run -m pytest -q libs/giskard-checks/tests/builtin/test_retrieval.py
uv run -m pytest -q libs/giskard-checks/tests/builtin
uv run ruff check ...
uv run basedpyright ...

gemini-code-assist

Code Review

This pull request introduces a comprehensive set of retrieval quality metrics, including Recall@K, Precision@K, HitRate@K, MRR, NDCG@K, and InfAP, along with corresponding unit tests. Feedback suggests refining the _as_sequence helper to handle None values correctly, adjusting the Precision@K calculation to use a standard denominator, and renaming the InfAP metric to AveragePrecision for better alignment with information retrieval terminology.

gemini-code-assist · 2026-05-14T04:34:28Z

+@Check.register("inf_ap")
+class InfAP[InputType, OutputType, TraceType: Trace](  # pyright: ignore[reportMissingTypeArgument]


The metric implemented here is standard Average Precision (AP). In IR literature, Inferred Average Precision (InfAP) refers to a specific estimator designed for incomplete relevance judgments (where some documents are unjudged). Since this implementation assumes strict exact-ID matching against a provided set (complete judgment), it should be renamed to AveragePrecision to avoid confusion with the specialized InfAP metric.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

checks: add retrieval quality checks

edb9ce1

github-actions Bot added the Scope: Checks label May 14, 2026

gemini-code-assist Bot reviewed May 14, 2026

View reviewed changes

harsh21234i and others added 2 commits May 14, 2026 10:11

Update libs/giskard-checks/src/giskard/checks/builtin/retrieval.py

1837e39

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update libs/giskard-checks/src/giskard/checks/builtin/retrieval.py

1b5e5a0

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

checks: add retrieval quality checks#2451

checks: add retrieval quality checks#2451
harsh21234i wants to merge 3 commits into
Giskard-AI:mainfrom
harsh21234i:feat/retrieval-quality-checks

harsh21234i commented May 14, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist Bot May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

1 participant

		@Check.register("inf_ap")
		class InfAP[InputType, OutputType, TraceType: Trace]( # pyright: ignore[reportMissingTypeArgument]

Uh oh!

Conversation

harsh21234i commented May 14, 2026

Summary

Scope

Testing

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist Bot May 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

1 participant