checks: add readability NLP metric check by harsh21234i · Pull Request #2412 · Giskard-AI/giskard-oss

harsh21234i · 2026-04-18T04:52:27Z

Adds a new built-in Readability check backed by textstat, with
configurable metric, min_score, and max_score. Exposes it via
giskard.checks exports, wires it into giskard.checks.builtin, adds
textstat under the nlp optional extra (giskard-checks[nlp]), and includes
unit tests covering pass/fail cases, missing key/type errors, missing
dependency behavior, and invalid threshold ranges. Related to issue

#2349.

gemini-code-assist

Code Review

This pull request adds a Readability check using the textstat library, allowing users to validate text quality metrics like Flesch Reading Ease. The implementation includes optional dependency handling and threshold validation. Reviewers suggested expanding the list of supported metrics, ensuring JSON serializability for error details, and adding error handling around the metric computation to improve robustness.

gemini-code-assist · 2026-04-18T04:54:09Z

+ReadabilityMetric = Literal[
+    "flesch_reading_ease",
+    "flesch_kincaid_grade",
+    "gunning_fog",
+]


The ReadabilityMetric literal is currently limited to only three metrics. To better align with the "configurable metric" goal mentioned in the PR description, consider expanding this list to include other common metrics supported by textstat, such as automated_readability_index, coleman_liau_index, and dale_chall_readability_score.

gemini-code-assist · 2026-04-18T04:54:09Z

+            return CheckResult.failure(
+                message=(
+                    f"Value for key '{self.key}' must be a string, but found "
+                    f"{type(text).__name__}."
+                ),
+                details={**details, "value": text},
+            )


Including the raw text value in the details dictionary when it is not a string (e.g., a complex dictionary or custom object) may lead to serialization errors if the CheckResult is converted to JSON for the UI or storage. It is safer to store a string representation of the value.

Suggested change

return CheckResult.failure(

message=(

f"Value for key '{self.key}' must be a string, but found "

f"{type(text).__name__}."

),

details={**details, "value": text},

)

if not isinstance(text, str):

return CheckResult.failure(

message=(

f"Value for key '{self.key}' must be a string, but found "

f"{type(text).__name__}."

),

details={**details, "value": str(text)},

)

gemini-code-assist · 2026-04-18T04:54:09Z

+        score_fn = getattr(textstat, self.metric)
+        score = float(score_fn(text))


The call to textstat metrics can raise exceptions due to internal processing errors or missing NLTK data (e.g., the punkt tokenizer). Wrapping this in a try-except block and returning a CheckStatus.ERROR result makes the check more robust and provides better feedback than an unhandled exception.

try: score = float(getattr(textstat, self.metric)(text)) except Exception as e: return CheckResult( status=CheckStatus.ERROR, message=f"Failed to compute readability score ({self.metric}): {e}", details={**details, "error": str(e)}, )

harsh21234i · 2026-04-18T05:02:33Z

Expanded ReadabilityMetric with additional textstat metrics
(automated_readability_index, coleman_liau_index,
dale_chall_readability_score).
Made details["value"] JSON-safe for non-string extracted values
by storing str(...).
Wrapped metric computation in try/except and return
CheckStatus.ERROR with details["error"] when textstat fails.

Updated/added unit tests; uv run -m pytest -q libs/giskard-checks/ tests/builtin/test_nlp_metrics.py passes.

davidberenstein1957 · 2026-05-13T09:52:50Z

Hi @harsh21234i can you resolve the conflict. Also, can we expose what are some realistic and expected readability scores across the various proposed metrics so the end-user can understand the min-max paradigm? After we should be good to merge.

davidberenstein1957 · 2026-05-13T09:54:27Z

    "rich>=14.2.0,<15",
 ]

+[project.optional-dependencies]


Can we make these checks more specific like readability?

harsh21234i · 2026-05-13T10:24:25Z

done can you recheck sir!!

checks: add readability NLP metric check

93d9d4c

github-actions Bot added the Scope: Checks label Apr 18, 2026

gemini-code-assist Bot reviewed Apr 18, 2026

View reviewed changes

checks: harden readability metric computation

bf73c7e

davidberenstein1957 reviewed May 13, 2026

View reviewed changes

checks: update readability guidance

66d0a38

harsh21234i requested a review from davidberenstein1957 May 13, 2026 10:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

checks: add readability NLP metric check#2412

checks: add readability NLP metric check#2412
harsh21234i wants to merge 3 commits into
Giskard-AI:mainfrom
harsh21234i:fix/readability-check

harsh21234i commented Apr 18, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 18, 2026

Uh oh!

gemini-code-assist Bot Apr 18, 2026

Uh oh!

gemini-code-assist Bot Apr 18, 2026

Uh oh!

harsh21234i commented Apr 18, 2026

Uh oh!

davidberenstein1957 commented May 13, 2026 •

edited

Loading

Uh oh!

davidberenstein1957 May 13, 2026

Uh oh!

harsh21234i commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

		score_fn = getattr(textstat, self.metric)
		score = float(score_fn(text))

Uh oh!

Conversation

harsh21234i commented Apr 18, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

harsh21234i commented Apr 18, 2026

Uh oh!

davidberenstein1957 commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davidberenstein1957 May 13, 2026

Choose a reason for hiding this comment

Uh oh!

harsh21234i commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

davidberenstein1957 commented May 13, 2026 •

edited

Loading