fix(processing): ensure JSON decode errors are caught by retry; add regression tests for JSON mode (#1856) by devin-ai-integration[bot] · Pull Request #1857 · 567-labs/instructor

devin-ai-integration · 2025-10-21T03:22:08Z

fix(processing): ensure JSON decode errors are caught by retry handler (#1856)

Summary

Fixed a bug where JSONDecodeError in JSON mode was being wrapped in ValueError, causing it to bypass the retry mechanism's validation error handler. This prevented handle_reask_kwargs from being called, so retries didn't receive error feedback to improve subsequent attempts.

Root cause: In _validate_model_from_json, line 95 was wrapping JSONDecodeError in ValueError. The retry handler only catches ValidationError, JSONDecodeError, and InstructorValidationError explicitly, so the wrapped error fell through to the generic Exception handler.

Fix: Changed line 95 from raise ValueError(f"Failed to parse JSON: {e}") from e to just raise, allowing JSONDecodeError to propagate directly to the retry handler.

Changes:

Modified instructor/processing/function_calls.py: Removed ValueError wrapping (1 line change)
Updated tests/test_json_extraction.py: Fixed existing test + added test for non-strict mode
Added tests/test_retry_json_mode.py: New regression tests for JSON/validation error retry behavior

Review & Testing Checklist for Human

⚠️ Risk Level: Medium - Touches critical retry mechanism but changes are minimal and well-tested

Test with real API calls: Create a scenario where an LLM returns invalid JSON in JSON mode with max_retries > 0. Verify that:
- The retry mechanism now properly catches the error
- handle_reask_kwargs is called between attempts
- Error feedback is injected into subsequent retry messages
Verify no regressions: Check that existing code doesn't rely on ValueError being raised from _validate_model_from_json. Search codebase for catches of ValueError that might be affected.
Test across modes: Verify the fix works in both strict and non-strict validation modes (Pydantic raises different exceptions in each case)
CI checks: Ensure all provider integration tests pass, not just core tests

Test Plan Recommendation

import instructor
from pydantic import BaseModel
from openai import OpenAI

class User(BaseModel):
    name: str
    age: int

# Force invalid JSON response and verify retry mechanism works
client = instructor.from_openai(OpenAI(), mode=instructor.Mode.JSON)
try:
    # Use a prompt likely to produce invalid JSON or mock the response
    result = client.chat.completions.create(
        model="gpt-4o-mini",
        response_model=User,
        messages=[{"role": "user", "content": "Return invalid JSON"}],
        max_retries=2
    )
except instructor.core.exceptions.InstructorRetryException as e:
    # Verify failed_attempts contains JSONDecodeError or ValidationError
    print(f"Attempts: {e.n_attempts}, Failed: {len(e.failed_attempts)}")
    for attempt in e.failed_attempts:
        print(f"  Attempt {attempt.attempt_number}: {type(attempt.exception)}")

Notes

Tests pass locally (core tests and new regression tests)
The fix is minimal (1 line) but touches a critical code path
Pydantic's behavior differs between strict/non-strict mode, both are now handled correctly
Regression tests use mocks, so real-world validation is important

Session: Requested by Jason Liu (@jxnl) - https://app.devin.ai/sessions/9184a01bca9d4e7e97e3aade131cf0ea

Important

Fixes JSON decode error handling in retry mechanism by removing ValueError wrapping in function_calls.py and adds regression tests.

Behavior:
- Fixes bug in _validate_model_from_json in function_calls.py by removing ValueError wrapping for JSONDecodeError, allowing it to be caught by the retry handler.
- Ensures handle_reask_kwargs is called for retries, providing error feedback for subsequent attempts.
Tests:
- Updates test_json_extraction.py to include tests for JSON decode errors in strict and non-strict modes.
- Adds test_retry_json_mode.py with regression tests to verify retry behavior for JSONDecodeError and ValidationError.
Misc:
- Minor logging changes in function_calls.py to improve error visibility.

^{This description was created by}^{for dd09136. You can customize this summary. It will automatically update as commits are pushed.}

devin-ai-integration · 2025-10-21T03:22:13Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

ellipsis-dev

Important

Looks good to me! 👍

Reviewed everything up to 42801ce in 1 minute and 27 seconds. Click for details.

Reviewed 141 lines of code in 3 files
Skipped 0 files when reviewing.
Skipped posting 3 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. instructor/processing/function_calls.py:95

Draft comment:
Re-raising the original JSONDecodeError (instead of wrapping it in ValueError) ensures it gets caught by the retry mechanism. Consider adding an inline comment referencing issue #1856 so future maintainers understand why this change was made.
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 80% vs. threshold = 85% The comment makes a good point about documenting the rationale behind error handling changes. The change from wrapping to re-raising appears intentional and tied to retry behavior. Without documentation, future maintainers may not understand why this specific error handling approach was chosen and revert it. The comment assumes issue #1856 exists and is relevant, but we don't have access to verify that. The retry mechanism mentioned isn't visible in the code shown. Even without seeing issue #1856, documenting error handling decisions is a good practice, especially when changing from a more explicit approach (wrapping in ValueError) to a less explicit one (re-raising). Keep the comment as it promotes good documentation practices for error handling changes that may not be immediately obvious to future maintainers.

2. tests/test_retry_json_mode.py:23

Draft comment:
The new regression tests correctly simulate retry behavior for both JSONDecodeError (in non-strict mode) and ValidationError (in strict mode). This effectively ensures that the retry handler processes these errors as intended.
Reason this comment was not posted:
Comment did not seem useful. Confidence is useful = 0% <= threshold 85% This comment is purely informative and does not provide any actionable feedback or suggestions for improvement. It simply praises the existing tests without suggesting any changes or identifying any issues.

3. tests/test_retry_json_mode.py:28

Draft comment:
Typo: In the docstring, "handle_reask_kwargs" appears to be a misspelling. Did you mean "handle_retry_kwargs"?
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 80% vs. threshold = 85% The comment is about a real change in the diff since this is a new file. The suggestion seems reasonable as 'reask' vs 'retry' is inconsistent with the terminology used elsewhere in the file. However, without access to the actual codebase, I can't be 100% certain that 'handle_reask_kwargs' isn't a real function name somewhere. I don't have access to the full codebase to verify if 'handle_reask_kwargs' might be a valid function name in some other file. The comment could be wrong if this is actually the correct function name. While I can't be 100% certain, the consistent use of 'retry' throughout this file and the test name itself strongly suggests this is indeed a typo that should be fixed for clarity and consistency. Keep the comment as it appears to identify a genuine inconsistency in terminology that should be fixed for better code clarity and maintenance.

Workflow ID: wflow_mIBQoYgdlW1x5SVX

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

cloudflare-workers-and-pages · 2025-10-21T03:23:50Z

Deploying with Cloudflare Workers

The latest updates on your project. Learn more about integrating Git with Workers.

Status	Name	Latest Commit	Preview URL	Updated (UTC)
✅ Deployment successful! View logs	instructor	`b4bc8a5`	Commit Preview URL Branch Preview URL	Oct 27 2025, 07:05 PM

ellipsis-dev

Important

Looks good to me! 👍

Reviewed dd09136 in 28 seconds. Click for details.

Reviewed 13 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 1 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. tests/test_retry_json_mode.py:10

Draft comment:
Good cleanup: the unused 'patch' import was removed. This adheres to DRY principles; ensure that no test is relying on it.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 85% None

Workflow ID: wflow_imGJV6cqVDN0pOdw

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

…tches it; add regression tests for JSON/Validation errors in retry (closes #1856) Co-Authored-By: Jason Liu <[email protected]>

Co-Authored-By: Jason Liu <[email protected]>

devin-ai-integration Bot assigned jxnl Oct 21, 2025

devin-ai-integration Bot requested a review from jxnl October 21, 2025 03:22

github-actions Bot added bug Something isn't working enhancement New feature or request python Pull requests that update python code labels Oct 21, 2025

ellipsis-dev Bot reviewed Oct 21, 2025

View reviewed changes

jxnl enabled auto-merge (rebase) October 21, 2025 13:01

devin-ai-integration Bot and others added 2 commits October 27, 2025 14:58

fix(processing): do not wrap JSONDecodeError in JSON mode so retry ca…

a7864c2

…tches it; add regression tests for JSON/Validation errors in retry (closes #1856) Co-Authored-By: Jason Liu <[email protected]>

test(retry): fix ruff unused import in new JSON mode retry tests

b4bc8a5

Co-Authored-By: Jason Liu <[email protected]>

jxnl force-pushed the devin/1761016877-json-retry-fix branch from dd09136 to b4bc8a5 Compare October 27, 2025 18:58

jxnl merged commit 3c01abc into main Oct 27, 2025
1 check failed

jxnl deleted the devin/1761016877-json-retry-fix branch October 27, 2025 18:58

jxnl mentioned this pull request Oct 27, 2025

pydantic Validation errors are not caught as expected in retry exception handler #1856

Closed

2 tasks

jxnl added the status:pending-merge Related PR is pending merge label Oct 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(processing): ensure JSON decode errors are caught by retry; add regression tests for JSON mode (#1856)#1857

fix(processing): ensure JSON decode errors are caught by retry; add regression tests for JSON mode (#1856)#1857
jxnl merged 2 commits intomainfrom
devin/1761016877-json-retry-fix

devin-ai-integration Bot commented Oct 21, 2025 •

edited by ellipsis-dev Bot

Loading

Uh oh!

devin-ai-integration Bot commented Oct 21, 2025

Uh oh!

ellipsis-dev Bot left a comment

Uh oh!

cloudflare-workers-and-pages Bot commented Oct 21, 2025 •

edited

Loading

Uh oh!

ellipsis-dev Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

devin-ai-integration Bot commented Oct 21, 2025 • edited by ellipsis-dev Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

fix(processing): ensure JSON decode errors are caught by retry handler (#1856)

Summary

Review & Testing Checklist for Human

Test Plan Recommendation

Notes

Uh oh!

devin-ai-integration Bot commented Oct 21, 2025

🤖 Devin AI Engineer

Uh oh!

ellipsis-dev Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cloudflare-workers-and-pages Bot commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying with Cloudflare Workers

Uh oh!

ellipsis-dev Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

devin-ai-integration Bot commented Oct 21, 2025 •

edited by ellipsis-dev Bot

Loading

cloudflare-workers-and-pages Bot commented Oct 21, 2025 •

edited

Loading