fix: preserve conversation context when provider fallback activates by rfpassos · Pull Request #13235 · NousResearch/hermes-agent

rfpassos · 2026-04-20T23:52:47Z

Summary

rebuild api_messages on every retry attempt instead of reusing a payload prepared for the previous provider
prevent provider-specific transforms (for example Anthropic/OpenRouter prompt caching wrappers) from leaking into fallback requests
add regression coverage for fallback from codex/chat paths and prompt-cached Claude requests
make fallback tests self-contained so they do not depend on local provider config

Root cause

When the primary model failed, Hermes switched provider / model / api_mode in-place, but continued using an api_messages payload that had already been prepared for the previous backend.

That meant fallback requests could inherit backend-specific payload mutations such as:

Anthropic/OpenRouter cache_control wrappers
provider-specific tool-call sanitization state
message formatting intended for the old API mode

In practice this could look like context loss or broken continuity after fallback, because the new backend was not receiving a fresh request built from the canonical conversation state.

Fix

extracted per-attempt payload construction into AIAgent._build_api_messages_for_attempt(...)
moved API message reconstruction inside the retry loop
ensured each retry/fallback attempt recalculates request size from the rebuilt payload
preserved newer upstream behavior while resolving the cherry-pick conflict (/steer pre-API drain, cache layout selection, tool-argument repair)

Tests

Added regression coverage in:

tests/run_agent/test_fallback_context_preservation.py

Validated with:

pytest tests/run_agent/test_fallback_context_preservation.py -q
pytest tests/run_agent/test_fallback_model.py -q
pytest tests/run_agent/test_run_agent_codex_responses.py -q

Why this matters

Fallback should always start from the canonical conversation history, not from a payload already transformed for a different provider. This keeps tool state, prompt context, and conversation continuity intact when Hermes has to fail over mid-turn.

- rebuild api_messages inside each retry attempt - prevent provider-specific payload transforms from leaking into fallback - add regression tests for codex/chat and prompt-cache fallback cases - make fallback model tests self-contained

alt-glitch · 2026-04-22T11:14:14Z

Likely duplicate of PR #13654 — same root cause: fallback reuses provider-specific payload instead of rebuilding from canonical conversation state.

alt-glitch mentioned this pull request Apr 21, 2026

fix: preserve context when fallback switches runtime #13654

Open

alt-glitch added type/bug Something isn't working P1 High — major feature broken, no workaround comp/agent Core agent loop, run_agent.py, prompt builder labels Apr 22, 2026

bradhallett mentioned this pull request May 7, 2026

fix(agent): rebuild api_messages after provider fallback to apply reasoning requirements #21033

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: preserve conversation context when provider fallback activates#13235

fix: preserve conversation context when provider fallback activates#13235
rfpassos wants to merge 1 commit intoNousResearch:mainfrom
rfpassos:fix/fallback-preserve-context

rfpassos commented Apr 20, 2026

Uh oh!

alt-glitch commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rfpassos commented Apr 20, 2026

Summary

Root cause

Fix

Tests

Why this matters

Uh oh!

alt-glitch commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants