perf: prompt loop loads entire conversation history into memory on every step

### Description

The prompt loop in `prompt.ts` calls `filterCompacted(stream(sessionID))` on every iteration of its `while(true)` loop. For long-running sessions (e.g., 7,704 messages, 27,895 parts, ~91MB of data), this loads the entire conversation history into JS heap on each tool-call step.

The loaded `WithParts[]` array (~300MB after V8 object expansion) is then passed through `toModelMessages` → `convertToModelMessages` → `ProviderTransform.message` → `convertToLanguageModelPrompt` — 4-5 copy layers creating ~60MB of wrapper objects each. With 10-50 tool-call steps per prompt, peak RSS reaches 4-8GB.

Two issues compound this:
1. **No context windowing**: all messages are converted to ModelMessage format even though only ~200 fit in the LLM context window (~200K tokens ≈ 800KB of text)
2. **No compaction boundary optimization**: `filterCompacted` streams through all messages loading parts eagerly, even for compacted sessions where only messages after the boundary are needed

### Steps to reproduce

1. Use opencode for several days with active sessions (1000+ messages)
2. Start a prompt in a large session
3. Monitor RSS: `watch -n1 'grep VmRSS /proc/$(pgrep -f "opencode serve")/status'`
4. Observe RSS climbing to 4-8GB during tool-call loops

### OpenCode version

0.1.35

### OS

Linux (Ubuntu 24.04)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: prompt loop loads entire conversation history into memory on every step #18136

Description

Steps to reproduce

OpenCode version

OS

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

perf: prompt loop loads entire conversation history into memory on every step #18136

Description

Description

Steps to reproduce

OpenCode version

OS

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions