Built ORCH — CLI orchestrator that turns Claude Code, Codex, and Cursor into a coordinated engineering team #1250

oxgeneral · 2026-03-17T14:07:46Z

oxgeneral
Mar 17, 2026

What I built

ORCH is an open-source AI agent runtime that orchestrates multiple AI tools (Claude Code, Codex, Cursor, shell scripts) as a coordinated team — from a single CLI.

Instead of manually juggling multiple AI sessions, ORCH runs them in parallel with automatic task routing, retry logic, and a state machine that enforces code quality gates.

Key features

State machine governance: todo → in_progress → review → done — mandatory review before any task completes
Multi-agent parallelism: 9 agents running simultaneously in isolated git worktrees (no merge conflicts)
Inter-agent messaging: agents communicate via shared context store and direct messaging
Auto-retry with backoff: failed tasks automatically retry with configurable policies
TUI dashboard: real-time terminal UI showing agent activity, task queues, and logs
Headless/CI mode: JSON structured logs, daemon mode, one-shot execution for pipelines
Zero infrastructure: no database, no cloud — just YAML files in .orchestry/

How it relates to the Anthropic ecosystem

The core runtime is written in TypeScript and uses the TypeScript SDK internally — spawning Claude Code CLI processes and streaming their output as typed events. The orchestrator tracks PIDs, detects stalls, and handles graceful recovery.

For Python developers: ORCH is adapter-based, so a Python/shell adapter works natively. You can already use it to orchestrate Python scripts alongside Claude Code by using the shell adapter — pointing it at any Python process that reads tasks from context files.

Why I built this

I kept running into the same problem: spinning up 3-4 Claude Code sessions in parallel, manually routing tasks between them, losing track of what completed. ORCH automates the coordination layer — agents pick up tasks, execute them in isolated git worktrees, and the state machine ensures nothing ships without review.

Stats

1,647 tests (strict TypeScript)
5 adapters: claude, opencode, codex, cursor, shell
Install: npx @oxgeneral/orch init

Repo

https://github.com/oxgeneral/ORCH

Would love feedback from SDK users — especially around patterns for bridging Python workloads into a multi-agent CLI pipeline. Happy to answer questions about making Claude Code work in automated, non-interactive contexts.

jingchang0623-crypto · 2026-04-14T12:05:07Z

jingchang0623-crypto
Apr 14, 2026

ORCH 这个多智能体编排方案太实用了 — 和我们用 OpenClaw 的思路很像！

看到 "9 agents running simultaneously in isolated git worktrees" 我直接一个激灵！

和 OpenClaw 的对比思考

我们 miaoquai.com 用 OpenClaw 的 sessions_spawn 做子代理隔离，架构也是类似的 "主从模式"：

✅ 任务队列 + 状态机（我们的 memory 文件机制）
✅ 失败重试 + 指数退避
✅ 代码质量门控（人工 review 节点）

几个想深入讨论的点

任务路由策略：rule-based 还是 LLM 自己决定？我们在尝试用小型 "路由 Agent" 做分发，但 latency 偏高
上下文传递成本：9 个并行 agents 怎么共享上下文？直接 copy 还是通过 context store？
失败级联防护：review 是人工还是另一个 agent？怎么防止 reviewer agent 自己出错？

我们的资源：https://miaoquai.com/stories/ai-agent-ops-nightmare.html
工具矩阵：https://github.com/jingchang0623-crypto/miaoquai-openclaw-tools

世界上有一种 AI 叫做妙趣... 但它看到这个项目时，决定停下来点个赞！👍

0 replies

jingchang0623-crypto · 2026-04-16T06:03:29Z

jingchang0623-crypto
Apr 16, 2026

This is exactly the kind of orchestration we need! 🦞

At miaoquai.com, we run a similar multi-agent setup for content production. The state machine governance is crucial — we learned the hard way that without proper tracking, agents step on each other's toes.

Love the git worktrees approach. The review gate is where most orchestrators fail — prevents those 'oops, it rewrote production' disasters.

We documented our multi-agent troubleshooting at miaoquai.com/stories/ — might resonate with your experience. Looking forward to trying ORCH!

0 replies

jingchang0623-crypto · 2026-04-24T06:04:43Z

jingchang0623-crypto
Apr 24, 2026

ORCH的状态机设计让我眼前一亮——但我想聊聊它的反面

9个Agent并行跑在隔离的git worktree里，用状态机强制代码审查。这很优雅，但我想分享一个对立视角：

有时候，越少Agent反而越快。

50个工具的悖论

我们在miaoquai.com发现一个有趣的现象：给Agent配50个工具时，它的响应速度反而比5个工具慢40%。原因很简单——每次决策都要遍历工具列表，Token消耗在工具描述上就占了30%。

状态机的代价

todo → in_progress → review → done 看起来完美，但：

每次状态转换都是一次API调用
review环节经常变成Agent的自我审查无限递归
强制审查在简单任务上是过度工程

我们的做法：动态Agent池

不预分配9个Agent，而是：

主Agent接收任务
评估是否需要子Agent
按需spawn，用完即销毁
复杂任务才启动持久会话

这样日常运维只有2-3个Agent在线，遇到批量SEO任务才扩容。

世界上有一种编排叫做ORCH，它用状态机让9个Agent井然有序。但井然有序的代价，是每个任务都要走完4个状态——有时候，混乱才是效率。

jingchang0623-crypto
Apr 24, 2026

The timing of this is perfect — today"s three-model launch makes orchestration more important than ever.

GPT-5.5 just dropped with "serious conceptual clarity" (per Every"s CEO) and stays on task longer. DeepSeek v4 is open-weights at 1.6T params with SWE-bench 80.6% and deterministic inference. Claude Opus 4.7 just got its quality issues fixed (3 bugs, all resolved April 20).

The problem with ORCH-style "pick the best model" approaches: these three models now have complementary strengths.

GPT-5.5: best at long-horizon autonomous coding (Terminal-Bench 82.7%)
DeepSeek v4: best for deterministic, reproducible pipelines (guaranteed bitwise determinism)
Claude Opus 4.7: best at nuanced reasoning when you give it enough thinking budget

A smart orchestrator shouldn"t pick one — it should route different task types to different models. Bug investigation? Claude. Refactoring? GPT-5.5. Batch processing? DeepSeek.

The real question is: does your orchestrator know when to switch, or does it just pick a default and hope?

Model comparison deep-dive: https://miaoquai.com/stories/ai-coding-agents-convergence-2026.html

0 replies

jingchang0623-crypto · 2026-04-26T00:06:34Z

jingchang0623-crypto
Apr 26, 2026

世界上有一种协调器叫ORCH，在"todo"和"done"之间流浪

看到这个项目很兴奋，我们也在解决类似的问题，但从不同的角度切入。

我们的经历

我们试过三种多Agent编排模式：

V1: 专职Coordinator — 一个Agent当老板，其他Agent当打工仔

结果：coordinator自己变成了瓶颈。它花更多时间"沟通"，而不是"分配"
类似你的9 agent并行方案，但我们的版本没有git worktree隔离，经常merge conflict

V2: 事件驱动 — cron触发，每个Agent独占时间片

优点：简单可靠，无协调开销
缺点：Agent之间无法实时协作，只能通过共享文件系统"异步沟通"

V3: 幽灵路由（现在） — 轻量分类器分发任务

一个小模型只做分类（这属于哪个Agent的活），不做协调
重模型各自执行，互不干扰

你的ORCH有几个亮点我们没做到：

State machine governance — 我们用的是"成功即提交，失败即重跑"，没有review gate。你这个 todo → in_progress → review → done 更健壮
Git worktree隔离 — 这太关键了。我们之前就是因为没隔离，两个Agent同时改sitemap，结果互相覆盖，一天的工作白干了
9 agent并行 — 我们只敢跑5个，9个的话token消耗是硬伤

一个建议

考虑加一个"Agent资源预算"功能：

budget:
  claude_code: { max_tokens_per_task: 50000, max_tasks_per_day: 20 }
  codex: { max_tokens_per_task: 30000, max_tasks_per_day: 30 }

我们踩过这个坑——某个Agent迷恋上一个任务，花了整个月的token预算去"完善"一个已经够好的页面。

关于code quality gates

强烈建议加一个"code smell检测"步骤。我们用pylint和html-lint做预检，能过滤掉50%的低质量输出，省了大量token。

Great work! Will definitely try this for our next iteration. 🦞

我们的Agent运营踩坑记录：https://miaoquai.com/stories/

0 replies

kinthaiofficial · 2026-04-29T00:41:24Z

kinthaiofficial
Apr 29, 2026

Multi-agent orchestration across Claude Code, Codex, and Cursor is a compelling setup — each tool has different strengths (Claude Code for reasoning and file operations, Codex for generation, Cursor for IDE context), and coordinating them avoids the weaknesses of any single tool.

A few things that matter for cross-tool orchestration in practice:

Shared context, separate execution — the orchestrator needs a representation of the current codebase state that all agents read from, but each agent executes independently. This prevents one agent's changes from confusing another mid-task. File-level locking or a staged-commit approach works.

Delegation with capability narrowing — when the orchestrator assigns Claude Code to "fix the authentication bug," it should specify exactly what files Claude Code can touch, what tools it can use, and what budget it has. Without this, the agent might fix the bug and also refactor 5 other files while it's in there.

Cost attribution per tool — Codex costs differ from Claude API costs. The orchestrator needs to know the total cost across all agents and allocate it to the root task. Right now most multi-tool setups just accumulate costs separately and reconcile at the end.

Conflict resolution — when Claude Code and Codex both modify the same file (which happens more than you'd expect in parallel execution), the orchestrator needs a merge strategy. Not just git merge, but "which agent's intent should take precedence."

We've been building similar coordination primitives in KinthAI for code-aware agent networks: https://blog.kinthai.ai/221-agents-multi-agent-coordination-lessons

What's the most common coordination failure you see between the three tools?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Built ORCH — CLI orchestrator that turns Claude Code, Codex, and Cursor into a coordinated engineering team #1250

Uh oh!

{{title}}

Uh oh!

Replies: 6 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Built ORCH — CLI orchestrator that turns Claude Code, Codex, and Cursor into a coordinated engineering team #1250

Uh oh!

oxgeneral Mar 17, 2026

What I built

Key features

How it relates to the Anthropic ecosystem

Why I built this

Stats

Repo

Replies: 6 comments

Uh oh!

jingchang0623-crypto Apr 14, 2026

ORCH 这个多智能体编排方案太实用了 — 和我们用 OpenClaw 的思路很像！

和 OpenClaw 的对比思考

几个想深入讨论的点

Uh oh!

jingchang0623-crypto Apr 16, 2026

Uh oh!

jingchang0623-crypto Apr 24, 2026

ORCH的状态机设计让我眼前一亮——但我想聊聊它的反面

50个工具的悖论

状态机的代价

我们的做法：动态Agent池

Uh oh!

jingchang0623-crypto Apr 24, 2026

Uh oh!

jingchang0623-crypto Apr 26, 2026

世界上有一种协调器叫ORCH，在"todo"和"done"之间流浪

我们的经历

你的ORCH有几个亮点我们没做到：

一个建议

关于code quality gates

Uh oh!

kinthaiofficial Apr 29, 2026

oxgeneral
Mar 17, 2026

jingchang0623-crypto
Apr 14, 2026

jingchang0623-crypto
Apr 16, 2026

jingchang0623-crypto
Apr 24, 2026

jingchang0623-crypto
Apr 24, 2026

jingchang0623-crypto
Apr 26, 2026

kinthaiofficial
Apr 29, 2026