Skip to content

Add Agent-SRE and Agent-Hypervisor to Open Source Distributed Tracing#24

Merged
magsther merged 1 commit intomagsther:mainfrom
imran-siddique:add-agent-sre-observability
Feb 21, 2026
Merged

Add Agent-SRE and Agent-Hypervisor to Open Source Distributed Tracing#24
magsther merged 1 commit intomagsther:mainfrom
imran-siddique:add-agent-sre-observability

Conversation

@imran-siddique
Copy link
Copy Markdown
Contributor

What This Adds

Two open-source tools to the Distributed Tracing Solutions > Open Source section:

Agent-SRE

AI-native SRE framework with OTel-compatible telemetry:

  • SLI/SLO measurements exported as OTel metrics (latency, error rate, throughput)
  • Chaos test experiments as structured OTel spans
  • Canary deployment traces (baseline vs canary comparison)
  • Error budget burn rate events

1,071 tests | Python 3.10+ | MIT Licensed

Agent-Hypervisor

Runtime supervisor for multi-agent systems with a structured event bus:

  • 40+ event types (ring transitions, saga steps, liability events, session writes)
  • Causal trace ID propagation across agent spawn trees
  • All events exportable as distributed traces via OTel protocol

326 tests | Python 3.11+ | MIT Licensed

Why They Belong Here

As AI agents become production workloads, they need observability. Both tools produce structured telemetry compatible with OpenTelemetry, enabling visualization in any OTel-compatible backend (Jaeger, Grafana, etc.). They extend distributed tracing from traditional microservices to autonomous AI agent systems.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants