Skip to content

Overview — frameworks we surveyed

We surveyed every Claude Code agent framework we could find before deciding to write our own. Here’s the matrix.

FrameworkAuthor / OrgStars (Apr 2026)Core ideaClosest pattern we adopted
gstackGarry Tan (YC)71K23 role-based slash commands, “boil the lake”Boil the lake principle in 5 role prompts
SuperpowersJesse Vincent149K7-phase TDD iron lawTDD iron law in validator
GSDLex Christopherson51KPer-phase orchestrators with state-to-diskFresh contexts + cache-stable prompts
AgentwisePhil8 specialist agents in parallel + dashboardReal-time dashboard pattern
Hermes(referenced in gstack)Autonomous orchestration with checkpointsNamed checkpoints
Multi-Agent Ralph Loopalfredolopez80MemPalace 4-layer memory4-layer memory (L0-L3)
ComposioHQ Agent OrchestratorComposioGit worktrees per parallel agentWorktrees per lane
OpenSwarmunoheeLinear-driven Worker/Reviewer pair(rejected — no external task source)
Conductor$22M Series ATwo-mode parallelismCompetition mode
claudecode-orchestratordarrenapfel (deprecated)“Quality through truth” + service smoke-testEvidence rule + smoke-test gate
MOLTRONSelf-evolving Skills.mdTRICK: convention

Plus general-purpose: AutoGPT, LangChain, CrewAI, MetaGPT, SuperAGI, Haystack, Semantic Kernel — these aren’t coding-specific and we didn’t borrow patterns from them directly.

Reading the surveys, the Claude Code ecosystem clusters around two patterns:

  1. Skill packs. gstack, Superpowers, GSD — slash commands inside a Claude Code chat. Human is the orchestrator. Claude runs one task at a time. Useful for solo founders, but not autonomous.
  2. Multi-agent runtime. Agentwise, Hermes, Conductor, ComposioHQ — they run agents in parallel. Closer to what we needed, but each ties orchestration to their opinion of how the company should work.

We landed in a third cluster: a tiny supervised loop that owns no opinion about the company, just the agent runtime.

Things no other framework had, that we needed:

  • Decisions timeline with parser-ghost rate. None of them expose orchestrator parse-ghost telemetry.
  • Cost-per-feature attribution via filename tagging. Others track per-session.
  • Plan-reviewer role as an autonomous gate. gstack has the spiritual equivalent (/plan-ceo-review) but human-triggered.
  • Autonomous Product role for scope expansion. Closest is MOLTRON’s self-evolving capabilities, but that’s capability evolution not scope expansion.
  • Proposal CRUD + bulk accept/reject UI.
  • Per-role identity files with curator-promoted patterns + worker TRICK: convention.

See What we kept, dropped, and invented for the synthesis.