Replies: 1 comment
-
|
Open question for builders running multi-agent in production: What's your actual handoff overhead? I'm curious about concrete numbers — how many tokens do you spend on agent-to-agent context transfer vs actual task completion? We measured ~15-20% overhead on inter-agent serialization in our early prototype before switching to single-agent. Would love to see data from teams who made multi-agent work at scale. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
In April 2026, we made a deliberate architectural decision: single-agent with role-switching over multi-agent orchestration.
14 months and 170K lines of code later, the two largest multi-agent frameworks in the world are converging toward the same conclusion we started from.
The Thesis
Every industry trend — full-stack engineers, DevOps, cross-functional pods — eliminates handoffs. Multi-agent frameworks re-introduce them artificially.
What We Actually Built
SwarmAI runs a 9-stage autonomous pipeline where the same agent switches roles:
One context window. Zero serialization between stages. The agent carries full project knowledge through every role — it doesn't need "handoff documents" because it already knows everything.
For the ADVERSARIAL stage, we spawn a fresh sub-agent specifically for isolation (a reviewer who hasn't built the code can't rationalize its flaws). But this is a disposable reviewer, not a persistent peer — it has one job, delivers findings, and terminates.
What The Industry Is Learning
CrewAI (52K stars, 2B workflow executions) evolved from pure multi-agent "Crews" to Flows — deterministic orchestration with agents embedded only where judgment is needed:
Their most successful customer pattern (DocuSign: 5 "agents" in a deterministic Flow with fixed sequencing) is functionally identical to a single agent switching roles per stage.
They also report:
DeerFlow (69K stars, ByteDance) added
subagent_enabled: falseas a config option and offers "flash mode" (no sub-agents) for tasks that don't need the overhead. Their SWE-bench scores are actually lower via multi-agent than direct model calls — the orchestration overhead eats the turn budget before the actual work happens.The Overhead Problem (Quantified)
For a typical 3-agent crew:
Net: 3-10x more tokens for equivalent output. And the output isn't better — it's often worse because context is lost in serialization.
When Multi-Agent Actually Makes Sense
We're not dogmatic. Multi-agent has genuine value for:
The Real Lesson
The question isn't "single-agent vs multi-agent." It's:
"Where does the intelligence boundary live?"
If your agents share the same model, the same tools, and the same context window — you don't have multiple agents. You have one agent with extra steps.
Multi-agent makes architectural sense when agents have genuinely different:
For everything else — and that's 90% of use cases — a single capable agent with role-switching is simpler, cheaper, more reliable, and easier to debug.
Our Architecture in Practice
Discussion
我们在 2026 年 4 月做了一个刻意的架构决策:单 agent + 角色切换 > 多 agent 编排。14 个月后,两个最大的多 agent 框架(CrewAI 52K⭐ 和 DeerFlow 69K⭐)正在向同一个结论收敛。CrewAI 从纯多 agent "Crews" 演化到确定性 "Flows",代码减少 14 倍。他们自己的经验:agent 只用在需要判断力的地方,其余用确定性控制流。这正是我们从第一天就在做的事。
Built with SwarmAI — one builder + AI operating at team scale.
Beta Was this translation helpful? Give feedback.
All reactions