Back to portfolio
Case study

Multi-agent orchestration harness for teams that need control over how coding agents run.

A case study based on wagents-py: a control layer for running multiple coding agents across isolated workspaces, coordinated execution waves, and different agent runtimes without losing visibility.

Project context

This entry is grounded in the actual orchestration repo. The public framing focuses on the control layer above the agents themselves: how to start them, isolate them, monitor them, and coordinate them as a system.

Case file

Best fit

Teams running more than one coding agent and needing operational control

What it provides

A harness for orchestration, runtime selection, and isolated execution

Why it is credible

State, isolation, and control surfaces are explicit in the product

Delivery story

The project in sequence

01

The problem

Once a team starts using more than one coding agent, the problem is no longer prompt quality alone. The real issue becomes coordination: isolating work, managing dependencies, checking progress, and recovering from failures without turning everything into manual babysitting.

That gets harder when different teams want different runtimes, permission models, and execution environments. Without a control layer, the workflow becomes a pile of disconnected agent sessions.

02

The product

wagents-py approaches that as an orchestration harness. It defines wave and set lifecycle controls, persists coordination state, launches agents into isolated git workspaces, and uses terminal sessions when runtimes need interactive control.

Instead of tying the product to one preferred vendor, it supports multiple agent runtimes. That makes the control layer the product, not just the shell around one model provider.

03

Why it feels operational

The platform becomes more compelling because orchestration state, runtime spawning, terminal control, and monitoring are all treated as first-class concerns. That makes it usable as an actual operating layer, not just a thin command wrapper.

This is why the right framing is an agent harness rather than a generic CLI utility. The value is in making multi-agent execution controllable and inspectable.

Observed stack

PythonControl ServertmuxGit WorkspacesCodexOpenCode

Team outcomes

Agent coordination

Wave-based

Runtime flexibility

Multi-adapter

Operational view

CLI, control API, and terminal sessions

Harness surface

The best visual for a harness is not a fabricated dashboard. It is a clear picture of how control, orchestration, adapters, and execution fit together.

wagents is mostly a systems-control product rather than a browser UI, so the strongest asset is an architecture-style visual that explains the harness layer teams are actually adopting.

Harness architecture: orchestration becomes operational when command-line control, runtime adapters, persistence, and execution isolation are treated as one coordinated system.

How it works

The harness works because orchestration, runtime adapters, and execution environments are treated as separate concerns with one shared control model.

The repo separates orchestration, runtime adapters, state, terminal control, and workspace isolation so the system can coordinate multiple agents without collapsing into one brittle script. That is what makes the harness useful to an operator instead of only to the person who wrote it.

Why it is credible

The strong claim here is not simply that multiple agents can run. It is that they can be coordinated as a system with explicit control.

01

Wave-based coordination is explicit

The root AGENTS file describes wagents-py as a Python command-line tool plus control server for wave-based agent orchestration using tmux and git workspaces.

02

Multiple runtimes are supported

The runtime package includes dedicated adapters for Codex, Claude Code, and OpenCode rather than assuming a single agent backend.

03

State and dependencies are managed centrally

The orchestrator coordinates phases, waits on dependencies, spawns agents through the scheduler, and tracks orchestration state for synchronous and asynchronous runs.

04

External control is part of the design

The control server exposes wave, set, planning, interaction, and artifact tools, making the harness usable as a controller-facing system rather than only as a local shell script.

Related use cases

Problems this proof can support

This proof stays project-specific. Related use cases show the broader situations where similar system shapes can help.

AI consulting use case

Deciding where AI belongs before the team commits to a build.

This situation is for leaders who can see AI may matter, but do not yet know where it should enter the work, what should stay manual, or whether the right move is buy, build, automate, or wait.

See the use case

Prototype and product use case

Proving a new AI-enabled product or capability.

This situation is for founders, product leads, or internal innovation teams that need to prove a new AI-enabled workflow, feature, or product surface with enough substance to guide the next decision.

See the use case

Next step

If your team is coordinating more than one coding agent, we can help make that system easier to control.

A harness like this is strongest when isolated workspaces, monitoring, permissions, and runtime choice all matter as much as the agent prompts themselves.

Which agent tasks need isolation instead of one shared session

Where monitoring, approvals, or recovery become operational requirements

How much runtime flexibility the control layer should support

A first conversation is usually enough to map the workflow, the constraints, and the right shape for a useful first version.