Agent Architecture

Agent Anatomy

Every agent definition follows a standard structure:

# Frontmatter
---
name: 06b-Bicep CodeGen
description: Expert Azure Bicep IaC specialist...
model: ["Claude Opus (latest)"] # (1)!
tools: [list of allowed tools] # (2)!
handoffs:
  - label: "Step 6: Deploy"
    agent: 07b-Bicep Deploy # (3)!
    prompt: "Deploy the Bicep templates..."
---
# Body (≤ 350 lines)
## MANDATORY: Read Skills First
1. **Read** `.github/skills/azure-defaults/SKILL.md` # (4)!
2. **Read** `.github/skills/azure-artifacts/SKILL.md`

Model selection — the Conductor can override this based on task complexity
Tool allowlist — agents only access tools they need
Handoff target — the next agent in the workflow
Skills are loaded on demand to preserve context budget

The frontmatter is machine-readable metadata. The body is the agent’s full operating manual — the runtime loads it into the system prompt at invocation.

Tools

Agents interact with external systems through tools — structured interfaces provided by MCP servers and the VS Code runtime. Each agent’s frontmatter declares a tools: allowlist that restricts which tools it can call. Common tool categories:

MCP tools: Cloud API wrappers (Azure pricing queries, GitHub operations, Terraform registry lookups)
File tools: Read and write workspace files (create artifacts, read prior step outputs)
Terminal tools: Execute CLI commands (Bicep build, Terraform validate, Azure CLI)
Subagent tools: Delegate to specialised subagents via #runSubagent

Handoffs

Agents do not communicate directly. Instead, each agent produces artifact files in agent-output/{project}/ that the next agent reads as input. The Conductor orchestrates this by delegating to one agent at a time, collecting its output, and routing to the next step. At approval gates, the Conductor writes a 00-handoff.md summary document that enables session resume.

Top-Level Agents

Agent	Role	Primary Skills
01-Conductor	Master orchestrator	workflow-engine, session-resume
01-Conductor (Fast Path)	Simplified path for ≤3 resources	session-resume, azure-defaults
02-Requirements	Captures project requirements	azure-defaults, azure-artifacts
03-Architect	WAF assessment and cost estimation	azure-defaults
04-Design	Diagrams and ADRs	azure-diagrams, azure-adr
04g-Governance	Policy discovery and compliance	azure-defaults
05b-Bicep Planner	Bicep implementation planning	azure-bicep-patterns
05t-Terraform Planner	Terraform implementation planning	terraform-patterns
06b-Bicep CodeGen	Bicep template generation	azure-bicep-patterns
06t-Terraform CodeGen	Terraform configuration generation	terraform-patterns
07b-Bicep Deploy	Bicep deployment execution	azure-validate, iac-common
07t-Terraform Deploy	Terraform deployment execution	azure-validate, iac-common, terraform-patterns
08-As-Built	Post-deployment documentation	azure-artifacts, azure-diagrams
09-Diagnose	Azure resource troubleshooting	azure-diagnostics
10-Challenger	Standalone adversarial review	—
11-Context Optimizer	Context window audit and optimisation	context-optimizer

Subagents

Subagents are not user-invocable. They are delegated to by parent agents for isolated, specific tasks:

Subagent	Purpose	Invoked By
challenger-review-subagent	Adversarial review of artifacts	Steps 1, 2, 4, 5, 6
challenger-review-batch-subagent	Batch multi-lens adversarial review	Steps 2, 4, 5
challenger-review-codex-subagent	Fast adversarial review (Codex model)	Steps 2, 4
cost-estimate-subagent	Azure Pricing MCP queries	Steps 2, 7
governance-discovery-subagent	Azure Policy discovery via REST API	Step 4
bicep-lint-subagent	`bicep build` + `bicep lint`	Step 5 (Bicep)
bicep-review-subagent	Code review against AVM standards	Step 5 (Bicep)
bicep-whatif-subagent	`az deployment what-if` preview	Step 6 (Bicep)
terraform-lint-subagent	`terraform fmt` + `terraform validate`	Step 5 (Terraform)
terraform-review-subagent	Code review against AVM-TF standards	Step 5 (Terraform)
terraform-plan-subagent	`terraform plan` preview	Step 6 (Terraform)

The Challenger Pattern

The challenger-review-subagent implements adversarial review at critical workflow steps. It operates with rotating lenses:

1-pass review (comprehensive): A single review covering all dimensions. Used for requirements (Step 1) and deploy (Step 6).
3-pass review (rotating lenses): Three separate reviews, each focused on a specific dimension (security, reliability, cost). Used for architecture (Step 2), planning (Step 4), and code (Step 5).

Findings are classified as must_fix (blocking) or should_fix (advisory). Only must_fix findings block workflow progression.

Conditional Pass 3: Pass 3 of the 3-pass rotating lens review is now conditional — it only runs if Pass 2 returned ≥1 must_fix finding. If Pass 2 returns zero must_fix items, Pass 3 is skipped entirely, saving approximately 4 minutes per review cycle.

Context Shredding for Challenger Inputs: The challenger is instructed to apply context compression tiers when loading predecessor artefacts for review:

Context Usage	Loading Strategy
< 60%	Full artefact
60–80%	Key H2 sections only (resource list, SKUs, WAF scores, budget)
> 80%	Decision summary from `00-session-state.json` + resource list

After each review pass, only the compact_for_parent string (~200 characters) is carried forward — not the full JSON findings. This prevents context bloat across multi-pass reviews and is enforced by the output schema.

New Challenger Checklists: Two mandatory checklist categories were added:

Cost Monitoring: Budget resource, forecast alerts at 80/100/120%, anomaly detection.
Repeatability: Parameterised values, multi-tenant deploy, projectName required.

Handoffs and Delegation

Agents communicate through artefact files, not direct message passing. The Conductor delegates to a step agent, which produces output files in agent-output/{project}/. The next agent reads those files as input. This design:

Eliminates context leakage between agents
Enables resume from any point (artefacts are persistent)
Allows human review at every gate (artefacts are human-readable markdown)
Supports parallel development of different steps

Phase Handoff Document: At each approval gate, the Conductor writes a 00-handoff.md file containing a summary of what was completed, key decisions made, what comes next, and (at Gates 2 and 3) a session break recommendation. This enables resume from any gate without needing to re-read all prior artefacts.

Creating a Custom Agent

This section walks through creating a new agent from scratch.

Step 1: Choose Agent Type and Model

Type	File Location	User-Invocable	Use When
Top-level agent	`.github/agents/{name}.agent.md`	Yes	User-facing workflow steps
Subagent	`.github/agents/_subagents/{name}.agent.md`	No	Isolated tasks delegated by parent agents

Model selection depends on the task:

Planning agents (accuracy-first) — use Claude Opus (latest)
Execution subagents (speed) — check .github/agent-registry.json for current assignments
Adversarial review — use a different model family than the artifact author

Step 2: Create the Agent File

Create a .agent.md file with the required frontmatter:

---
name: My Custom Agent
description: >-
  One-line description of what this agent does.
  USE FOR: keyword triggers. DO NOT USE FOR: anti-triggers.
model:
  - Claude Opus (latest)
tools:
  - read_file
  - create_file
  - replace_string_in_file
  - run_in_terminal
  - runSubagent
handoffs:
  - label: "Next Step"
    agent: next-agent-name
    prompt: "Hand off with context..."
---

Required frontmatter fields: name, description, model, tools. Optional: handoffs, user-invocable (defaults to true for top-level).

See .github/instructions/agent-definitions.instructions.md for the complete frontmatter specification.

Step 3: Write the Agent Body

The body (below the frontmatter) is the agent’s operating manual:

## MANDATORY: Read Skills First

1. **Read** `.github/skills/azure-defaults/SKILL.md`
2. **Read** `.github/skills/session-resume/SKILL.md`

## DO (required behaviours)

- Always check for existing session state before starting
- Load skills progressively (SKILL.md first, then references/ on demand)

## DO NOT (prohibited behaviours)

- Do not skip approval gates
- Do not hardcode project-specific values

Keep the body under 350 lines. Use skill references for deep domain knowledge rather than inlining it.

Step 4: Register the Agent

Update two registry files:

.github/agent-registry.json — add the agent’s role, file path, model, and skill list
.github/skill-affinity.json — set primary/secondary/never affinities for each skill the agent uses

Step 5: Validate

# Validate frontmatter syntax
npm run lint:agent-frontmatter

# Check body size and language quality
npm run lint:agent-checks

# Verify registry consistency
npm run validate:agent-registry
npm run validate:skill-affinity

Troubleshooting

Problem	Cause	Fix
Agent not appearing in chat	Missing or invalid frontmatter	Run `npm run lint:agent-frontmatter`
Tool not available	Tool not in `tools:` allowlist	Add the tool name to frontmatter
Handoff not triggering	Wrong agent name in `handoffs:`	Verify target agent file exists
Skills not loading	Typo in skill path	Check path matches `.github/skills/{name}/SKILL.md`