do-and-judge
Quality Assurance
Pattern: Single-Task Execution with Meta-Judge and Judge Verification
Phase 1: Task Analysis and Model Selection
Complexity + Risk + Scope → Model Selection
│
Phase 2: Parallel Dispatch (single message, 2 tool calls)
┌──────────────────────┬──────────────────────────────┐
│ Meta-Judge (opus) │ Implementation Agent │
│ sadd:meta-judge │ [CoT + Task + Self-Critique] │
│ → Evaluation spec │ → Implementation artifact │
│ (YAML rubrics, │ │
│ checklists, │ │
│ scoring criteria) │ │
└──────────┬───────────┴────────────────────┬─────────┘
│ Waiting for both to complete │
▼ ▼
Phase 3: Dispatch Judge Agent (sadd:judge)
Judge applies meta-judge spec mechanically
│
Phase 4: Parse Verdict and Iterate
├─ PASS (>=4, or >=3.0 all low-priority) → Report Success
└─ FAIL → Retry with Feedback (max 3)
└─ Return to Phase 3 (same meta-judge spec)
│
Phase 5: Final Report or Escalation
Success summary OR escalate to user after max retriesUsage
When to Use
Key Architecture Details
Quality Enhancement Techniques
Phase
Technique
Benefit
Theoretical Foundation
Last updated