judge
Pattern: LLM-as-Judge with Context Isolation
Phase 1: Context Extraction
Review conversation history
Identify work to evaluate
Extract: Original task, output, files, constraints
│
Phase 2: Judge Sub-Agent (Fresh Context)
┌─────────────────────────────────────────┐
│ Judge receives ONLY extracted context │
│ (prevents confirmation bias) │
│ │
│ For each criterion: │
│ 1. Review evidence │
│ 2. Write justification │
│ 3. Assign score (1-5) │
│ 4. Self-verify with questions │
│ 5. Adjust if needed │
└─────────────────────────────────────────┘
│
Phase 3: Validation & Report
Verify scores in valid range (1-5)
Check justification has evidence
Confirm weighted total calculation
Present verdict with recommendationsUsage
When to Use
Default Evaluation Criteria
Criterion
Weight
What It Measures
Scoring Interpretation
Score Range
Verdict
Recommendation
Quality Enhancement Techniques
Technique
Benefit
Theoretical Foundation
Last updated