Concepts
Reference of terms and concepts used throughout Context Engineering Kit documentation.
What is Context Engineering?
Context engineering is the discipline of managing the language model's context window. Unlike prompt engineering, which focuses on crafting effective instructions, context engineering addresses the holistic curation of all information that enters the model's limited attention budget: system prompts, tool definitions, retrieved documents, message history, and tool outputs.
The fundamental challenge is that context windows are constrained not by raw token capacity but by attention mechanics. As context length increases, models exhibit predictable degradation patterns: the "lost-in-the-middle" phenomenon, U-shaped attention curves, and attention scarcity. Effective context engineering means finding the smallest possible set of high-signal tokens that maximize the likelihood of desired outcomes.
Commands
Commands are explicit actions you invoke manually to perform specific tasks. They follow the pattern /plugin-name:command-name. They include a prompt that will be loaded to the LLM and trigger it to perform a specific task.
Commands are:
User-invoked - You explicitly call them when needed
Task-specific - Designed for particular operations
Token-efficient - Don't consume context when not in use
Discovering Commands
After installing a plugin, its commands become available. View all commands:
/helpCommands are namespaced by plugin, making their origin clear:
/reflexion:reflect- From the reflexion plugin/git:commit- From the git plugin/sdd:01-specify- From the sdd plugin
Command Syntax
Examples:
Using Skills
Skills are automatically applied knowledge that influences Claude's behavior without explicit invocation. When a skill is loaded, Claude considers it continuously during your session.
How Skills Work
Skills are markdown documents loaded into Claude's context that provide:
Best practices - Industry-standard approaches and patterns
Methodologies - Systematic processes (e.g., TDD, DDD)
Anti-patterns - Common mistakes to avoid
Gate functions - Checks before taking certain actions
Key difference from commands:
Commands - You invoke manually with
/command-nameSkills - Claude applies automatically when relevant, but you can explicitly ask Claude to load a specific skill, for example: "load TDD skill, before implementing feature"
Skills vs Commands Trade-off
Why prefer commands over skills:
The Context Engineering Kit architecture prefers commands over skills to minimize token usage:
Skills description always populate context (every session)
Commands only load when invoked (zero tokens until used)
Agents
Agents are specialized sub-agents designed for focused tasks. Unlike Claude's general-purpose capabilities, agents have specific expertise and domain knowledge.
What Are Agents?
Agents are fresh Claude instances launched with specialized prompts for specific tasks. They:
Focus on one domain - e.g., code review, architecture design, business analysis
Have specialized knowledge - Expertise in their specific area
Work independently - Operate as sub-agents with their own context
Return specific outputs - Structured results aligned with their purpose
Agent Architecture
Key benefits:
Fresh context - Each agent starts with clean context specific to its task
Specialized expertise - Domain-specific knowledge and techniques
Parallel execution - Multiple agents can work simultaneously
Quality gates - Agents perform validation and review
How Agents Are Invoked
Automatic invocation - Commands launch agents as part of their workflow:
Manual invocation - Request specific agents directly:
Workflow Commands
Workflow commands orchestrate multi-step processes using sub-agents, automating complex reasoning cycles that would otherwise require manual invocation of multiple commands.
What Are Workflow Commands?
Workflow commands combine multiple sub-tasks into a single invocation:
Orchestrate sub-agents - Launch specialized agents for each step
Handle transitions - Manage state between phases automatically
Preserve audit trails - Document intermediate results
Support iteration - Allow user intervention at key decision points
Example: FPF Propose-Hypotheses Workflow
The FPF plugin's /fpf:propose-hypotheses command demonstrates this pattern:
Benefits of Workflow Commands
Reduced cognitive load
User doesn't need to remember command sequence
Consistent execution
Same process every time, reducing errors
Parallel processing
Multiple agents work simultaneously where possible
Auditable results
Intermediate artifacts preserved for review
User control
Decision points allow course correction
When to Use Workflow vs Individual Commands
Complete end-to-end process
Workflow command
Check current state
Utility command (/fpf:status)
Manage specific aspect
Utility command (/fpf:decay)
Iterate on single phase
Individual task prompts
First-Principles Reasoning (FPF)
The First Principles Framework (FPF) provides structured reasoning for complex decisions. Rather than jumping to solutions, FPF enforces systematic hypothesis generation, logical verification, and evidence-based validation.
ADI Cycle
FPF follows the Abduction-Deduction-Induction reasoning cycle:
Abduction
Generate candidate explanations
L0 hypotheses (conjectures)
Deduction
Verify logical consistency
L1 hypotheses (substantiated)
Induction
Validate against evidence
L2 hypotheses (corroborated)
This cycle ensures that hypotheses progress through increasing levels of confidence before informing decisions.
Knowledge Layers (L0/L1/L2)
FPF tracks epistemic status through knowledge layers:
L0
Conjecture
Hypothesis generated but unverified
L1
Substantiated
Passed logical consistency checks
L2
Corroborated
Validated by empirical evidence
Invalid
Falsified
Failed verification or validation
Hypotheses progress through layers as they accumulate verification:
L0 -> L1: Logical deduction passes
L1 -> L2: Evidence gathering confirms
Any layer -> Invalid: Verification or validation fails
Trust Calculus and R_eff
FPF computes effective reliability (R_eff) using the Weakest Link principle:
This means a hypothesis is only as reliable as its weakest supporting evidence. Key factors:
Evidence Score
Reliability rating of each evidence source
Congruence Level
How closely evidence context matches current context (CL3=same, CL1=different)
Evidence Freshness
Age of evidence affects reliability (decay over time)
The trust calculus ensures decisions are based on computed reliability, not estimated confidence.
Transformer Mandate
A core FPF principle: A system cannot transform itself.
AI generates options with computed evidence scores
Human makes the final decision
Autonomous architectural choices are a protocol violation
This separation ensures accountability and prevents unsupervised AI decisions on consequential matters.
Working with CLAUDE.md
The CLAUDE.md file is your project's living memory - a central repository of project-specific knowledge, patterns, and insights that persists across sessions.
What is CLAUDE.md?
CLAUDE.md is a markdown file in your project root that contains:
Project constitution - Core principles and standards
Architecture decisions - Key design choices and rationale
Best practices - Project-specific patterns and conventions
Lessons learned - Insights from reflections and critiques
Common pitfalls - Known issues and how to avoid them
Tech stack guidance - Framework and library usage patterns
Why it matters:
Persistent memory - Knowledge survives between sessions
Consistency - Ensures Claude follows project patterns
Quality improvement - Accumulates insights over time
Team alignment - Documents shared understanding
How Plugins Update CLAUDE.md
Several plugins read from and write to CLAUDE.md:
Reflexion plugin - Memorizes insights:
What it adds:
Key insights from reflection sessions
Patterns discovered during implementation
Lessons learned from critiques
Common mistakes to avoid
Successful approaches to replicate
Tech Stack plugin - Adds language/framework practices:
What it adds:
Language-specific best practices
Framework usage patterns
Code style guidelines
Common anti-patterns for the tech stack
DDD plugin - Sets up code quality standards:
What it adds:
Code formatting rules
Architecture principles
SOLID principle applications
Clean Architecture patterns
MCP plugin - Documents MCP server requirements:
What it adds:
MCP server integration requirements
When and how to use specific MCP servers
Configuration and usage patterns
SDD plugin - Establishes project constitution:
What it adds:
Project constitution and governance
Core architectural principles
Technology stack decisions
Development standards
Last updated