The agentic design pattern approach is a reusable architecture catalog for LLM systems because it provides a selection framework for organizing patterns and control models.
TL;DR
Engineers building AI agent systems work from at least three overlapping pattern sources: Andrew Ng's four foundational patterns, Anthropic's five workflow patterns, and a growing set of emergent reliability and memory patterns from 2025-2026. This guide consolidates those sources into a single 12-pattern foundational taxonomy, adds emergent patterns with maturity ratings, and maps each pattern to current frameworks. It also includes a worked PR triage example, SDLC phase mappings, seven anti-patterns, and five decision rules for selecting the minimum control mechanism for each failure mode.
What Is an Agentic Design Pattern?
An agentic design pattern is a reusable solution to a recurring coordination problem that appears when LLM-driven systems make decisions at runtime rather than following fixed code paths. Traditional software design patterns assume deterministic actors and known control flow. Agentic patterns address what changes when the actor is a probabilistic model that can choose its next tool call, revise its own output, or hand work off to another agent.
The distinction matters for three practical reasons:
- Runtime decisions create new failure modes. Infinite reflection loops, hallucinated tool calls, and runaway token spend do not exist in deterministic systems. Agentic patterns name the controls that prevent them.
- Coordination becomes a first-class concern. Once more than one agent or step is involved, the system needs explicit patterns for delegation, evaluation, and handoff. The same problems exist in distributed systems, though the actors here are model calls rather than services.
- Auditability shifts from code to trajectory. With workflows, every step is traceable in the source. With agents, the trace lives in the run, which forces patterns around logging, bounded execution, and human approval.
Readers brand new to the topic can think of agentic patterns as the engineering vocabulary for "what controls do I add around a model that decides for itself." The rest of this catalog names those controls, groups them, and shows when each one is worth adopting.
The Pattern Fragmentation Problem
Engineering teams shipping agentic systems today often select patterns from partial taxonomies rather than a single shared catalog. One team reads Andrew Ng's four foundational patterns and builds a reflection loop. Another reads Anthropic's workflow taxonomy and implements prompt chaining. A third discovers a 21-pattern community catalog on GitHub and realizes both teams missed entire categories. Three teams end up working from three vocabularies for the same problem.
Pattern fragmentation changes selection decisions across latency, cost, auditability, and failure boundaries because workflows use predefined execution paths while agents decide next steps at runtime. The vocabulary has expanded across vendor guides, practitioner catalogs, and academic taxonomies. This catalog walks through the taxonomy, the workflow-versus-agent boundary, selection criteria, framework implementations, anti-patterns, a worked example, and lifecycle fit.
Augment Cosmos is the unified cloud agents platform with shared context and memory that compounds across the team and the software development lifecycle, so the patterns below map directly to a production runtime rather than staying as theory.
See how Cosmos turns these patterns into coordinated agents with shared context and memory across your SDLC.
Free tier available · VS Code extension · Takes 2 minutes
Workflows vs. Agents: The Architectural Boundary
The workflows-versus-agents boundary separates two control models: predefined execution order in code, and runtime delegation by the model. Anthropic's December 2024 "Building Effective AI Agents" post highlighted the distinction that informs many pattern selection decisions.
| Type | Definition | Control Model | Auditability |
|---|---|---|---|
| Workflows | LLMs orchestrated through predefined code paths | Deterministic; execution order specified at design time | High: every step traceable |
| Agents | LLMs dynamically directing their own processes and tool usage | Non-deterministic; model decides next steps at runtime | Lower: requires trajectory logging |
Both qualify as agentic systems, though the engineering implications differ by control model. Workflows provide predictability through predefined execution paths, which makes every step traceable. Agents provide flexibility through runtime decision-making, which increases the need for trajectory logging and execution controls.
The 12 Foundational Agentic AI Design Patterns
Foundational agentic AI design patterns come from multiple taxonomies: four patterns from Andrew Ng, five workflow patterns from Anthropic, and three additional patterns that clarify control, topology, and implementation choices. The sections below cover each in turn before consolidating them into a single reference table.
Andrew Ng's Four Core Patterns (2024)
Andrew Ng's four core patterns define one early agentic framework with four distinct mechanisms for output revision, tool access, task decomposition, and coordination. Andrew Ng named and published these four foundational agentic AI patterns in The Batch newsletter at DeepLearning.AI:
1. Reflection. The agent generates an initial output, critiques it for errors or quality gaps, then produces a revised output across multiple iterations. Ng describes Reflection as a pattern in which the LLM examines its own work to improve it. One specific research architecture in this family is the Reflexion paper, which adds episodic memory across trials beyond same-turn critique.
2. Tool Use. The agent calls external APIs for information missing from model weights: current information, code execution, and proprietary data sources. Tool Use is the most established execution pattern in this catalog and serves as a building block for many other patterns.
3. Planning. The agent breaks large tasks into smaller subgoals and handles complex tasks through decomposition, including self-criticism and refinement over past actions. Ng explicitly flagged Planning as "less mature, less predictable" than Reflection and Tool Use.
4. Multi-Agent Collaboration. Multiple specialized agents, each with its own prompt, LLM, tools, and custom code, work together on problems that exceed single-agent capabilities. The mechanism combines role specialization with coordination across agents, extending beyond a single prompt loop.
Anthropic's Five Workflow Patterns (2024)
Anthropic's five workflow patterns are structured orchestration patterns for multi-step processes that define the sequence and conditions under which agents execute tasks. Anthropic formalized five named workflow patterns in its engineering guide. The five-pattern framing reflects the original December 2024 post; some newer Anthropic materials group the same ideas under a smaller set of higher-level workflow shapes (for example, sequential, parallel, and evaluator-optimizer), so readers may encounter different counts depending on which document they reference.
5. Prompt Chaining. Output of one LLM call becomes input of the next in a deterministic pipeline. Use when tasks decompose "easily and cleanly into fixed subtasks."
6. Routing. Classifies input and directs it to specialized downstream processes. Enables separation of concerns across distinct task categories.
7. Parallelization. Multiple LLM calls run simultaneously with outputs aggregated programmatically. Two sub-variants exist: sectioning, which runs independent subtasks in parallel, and voting, which runs the same task multiple times.
8. Orchestrator-Workers. A central LLM dynamically delegates tasks to worker LLMs and synthesizes results.
9. Evaluator-Optimizer. One LLM generates, another evaluates against predefined criteria. The generator and evaluator have explicitly separated roles, which distinguishes this pattern from single-LLM Reflection.
Three Additional Foundational Patterns
The Ng and Anthropic taxonomies cover output revision, tool access, decomposition, and orchestration, but they leave gaps around approval gates, communication structure, and concrete reasoning loops. Human-in-the-Loop, topology patterns, and ReAct fill those gaps and round out the foundational set used throughout the rest of this catalog. ReAct predates Ng's 2024 framing by roughly two years and appears here as an earlier foundational reference. Together, these three patterns make the broader taxonomy easier to apply in production systems.
10. Human-in-the-Loop. A cross-cutting modifier insertable into any pattern as an approval gate, with "stopping conditions (such as a maximum number of iterations) to maintain control".
11. Topology Patterns (Chain/Star/Mesh). Recent academic literature on agentic community design describes three communication topologies for multi-agent systems: chain, star, and mesh.
12. ReAct (Reasoning + Acting). A concrete reasoning-and-acting approach, originally introduced in 2022, that supports dynamic plan creation and adjustment.
Consolidated Taxonomy
The consolidated taxonomy groups 12 foundational patterns by tier, year, and control role, which simplifies comparison across workflow patterns, agent patterns, guardrails, and implementation choices in one view.
| # | Pattern | Tier | Year | Anthropic Category |
|---|---|---|---|---|
| 1 | Reflection | Core (Ng) | 2024 | Agent pattern |
| 2 | Tool Use | Core (Ng) | 2023-2024 | Building block |
| 3 | Planning | Core (Ng) | 2024 | Agent pattern |
| 4 | Multi-Agent Collaboration | Core (Ng) | 2024 | Agent pattern |
| 5 | Prompt Chaining | Workflow (Anthropic) | 2024 | Workflow |
| 6 | Routing | Workflow (Anthropic) | 2024 | Workflow |
| 7 | Parallelization | Workflow (Anthropic) | 2024 | Workflow |
| 8 | Orchestrator-Workers | Workflow (Anthropic) | 2024 | Workflow |
| 9 | Evaluator-Optimizer | Workflow (Anthropic) | 2024 | Workflow |
| 10 | Human-in-the-Loop | Safety/Guardrail | 2024-2025 | Safety/Guardrail |
| 11 | Topology (Chain/Star/Mesh) | Structural | 2024-2025 | Architecture |
| 12 | ReAct | Reasoning + Acting (pre-dates Ng's 2024 patterns) | 2022-2023 | Reasoning + Acting paradigm |
Emergent Patterns from the 2025-2026 Catalog
Beyond the 12 foundational patterns, the 2025-2026 literature adds a wave of emergent patterns that address production constraints through context management, bounded execution, layered safety controls, memory, and meta-level orchestration. Pattern maturity varies across the set, so this section flags maturity alongside the mechanism. The three highlighted entries below are the most actionable today; the remaining patterns appear in the second table for completeness.
Highlighted Emergent Patterns
| # | Pattern | Category | Key Mechanism | Maturity |
|---|---|---|---|---|
| P3 | Context Engineering | Memory | Select/Compress techniques and memory-based write behavior for context window optimization | Production-ready |
| P10 | Bounded Execution / Circuit Breaker | Reliability | Maximum step limits, tool-call caps, circuit breakers, idempotent tool design | Production-ready |
| P11 | Guardrail Layering | Reliability | Guardrails at four execution points: user input, tool calls, tool responses, final output | Production-ready |
Each highlighted pattern has a primary reference. LangChain's writeup on context engineering for agents details the Context Engineering pattern. Bounded Execution and Circuit Breaker patterns appear in recent arXiv work on agent reliability. Microsoft's AI agent design patterns guide on Azure Architecture Center covers Guardrail Layering.
Context Engineering (P3) improves multi-agent operation through selection, compression, and isolation of context. Bounded Execution (P10) caps work within defined limits and reduces operational risk. Guardrail Layering (P11) places controls at four execution points (user input, tool calls, tool responses, and final output), giving teams multiple points to catch failures before they reach end users.
Additional Emergent Patterns
The patterns below appear repeatedly in 2025-2026 catalogs but have less converged naming or interfaces than the three highlighted above. They are still worth knowing about because each addresses a distinct production failure mode.
| Pattern | Category | What It Does | Maturity |
|---|---|---|---|
| Short-Term Memory | Memory | Scratchpad or working memory for the current task, distinct from session history | Production-ready |
| Long-Term Memory | Memory | Persistent store of user, task, and team knowledge across sessions; typically vector or hybrid-backed | Emerging |
| Episodic Memory | Memory | Record of prior agent trajectories (success and failure) used to inform future attempts; appears in Reflexion-style architectures | Emerging |
| Hierarchical RAG | Retrieval | Multi-level retrieval that first selects a corpus or section, then chunks within it, beyond flat similarity search | Emerging |
| Supervisor / Sub-Agent Hierarchy | Coordination | Tree of agents where a parent agent delegates to specialized children and aggregates their results | Production-ready |
| Agent Swarm / Mesh Coordination | Coordination | Peer agents communicate without a central coordinator, common in topology research | Experimental |
| Self-Correction / Self-Verification | Reliability | The agent re-checks its own intermediate outputs against constraints before continuing; related to Reflection but applied mid-task | Emerging |
| Tool Sandboxing | Reliability | Tool execution is isolated (filesystem, network, time) so that an errant call cannot affect the host environment | Production-ready |
| Trajectory Logging and Replay | Observability | Full trace of decisions, tool calls, and intermediate outputs, replayable for debugging and evaluation | Production-ready |
| Meta-Agent / Self-Improving Agent | Meta-level | An agent that modifies its own prompts, tools, or sub-agents based on observed performance | Experimental |
Teams evaluating these coordination approaches often benchmark them against current multi-agent orchestration platforms before deciding what to build versus what to adopt.
Pattern Selection: Five Tradeoff Dimensions
Pattern selection works best when teams match the observed failure mode to the minimum control mechanism that resolves it, because each additional pattern in the tradeoff matrix adds latency, cost, or coordination overhead. Anthropic's guidance sets the baseline: start simple, evaluate carefully, and add agentic structure only when simpler solutions fall short. The matrix below summarizes how the four core patterns compare across five dimensions.
| Dimension | Tool Use | Reflection | Planning | Multi-Agent |
|---|---|---|---|---|
| Latency | Low | Medium (+1 LLM call min) | Medium-High (variable) | Highest |
| Cost | Predictable | ~2x per reflection cycle | Variable; loop risk | Highest token risk |
| Reliability | High (bounded) | High if criteria objective | Degrades with plan length | Lowest out of the box |
| Build Complexity | Low | Medium | Medium-High | Highest |
| Controllability | Highest | High | Medium | Lowest |
Five Decision Rules
These five decision rules map common task shapes to the smallest useful pattern, which keeps teams clear of unnecessary coordination overhead and preserves controllability.
- If the task resolves with a single LLM call plus tools, use Tool Use alone.
- If output quality needs verification against objective, externally verifiable criteria, add Reflection.
- If the task requires multi-step dependencies that cannot be pre-specified, use Planning (ReAct) with explicit iteration limits.
- If the task genuinely requires specialized roles that exceed a single context, consider Multi-Agent.
- Before adding any pattern, verify the benefits justify the costs by responding to observed failure modes more than anticipated requirements.
The combined effect of these rules is a smaller surface area to debug, observe, and govern as teams scale agentic work.
Explore how Cosmos coordinates parallel agents around shared context and memory, so the patterns above run as one unified system rather than disconnected experiments.
Free tier available · VS Code extension · Takes 2 minutes
in src/utils/helpers.ts:42
Worked Example: Selecting Patterns for an Automated PR Triage Agent
The decision rules are easier to apply against a concrete task. Consider a realistic engineering scenario: a team wants to ship an agent that triages incoming pull requests, applies labels, suggests reviewers, and flags PRs that look risky based on the files touched and the test coverage of the changed code.
Step 1: Start with Tool Use. The agent needs to read PR metadata, list changed files, and query coverage data. All three are deterministic API calls. A single LLM call plus three tools (get_pr, list_files, get_coverage) handles the labeling and reviewer suggestion. Rule 1 applies, so no further pattern is needed yet.
Step 2: Add Reflection only when verification fails. Initial evaluation shows the agent occasionally suggests reviewers who left the team. The failure mode is verifiable against an external source (the active members list), so Rule 2 applies. Reflection compares the suggested reviewers against the active list and revises if needed. Stop here unless something else breaks.
Step 3: Skip Planning unless dependencies emerge. The task does not have multi-step dependencies that change at runtime; the steps are known in advance. Rule 3 does not apply. Adding Planning here would add latency and loop risk without solving a real failure.
Step 4: Skip Multi-Agent unless roles diverge. Risk-flagging could plausibly become its own role (security review, performance review, compliance review), but for the current scope, a single agent with the right tools and one reflection step covers the work. Rule 4 does not apply yet. Multi-Agent becomes justified only when the team needs distinct reviewers, distinct prompts, and distinct evaluation criteria that cannot share one context.
Step 5: Layer in emergent patterns as guardrails. Before shipping, the team adds three emergent patterns: Bounded Execution (cap at five tool calls per PR), Guardrail Layering (validate tool inputs and outputs), and Trajectory Logging (record every decision for later evaluation). These are required in production, since they catch the failures the core patterns cannot.
The result is a working agent built from Tool Use, Reflection, and three emergent guardrails, with Planning and Multi-Agent held in reserve. The same step-by-step application of the rules works for most production tasks: start at Tool Use, climb only when a real failure mode forces the next pattern.
Framework Support Matrix
Framework support determines how directly a pattern maps into implementation primitives, and that mapping affects how much custom code teams must write. This matrix shows where common agentic patterns map cleanly to current frameworks and where teams should expect implementation overhead.
| Pattern | LangGraph | CrewAI | AutoGen/AG2 | OpenAI Agents SDK | Bedrock Agents | Claude Agent SDK |
|---|---|---|---|---|---|---|
| Prompt Chaining | ✅ Sequential nodes | ✅ Process.sequential | ✅ RoundRobinPattern | ✅ Sequential agent/task flows | ✅ Action groups used across multiple steps | ✅ Tool call sequences |
| Tool Use | ✅ Native | ✅ Native | ✅ Native | ✅ Built-in tools | ✅ JSON-schema tool definitions | ✅ Native |
| Reflection | ✅ Custom nodes | ⚠️ Via delegation | ✅ GroupChat critique | ⚠️ Via guardrails | ⚠️ Post-processing | ✅ Extended thinking |
| Multi-Agent | ✅ Supervisor + Swarm | ✅ Core design | ✅ Core design | ✅ Handoffs | ✅ Multi-agent collaboration | ⚠️ MCP support documented; handoffs not clearly documented |
| MCP Support | ✅ Via adapter | ✅ First-class | ✅ Built-in | ✅ Built-in | ✅ Documented | ✅ Core design |
Legend: ✅ Native/first-class | ⚠️ Requires custom code | ❌ Not supported
Human-in-the-loop support varies by framework and typically appears as interrupts, checkpoints, guardrails, or approval and review workflows.
Note: Microsoft's AutoGen project is now in maintenance mode, and Microsoft's official recommendation for new projects is the Microsoft Agent Framework, which consolidates the AutoGen and Semantic Kernel lineages under one supported codebase. Separately, the AG2 0.9 release is a community fork maintained outside Microsoft for teams that want to stay on the original AutoGen API surface. For practitioner-grade comparisons of these frameworks in production, see this roundup of open-source agent orchestrators.
State of the Field: What's Production-Ready in 2026
Framework maturity moves quickly, so a pillar reference should orient readers on what is stable enough to standardize on versus what is still experimental. The table below summarizes the current state of the major frameworks and pattern categories as of mid-2026.
| Area | Status | What This Means |
|---|---|---|
| LangGraph | Ascending | Strong native support for most foundational patterns; common default for graph-shaped agentic systems |
| CrewAI | Stable | First-class multi-agent and MCP support; well-suited for role-based teams of agents |
| Microsoft Agent Framework | Ascending | Microsoft's official forward path, consolidating AutoGen and Semantic Kernel |
| AutoGen (original) | Declining | In maintenance mode; new projects directed to Microsoft Agent Framework |
| AG2 | Stable (community) | Active community fork of AutoGen for teams preserving the original API |
| OpenAI Agents SDK | Ascending | Built-in tools and handoffs; rapidly evolving alongside model releases |
| Bedrock Agents | Stable | Enterprise AWS deployments; action groups and multi-agent collaboration documented |
| Claude Agent SDK | Ascending | MCP as a core design choice; extended thinking supports Reflection-style patterns |
| Foundational patterns (Ng + Anthropic) | Production-ready | Tool Use, Prompt Chaining, Routing, Parallelization, Reflection, Orchestrator-Workers, Evaluator-Optimizer all have stable implementations across frameworks |
| Bounded Execution, Guardrail Layering, Context Engineering | Production-ready | Required for any system running in production; framework support varies but the patterns are well-defined |
| Memory patterns (long-term, episodic) | Emerging | Active research and rapid product churn; interfaces not yet stable across frameworks |
| Meta-agents and self-improving agents | Experimental | Worth tracking, though not yet worth standardizing on |
The overall pattern from this view: foundational patterns and the three highlighted emergent patterns are safe to standardize on, while memory and meta-level patterns warrant active monitoring with careful adoption.
Seven Anti-Patterns That Break Production Agentic Systems
Seven production anti-patterns break agentic systems through control gaps, weak safety placement, missing governance, and poor evaluation discipline. The recurring failure modes below show where otherwise capable agentic systems break once they face real tools, real costs, and real operational constraints.
| Anti-Pattern | Description |
|---|---|
| The God Prompt | Fitting every instruction and edge case into a single prompt; decompose into orchestrated sub-agents |
| Over-Agentification | Using agentic patterns when deterministic logic would be simpler; a single capable agent almost always outperforms multi-agent for simple tasks |
| Uncontrolled Recursion | Reflection or planning loops without hard bounds on iterations, cost, or execution time |
| Agent Sprawl | Deploying agents without ownership, accountability, or governance |
| Output-Only Guardrails | Applying safety checks only to final output |
| Governance as Afterthought | Shipping without governance mechanisms such as documentation, human control, escalation pathways, and continuous monitoring, then retrofitting; build governance in from the outset |
| Vibe-Checking as Testing | Shipping based on subjective assessment that outputs "look correct" without eval frameworks |
The God Prompt and Over-Agentification draw on practitioner guidance, with the monolith-to-sub-agent decomposition pattern documented in Google's writeup on refactoring a monolithic AI agent. Output-Only Guardrails inverts the layered guardrail pattern recommended by Microsoft. The Berkeley CLTC agentic AI risk management standards profile addresses both Agent Sprawl and Governance as Afterthought, arguing for documentation, escalation pathways, and continuous monitoring from day one. Vibe-Checking as Testing aligns with Anthropic's eval recommendations.
Agent Sprawl deserves particular attention because scaling multi-agent systems increases coordination drift as more roles, prompts, and handoffs accumulate. Threat-specific patterns covering prompt injection, data exfiltration, and adversarial inputs sit outside this catalog and warrant their own treatment.
Agentic Patterns Across the Software Development Lifecycle
Agentic patterns map unevenly across software development lifecycle phases because some phases already fit deterministic workflows such as prompt chaining, routing, and CI-oriented review, while others still rely on emerging multi-agent coordination. Pattern maturity ranges from production-ready workflow patterns to emerging spec-driven orchestration and more experimental agentic DevOps and requirement agents.
The table below maps specific SDLC phases to the patterns that fit them best today, alongside the maturity of agentic adoption in each phase.
| SDLC Phase | Pattern Fit | Why | Maturity |
|---|---|---|---|
| Requirements and intake | Routing, Human-in-the-Loop | Incoming requests fall into a small number of known buckets that need classification and human confirmation | Emerging |
| Design and architecture | Planning (ReAct), Multi-Agent | Open-ended decomposition and specialized roles (data, security, infra) genuinely benefit from coordination | Experimental |
| Implementation | Tool Use, Prompt Chaining | Codegen, search, and edit tools chain cleanly; multi-step dependencies often known in advance | Production-ready |
| Code review | Routing, Evaluator-Optimizer | PRs route to checks (style, security, tests), each with separated generator and evaluator roles | Production-ready |
| Testing | Evaluator-Optimizer, Parallelization | Generator writes tests, evaluator scores coverage and correctness; cases run in parallel | Production-ready |
| CI and release | Prompt Chaining, Bounded Execution | Deterministic pipelines with hard caps on retries and tool calls | Production-ready |
| Incident response | ReAct, Orchestrator-Workers | Open-ended investigation with delegation to specialized log, metric, and trace workers | Emerging |
| Documentation | Tool Use, Reflection | Single-pass generation with verification against the actual codebase | Production-ready |
The lifecycle fit is uneven in three recurring ways:
- Deterministic workflows fit phases that already decompose cleanly into fixed subtasks (implementation, CI, release, documentation).
- CI-oriented review aligns more naturally with prompt chaining and routing than with open-ended delegation.
- Emerging multi-agent coordination appears more often in spec-driven orchestration, agentic DevOps, and requirement agents, where the structure is still being worked out in practice.
Match Each Pattern to the Problem Before Scaling Agent Complexity
Matching each pattern to the actual failure mode is the practical next step, because extra loops, handoffs, and worker roles increase coordination cost before they improve outcomes. Teams that follow the decision rules above stay on the minimum control mechanism for each failure mode, adding Reflection only when output verification fails, Planning only when multi-step dependencies emerge, and Multi-Agent only when the work exceeds a single role, context window, or service boundary.
For teams ready to put these patterns into production, Augment Cosmos is a unified cloud agents platform with shared context and memory that compounds across the team and the software development lifecycle. It is one option among the broader multi-agent orchestration platforms covered in the related listicle below, and readers should evaluate it against the rest of that landscape.
See how Cosmos coordinates parallel agents with shared context and memory across review, testing, and deployment.
Free tier available · VS Code extension · Takes 2 minutes
FAQ
Related
Written by

Molisha Shah
Molisha is an early GTM and Customer Champion at Augment Code, where she focuses on helping developers understand and adopt modern AI coding practices. She writes about clean code principles, agentic development environments, and how teams are restructuring their workflows around AI agents. She holds a degree in Business and Cognitive Science from UC Berkeley.