| name | agent-workflow |
| description | Expert system for designing and architecting AI agent workflows based on proven Meta methodologies. Use when users need to build AI agents, create agent workflows, solve problems using agentic systems, integrate multiple tools into agent architectures, or need guidance on agent design patterns. Helps translate business problems into structured agent solutions with clear scope, tool integration, and multi-layer architecture planning. |
Agent Workflow Designer
Overview
This skill guides the design and architecture of AI agent workflows using proven methodologies. When a user presents a problem, this skill helps structure an agent-based solution following the 9-step building process and 8-layer architecture framework validated at Meta.
Workflow Decision Tree
When a user shares a problem or requests agent design help:
Assess the problem scope
- Is the problem clearly defined? → Proceed to Problem Analysis
- Is the problem vague? → Ask clarifying questions about desired outcomes and constraints
Determine architecture complexity
- Simple task (single action)? → Single agent with basic tools
- Complex task (multiple sub-tasks)? → Consider multi-agent orchestration
- Integration task (connecting systems)? → Focus on Layer 4 (Tooling) design
Follow the appropriate workflow
- New agent from scratch → Apply 9-Step Building Process
- Existing agent improvement → Focus on specific layers needing enhancement
- Tool integration problem → Apply MCP and tooling patterns
9-Step Agent Building Process
Use this sequential workflow when designing a new agent from scratch:
Step 1: Define Purpose and Scope
Key principle: Start with job-to-be-done, not technology.
Ask the user:
- What specific outcome does the end user need?
- What are the constraints (budget, time, resources)?
- What's the success metric?
Bad scope example: "An AI assistant for customer service"
Good scope example: "An agent that takes customer complaints, pulls order history from Shopify API, and drafts refund approvals for orders under $200"
Decision point: Narrow scope = better performance. Resist building Swiss Army knives.
Step 2: Structure Inputs and Outputs
Treat the agent as a function with structured interfaces:
Inputs:
- Use JSON schemas or Pydantic models, not free text
- Define required vs. optional fields
- Specify data types and validation rules
Outputs:
- Return data objects, not prose
- Define clear error states
- Include confidence scores when relevant
Example structure:
Input: {
"complaint_text": "string",
"customer_id": "string",
"order_id": "string (optional)"
}
Output: {
"action": "approve_refund | escalate | request_info",
"refund_amount": "number",
"reasoning": "string",
"confidence": "number"
}
Step 3: Write System Instructions
Critical: Spend 80% of design time here.
Include in system prompt:
- Role definition: "You are a sales qualification specialist..."
- Behavioral guidelines: "Always ask for budget before proposing solutions"
- Output format requirements: Specify JSON structure, word limits, tone
- Edge case handling: What to do when data is missing or ambiguous
Testing strategy: A great system prompt can make GPT-3.5 outperform poorly prompted GPT-4.
Step 4: Enable Reasoning and External Actions
ReAct Framework Pattern:
- Reason: Analyze the current state and decide next action
- Act: Call an API, use a tool, or make a decision
- Observe: Review the result and determine if goal is achieved
Start simple:
- Begin with if/then logic before complex reasoning chains
- Add tools incrementally (don't overwhelm with 50 tools at once)
- Test each tool integration independently
Common tools to integrate:
- Calculators for math operations
- Web browsers for research
- Database queries for data retrieval
- API calls to external systems
Step 5: Orchestrate Multiple Agents (When Needed)
When to use multi-agent architecture:
- Task has clearly separable sub-tasks
- Different sub-tasks require different expertise
- Parallel processing would improve speed
When NOT to use multi-agent:
- Simple linear workflows
- Tasks that require continuous context
- When handoff complexity exceeds benefit
Common 4-agent pattern:
- Research Agent: Gathers information from sources
- Analysis Agent: Processes and synthesizes data
- Writing Agent: Creates structured outputs
- QA Agent: Reviews quality and accuracy
Keep handoffs simple: Complex orchestration = complex failures.
Step 6: Implement Memory and Context
Three types of memory to consider:
Conversation history:
- What happened this session
- Recent user interactions
- Current task state
User context:
- User preferences and settings
- Past interaction patterns
- Historical decisions
Knowledge retrieval:
- Relevant information from knowledge base
- Similar past cases
- Domain-specific context
Implementation guidance:
- Start with simple conversation buffers
- Add vector databases only when needing semantic search across large datasets
- Consider memory retrieval latency in architecture
Step 7: Add Multimedia Capabilities
Modern agents should handle:
- Voice input/output for accessibility
- Image understanding for visual tasks
- Document processing (PDF, DOCX, spreadsheets)
Strategic approach: Add capabilities based on actual user needs, not "nice-to-haves."
Step 8: Format and Deliver Results
Output is your product's UX. Design outputs for:
Human consumption:
- Clear formatting and structure
- Scannable with headers and bullets
- Professional appearance
System consumption:
- Valid JSON/XML
- Consistent field names
- Error codes for handling
Quality standard: Great agent outputs look like a human created them.
Step 9: Build Interface or API
Delivery method options:
- Chat interface for conversational tasks
- API endpoints for system integration
- Integration with existing tools (Slack, email, CRM)
Best practice: The best agents feel invisible—they just make things happen.
8-Layer Architecture Framework
When analyzing agent architecture needs, consider which layers require attention:
Layer 1: Infrastructure
Foundation: Cloud, databases, APIs, compute resources
Key considerations:
- GPU/TPU requirements for inference
- Data storage and retrieval speed
- Load balancing for scale
- Monitoring and observability
Common mistake: Underestimating compute needs—agents make more API calls than traditional apps.
Layer 2: Agent Internet
Operating system for agents: Identity, state management, inter-agent communication
Current state: Mostly custom-built, but platforms like LangChain and CrewAI are emerging.
Layer 3: Protocol
Standards for interoperability: MCP (Model Context Protocol) is becoming the standard
Key principle: Bet on open standards, not proprietary solutions. MCP allows any tool to work with any agent.
Layer 4: Tooling Enrichment
Agent superpowers: RAG systems, function calling, external integrations
Quality over quantity: 5 rock-solid tools > 50 flaky integrations
Tool categories:
- Data retrieval (databases, APIs)
- Computation (calculators, processors)
- Communication (email, messaging)
- Content creation (documents, reports)
Layer 5: Cognition Reasoning
The brain: Planning, decision-making, error handling
Critical elements:
- Guardrails to prevent hallucinations
- Error recovery strategies
- Confidence scoring
- Graceful degradation
User forgiveness: Users forgive agents that fail gracefully, not ones that spiral into nonsense.
Layer 6: Memory Personalization
Human touch: Personal context, preferences, conversation history
Start simple: Store user preferences and conversation context before building complex personalization.
Layer 7: Application
User-facing products: The actual agent functionality users interact with
Focus strategy: Nail one use case before expanding to others.
Layer 8: Ops Governance
Risk management: Monitoring, cost control, privacy, oversight
Build from day one: Retrofitting governance is expensive and painful.
Key components:
- Cost tracking per agent action
- Privacy enforcement and data handling
- Human-in-the-loop for critical decisions
- Audit logs and compliance
Problem-to-Solution Workflow
When a user presents a problem:
Step 1: Clarify the problem
- What's the current manual process?
- What's the desired outcome?
- What are the constraints (time, cost, technical)?
- What data sources are available?
Step 2: Assess agent appropriateness Not every problem needs an agent. Consider:
- Is the task repetitive and rule-based?
- Does it require decision-making with context?
- Would automation provide significant value?
- Is the problem scope clear and bounded?
Step 3: Map to architecture Using the 8 layers, identify which need focus:
- Simple task → Focus on Layers 4, 5, 7 (tools, reasoning, application)
- Complex integration → Add Layer 3 (protocol) emphasis
- Scalability concern → Prioritize Layers 1, 8 (infrastructure, ops)
Step 4: Design workflow Apply the 9-step building process, calling out:
- Critical decision points
- Tool integration requirements
- Multi-agent needs (if any)
- Memory and context strategy
Step 5: Identify implementation path Based on user's role and resources:
- For PMs: High-level architecture and tool selection
- For engineers: Detailed technical implementation with code patterns
- For product teams: Full stack from requirements to monitoring
Tool Integration Patterns
MCP (Model Context Protocol) Integration
When tools support MCP:
- Agent discovers available tools
- Agent calls tools using standardized interface
- Tool returns structured response
- Agent processes and continues workflow
Advantage: Write once, use with any agent.
Custom API Integration
When building custom integrations:
- Define clear API contract (inputs/outputs)
- Implement error handling and retries
- Add rate limiting and caching
- Monitor usage and costs
- Document for agent consumption
Common Integration Scenarios
CRM Integration (Salesforce, HubSpot):
- Read customer data
- Create/update records
- Search across objects
- Trigger workflows
Communication Tools (Slack, Email):
- Send messages/notifications
- Read incoming requests
- Monitor channels
- Respond to mentions
Data Sources (Databases, APIs):
- Query structured data
- Retrieve documents
- Search knowledge bases
- Aggregate information
Decision Framework: Single vs. Multi-Agent
Use Single Agent When:
- Task is linear and sequential
- Context must be maintained throughout
- Decision-making is unified
- Complexity of orchestration > benefit
Use Multi-Agent When:
- Clear task separation exists
- Sub-tasks need different expertise
- Parallel processing improves performance
- Quality benefits from specialization
Example - Customer Support:
Single agent sufficient for: "Take customer complaint, pull order history, draft refund approval"
Multi-agent beneficial for: "Monitor social media, categorize issues, research solutions, generate responses, escalate critical cases, track resolution"
Common Pitfalls and Solutions
Pitfall 1: Scope Creep
Problem: Trying to build a general-purpose assistant Solution: Define narrow, specific job-to-be-done with clear success metrics
Pitfall 2: Tool Overload
Problem: Giving agent 50+ tools upfront Solution: Start with 5 essential tools, add incrementally based on actual needs
Pitfall 3: Skipping System Prompt
Problem: Generic or minimal instructions Solution: Invest 80% of time crafting detailed system prompt with examples and edge cases
Pitfall 4: No Error Handling
Problem: Agent breaks on unexpected inputs Solution: Design graceful degradation, clear error states, and fallback behaviors
Pitfall 5: Ignoring Costs
Problem: Runaway API costs from inefficient agent design Solution: Build cost monitoring from day one, implement caching, optimize prompt length
Pitfall 6: Over-Engineering Architecture
Problem: Building all 8 layers simultaneously Solution: Start with Layers 4, 5, 7 (tools, reasoning, application), add others as needed
Output Format
When providing agent workflow solutions, structure the response as:
- Problem Restatement: Confirm understanding of the user's need
- Agent Architecture Recommendation: Single vs. multi-agent, with rationale
- Step-by-Step Workflow: Apply relevant steps from the 9-step process
- Tool Integration Plan: Specific tools needed and integration approach
- Layer Analysis: Which of the 8 layers need focus and why
- Implementation Guidance: Prioritized next steps based on user's role
- Success Metrics: How to measure if the agent is working
Agent Taxonomy Quick Reference
When users ask about existing tools:
Category 1: Consumer Agents (Built-In)
- Examples: ChatGPT Agent, Claude, Gemini, Grok
- Best for: Quick tasks, research, content creation
- User type: Everyone, especially PMs
Category 2: No-Code Builders
- Examples: Zapier Central, n8n, Make
- Best for: Workflow automation without coding
- User type: PMs, operations teams
Category 3: Developer-First Platforms
- Examples: LangChain, CrewAI, AutoGen, Swarm
- Best for: Custom agent features in products
- User type: Engineering teams
Category 4: Specialized Agent Apps
- Examples: Cursor (coding), Perplexity (research), Notion AI (writing)
- Best for: Specific job-to-be-done with deep specialization
- User type: Domain-specific professionals