| name | deepresearchagent |
| description | Use when working with DeepResearchAgent/AgentOrchestra - a hierarchical multi-agent system for complex task solving, deep research, and automation. Achieved 83.39% accuracy on GAIA benchmark. Includes planning agents, research agents, browser automation, and MCP tool integration. |
DeepResearchAgent (AgentOrchestra) Skill
A comprehensive assistant for working with DeepResearchAgent/AgentOrchestra - a state-of-the-art hierarchical multi-agent framework that achieved 83.39% accuracy on the GAIA benchmark for general-purpose task solving.
When to Use This Skill
This skill should be triggered when users ask about:
Core Framework:
- DeepResearchAgent or AgentOrchestra architecture
- Hierarchical multi-agent systems and coordination
- TEA Protocol (Tool-Environment-Agent) implementation
- Complex task decomposition and delegation strategies
Agent-Specific Questions:
- Planning Agent configuration and task decomposition
- Deep Researcher for information synthesis and research
- Browser Use Agent for web automation and data collection
- Deep Analyzer for extracting insights from complex data
- MCP Manager Agent for dynamic tool creation and evolution
- General Tool Calling integration and function invocation
Implementation Scenarios:
- Building multi-agent systems for complex problem solving
- Automating research workflows with multiple specialized agents
- Creating systems that need intelligent tool discovery and creation
- Implementing browser-based automation with multi-agent coordination
- Working with GAIA benchmark evaluation or similar complex tasks
Performance & Benchmarking:
- GAIA benchmark results (83.39% test accuracy, 82.4% validation)
- SimpleQA or HLE benchmark comparisons
- General-purpose LLM-based agent system design
Key Concepts
Hierarchical Architecture
AgentOrchestra uses a 2-layer hierarchical structure inspired by how a conductor orchestrates a symphony:
- Top Layer: Planning Agent - Central coordinator that decomposes complex objectives and delegates sub-tasks to specialized agents
- Bottom Layer: 6 Specialized Agents - Each with distinct capabilities for specific task types
The Six Specialized Agents
Planning Agent
- Central coordinator for task decomposition
- Delegates sub-tasks to appropriate specialized agents
- Maintains overall workflow coordination
Deep Researcher Agent
- Conducts thorough research on specified topics
- Retrieves and synthesizes high-quality information
- Supports multi-source information gathering
Browser Use Agent
- Automates browser operations using Playwright/Patchright
- Supports web search, information extraction, and data collection
- Handles complex web interactions and navigation
Deep Analyzer Agent
- Performs in-depth analysis of input information
- Extracts key insights and identifies requirements
- Supports multi-model analysis (Gemini 2.5 Pro)
General Tool Calling Agent
- Provides general-purpose interface for tools and APIs
- Supports function calling and external service integration
- Handles standard automation tasks
MCP Manager Agent ⭐ NEW
- Enables intelligent tool evolution through automated creation
- Dynamic retrieval and systematic reuse of MCP tools
- Addresses tool development inefficiency and version inconsistency
MCP Manager Agent Features
The MCP Manager Agent represents a paradigm shift from static tool provisioning to adaptive tool ecosystem management:
Core Capabilities:
- Keyword pre-filtering - Efficiently match tasks with relevant tools
- Automated tool generation - Create MCP-compliant tools through intent analysis
- Tool registry - Comprehensive registry with persistence and versioning
- Lifecycle tracking - Monitor tool usage and performance
Workflow:
- Intent Analysis - Parse user task intentions and extract functional requirements
- Tool Synthesis - Generate executable MCP-compliant tool implementations
- Validation - Multi-stage evaluation for correctness and compatibility
- Registration - Register validated tools with metadata and usage examples
Quick Reference
Agent Selection Guidelines
# Use Planning Agent for:
- Complex multi-step tasks requiring coordination
- Task decomposition and delegation strategies
- Overall workflow orchestration
# Use Deep Researcher for:
- Literature reviews and information synthesis
- Multi-source research requiring quality assessment
- Knowledge base construction and fact verification
# Use Browser Use Agent for:
- Web scraping and data collection
- Automated form filling and navigation
- E-commerce automation and monitoring
# Use Deep Analyzer for:
- Document analysis and insight extraction
- Pattern recognition in large datasets
- Requirements analysis from specifications
# Use MCP Manager for:
- Creating new tools for specific tasks
- Tool discovery and reuse optimization
- Managing evolving tool ecosystems
Benchmark Performance Context
GAIA Benchmark Results:
Test Accuracy: 83.39%
Validation Accuracy: 82.4%
Comparison Benchmarks:
- SimpleQA: Evaluation on simple question-answering
- HLE: Human-level evaluation for complex reasoning
- Real-world tasks: Web search and reasoning combinations
Framework Integration Patterns
# Hierarchical Task Flow:
Planning Agent → Task Decomposition → Specialized Agent Selection → Execution → Result Synthesis
# Tool Evolution Workflow:
User Request → MCP Manager Analysis → Tool Creation/Discovery → Validation → Registration → Reuse
# Multi-Agent Coordination:
Complex Query → Planning Agent → Multiple Specialized Agents → Result Aggregation → Final Response
Reference Files
This skill includes comprehensive documentation in references/:
- architecture.md - Complete framework architecture, agent descriptions, MCP Manager details, benchmark results, and key principles for hierarchical multi-agent coordination
Use view references/architecture.md to access detailed technical documentation about the framework components and implementation strategies.
Working with This Skill
For Beginners
- Start with: Understanding the hierarchical architecture and six agent types
- Key concepts: Planning Agent as coordinator, specialized agents for specific tasks
- First steps: Learn when to use each agent type for different problem categories
For Framework Developers
- Focus on: MCP Manager Agent for dynamic tool creation and evolution
- Architecture patterns: Hierarchical coordination and TEA Protocol implementation
- Integration: Multi-agent workflow design and specialized agent coordination
For Researchers & Evaluators
- Benchmark context: GAIA results (83.39% accuracy) and comparison methodologies
- Evaluation approaches: SimpleQA, HLE, and real-world task assessment
- Performance analysis: Multi-agent system effectiveness measurement
For Implementation Teams
- Agent selection: Choose appropriate specialized agents for specific use cases
- Tool management: Leverage MCP Manager for automated tool creation and discovery
- Workflow design: Plan hierarchical task decomposition strategies
Advanced Features
MCP Tool Evolution
- Automated Creation: Generate MCP-compliant tools through intent analysis
- Dynamic Retrieval: Intelligent matching of tasks with existing tools
- Systematic Reuse: Comprehensive tool registry with versioning and lifecycle tracking
Multi-Model Support
- OpenAI: GPT-4.1, O3 series integration
- Anthropic: Claude 3.7 Sonnet support
- Google: Gemini 2.5 Pro for analysis tasks
- Local Models: vLLM integration with Qwen 2.5 series
Integration Capabilities
- Web Search: Google, Bing, DuckDuckGo, Baidu, Firecrawl
- Document Processing: PDF, DOCX, PPTX, Excel support
- Media Generation: Imagen (images) and Veo3 (video) integration
- Browser Automation: Playwright/Patchright for complex web interactions
Resources
Research & Documentation
- ArXiv Paper: "AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task Solving"
- GitHub Repository: Complete implementation and examples
- Website: https://skyworkai.github.io/DeepResearchAgent/
Academic Context
- Institution: Nanyang Technological University
- Domain: General-purpose task solving with LLM-based multi-agent systems
- Benchmark Suite: GAIA, SimpleQA, HLE evaluations
Notes
- AgentOrchestra represents a shift from simple dialogue to sophisticated multi-step reasoning
- The framework addresses LLM limitations in real-world environment interaction
- MCP Manager Agent enables adaptive tool ecosystem management vs. static tool provisioning
- Hierarchical design inspired by musical conductor orchestration principles
- State-of-the-art performance on general-purpose task solving benchmarks
Updating
To refresh this skill with updated documentation:
- Re-run the scraper with the same configuration
- The skill will be rebuilt with the latest information from the project website
- Monitor GitHub repository for implementation updates and new examples