name	mcp-code-execution
description	Auto-activates when user mentions:

mcp-code-execution

Quick Reference

When to Use

✅ Use code execution when:

3+ MCP tool calls needed
Privacy concerns (PII in workflow data)
Performance critical (need parallel execution)
Recurring workflow (runs multiple times)

❌ Traditional MCP when:

1-2 simple tool calls
One-off exploratory task
Workflow changes frequently

Token Savings

Workflow	Traditional	Code Execution	Savings
Web scraping	85K	3K	96%
UI testing	120K	4K	97%
Monitoring	65K	2.5K	96%
Data extraction	95K	3.5K	96%

Task Decomposition Override (v5.4.0)

When this skill applies (3+ MCP tool calls detected), DO NOT use your default task decomposition.

❌ PROHIBITED SEQUENCE (Traditional MCP - wastes 70K-100K tokens):

Load all MCP tool definitions into context
Make sequential MCP tool calls
Store all results in context
Process results in conversation

✅ MANDATORY SEQUENCE (Code Execution Pattern):

Phase 1: Workflow Analysis (Make 3 explicit decisions)

Workflow Pattern Identification: Classify the MCP workflow type
- Reference: Skill "The Solution" section
- Output: Pattern classification with tool count (e.g., "Parallel: 5 playwright calls for multi-page scraping, estimated 85K → 3K tokens (96% savings)")
Script Structure Design: Plan TypeScript script organization
- Reference: Skill "The Solution" section
- Output: Script approach (e.g., "Single script with async/await parallel execution, error handling, JSON output")
Data Privacy Strategy: Define what data stays local vs returned
- Reference: Skill "The Problem" section (PII considerations)
- Output: Privacy specification (e.g., "Process all PII locally, return aggregated metrics only, no raw data in context")

Output Acknowledgment After Phase 1:

MCP Code Execution Applied:
- Workflow Pattern: [Type + tool count + estimated savings]
- Script Structure: [Organization approach]
- Data Privacy: [What stays local vs returned]

Phase 2: Implementation (Apply Phase 1 decisions) 4. Create TypeScript script with MCP wrapper calls 5. Implement error handling and logging 6. Execute script locally (data stays local) 7. Return summary/aggregated results only

Phase 3: Validation (Verify quality criteria) 8. Verify token savings: 96-98% reduction achieved (2K-5K vs 70K-100K) 9. Verify privacy: No PII or sensitive data in context 10. Verify correctness: Output matches expected workflow results 11. Verify reusability: Script can be reused for similar workflows

IF you use ❌ sequence instead of ✅ sequence = ARCHITECTURE VIOLATION

Rationale: Traditional MCP for 3+ calls wastes 70K-100K tokens. Code execution pattern achieves same result with 2K-5K tokens (96-98% reduction). The 3-phase approach GUARANTEES: (1) optimal workflow pattern selection, (2) privacy-preserving implementation, (3) measurable token savings verification. This is MANDATORY for cost optimization.

Language Standards (v5.4.0)

YOU MUST use directive language throughout this skill:

Required Directives:

✅ "YOU MUST use", "DO NOT use", "ALWAYS", "NEVER", "MANDATORY", "PROHIBITED", "REQUIRED"
❌ Never: "should", "consider", "might", "could", "try to", "it's recommended", "please", "ideally"

Section Headers:

✅ "Required Standards", "Rules", "Requirements", "Anti-Patterns to Avoid"
❌ "Best Practices", "Guidelines", "Recommendations", "Suggestions"

Examples of Directive Transformation:

❌ "Consider using X" → ✅ "YOU MUST use X"
❌ "You should avoid Y" → ✅ "DO NOT use Y (PROHIBITED)"
❌ "It's recommended to Z" → ✅ "MANDATORY: Z"
❌ "Try to follow pattern P" → ✅ "ALWAYS follow pattern P"

Enforcement Note: Skills with weak language will be rejected by pre-tool-use-write.ts hook.

The Problem

Traditional MCP approach:

All tool definitions loaded upfront → 50K-100K tokens
Every result passes through context → 20K-50K tokens
Sequential execution → slow
PII data exposed in agent context → privacy risk

Example (Web Scraping):

Traditional: 85K tokens, $0.26, 35 seconds
├─ playwright_navigate → 5K tokens (page in context)
├─ playwright_waitForSelector → 2K tokens
├─ playwright_getText → 3K tokens (all text in context)
└─ playwright_screenshot → 8K tokens (base64 image)

The Solution

Code execution approach:

Minimal wrappers (2-3 tools) → 2K tokens
Local scripts process data → no PII in context
Parallel execution → 3-10× faster
Reusable workflows → write once, use forever

Example (Web Scraping):

Code Execution: 3K tokens, $0.009, 8 seconds
└─ playwright_execute('./scripts/scrape-page.ts', config)
   Returns: "✅ Scraped 47 items" (no PII in context)

Implementation Pattern

1. Minimal MCP Wrappers

Create ./servers/[mcp-name]/ with only essential tools:

// ./servers/playwright/execute.ts
export async function execute(scriptPath: string, args: any) {
  // Run local script, return summary only
}

// ./servers/playwright/screenshot.ts
export async function screenshot(url: string) {
  // Capture screenshot for verification
}

Why minimal? Everything else becomes local scripts (privacy, reuse, speed).

2. Local Workflow Scripts

Create ./scripts/[mcp-name]/[workflow].ts:

// ./scripts/playwright/scrape-page.ts
// - Accepts CLI args (--url, --selector, --output)
// - Processes data locally
// - Saves to ./data/ (never in context)
// - Returns 1-line summary

3. Usage

// ❌ Traditional (85K tokens)
await playwright_navigate(url);
await playwright_getText('.price');
await playwright_screenshot();

// ✅ Code Execution (3K tokens)
await playwright_execute('./scripts/scrape-page.ts', {
  url, selector: '.price', output: './data/prices.json'
});

Resources (Progressive Loading)

Setup: Load @resources/mcp-setup-guide.md for:

MCP server discovery scripts
TypeScript wrapper generation
Directory structure creation

Patterns: Load @resources/mcp-patterns.md for:

10 composition patterns (Sequential, Parallel, Conditional, etc.)
Token savings per pattern
Copy-paste templates

Examples: Load @resources/mcp-examples.md for:

5 complete workflows (scraping, testing, monitoring)
Before/after token comparisons
Production-ready code

Integration: Load @resources/mcp-integration-guide.md for:

skill-rules.json configuration
Hook setup
Testing procedures

Key Benefits

1. Token Efficiency (96-98% reduction)

Traditional: 70K-150K tokens
Code execution: 2K-5K tokens
Savings: $0.25 → $0.01 per run

2. Privacy Preservation

Problem: Customer data, PII in agent context
Solution: Process locally, only summaries to context
Impact: GDPR/CCPA compliant

3. Performance (3-10× faster)

Traditional: Sequential tool calls (navigate → wait → getText...)
Code execution: Parallel script execution
Impact: 45s → 8s typical workflow

4. Reusability

Traditional: Re-explain workflow each time
Code execution: Import scripts, compose workflows
Impact: Write once, use 100+ times

Security (CRITICAL)

⚠️ Required before implementing:

Sandboxing - Docker/VM/serverless environment
Resource limits - Memory (512MB), CPU (50%), timeout (30s)
Audit logging - All script executions logged
Code review - Security review before execution

const sandboxConfig = {
  maxMemory: '512MB',
  maxCPU: '50%',
  timeout: 30000,
  allowedDomains: ['target-sites.com'],
  blockPrivateIPs: true,
  allowedPaths: ['./scripts/', './data/'],
  readOnly: ['./scripts/']
};

Trade-offs

Aspect	Traditional	Code Execution
Setup	Simple	Complex (one-time)
Learning	Low	Medium
Tokens	High (70K-150K)	Low (2K-5K)
Privacy	Poor	Excellent
Reuse	None	High

Decision:

One-off task → Traditional
Recurring + 3+ tools → Code execution
PII data → Code execution (mandatory)

Quick Start

Today (30 min): Create minimal wrappers in ./servers/playwright/
This week (2 hours): Implement first script (scrape-page.ts)
This month: Convert 3-5 workflows, measure savings

See: /load mcp-patterns for implementation templates

mcp-code-execution

Install Skill

SKILL.md

mcp-code-execution

Quick Reference

When to Use

Token Savings

Task Decomposition Override (v5.4.0)

❌ PROHIBITED SEQUENCE (Traditional MCP - wastes 70K-100K tokens):

✅ MANDATORY SEQUENCE (Code Execution Pattern):

Language Standards (v5.4.0)

The Problem

The Solution

Implementation Pattern

1. Minimal MCP Wrappers

2. Local Workflow Scripts

3. Usage

Resources (Progressive Loading)

Key Benefits

1. Token Efficiency (96-98% reduction)

2. Privacy Preservation

3. Performance (3-10× faster)

4. Reusability

Security (CRITICAL)

Trade-offs

Quick Start