| name | root-cause-analysis |
| description | Conducts systematic root cause analysis using proven methodologies including Toyota's 5 Whys, Ishikawa fishbone diagrams, Pareto analysis, and fault tree analysis. Use when investigating bugs, debugging code, troubleshooting systems, diagnosing equipment failures, analyzing life problems, or identifying underlying causes of any issue across software engineering, hardware maintenance, process failures, or personal challenges. |
| version | 2 |
Root Cause Analysis Orchestration Skill
This skill helps you systematically identify the root cause of any problem using proven methodologies from the Toyota Production System and other industry-standard techniques.
Quick Reference: When to Load Which Resource
| Your Problem Type | Load Resource | Why |
|---|---|---|
| Need to understand 5 Whys, Fishbone, Pareto, Fault Tree methodology | resources/rca-methodologies.md |
Learn each method step-by-step with examples |
| Looking for common root causes in your domain | resources/common-root-causes.md |
Pattern match against known causes: software, hardware, process, personal |
| Want to see complete worked examples | resources/example-analyses.md |
Study real cases: software bugs, vehicle maintenance, system failures, personal problems |
| Advanced: need barrier analysis, complex cause mapping | resources/advanced-techniques.md |
Formal methods: Fault Tree, Barrier Analysis, multi-methodology chains |
Core Principle
Do not treat symptoms—find and fix the root cause. As Taiichi Ohno, architect of the Toyota Production System, said: "By repeating why five times, the nature of the problem as well as its solution becomes clear."
Orchestration Protocol
Phase 1: Problem Classification
Quickly identify your problem domain and complexity:
Problem Domain:
- Software: Code bugs, system failures, performance, deployment
- Hardware: Equipment, mechanical, electrical, maintenance
- Process: Workflow, procedures, organizational, communication
- Personal: Life challenges, productivity, habits, wellbeing
Complexity Level:
- Simple: Clear failure chain, 1-2 likely causes → Use 5 Whys
- Complex: Multiple possible causes, unknown scope → Start with Fishbone
- Critical/Safety: High stakes, needs rigor → Use Fault Tree
- Multiple Issues: Many competing problems → Use Pareto first
Action: Load appropriate resource file(s) based on classification.
Phase 2: Methodology Selection
Based on problem type, select your approach:
| Situation | Recommended | Load |
|---|---|---|
| Single clear failure | 5 Whys | methodologies.md |
| Complex/multiple possibilities | Fishbone → 5 Whys | methodologies.md |
| Competing priorities | Pareto → 5 Whys | methodologies.md |
| Safety/high-stakes | Fault Tree | advanced-techniques.md |
| Process breakdown | Barrier Analysis | advanced-techniques.md |
| Pattern matching | Common causes + 5 Whys | common-root-causes.md |
Phase 3: Execution & Verification
During Analysis:
- Define problem clearly (What/Where/When/Impact)
- Gather evidence systematically
- Apply selected methodology
- Document reasoning at each step
- Verify root cause with Forward/Backward tests
Before Finalizing:
- Validate conclusion against evidence
- Check for red flags (see common-root-causes.md)
- Confirm actionability (can you fix this?)
- Develop solutions addressing root cause
Problem Definition Framework
Create a clear problem statement before analysis:
Essential Elements:
- What: Observable symptom (not assumed cause)
- Where: Location/system/component affected
- When: Timeline, frequency, pattern
- Impact: Users/systems affected, severity
Example: "Users in EU region experience 3-5 second dashboard load delays during 9-11 AM UTC peak hours, affecting ~2,000 daily active users. Started after v2.4 deployment on Nov 18th."
Evidence Gathering (Go and See)
Follow Toyota's principle—collect facts, not opinions:
Key Evidence Sources:
- Logs, metrics, monitoring data
- Timeline of events and changes
- System/code/configuration changes before problem
- Environmental factors (load, traffic, season)
- User reports and reproduction steps
- System state before/during/after
RCA Methodologies
See resources/rca-methodologies.md for complete methodology guide.
Resource Files Summary
resources/rca-methodologies.md
Comprehensive methodology guide covering:
- 5 Whys: Step-by-step process with software examples
- Fishbone Diagram: Structure, 6 M's categories, process
- Pareto Analysis: Prioritization using 80/20 rule
- Fault Tree Analysis: Top-down formal analysis
- Barrier Analysis: Control failure examination
- Structured 6-phase RCA process, domain-specific guidance, templates
resources/common-root-causes.md
Pattern reference catalog by domain:
- Software Engineering: Code defects, configuration, dependencies, deployment
- Hardware & Equipment: Mechanical, electrical, operational, maintenance
- Process & Operations: Workflow, design, resources
- Personal/Life: Health, habits, environment, skills
- Red flags, recurring themes, pattern recognition
resources/example-analyses.md
Four worked examples with full analysis:
- Software Bug: JWT authentication (5 Whys)
- Vehicle Maintenance: Overheating (5 Whys)
- System Failure: E-commerce checkout (Fishbone + 5 Whys)
- Personal Productivity: Missed deadlines (Fishbone + 5 Whys)
resources/advanced-techniques.md
Formal methods for complex problems:
- Fault Tree Analysis: Boolean logic, safety systems
- Barrier Analysis: Control failures
- Multi-Methodology Chains: Complex orchestration
- Verification Frameworks: Comprehensive testing
How This Skill Works
- Clarify your situation: Domain, observations, evidence, time
- Recommend approach: Complexity analysis, methodology, resources
- Guide through analysis: Problem statement, evidence, methodology, verification
- Deliver output: Analysis, root cause, solutions, implementation
Quick Start: 5-Minute RCA
- State problem (What/Where/When/Impact)
- First Why: fact-based answer
- Second Why: dig deeper
- Third Why: dig deeper again
- Verify: would fixing this prevent it?
Templates & Examples
- 5 Whys Template in
resources/rca-methodologies.md - Fishbone Template in
resources/rca-methodologies.md - Worked Examples in
resources/example-analyses.md - Solution Structures in
resources/example-analyses.md
Next Steps
- Identify problem domain (software/hardware/process/personal)
- Load appropriate resource from table above
- Select methodology based on complexity
- Follow step-by-step process in resource
- Verify root cause (Forward/Backward tests)
- Develop actionable solutions
Remember: Goal is systematic investigation—disciplined questioning until you reach a cause you can actually fix.