name	pattern-miner
description	Mine historical data for patterns and predictive insights
category	analytics
execution_time	15-20s

Pattern Miner Skill

Overview

The pattern-miner Skill analyzes historical project data to identify recurring patterns, predict future issues, and provide data-driven recommendations for estimation and planning.

Purpose: Learn from past runs to make better predictions and avoid repeating mistakes.

When to Use

After creating task groups - Adjust estimates based on historical patterns
Before major features - Predict effort based on similar past features
Weekly retrospectives - Identify systemic issues
Quarterly planning - Understand team's actual velocity patterns

What It Analyzes

Source	Patterns Extracted
historical_metrics.json	Velocity patterns, cycle time trends, revision rates by task type
tech_debt.json	Recurring issues, problem modules, common failure modes
pm_state.json (past runs)	Task type durations, parallelization effectiveness
coordination logs	Developer efficiency patterns, common bottlenecks

Pattern Types Detected

1. Task Duration Patterns

"Database tasks always take 2.5x initial estimate" (85% confidence)
"Authentication features require 3 revisions on average"
"API integrations complete faster than expected (0.7x estimate)"

2. Module Risk Patterns

"Payment module has 80% revision rate (high risk)"
"Auth module always requires security review"
"Frontend changes rarely need QA iterations"

3. Team Velocity Patterns

"Team velocity increases 20% after first week of project"
"Parallel mode with 3+ developers shows diminishing returns"
"Friday deployments have 2x higher rollback rate"

4. Quality Patterns

"Coverage drops correlate with rushed features (r=0.85)"
"Security issues cluster in user input handling"
"Lint violations spike during feature freeze"

Output Format

{
  "timestamp": "2024-11-08T12:00:00Z",
  "total_runs_analyzed": 25,
  "patterns_detected": [
    {
      "pattern_id": "db_task_overrun",
      "pattern": "database_tasks_overrun",
      "category": "estimation",
      "confidence": 0.85,
      "occurrences": 12,
      "description": "Database-related tasks take 2.5x longer than estimated",
      "evidence": {
        "avg_estimate": 5.0,
        "avg_actual": 12.5,
        "variance": 2.3
      },
      "recommendation": "Multiply database task estimates by 2.5x",
      "impact": "high"
    },
    {
      "pattern_id": "auth_security_review",
      "pattern": "auth_requires_security_review",
      "category": "process",
      "confidence": 1.0,
      "occurrences": 8,
      "description": "Authentication tasks always require security review (100%)",
      "recommendation": "Plan for security review in auth task timeline",
      "impact": "medium"
    }
  ],
  "lessons_learned": [
    "Payment processing features have 80% revision rate - break into smaller tasks",
    "Parallel mode with >3 developers shows coordination overhead (diminishing returns)",
    "Integration tests catch 90% of bugs that slip past unit tests"
  ],
  "predictions_for_current_project": [
    {
      "task_group": "C",
      "prediction": "Group C (payment processing) likely needs +30% time based on pattern 'payment_module_revisions'",
      "confidence": 0.78,
      "recommendation": "Add buffer to Group C estimate"
    }
  ],
  "estimation_adjustments": {
    "database_tasks": {
      "multiplier": 2.5,
      "confidence": 0.85
    },
    "authentication": {
      "multiplier": 1.5,
      "confidence": 0.90,
      "note": "Include security review time"
    },
    "api_integration": {
      "multiplier": 0.7,
      "confidence": 0.75,
      "note": "Usually faster than expected"
    }
  },
  "risk_indicators": [
    {
      "indicator": "revision_count > 3",
      "probability": 0.85,
      "outcome": "Requires tech lead escalation"
    },
    {
      "indicator": "story_points > 8",
      "probability": 0.70,
      "outcome": "Should be split into smaller tasks"
    }
  ]
}

Pattern Confidence Levels

Confidence	Meaning	Action
0.90-1.00	Very High	Apply automatically
0.75-0.89	High	Apply with PM review
0.60-0.74	Medium	Suggest as option
0.00-0.59	Low	Informational only

Pattern Categories

Estimation - Task duration patterns
Process - Workflow patterns (e.g., always needs review)
Quality - Defect and revision patterns
Risk - Failure mode patterns
Team - Velocity and efficiency patterns

Usage Example

In PM agent:

# After creating task groups, refine estimates with patterns
/pattern-miner

# Read predictions
cat coordination/pattern_insights.json

# Apply adjustments
PATTERN_MULTIPLIER=$(jq -r '.estimation_adjustments.database_tasks.multiplier' coordination/pattern_insights.json)

if [ "$PATTERN_MULTIPLIER" != "null" ]; then
    # Adjust database task estimate
    NEW_ESTIMATE=$(echo "$ORIGINAL_ESTIMATE * $PATTERN_MULTIPLIER" | bc)
    echo "Adjusted database task estimate: $ORIGINAL_ESTIMATE → $NEW_ESTIMATE (${PATTERN_MULTIPLIER}x multiplier)"
fi

# Check predictions for current groups
jq -r '.predictions_for_current_project[] | "⚠️  \(.prediction)"' coordination/pattern_insights.json

Pattern Detection Algorithm

For each task type:
1. Group historical tasks by type (database, auth, API, etc.)
2. Calculate avg_actual_time / avg_estimated_time ratio
3. If ratio consistently >1.2 or <0.8 across ≥5 occurrences:
   → Pattern detected
4. Calculate confidence based on:
   - Number of occurrences (more = higher confidence)
   - Variance (lower = higher confidence)
   - Recency (recent data weighted more)

Benefits

✅ Data-driven estimates - Stop guessing, use historical data
✅ Avoid recurring mistakes - Learn from past patterns
✅ Proactive risk management - Predict issues before they happen
✅ Continuous improvement - Estimation accuracy improves over time
✅ Team-specific insights - Patterns unique to your team's velocity

Performance

Execution time: 15-20 seconds
Dependencies: jq (graceful fallback if not available)
Minimum data: 5 historical runs for meaningful patterns
Output: coordination/pattern_insights.json

Platform Support

✅ Linux/macOS (bash)
✅ Windows (PowerShell)

Related Skills

velocity-tracker - Provides historical metrics input
quality-dashboard - Uses pattern insights for predictions

Example Patterns

Real-world patterns detected:

Pattern: database_migrations_underestimated
Confidence: 0.88
Occurrences: 15/17 database tasks
Recommendation: Use 2.5x multiplier for DB tasks

Pattern: friday_deployments_risky
Confidence: 0.92
Occurrences: 11/12 Friday deploys had issues
Recommendation: Avoid Friday deployments

Pattern: parallel_mode_diminishing_returns
Confidence: 0.81
Occurrences: 8/10 runs with >3 developers
Recommendation: Cap parallelism at 3 developers

pattern-miner

Install Skill

SKILL.md