name	deployment-manager
description	Automated deployment orchestration with rollback, blue-green, and canary deployment strategies
allowed-tools	Read, Write, Bash, Grep, Glob
version	1.0.0
author	GLINCKER Team
license	Apache-2.0
keywords	deployment, automation, rollback, blue-green, canary, orchestration

Deployment Manager

Advanced deployment orchestration agent for automated, safe, and reliable application deployments. Supports multiple deployment strategies, automatic rollback, and production-ready workflows.

Agent Expertise

Zero-downtime deployments
Blue-green deployment strategy
Canary releases with progressive rollout
Automated rollback on failure
Multi-environment deployment (dev, staging, prod)
Database migration coordination
Health checks and validation
Deployment pipeline orchestration

Key Capabilities

Deployment Strategies: Blue-green, canary, rolling updates, recreate
Automated Rollback: Automatic rollback on health check failures
Database Migrations: Safe schema changes with rollback support
Health Monitoring: Pre and post-deployment validation
Traffic Management: Progressive traffic shifting for canary deployments
Multi-Cloud Support: AWS, GCP, Azure, and on-premise

Workflow

When activated, this agent will:

Analyze application architecture and deployment requirements
Select appropriate deployment strategy
Run pre-deployment health checks
Execute deployment with safety measures
Monitor application health during rollout
Automatically rollback if issues detected
Notify stakeholders of deployment status

Quick Commands

# Deploy to production
"Deploy the application to production using blue-green strategy"

# Canary release
"Deploy new version to 10% of users, then gradually increase"

# Deploy with migration
"Deploy application and run database migrations safely"

# Rollback
"Rollback the last production deployment"

# Setup deployment pipeline
"Create automated deployment pipeline for this application"

# Multi-environment deploy
"Deploy to staging, wait for approval, then deploy to production"

Deployment Strategies

Blue-Green Deployment

Zero-downtime deployment:

Deploy new version (Green) alongside current (Blue)
Run health checks on Green environment
Switch traffic from Blue to Green
Keep Blue as instant rollback option

Best for: Production deployments requiring zero downtime

Canary Deployment

Progressive rollout:

Deploy to small subset of users (5-10%)
Monitor metrics and errors
Gradually increase traffic (25% → 50% → 100%)
Rollback if error rate increases

Best for: High-risk changes, new features, performance optimizations

Rolling Update

Gradual instance replacement:

Update instances one at a time
Wait for health check before next instance
Maintain minimum available capacity
Automatic rollback if health checks fail

Best for: Kubernetes deployments, containerized applications

Recreate

Simple stop-and-start:

Stop current version
Deploy new version
Start new version

Best for: Development environments, non-critical applications

Features

Automated Health Checks

Pre-deployment validation:

Check dependencies (database, external services)
Verify resource availability (CPU, memory, disk)
Validate configuration

Post-deployment validation:

Application startup success
API endpoint responsiveness
Error rate monitoring
Performance metrics comparison

Database Migration Management

Safe schema changes:

# Coordinated migration with deployment
1. Run backward-compatible migrations
2. Deploy new application code
3. Run cleanup migrations
4. Automatic rollback if any step fails

Migration strategies:

Expand-contract pattern
Feature flags for breaking changes
Data backfill coordination
Zero-downtime migrations

Automatic Rollback

Triggers for automatic rollback:

Health check failures (3+ consecutive)
Error rate spike (> 5% increase)
Response time degradation (> 50% slower)
Custom metric thresholds

Rollback process:

Detect failure condition
Stop new deployment
Redirect traffic to previous version
Notify team with failure details
Preserve logs for investigation

Traffic Management

Progressive traffic shifting:

# Canary deployment schedule
- 10% for 10 minutes
- 25% for 10 minutes
- 50% for 15 minutes
- 100% if all metrics healthy

Load balancing integration:

AWS ALB weighted targets
Nginx traffic splitting
Istio traffic management
HAProxy backend weighting

Platform Support

Cloud Platforms

AWS: ECS, EKS, Elastic Beanstalk, Lambda
Google Cloud: GKE, Cloud Run, App Engine
Azure: AKS, App Service, Container Instances
DigitalOcean: App Platform, Kubernetes

Container Orchestration

Kubernetes: Deployments, StatefulSets, DaemonSets
Docker Swarm: Service updates, rolling updates
Nomad: Job deployments, canary releases

CI/CD Integration

GitHub Actions: Deployment workflows
GitLab CI/CD: Deploy jobs with environments
Jenkins: Deployment pipelines
CircleCI: Deployment orchestration
ArgoCD: GitOps deployments

Best Practices

Always Test First: Deploy to staging before production
Use Feature Flags: Decouple deployment from release
Monitor Actively: Watch metrics during deployment
Keep Rollback Ready: Maintain quick rollback capability
Automate Everything: Reduce human error with automation
Communicate Status: Keep stakeholders informed
Document Runbooks: Prepare for common issues
Practice Deployments: Regular deployment drills

Example Workflows

Production Deployment with Approval

workflow:
  - stage: deploy-staging
    environment: staging
    on_success: request-approval

  - stage: await-approval
    approvers: [tech-lead, product-owner]
    timeout: 24h

  - stage: deploy-production
    environment: production
    strategy: blue-green
    health_checks:
      - endpoint: /health
      - error_rate: < 1%
      - response_time: < 500ms
    auto_rollback: true

Canary Release

deployment:
  strategy: canary
  stages:
    - traffic: 10%
      duration: 10m
      success_criteria:
        error_rate: < 2%
        p95_latency: < 1000ms

    - traffic: 50%
      duration: 15m
      success_criteria:
        error_rate: < 1%
        p95_latency: < 800ms

    - traffic: 100%
      success_criteria:
        error_rate: < 0.5%
        p95_latency: < 500ms

  rollback:
    automatic: true
    on_failure: true

Database Migration with Deployment

pipeline:
  - step: backup-database
    required: true

  - step: run-migrations
    backward_compatible: true

  - step: deploy-application
    strategy: rolling-update
    health_check: /health

  - step: cleanup-migrations
    on_success: true

  rollback_plan:
    - revert-deployment
    - rollback-migrations
    - restore-backup (if needed)

Monitoring & Observability

Metrics to monitor:

Response time (p50, p95, p99)
Error rate (4xx, 5xx)
Request throughput
CPU and memory usage
Database connection pool
External service latency

Alerting integration:

PagerDuty for critical failures
Slack for deployment notifications
Email for approval workflows
Webhooks for custom integrations

Common Use Cases

Zero-Downtime Production Deploy

"Deploy the new version to production with zero downtime using blue-green strategy"

Gradual Feature Rollout

"Deploy the new feature to 10% of users and gradually increase if metrics look good"

Emergency Rollback

"Rollback the production deployment that went out 30 minutes ago"

Multi-Region Deployment

"Deploy to us-east-1, validate, then roll out to all other regions"

Deployment with Database Changes

"Deploy the application with database schema changes using expand-contract pattern"

Safety Features

Dry-run mode: Preview changes without applying
Deployment locks: Prevent concurrent deployments
Approval workflows: Require manual approval for production
Rate limiting: Prevent deployment storms
Circuit breakers: Stop deployment if too many failures
Audit logging: Track all deployment activities

Author

GLINCKER Team

deployment-manager

Install Skill

SKILL.md