| name | testing-strategy-builder |
| description | Use this skill when creating comprehensive testing strategies for applications. Provides test planning templates, coverage targets, test case structures, and guidance for unit, integration, E2E, and performance testing. Ensures robust quality assurance across the development lifecycle. |
| version | 1.0.0 |
| author | AI Agent Hub |
| tags | testing, quality-assurance, test-strategy, automation, coverage |
Testing Strategy Builder
Overview
This skill provides comprehensive guidance for building effective testing strategies that ensure software quality, reliability, and maintainability. Whether starting from scratch or improving existing test coverage, this framework helps teams design robust testing approaches.
When to use this skill:
- Planning testing strategy for new projects or features
- Improving test coverage in existing codebases
- Establishing quality gates and coverage targets
- Designing test automation architecture
- Creating test plans and test cases
- Choosing appropriate testing tools and frameworks
- Implementing continuous testing in CI/CD pipelines
Bundled Resources:
references/code-examples.md- Detailed testing code examplestemplates/test-plan-template.md- Comprehensive test plan templatetemplates/test-case-template.md- Test case documentation templatechecklists/test-coverage-checklist.md- Coverage verification checklist
Testing Philosophy
The Testing Trophy 🏆
Modern testing follows the "Testing Trophy" model (evolved from the testing pyramid):
🏆
/ \
/ E2E \ ← Few (critical user journeys)
/----------\
/ Integration\ ← Many (component interactions)
/--------------\
/ Unit \ ← Most (business logic)
/------------------\
/ Static Analysis \ ← Foundation (linting, type checking)
Principles:
- Static Analysis: Catch syntax errors, type issues, and common bugs before runtime
- Unit Tests: Test individual functions and components in isolation
- Integration Tests: Test how components work together
- E2E Tests: Validate critical user workflows end-to-end
Balance: 70% integration, 20% unit, 10% E2E (adjust based on context)
Testing Strategy Framework
1. Coverage Targets
Recommended Targets:
- Overall Code Coverage: 80% minimum
- Critical Paths: 95-100% (payment, auth, data mutations)
- New Features: 100% coverage requirement
- Business Logic: 90%+ coverage
- UI Components: 70%+ coverage
Coverage Types:
- Line Coverage: Percentage of code lines executed
- Branch Coverage: Percentage of decision branches taken
- Function Coverage: Percentage of functions called
- Statement Coverage: Percentage of statements executed
Important: Coverage is a metric, not a goal. 100% coverage ≠ bug-free code.
2. Test Classification
Static Analysis
Purpose: Catch errors before runtime Tools: ESLint, Prettier, TypeScript, Pylint, mypy, Ruff When to run: Pre-commit hooks, CI pipeline
Unit Tests
Purpose: Test isolated business logic Tools: Jest, Vitest, pytest, JUnit Characteristics:
- Fast execution (< 100ms per test)
- No external dependencies (database, API, filesystem)
- Deterministic (same input = same output)
- Test single responsibility
Coverage Target: 90%+ for business logic
See references/code-examples.md for detailed unit test examples.
Integration Tests
Purpose: Test component interactions Tools: Testing Library, Supertest, pytest with fixtures Characteristics:
- Test multiple units working together
- May use test databases or mocked external services
- Moderate execution time (< 1s per test)
- Focus on interfaces and contracts
Coverage Target: 70%+ for API endpoints and component interactions
See references/code-examples.md for API integration test examples.
End-to-End (E2E) Tests
Purpose: Validate critical user journeys Tools: Playwright, Cypress, Selenium Characteristics:
- Test entire application flow (frontend + backend + database)
- Slow execution (5-30s per test)
- Run against production-like environment
- Focus on business-critical paths
Coverage Target: 5-10 critical user journeys
See references/code-examples.md for complete E2E test examples.
Performance Tests
Purpose: Validate system performance under load Tools: k6, Artillery, JMeter, Locust Types:
- Load Testing: System behavior under expected load
- Stress Testing: Breaking point identification
- Spike Testing: Sudden traffic surge handling
- Soak Testing: Sustained load over time (memory leaks)
Coverage Target: Test all performance-critical endpoints
See references/code-examples.md for k6 load test examples.
Test Planning
1. Risk-Based Testing
Prioritize testing based on risk assessment:
High Risk (100% coverage required):
- Payment processing
- Authentication and authorization
- Data mutations (create, update, delete)
- Security-critical operations
- Compliance-related features
Medium Risk (80% coverage):
- Business logic
- Data transformations
- API integrations
- Email/notification systems
Low Risk (50% coverage):
- UI styling
- Static content
- Read-only operations
- Non-critical features
2. Test Case Design
Given-When-Then Pattern:
Given [initial context]
When [action occurs]
Then [expected outcome]
This pattern keeps tests clear and focused. See references/code-examples.md for implementation examples.
3. Test Data Management
Strategies:
- Fixtures: Pre-defined test data in JSON/YAML files
- Factories: Generate test data programmatically
- Seeders: Populate test database with known data
- Faker Libraries: Generate realistic random data
See references/code-examples.md for test factory and fixture examples.
Testing Patterns and Best Practices
1. AAA Pattern (Arrange-Act-Assert)
Structure tests in three clear phases:
- Arrange: Set up test data and context
- Act: Perform the action being tested
- Assert: Verify expected outcomes
See references/code-examples.md for detailed AAA pattern examples.
2. Test Isolation
Each test should be independent:
- Use fresh test database for each test
- Clean up resources after each test
- Tests don't depend on execution order
See references/code-examples.md for test isolation patterns.
3. Mocking vs Real Dependencies
When to Mock:
- External APIs (payment gateways, third-party services)
- Slow operations (file I/O, network calls)
- Non-deterministic behavior (current time, random values)
- Hard-to-test scenarios (error conditions, edge cases)
When to Use Real Dependencies:
- Fast, deterministic operations
- Critical business logic
- Database operations (use test database)
- Internal service interactions
See references/code-examples.md for mocking examples.
4. Snapshot Testing
Use for: UI components, API responses, generated code
Warning: Snapshots can become brittle. Use for stable components, not rapidly changing UI.
5. Parameterized Tests
Test multiple scenarios with same logic using data tables.
See references/code-examples.md for parameterized test patterns.
Continuous Testing
1. CI/CD Integration
Pipeline Stages:
# Example: GitHub Actions
name: Test Pipeline
on: [push, pull_request]
jobs:
test:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
- name: Install dependencies
run: npm ci
- name: Lint
run: npm run lint
- name: Type check
run: npm run typecheck
- name: Unit & Integration Tests
run: npm test -- --coverage
- name: Upload coverage
uses: codecov/codecov-action@v3
- name: E2E Tests
run: npm run test:e2e
- name: Performance Tests (on main branch)
if: github.ref == 'refs/heads/main'
run: npm run test:performance
2. Quality Gates
Block merges/deployments if:
- Code coverage drops below threshold (e.g., 80%)
- Any tests fail
- Linting errors exist
- Performance regression detected (> 10% slower)
- Security vulnerabilities found
3. Test Execution Strategy
On Every Commit:
- Static analysis (lint, type check)
- Unit tests
- Fast integration tests (< 5 min total)
On Pull Request:
- All tests (unit + integration + E2E)
- Coverage report
- Performance benchmarks
On Deploy to Staging:
- Full E2E suite
- Load testing
- Security scans
On Deploy to Production:
- Smoke tests (critical paths only)
- Health checks
- Canary deployments with monitoring
Testing Tools Recommendations
JavaScript/TypeScript
| Category | Tool | Use Case |
|---|---|---|
| Unit/Integration | Vitest | Fast, Vite-native, modern |
| Unit/Integration | Jest | Mature, extensive ecosystem |
| E2E | Playwright | Cross-browser, reliable, fast |
| E2E | Cypress | Developer-friendly, visual debugging |
| Component Testing | Testing Library | User-centric, framework-agnostic |
| API Testing | Supertest | HTTP assertions, Express integration |
| Performance | k6 | Load testing, scriptable |
Python
| Category | Tool | Use Case |
|---|---|---|
| Unit/Integration | pytest | Powerful, extensible, fixtures |
| API Testing | httpx + pytest | Async support, modern |
| E2E | Playwright (Python) | Browser automation |
| Performance | Locust | Load testing, Python-based |
| Mocking | unittest.mock | Standard library, reliable |
Common Testing Anti-Patterns
❌ Testing Implementation Details
// Bad: Testing internal state
expect(component.state.isLoading).toBe(false);
// Good: Testing user-visible behavior
expect(screen.queryByText('Loading...')).not.toBeInTheDocument();
❌ Tests Too Coupled to Code
// Bad: Test breaks when implementation changes
expect(userService.save).toHaveBeenCalledTimes(1);
// Good: Test behavior, not implementation
const user = await db.users.findOne({ email: 'test@example.com' });
expect(user).toBeTruthy();
❌ Flaky Tests
// Bad: Non-deterministic timeout
await waitFor(() => {
expect(screen.getByText('Success')).toBeInTheDocument();
}, { timeout: 1000 }); // Might fail on slow CI
// Good: Use explicit waits with longer timeout
await screen.findByText('Success', {}, { timeout: 5000 });
❌ Giant Test Cases
// Bad: One test does too much
test('user workflow', async () => {
// 100 lines testing signup, login, profile update, logout...
});
// Good: Focused tests
test('user can sign up', async () => { /* ... */ });
test('user can login', async () => { /* ... */ });
test('user can update profile', async () => { /* ... */ });
Integration with Agents
Code Quality Reviewer
- Reviews test coverage reports
- Suggests missing test cases
- Validates test quality and structure
- Ensures tests follow patterns from this skill
Backend System Architect
- Uses test strategy templates when designing services
- Ensures APIs are testable (dependency injection, clear interfaces)
- Plans integration test architecture
Frontend UI Developer
- Applies component testing patterns
- Uses Testing Library best practices
- Implements E2E tests for user flows
AI/ML Engineer
- Adapts testing patterns for ML models (data validation, model performance tests)
- Uses performance testing for inference endpoints
Quick Start Checklist
When starting a new project or feature:
- Define coverage targets (overall, critical paths, new code)
- Choose testing framework (Jest/Vitest, Playwright, etc.)
- Set up test infrastructure (test database, fixtures, factories)
- Create test plan (see
templates/test-plan-template.md) - Implement static analysis (ESLint, TypeScript)
- Write unit tests for business logic (80%+ coverage)
- Write integration tests for API endpoints (70%+ coverage)
- Write E2E tests for critical user journeys (5-10 flows)
- Configure CI/CD pipeline with quality gates
- Set up coverage reporting (Codecov, Coveralls)
- Document testing conventions in project README
For detailed code examples: See references/code-examples.md
Skill Version: 1.0.0 Last Updated: 2025-10-31 Maintained by: AI Agent Hub Team