name	software-testing-strategy
description	Strategic testing framework: testing pyramid, test design patterns, anti-patterns. Complements tdd-enforcement (tactical) with comprehensive strategy.
tags	testing, quality, strategy
version	2.0.0

Software Testing Strategy

Purpose

Strategic guidance for designing effective test suites: what to test, where to test it, how to structure it. Use tdd-enforcement for tactical test-first workflow.

Iron Laws

Fast Feedback - Unit tests in milliseconds, integration in seconds, full e2e under 30 minutes
Deterministic - No random inputs, no real clocks, no network timing. Every run identical.
Test Behavior, Not Implementation - Verify what code does, not how it does it
Test at Appropriate Level - Don't use e2e for business logic, don't mock everything in unit tests
Tests Are Production Code - Same quality standards: readability, maintainability, simplicity
Risk-Based Coverage - Focus on critical paths and edge cases, not percentage targets

The Testing Pyramid

Level	%	Speed	Scope	Cost
Unit	70%	<100ms	Single function/class	$
Integration	20%	Seconds	Multiple components	$$
E2E	10%	Minutes	Full user journey	$$$

Unit (70%): Business logic, algorithms, validations. No I/O. Hundreds/thousands of tests.

Integration (20%): Component interactions, database queries, API contracts. Real dependencies.

E2E (10%): Critical user journeys only. Top 20% of journeys = 80% of business value.

Architecture for Testability

Problem: Logic tangled with I/O requires heavy mocking and slow tests.

Solution: Separate decisions from effects (see writing-code skill for full pattern).

# Decision (pure, fast unit test)
def users_needing_reminder(users, cutoff_date):
    return [u for u in users if u.expires_at <= cutoff_date and not u.reminded]

# Effect (thin orchestration, integration test)
def send_reminders():
    users = db.find_all()
    to_remind = users_needing_reminder(users, today() + 7.days)
    email.send_batch(generate_emails(to_remind))

Result: More logic in pure functions = more fast unit tests = testing pyramid economics.

Test Design Patterns

AAA Pattern (Arrange-Act-Assert)

def test_withdrawing_more_than_balance_raises_error():
    account = Account(balance=100)          # Arrange
    with pytest.raises(InsufficientFundsError):
        account.withdraw(150)               # Act + Assert

Test Builder Pattern

const admin = new UserBuilder().withRoles('admin').build();
const minor = new UserBuilder().withAge(17).build();

Use for complex object creation. Provides defaults, fluent interface, reusability.

Test Doubles

Type	Purpose	Example
Mock	Verify method was called	Assert email service sent notification
Stub	Control return values	API returns success then failure
Fake	Working in-memory implementation	In-memory database
Spy	Inspect calls after execution	Logger recorded error messages

Rule: Mock external systems, not your own components. Test state, not interactions.

Parameterized Tests

@pytest.mark.parametrize("input,expected", [
    ("Hello World", "hello-world"),
    ("HELLO WORLD", "hello-world"),
    ("Hello, World!", "hello-world"),
])
def test_slugify(input, expected):
    assert slugify(input) == expected

Anti-Patterns

Anti-Pattern	Symptom	Fix
Flaky Tests	Pass/fail randomly	Control time, isolate tests, use fakes
Slow Tests	Unit tests take seconds	Remove I/O, test pure logic
Test Interdependencies	Fail in different order	Isolate tests, complete setup per test
Over-Mocking	Breaks when refactoring	Mock external systems only
Logic in Tests	Tests have if/loops	Simplify, use parameterized tests
Unclear Names	Can't tell what broke	`test_<scenario>_<expected_behavior>`

Test Level Selection

What to Test	Unit	Integration	E2E
Business logic	✓
Calculations	✓
Database queries		✓
API contracts		✓
Critical user journey			✓
Edge cases	✓

Legacy Code Testing

Characterization Tests

When you don't know what code should do, capture what it currently does:

def test_legacy_calculator_current_behavior():
    calc = LegacyCalculator()
    assert calc.calculate(5, "A") == 47.50  # Document current behavior

Finding Seams

Add injection points for tests:

# Before: Hard to test
def process(order)
  gateway = PaymentGateway.new(ENV['KEY'])
  gateway.charge(order.amount)
end

# After: Testable via injection
def process(order, gateway: PaymentGateway.new(ENV['KEY']))
  gateway.charge(order.amount)
end

Test Quality Checklist

Names describe behavior, not implementation
Tests are isolated (no shared state)
Setup is clear and minimal
Assertions are simple and focused
Critical paths have multiple test cases
Edge cases covered
Error paths tested
No complex logic in tests
Tests run fast at appropriate level

Key Takeaways

Testing pyramid 70/20/10 - Unit base, integration middle, e2e top
Test behavior, not implementation - Tests survive refactoring
Risk-based coverage - Focus on critical paths, not percentages
Separate decisions from effects - Enables fast unit tests (see writing-code skill)
Mock external systems only - Test state, not interactions
Zero tolerance for flaky tests - Fix immediately or delete

software-testing-strategy

Install Skill

SKILL.md