name	Test Manager
description	Run visual tests, compare golden files, and report bugs for Stapledons Voyage. Use when user asks to run tests, check golden files, or report visual regressions.

Test Manager

Manage visual regression testing for Stapledon's Voyage. Run test scenarios, compare against golden files, and generate bug reports.

Quick Start

Most common usage:

# Run all test scenarios in test mode (no UI)
.claude/skills/test-manager/scripts/run_tests.sh

# Compare current output to golden files
.claude/skills/test-manager/scripts/compare_golden.sh

# Generate bug report for a visual regression
.claude/skills/test-manager/scripts/report_bug.sh <scenario-name> "<description>"

When to Use This Skill

Invoke this skill when:

User asks to "run tests" or "check tests"
User mentions "golden files" or "visual regression"
User wants to compare current rendering against baseline
User reports a visual bug that needs investigation
After making changes to rendering code

Available Scripts

`scripts/run_tests.sh [scenario-name]`

Run test scenarios and capture screenshots with UI stripped (test mode).

Usage:

# Run all scenarios
.claude/skills/test-manager/scripts/run_tests.sh

# Run specific scenario
.claude/skills/test-manager/scripts/run_tests.sh camera-pan

Output:

Screenshots saved to out/test/<scenario-name>/
Summary of pass/fail status

`scripts/compare_golden.sh [scenario-name]`

Compare current test output against golden files.

Usage:

# Compare all scenarios
.claude/skills/test-manager/scripts/compare_golden.sh

# Compare specific scenario
.claude/skills/test-manager/scripts/compare_golden.sh camera-pan

Output:

Lists matching and differing files
Generates diff images for mismatches (if ImageMagick available)
Exit code 0 if all match, 1 if differences found

`scripts/update_golden.sh [scenario-name]`

Update golden files from current test output.

Usage:

# Update all golden files
.claude/skills/test-manager/scripts/update_golden.sh

# Update specific scenario
.claude/skills/test-manager/scripts/update_golden.sh camera-pan

`scripts/report_bug.sh <scenario> <description>`

Generate a bug report design doc for visual regression.

Usage:

.claude/skills/test-manager/scripts/report_bug.sh camera-zoom "Zoom out produces artifacts at edge"

Workflow

1. Run Tests

Run all scenarios in test mode to generate current screenshots:

.claude/skills/test-manager/scripts/run_tests.sh

2. Compare Against Golden Files

Check if current output matches the baseline:

.claude/skills/test-manager/scripts/compare_golden.sh

3. Handle Results

If tests pass: No action needed.

If tests fail:

Review the diff images in out/test/<scenario>/diff/
If change is intentional: scripts/update_golden.sh
If change is a bug: scripts/report_bug.sh <scenario> "<description>"

4. Investigate Bugs

When a visual regression is found:

Run the specific scenario: make scenario-<name>
Review screenshots
Create bug report with design doc
Fix the issue
Re-run tests to verify

Test Scenarios

Current test scenarios in scenarios/:

Scenario	Purpose
`camera-pan`	Test WASD camera movement
`camera-zoom`	Test Q/E zoom controls
`npc-movement`	Capture NPC movement over time

Add "test_mode": true to JSON for golden file testing.

Golden Files

Golden files are stored in golden/<scenario-name>/ and should be committed to git.

Structure:

golden/
├── camera-pan/
│   ├── initial.png
│   ├── after-down.png
│   └── after-right.png
├── camera-zoom/
│   └── ...
└── npc-movement/
    └── ...

Output Organization

Test output follows the project's out/ directory structure:

Directory	Purpose
`out/test/<scenario>/`	Current test output (screenshots)
`out/test/<scenario>/diff/`	Diff images when tests fail
`out/scenarios/`	Temporary scenario runner output
`golden/<scenario>/`	Baseline golden files (committed to git)

Important:

Test output in out/test/ is gitignored and ephemeral
Golden files in golden/ are committed and versioned
Never put test output in out/ root - always use out/test/<scenario>/

Notes

Always run tests with --test-mode to strip UI elements
Golden files should be updated intentionally, not automatically
Use deterministic seeds for reproducible tests
Screenshots are at 1280x960 internal resolution

IMPORTANT: Screenshot Method

NEVER use macOS screencapture - it captures the desktop at native resolution (5K+), producing huge files.

ALWAYS use in-game screenshot flags:

go run ./cmd/game --screenshot 30 --output out/test.png

Or use the helper script:

.claude/skills/sprint-executor/scripts/take_screenshot.sh -f 30 -o out/test.png

Test Manager

Install Skill

SKILL.md

Test Manager

Quick Start

When to Use This Skill

Available Scripts

`scripts/run_tests.sh [scenario-name]`

`scripts/compare_golden.sh [scenario-name]`

`scripts/update_golden.sh [scenario-name]`

`scripts/report_bug.sh <scenario> <description>`

Workflow

1. Run Tests

2. Compare Against Golden Files

3. Handle Results

4. Investigate Bugs

Test Scenarios

Golden Files

Output Organization

Notes

IMPORTANT: Screenshot Method