| name | visual-regression-testing |
| description | Comprehensive visual regression testing using Playwright and jest-image-snapshot. Implements screenshot comparison, baseline management, CI/CD integration, and visual diff reporting following Ant Design best practices. Use for preventing visual bugs, ensuring UI consistency, and automating visual QA. (project) |
| license | Complete terms in LICENSE.txt |
Visual Regression Testing
This skill provides comprehensive visual regression testing capabilities to detect unintended visual changes in UI components and prevent visual bugs from being merged into production.
Overview
Visual regression testing works by:
- Taking screenshots of components/pages in a known-good state (baselines)
- Taking new screenshots after code changes
- Comparing new screenshots with baselines pixel-by-pixel
- Flagging differences for human review
Framework: Playwright + jest-image-snapshot Approach: Based on Ant Design's visual regression testing methodology
Core Capabilities
1. Screenshot Baseline Management
Create and update baseline screenshots that serve as the source of truth for visual comparison.
Implementation: tests/shared/imageTest.tsx
Key features:
- Automated baseline capture for all component demos
- Version-controlled baseline storage
- Easy baseline updates when intentional changes occur
- Support for multiple viewport sizes and themes
2. Visual Comparison Testing
Implementation: scripts/visual-regression/
Core testing workflow:
// Example structure from tests/shared/imageTest.tsx
import { test, expect } from '@playwright/test';
test('component visual regression', async ({ page }) => {
await page.goto('http://localhost:5173/component-demo');
await page.waitForLoadState('networkidle');
// Take screenshot and compare with baseline
const screenshot = await page.screenshot();
expect(screenshot).toMatchImageSnapshot({
customSnapshotsDir: '__image_snapshots__',
customDiffDir: '__image_snapshots__/diff',
threshold: 0.1, // 0.1% pixel difference tolerance
});
});
3. CI/CD Integration
Implementation: .github/workflows/visual-regression-*.yml
Workflow files:
visual-regression-pr.yml- Runs on pull requestsvisual-regression-baseline.yml- Updates baselines on main branchvisual-regression-report.yml- Generates and uploads diff reports
PR Workflow:
- Checkout code and install dependencies
- Start local server
- Run visual regression tests
- If differences found:
- Generate diff screenshots
- Upload artifacts to GitHub
- Post comment on PR with visual diff preview
- Mark check as failed
- If no differences, mark check as passed
Baseline Update Workflow:
- Runs on main branch after merge
- Regenerates all baseline screenshots
- Commits updated baselines back to repo
File Structure
.github/workflows/
├── visual-regression-pr.yml # PR visual checks
├── visual-regression-baseline.yml # Baseline updates
└── visual-regression-report.yml # Diff reporting
tests/shared/
└── imageTest.tsx # Baseline screenshot utilities
scripts/visual-regression/
├── capture-baselines.ts # Generate baseline screenshots
├── compare-screenshots.ts # Compare current vs baseline
├── generate-report.ts # Create HTML diff report
└── upload-to-storage.ts # Upload to OSS/S3
__image_snapshots__/
├── baseline/ # Baseline screenshots
├── current/ # Latest test screenshots
└── diff/ # Difference highlights
Usage Patterns
For Component Development
When developing a new component:
# 1. Create component and demos
npm run dev
# 2. Generate initial baseline
npm run visual:baseline
# 3. Make changes
# ... edit component code ...
# 4. Run visual regression test
npm run visual:test
# 5. Review diffs (if any)
npm run visual:report
# 6. Update baseline if changes are intentional
npm run visual:update-baseline
For PR Reviews
Reviewers can:
- Check CI status for visual regression failures
- Click artifact link in PR comment
- Review visual diff report
- Approve/request changes based on visual impact
For CI/CD Pipeline
# Example from .github/workflows/visual-regression-pr.yml
name: Visual Regression Testing
on:
pull_request:
branches: [main, develop]
jobs:
visual-regression:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- name: Setup Node
uses: actions/setup-node@v4
with:
node-version: '20'
- name: Install dependencies
run: npm ci
- name: Install Playwright
run: npx playwright install --with-deps chromium
- name: Start dev server
run: npm run dev &
- name: Wait for server
run: npx wait-on http://localhost:5173
- name: Run visual regression tests
run: npm run visual:test
- name: Upload diff artifacts
if: failure()
uses: actions/upload-artifact@v4
with:
name: visual-diffs
path: __image_snapshots__/diff/
- name: Generate report
if: failure()
run: npm run visual:report
- name: Comment PR with results
if: failure()
uses: actions/github-script@v7
with:
script: |
const fs = require('fs');
const report = fs.readFileSync('visual-report.md', 'utf8');
github.rest.issues.createComment({
issue_number: context.issue.number,
owner: context.repo.owner,
repo: context.repo.repo,
body: report
});
Best Practices
1. Baseline Management
- Version control baselines: Commit baseline screenshots to git
- Update selectively: Only update baselines for intentional visual changes
- Document updates: Include baseline updates in PR descriptions
- Platform consistency: Generate baselines in CI environment, not locally
2. Test Writing
- Wait for stability: Always use
page.waitForLoadState('networkidle') - Hide dynamic content: Mask timestamps, animations, random data
- Test critical paths: Focus on user-facing components
- Multiple viewports: Test responsive breakpoints
- Theme variants: Test light/dark modes if applicable
3. Threshold Configuration
// Strict comparison for critical UI
expect(screenshot).toMatchImageSnapshot({ threshold: 0.01 });
// Relaxed for minor anti-aliasing differences
expect(screenshot).toMatchImageSnapshot({ threshold: 0.1 });
// Very relaxed for charts/animations
expect(screenshot).toMatchImageSnapshot({ threshold: 0.5 });
4. Performance Optimization
- Parallel execution: Run tests in parallel across multiple workers
- Selective testing: Only test affected components in PR
- Incremental baselines: Cache and reuse unchanged baselines
- Headless mode: Always run in headless mode in CI
5. False Positive Reduction
Common causes of false positives:
- Font rendering differences across OS
- Anti-aliasing variations
- Animation timing
- Browser version differences
- System fonts
Solutions:
- Use Docker for consistent environment
- Freeze animations with CSS
- Use consistent browser versions
- Increase threshold for minor differences
Integration with Existing Testing
Visual regression testing complements existing testing strategies:
Unit Tests (Jest)
↓
Component Tests (React Testing Library)
↓
Visual Regression (Playwright + jest-image-snapshot)
↓
E2E Tests (Playwright)
↓
Manual QA
Troubleshooting
"Screenshots don't match but look identical"
Cause: Platform-specific rendering differences Solution:
expect(screenshot).toMatchImageSnapshot({
failureThreshold: 0.01,
failureThresholdType: 'percent'
});
"Baselines outdated after dependency update"
Cause: Library update changed component styling Solution:
# Review changes first
npm run visual:test
# Update all baselines if changes are expected
npm run visual:update-baseline
"CI fails but local tests pass"
Cause: Different environments (fonts, OS, browser version) Solution: Use Docker or GitHub Actions locally
# Run in Docker matching CI environment
docker run -v $(pwd):/app -w /app mcr.microsoft.com/playwright:v1.40.0 npm run visual:test
References
Implementation Files
.github/workflows/visual-regression-*.yml- CI/CD workflowstests/shared/imageTest.tsx- Baseline screenshot implementationscripts/visual-regression/- Test code and utilities
External Resources
- Ant Design Visual Regression - Original methodology
- Playwright Screenshots - Screenshot API docs
- jest-image-snapshot - Snapshot matcher
- Argos CI - Visual testing platform (used by Ant Design)
Advanced Features
Multi-Browser Testing
// Test across Chromium, Firefox, and WebKit
import { devices } from '@playwright/test';
const browsers = ['chromium', 'firefox', 'webkit'];
for (const browser of browsers) {
test(`visual regression on ${browser}`, async ({ playwright }) => {
const browserInstance = await playwright[browser].launch();
const page = await browserInstance.newPage();
// ... test logic
});
}
Responsive Testing
// Test multiple viewports
const viewports = [
{ width: 375, height: 667, name: 'mobile' },
{ width: 768, height: 1024, name: 'tablet' },
{ width: 1920, height: 1080, name: 'desktop' },
];
for (const viewport of viewports) {
test(`visual regression ${viewport.name}`, async ({ page }) => {
await page.setViewportSize(viewport);
// ... test logic
});
}
Component Isolation
// Test individual component variants
const variants = ['default', 'primary', 'danger', 'disabled'];
for (const variant of variants) {
test(`button ${variant} variant`, async ({ page }) => {
await page.goto(`http://localhost:5173/button/${variant}`);
await page.waitForLoadState('networkidle');
const screenshot = await page.locator('[data-testid="button"]').screenshot();
expect(screenshot).toMatchImageSnapshot({
customSnapshotIdentifier: `button-${variant}`
});
});
}
Agent Capabilities
When users request visual regression testing, the agent should:
Setup Infrastructure
- Create GitHub Actions workflows
- Configure Playwright with jest-image-snapshot
- Set up baseline directory structure
- Add npm scripts for common operations
Generate Tests
- Create test files for each component
- Configure appropriate thresholds
- Handle dynamic content masking
- Set up multi-viewport testing
Integrate with CI/CD
- Configure PR checks
- Set up baseline update workflows
- Configure artifact uploads
- Add PR commenting with diff previews
Provide Documentation
- Document baseline update process
- Create troubleshooting guides
- Add examples for common scenarios
- Document threshold configuration
Example: Full Implementation
See the following files for complete implementation examples:
.github/workflows/visual-regression-pr.yml:1- Full PR workflowtests/shared/imageTest.tsx:1- Baseline screenshot utilitiesscripts/visual-regression/compare-screenshots.ts:1- Comparison logic
This skill enables comprehensive visual regression testing that catches visual bugs before they reach production, maintaining UI consistency across the entire application.